Gene CERAT41308 (A0A2K5KU96)

E Cercocebus atys | Heparan-alpha-glucosaminide N-acetyltransferase [HGSNAT]

Select a TabArrow
Protein sequences and cDNA of the selected isoforms are displayed below the tables
Protein ID
Sequence length
Exons
Domain Architectures
Number exons
Cross reference
CERAT41306 635 18 UniProtKB/TrEMBL A0A2K5KU52
CERAT41307 633 18 UniProtKB/TrEMBL A0A2K5KU85
CERAT41308 reference isoform 638 17 UniProtKB/TrEMBL A0A2K5KU96


Protein Sequence

Download: Fasta
MSGAGRALVV LLLAASVLSA ALLAPGGSSE RDAQAAPPRD LDKKRHVELK MDQALLLIHN    60
ELLGANLTVY WKSECCYHCL FQVLVNVPQS PKAGKPSVAA ASVSTQHGAI LQLNNTLEEK   120
EVCRLEYRFG EFGNYSLLVK NIHNGVSEIA CDLAVNEDPV DSNLPVSIAF LIGLAVIIVI   180
SFLKLLLSLD DFNSWISKAI SSRETDRLIN SELGSPSRTD PLDGDVQPAV WHLSVPPPRL   240
RSVDTFRGIA LILMVFVNYG GGKYWYFKHA NKKNWKWPPI NNWILFRFVF IMGSSIFLSM   300
TSILQRGCSK FRLLGKIAWR SFLLICIGII IVNPNYCLGP LSWDKVRIPG VLQRLGVTYF   360
VVAVLELLFA KPVPEHCASE RSCLSLRDIT SSWPQWLLIL ALEGLWLGLT FLLPVPGCPT   420
GYLGPGGIGD FGKYPNCTGG AAGYIDRLLL GDDHLYQHPS STVLYHTEVA YDPEGILGTI   480
NSIVMAFLGV QAGKILLYYK AQTKDILIRF TAWCCILGLI SVVLTKVSEN EGFIPVNKNL   540
WSLSYVTTLS SFAFFILLVL YPVVDVKGLW TGTPFFYPGM NSILVYVGHE VFENYFPFQW   600
KLKDNQSHKE HLTQNLVATA LWVLIAYILY RKKIFWKI                           638

cDNA Sequence

Download: Fasta
ATGAGCGGTG CGGGCAGGGC GCTGGTCGTG CTGCTGCTGG CCGCGTCAGT GCTGAGCGCG    60
GCGCTGCTGG CTCCCGGCGG CTCTTCGGAG CGCGATGCCC AGGCCGCGCC GCCTCGAGAC   120
TTAGACAAAA AAAGACATGT AGAGCTGAAG ATGGATCAGG CTTTGCTACT CATCCATAAT   180
GAACTTCTTG GGGCCAACTT GACTGTCTAC TGGAAATCTG AATGCTGTTA TCACTGCTTG   240
TTTCAGGTGC TGGTAAACGT TCCTCAGAGT CCAAAAGCCG GGAAGCCTAG CGTTGCAGCC   300
GCCTCTGTCA GCACCCAGCA CGGAGCTATC CTGCAGCTGA ACAACACCTT GGAAGAGAAA   360
GAAGTTTGTA GACTGGAATA CAGATTTGGA GAATTTGGAA ACTATTCTCT CTTGGTAAAG   420
AATATCCATA ATGGAGTTAG TGAAATTGCC TGTGACCTGG CTGTGAACGA GGATCCAGTT   480
GATAGTAACC TTCCTGTGAG CATTGCATTC CTTATTGGTC TTGCTGTCAT CATTGTGATA   540
TCCTTTCTGA AGCTCTTGTT GAGTTTGGAT GACTTTAACA GTTGGATTTC TAAAGCCATA   600
AGTTCTCGAG AAACTGATCG CCTCATCAAT TCTGAGCTGG GATCTCCCAG CAGGACAGAC   660
CCTCTAGATG GTGATGTCCA GCCAGCAGTG TGGCATCTGT CTGTCCCGCC GCCCCGCCTC   720
CGCAGCGTGG ACACCTTCAG AGGGATTGCT CTCATACTCA TGGTCTTTGT CAATTATGGA   780
GGAGGAAAAT ATTGGTACTT CAAACACGCA AACAAAAAAA ATTGGAAATG GCCACCTATT   840
AATAACTGGA TTCTTTTTAG GTTTGTATTT ATTATGGGAT CTTCCATTTT TCTATCAATG   900
ACTTCTATAC TGCAACGGGG ATGTTCAAAA TTCAGATTGC TGGGGAAGAT TGCATGGAGG   960
AGTTTCCTGT TAATCTGCAT AGGAATTATC ATTGTGAATC CCAATTATTG CCTTGGTCCA  1020
TTGTCTTGGG ACAAGGTGCG CATTCCTGGT GTGCTGCAGC GCTTGGGAGT GACATACTTT  1080
GTGGTTGCTG TGTTGGAGCT CCTCTTTGCT AAACCTGTGC CTGAACATTG TGCCTCGGAG  1140
AGGAGCTGCC TTTCTCTTCG AGACATCACA TCCAGCTGGC CTCAGTGGCT GCTCATCCTG  1200
GCACTGGAAG GCCTGTGGCT GGGCTTGACA TTCCTCCTGC CAGTCCCTGG GTGCCCTACT  1260
GGTTATCTTG GTCCTGGGGG CATTGGAGAT TTTGGCAAGT ATCCCAATTG CACTGGAGGA  1320
GCTGCGGGCT ACATCGACCG CCTGCTGCTG GGAGACGATC ACCTTTACCA GCACCCATCT  1380
TCTACTGTGC TTTACCACAC CGAGGTGGCC TATGACCCTG AGGGCATCCT GGGCACCATC  1440
AACTCCATCG TGATGGCCTT TTTAGGAGTT CAGGCAGGAA AAATACTATT GTATTACAAG  1500
GCTCAGACCA AAGACATCCT GATTCGATTC ACTGCTTGGT GTTGTATTCT TGGGCTCATT  1560
TCTGTTGTTC TGACGAAAGT TTCTGAAAAT GAAGGCTTTA TTCCAGTAAA CAAAAACCTC  1620
TGGTCCCTTT CGTATGTCAC CACGCTCAGT TCTTTTGCCT TCTTCATCCT GTTGGTCCTG  1680
TACCCGGTTG TGGATGTGAA GGGGCTGTGG ACGGGAACTC CATTCTTTTA TCCAGGAATG  1740
AATTCTATTC TGGTGTACGT CGGCCACGAG GTGTTTGAGA ACTACTTCCC CTTTCAGTGG  1800
AAGCTGAAAG ACAACCAGTC CCACAAGGAG CACCTGACTC AGAACCTCGT TGCCACTGCC  1860
CTCTGGGTGC TCATTGCCTA CATCCTCTAT AGAAAGAAGA TTTTTTGGAA AATCTGA     1917