E Cercocebus atys | Heparan-alpha-glucosaminide N-acetyltransferase [HGSNAT]
Protein ID | Sequence length | Exons | Domain Architectures | Number exons | Cross reference | |
---|---|---|---|---|---|---|
CERAT41306 | 635 | 18 | ![]() | |||
CERAT41307 | 633 | 18 | ![]() | |||
CERAT41308 reference isoform | 638 | 17 | ![]() |
MSGAGRALVV LLLAASVLSA ALLAPGGSSE RDAQAAPPRD LDKKRHVELK MDQALLLIHN 60 ELLGANLTVY WKSECCYHCL FQVLVNVPQS PKAGKPSVAA ASVSTQHGAI LQLNNTLEEK 120 EVCRLEYRFG EFGNYSLLVK NIHNGVSEIA CDLAVNEDPV DSNLPVSIAF LIGLAVIIVI 180 SFLKLLLSLD DFNSWISKAI SSRETDRLIN SELGSPSRTD PLDGDVQPAV WHLSVPPPRL 240 RSVDTFRGIA LILMVFVNYG GGKYWYFKHA NKKNWKWPPI NNWILFRFVF IMGSSIFLSM 300 TSILQRGCSK FRLLGKIAWR SFLLICIGII IVNPNYCLGP LSWDKVRIPG VLQRLGVTYF 360 VVAVLELLFA KPVPEHCASE RSCLSLRDIT SSWPQWLLIL ALEGLWLGLT FLLPVPGCPT 420 GYLGPGGIGD FGKYPNCTGG AAGYIDRLLL GDDHLYQHPS STVLYHTEVA YDPEGILGTI 480 NSIVMAFLGV QAGKILLYYK AQTKDILIRF TAWCCILGLI SVVLTKVSEN EGFIPVNKNL 540 WSLSYVTTLS SFAFFILLVL YPVVDVKGLW TGTPFFYPGM NSILVYVGHE VFENYFPFQW 600 KLKDNQSHKE HLTQNLVATA LWVLIAYILY RKKIFWKI 638
ATGAGCGGTG CGGGCAGGGC GCTGGTCGTG CTGCTGCTGG CCGCGTCAGT GCTGAGCGCG 60 GCGCTGCTGG CTCCCGGCGG CTCTTCGGAG CGCGATGCCC AGGCCGCGCC GCCTCGAGAC 120 TTAGACAAAA AAAGACATGT AGAGCTGAAG ATGGATCAGG CTTTGCTACT CATCCATAAT 180 GAACTTCTTG GGGCCAACTT GACTGTCTAC TGGAAATCTG AATGCTGTTA TCACTGCTTG 240 TTTCAGGTGC TGGTAAACGT TCCTCAGAGT CCAAAAGCCG GGAAGCCTAG CGTTGCAGCC 300 GCCTCTGTCA GCACCCAGCA CGGAGCTATC CTGCAGCTGA ACAACACCTT GGAAGAGAAA 360 GAAGTTTGTA GACTGGAATA CAGATTTGGA GAATTTGGAA ACTATTCTCT CTTGGTAAAG 420 AATATCCATA ATGGAGTTAG TGAAATTGCC TGTGACCTGG CTGTGAACGA GGATCCAGTT 480 GATAGTAACC TTCCTGTGAG CATTGCATTC CTTATTGGTC TTGCTGTCAT CATTGTGATA 540 TCCTTTCTGA AGCTCTTGTT GAGTTTGGAT GACTTTAACA GTTGGATTTC TAAAGCCATA 600 AGTTCTCGAG AAACTGATCG CCTCATCAAT TCTGAGCTGG GATCTCCCAG CAGGACAGAC 660 CCTCTAGATG GTGATGTCCA GCCAGCAGTG TGGCATCTGT CTGTCCCGCC GCCCCGCCTC 720 CGCAGCGTGG ACACCTTCAG AGGGATTGCT CTCATACTCA TGGTCTTTGT CAATTATGGA 780 GGAGGAAAAT ATTGGTACTT CAAACACGCA AACAAAAAAA ATTGGAAATG GCCACCTATT 840 AATAACTGGA TTCTTTTTAG GTTTGTATTT ATTATGGGAT CTTCCATTTT TCTATCAATG 900 ACTTCTATAC TGCAACGGGG ATGTTCAAAA TTCAGATTGC TGGGGAAGAT TGCATGGAGG 960 AGTTTCCTGT TAATCTGCAT AGGAATTATC ATTGTGAATC CCAATTATTG CCTTGGTCCA 1020 TTGTCTTGGG ACAAGGTGCG CATTCCTGGT GTGCTGCAGC GCTTGGGAGT GACATACTTT 1080 GTGGTTGCTG TGTTGGAGCT CCTCTTTGCT AAACCTGTGC CTGAACATTG TGCCTCGGAG 1140 AGGAGCTGCC TTTCTCTTCG AGACATCACA TCCAGCTGGC CTCAGTGGCT GCTCATCCTG 1200 GCACTGGAAG GCCTGTGGCT GGGCTTGACA TTCCTCCTGC CAGTCCCTGG GTGCCCTACT 1260 GGTTATCTTG GTCCTGGGGG CATTGGAGAT TTTGGCAAGT ATCCCAATTG CACTGGAGGA 1320 GCTGCGGGCT ACATCGACCG CCTGCTGCTG GGAGACGATC ACCTTTACCA GCACCCATCT 1380 TCTACTGTGC TTTACCACAC CGAGGTGGCC TATGACCCTG AGGGCATCCT GGGCACCATC 1440 AACTCCATCG TGATGGCCTT TTTAGGAGTT CAGGCAGGAA AAATACTATT GTATTACAAG 1500 GCTCAGACCA AAGACATCCT GATTCGATTC ACTGCTTGGT GTTGTATTCT TGGGCTCATT 1560 TCTGTTGTTC TGACGAAAGT TTCTGAAAAT GAAGGCTTTA TTCCAGTAAA CAAAAACCTC 1620 TGGTCCCTTT CGTATGTCAC CACGCTCAGT TCTTTTGCCT TCTTCATCCT GTTGGTCCTG 1680 TACCCGGTTG TGGATGTGAA GGGGCTGTGG ACGGGAACTC CATTCTTTTA TCCAGGAATG 1740 AATTCTATTC TGGTGTACGT CGGCCACGAG GTGTTTGAGA ACTACTTCCC CTTTCAGTGG 1800 AAGCTGAAAG ACAACCAGTC CCACAAGGAG CACCTGACTC AGAACCTCGT TGCCACTGCC 1860 CTCTGGGTGC TCATTGCCTA CATCCTCTAT AGAAAGAAGA TTTTTTGGAA AATCTGA 1917