Gene PANTR42498 (A0A2I3RBX0)

E Pan troglodytes | Heparan-alpha-glucosaminide N-acetyltransferase [HGSNAT]

Select a TabArrow
Protein sequences and cDNA of the selected isoforms are displayed below the tables
Protein ID
Sequence length
Exons
Domain Architectures
Number exons
Cross reference
PANTR42496 692 19 UniProtKB/TrEMBL A0A2I3SAL8
PANTR42497 663 18 UniProtKB/TrEMBL H2QW44
PANTR42498 reference isoform 616 18 UniProtKB/TrEMBL A0A2I3RBX0


Protein Sequence

Download: Fasta
RAWPTRSYFI GLLGPGGSLV NLDKKRHAEL KMDQALLLIH NELLWTNLTV YWKSECCYHC    60
LFQVLVNVPQ SPKAGKPSAA AASVSTQHGS ILQLNDTLEE KEVCRLEYRF GEFGNYSLLV   120
KNIHNGVSEI ACDLAVNEDP VDSNLPVSIA FLIGLAVIIV ISFLRLLLSL DDFNNWISKA   180
ISSRETDRLI NSELGSPSRT DPLDGDVQPA TWRLSALPPR LRSVDTFRGI ALILMVFVNY   240
GGGKYWYFKH ASWNGLTVAD LVFPWFVFIM GSSIFLSMTS ILQRGCSKFR LLGKIAWRSF   300
LLICIGIIIV NPNYCLGPLS WDKVRIPGVL QRLGVTYFVV AVLELLFAKP VPEHCASERS   360
CLSLRDITSS WPQWLLILVL EGLWLGLTFL LPVPGCPTGY LGPGGIGDFG KYPNCTGGAA   420
GYIDRLLLGD DHLYQHPSSA VLYHTEVAYD PEGILGTINS IVMAFLGVQA GKILLYYKAR   480
TKDILIRFTA WCCILGLISV ALTKVSENEG FIPVNKNLWS LSYVTTLSSF AFFILLVLYP   540
VVDVKGLWTG TPFFYPGMNS ILVYVGHEVF ENYFPFQWKL KDNQSHKEHL TQNIVATALW   600
VLIAYILYRK KIFWKI                                                   616

cDNA Sequence

Download: Fasta
CGTGCCTGGC CCACCAGGAG CTATTTCATA GGGCTCTTGG GGCCTGGTGG TTCTTTGGTA    60
AACTTAGACA AAAAAAGACA TGCAGAGCTG AAGATGGATC AGGCTTTGCT ACTCATCCAT   120
AATGAACTTC TCTGGACCAA CTTGACCGTC TACTGGAAAT CTGAATGCTG TTATCATTGC   180
TTGTTTCAGG TTCTGGTAAA CGTTCCTCAG AGTCCAAAAG CAGGGAAGCC TAGTGCTGCA   240
GCTGCCTCTG TCAGCACCCA GCACGGATCT ATCCTGCAGC TGAACGACAC CTTGGAAGAG   300
AAAGAAGTTT GTAGGTTGGA ATACAGATTT GGAGAATTTG GAAACTATTC TCTCTTGGTA   360
AAGAACATCC ATAATGGAGT TAGTGAAATT GCCTGTGACC TGGCTGTGAA CGAGGATCCA   420
GTTGATAGTA ACCTTCCTGT GAGCATTGCA TTCCTTATTG GTCTTGCTGT CATCATTGTG   480
ATATCCTTTC TGAGGCTCTT GTTGAGTTTG GATGACTTTA ACAATTGGAT TTCTAAAGCC   540
ATAAGTTCTC GAGAAACTGA TCGCCTCATC AATTCTGAGC TGGGATCTCC CAGCAGGACA   600
GACCCTCTCG ATGGTGATGT TCAGCCAGCA ACGTGGCGTC TATCTGCCCT GCCGCCCCGC   660
CTCCGCAGCG TGGACACCTT CAGGGGGATT GCTCTTATAC TCATGGTCTT TGTCAATTAT   720
GGAGGAGGAA AATATTGGTA CTTCAAACAC GCAAGTTGGA ATGGGCTGAC AGTGGCTGAC   780
CTCGTGTTCC CGTGGTTTGT ATTTATTATG GGATCTTCCA TTTTTCTATC GATGACTTCT   840
ATACTGCAAC GGGGATGTTC AAAATTCAGA TTGCTGGGGA AGATTGCATG GAGGAGTTTC   900
CTGTTAATCT GCATAGGAAT TATCATTGTG AATCCCAATT ATTGCCTTGG TCCATTGTCT   960
TGGGACAAGG TGCGCATTCC TGGTGTGCTG CAGCGATTGG GAGTGACATA CTTTGTGGTT  1020
GCTGTGTTGG AGCTCCTCTT TGCTAAACCT GTGCCTGAAC ATTGTGCCTC GGAGAGGAGC  1080
TGCCTTTCTC TTCGAGACAT CACGTCCAGC TGGCCCCAGT GGCTGCTCAT CCTGGTGCTG  1140
GAAGGCCTGT GGCTGGGCTT GACATTCCTC CTGCCAGTCC CTGGGTGCCC TACTGGTTAT  1200
CTTGGTCCTG GGGGCATTGG AGATTTTGGC AAGTATCCAA ATTGCACTGG AGGAGCTGCG  1260
GGCTACATCG ACCGCCTGCT GCTGGGAGAC GATCACCTTT ACCAGCACCC ATCTTCTGCT  1320
GTGCTTTACC ACACCGAGGT GGCCTATGAC CCCGAGGGCA TCCTGGGCAC CATCAACTCC  1380
ATCGTGATGG CCTTTTTAGG AGTTCAGGCA GGAAAAATAC TATTGTATTA CAAGGCTCGG  1440
ACCAAAGACA TCCTGATTCG ATTCACTGCT TGGTGTTGTA TTCTTGGGCT CATTTCTGTT  1500
GCTCTGACGA AGGTTTCTGA AAATGAAGGC TTTATTCCAG TAAACAAAAA TCTCTGGTCC  1560
CTTTCGTATG TCACTACGCT CAGTTCTTTT GCCTTCTTCA TTCTGCTGGT CCTGTACCCA  1620
GTTGTGGATG TGAAGGGGCT GTGGACAGGA ACCCCATTCT TTTATCCAGG AATGAATTCC  1680
ATTCTGGTAT ACGTCGGCCA CGAGGTGTTT GAGAACTACT TCCCCTTTCA GTGGAAGCTG  1740
AAGGACAACC AGTCCCACAA GGAGCACCTG ACTCAGAACA TCGTCGCCAC TGCCCTCTGG  1800
GTGCTCATTG CCTACATCCT CTATAGAAAG AAGATTTTTT GGAAAATCTG A           1851