Gene PROCO27657 (A0A2K6GMS5)

E Propithecus coquereli | Heparan-alpha-glucosaminide N-acetyltransferase [HGSNAT]

Select a TabArrow
Protein sequences and cDNA of the selected isoforms are displayed below the tables
Protein ID
Sequence length
Exons
Domain Architectures
Number exons
Cross reference
PROCO27657 reference isoform 633 18 UniProtKB/TrEMBL A0A2K6GMS5


Protein Sequence

Download: Fasta
MRGARGGCWA LAALLLAASV LSAARLVPGG SPERGAQAAS PRDLDKKRPA ELKMDQALLL    60
IHNELLGTNL TVYWKSEWCY QCLFQVLVSV SPRGGPGGPG MAAVSVSTQH GSILQLNDTL   120
AEKEICRLQY KFGEFGNYSL LVKHVHNGVH EVACDLVVNE NPVDSDLPVS VAFLIGLALI   180
IALSLLRLLF SLDDFNNWIS KAINSQETDH LINSELGSPS RADPLGGDIQ PATWHPSTPP   240
PRLRCVDTFR GLSLILMVFV NYGGGKYWYF KHASWNGLTV ADLVFPWFVF IMGSSIFLSM   300
TSILQRGCSK FRLLGKIAWR SFLLICIGII IVNPNYCLGP LSWDKVRIPG VLQRLGVTYF   360
VVAVLELLFA KPVPENCALE RSCFSLGDVT SSWPQWLLIL MLESIWLGLT FFLPVPGCPP   420
GGIGDLGEHP NCTGGAAGYI DRLLLGDDHL HQRPSCTVLY HTEVAYDPEG ILGTINSIVM   480
AFLGVQAGKI LLYYKDQTKA ILIRFTAWGC ILGLISVALT KGSENEGFIP VNKNLWSISY   540
VTTLGSFAFF ILLVLYPVVD VRGLWTGAPF FYPGMNSILV YVGHEVFENY FPFQWKLEDN   600
QSHKEHLTQN IVATAVWVLI AYILYRKKIF WKI                                633

cDNA Sequence

Download: Fasta
ATGCGCGGGG CGCGGGGCGG CTGCTGGGCG CTGGCCGCGC TGCTGCTCGC CGCGTCGGTG    60
CTGAGCGCCG CGCGGTTGGT CCCTGGCGGC TCCCCGGAGC GCGGCGCCCA GGCCGCGTCC   120
CCGCGAGATT TAGACAAAAA AAGGCCTGCG GAGCTGAAGA TGGATCAGGC TTTGCTGCTC   180
ATCCACAACG AACTACTCGG GACAAACTTG ACTGTCTACT GGAAATCGGA ATGGTGCTAT   240
CAGTGCTTGT TTCAGGTGCT GGTCAGCGTC TCTCCGAGGG GAGGACCTGG GGGGCCCGGC   300
ATGGCAGCCG TCTCTGTCAG CACCCAGCAC GGCTCCATCC TGCAGCTGAA CGACACCCTG   360
GCAGAGAAAG AAATTTGTAG GCTGCAATAC AAATTTGGAG AGTTTGGAAA CTATTCTCTT   420
TTGGTGAAGC ACGTCCATAA CGGAGTCCAT GAGGTCGCCT GTGACCTAGT CGTCAACGAG   480
AATCCAGTTG ATAGTGATCT TCCTGTGAGT GTTGCATTCC TCATTGGTCT AGCACTCATC   540
ATTGCGCTGT CCCTTCTGAG GCTCTTGTTT AGTTTGGACG ACTTTAATAA TTGGATTTCT   600
AAGGCAATAA ATTCTCAAGA AACAGATCAC CTCATCAATT CTGAGCTGGG ATCTCCAAGC   660
AGGGCAGACC CTCTTGGCGG CGACATCCAG CCAGCCACAT GGCACCCGTC CACCCCTCCA   720
CCGCGCCTCC GCTGTGTGGA CACATTCAGA GGGCTATCTC TCATACTCAT GGTCTTCGTC   780
AATTACGGAG GAGGAAAATA CTGGTACTTC AAGCATGCGA GTTGGAATGG ACTGACGGTG   840
GCTGACCTTG TTTTCCCGTG GTTTGTATTT ATTATGGGAT CTTCAATTTT TCTATCAATG   900
ACATCTATAC TGCAACGGGG ATGTTCCAAA TTCAGATTGC TGGGGAAAAT TGCATGGAGG   960
AGTTTCCTAT TAATCTGTAT AGGAATTATC ATTGTGAATC CCAATTATTG CCTTGGTCCA  1020
TTGTCTTGGG ATAAGGTGCG AATTCCCGGT GTTCTGCAGC GGTTGGGAGT GACTTACTTC  1080
GTAGTTGCTG TGTTGGAGCT TCTCTTTGCT AAACCGGTGC CTGAAAATTG TGCCTTGGAG  1140
AGGAGCTGCT TTTCTCTTGG AGACGTCACG TCCTCCTGGC CCCAGTGGCT CCTCATTCTG  1200
ATGCTGGAAA GCATCTGGCT GGGCTTGACG TTCTTCCTGC CGGTTCCTGG GTGCCCGCCC  1260
GGCGGCATCG GAGATTTGGG GGAGCATCCA AATTGCACCG GAGGAGCTGC TGGCTACATC  1320
GACCGCCTGC TGCTGGGGGA CGATCACCTG CATCAGCGCC CTTCTTGCAC CGTGCTTTAT  1380
CACACTGAGG TGGCCTATGA TCCAGAAGGA ATCCTAGGGA CCATCAACTC CATTGTGATG  1440
GCCTTTTTAG GAGTTCAGGC AGGAAAAATT CTATTGTATT ACAAGGATCA GACCAAAGCT  1500
ATCCTGATCA GATTCACTGC CTGGGGTTGT ATTCTAGGGC TCATTTCTGT TGCTTTGACA  1560
AAAGGTTCTG AAAACGAAGG CTTTATTCCA GTAAACAAAA ACCTCTGGTC CATATCGTAC  1620
GTCACCACGC TCGGCTCTTT TGCCTTCTTC ATCCTGCTGG TCCTGTACCC GGTTGTGGAT  1680
GTGAGGGGGC TGTGGACAGG AGCCCCGTTC TTTTATCCAG GAATGAATTC TATTCTGGTG  1740
TATGTAGGCC ACGAGGTGTT TGAGAACTAC TTCCCCTTTC AGTGGAAGCT GGAGGATAAC  1800
CAGTCGCACA AGGAACATCT AACTCAGAAC ATAGTCGCCA CTGCAGTCTG GGTGCTCATC  1860
GCCTACATTC TCTATAGAAA GAAGATTTTT TGGAAAATCT GA                     1902