Gene CAPHI18095 (A0A452DLB3)

E Capra hircus | Heparan-alpha-glucosaminide N-acetyltransferase [HGSNAT]

Select a TabArrow
Protein sequences and cDNA of the selected isoforms are displayed below the tables
Protein ID
Sequence length
Exons
Domain Architectures
Number exons
Cross reference
CAPHI18093 154 5 Ensembl ENSCHIG00000000180.1
CAPHI18094 641 18 UniProtKB/TrEMBL A0A452DKA5
CAPHI18096 341 10 UniProtKB/TrEMBL A0A452DKB5
CAPHI18095 reference isoform 631 18 UniProtKB/TrEMBL A0A452DLB3


Protein Sequence

Download: Fasta
HAGSLVATYG LLMAAYGILF PNQGFPGGSD GKASVYNVGD LDKKKHVELK MDQALLLIHN    60
ELPETNLTVY WNFDRCYHCL LQVLATVPQS RKAGRPGVAA VAVGTQHGSI LQLNDTAEDK   120
EVCRLEYKFG EFGNYSLLVK HVRDGVSEIA CDLVVNKEPV DSNLPVCIAF LVGMVLVIVV   180
SFLRFLLSLE DFQNWISKAI NSRETDRLIN SELGSPSRAS DPQPEAWRRS AAPLRLRCVD   240
TFRGMALILM VFVNYGGGKY WYFKHSSWNG LTVADLVFPW FVFIMGASIF LSMTSILQRG   300
CSKFRLLGKI VWRSLLLICI GIFVVNPNYC LGPLSWEKAR IPGVLQRLGA TYFVVAVLEL   360
LFAKPVPETC ASERSCFSLL DITASWPQWL FVLILEGVWL ALTFFLPVPG CPTGYLGPGG   420
IGDGGRYRNC TGGAAGYVDR LLLGDQHLYQ HPSSAVLYHT EVAYDPEGIL GTINSIVMAF   480
LGVQAGKILL YYKDQTRGIL IRFAAWGCLL GLVSVALTKA SENEGFIPVN KNLWSVSYVT   540
TLSSLAFLIL LALYPVVDVK GLWTGAPFFY PGMNSILVYV GHEVFANYFP FQWKLGDQQS   600
HKEHLVQNTV ATALWVLIAY VLYKKKVFWK I                                  631

cDNA Sequence

Download: Fasta
CATGCAGGAT CTTTAGTTGC GACATATGGA CTTTTAATGG CGGCATATGG AATCTTGTTC    60
CCTAACCAGG GGTTCCCTGG TGGCTCAGAT GGTAAAGCGT CTGTCTACAA TGTGGGAGAC   120
CTGGACAAAA AGAAGCACGT GGAGCTGAAG ATGGACCAGG CTTTGCTGCT CATCCATAAT   180
GAACTGCCCG AGACAAACCT GACCGTCTAC TGGAATTTCG ATCGCTGTTA CCATTGCCTG   240
CTTCAGGTTC TGGCCACCGT CCCGCAGAGC AGGAAGGCCG GGCGGCCGGG CGTCGCAGCC   300
GTGGCCGTGG GCACCCAGCA CGGGTCCATC CTGCAGCTGA ACGACACCGC GGAGGACAAG   360
GAAGTCTGTA GGCTGGAGTA CAAATTTGGA GAATTTGGAA ACTATTCTCT CTTGGTAAAG   420
CACGTCCGTG ATGGAGTCAG TGAAATTGCT TGTGACCTGG TCGTCAACAA GGAGCCGGTT   480
GACAGTAACC TTCCTGTGTG TATCGCGTTC CTTGTTGGCA TGGTGCTCGT CATCGTGGTG   540
TCCTTCCTGA GGTTCTTGCT GAGTTTGGAA GACTTCCAGA ATTGGATTTC CAAAGCAATC   600
AATTCTCGGG AAACCGATCG CCTCATCAAT TCCGAGCTGG GGTCCCCGAG CAGAGCGAGT   660
GACCCCCAAC CAGAGGCCTG GCGTCGGTCA GCGGCCCCGC TGCGCCTCCG CTGCGTGGAC   720
ACGTTCCGAG GGATGGCACT CATCCTCATG GTCTTCGTGA ACTATGGAGG CGGGAAATAC   780
TGGTACTTCA AGCACTCGAG CTGGAACGGG CTGACGGTGG CCGACCTTGT GTTCCCATGG   840
TTTGTGTTTA TTATGGGAGC TTCGATTTTT CTGTCGATGA CTTCCATTCT GCAGCGGGGA   900
TGTTCAAAAT TCAGACTACT GGGAAAGATC GTGTGGAGGA GTTTGCTGTT AATCTGTATA   960
GGAATTTTCG TTGTGAACCC CAATTATTGC CTTGGTCCAC TGTCCTGGGA GAAGGCGCGC  1020
ATCCCCGGCG TGCTCCAGCG CCTGGGGGCC ACCTACTTCG TGGTCGCCGT GTTGGAGCTG  1080
CTCTTCGCCA AGCCCGTGCC CGAGACCTGC GCCTCGGAGA GAAGCTGCTT TTCTCTGCTG  1140
GATATCACGG CCAGTTGGCC CCAGTGGCTC TTCGTGCTGA TACTGGAAGG CGTCTGGCTG  1200
GCCTTGACCT TCTTCTTACC GGTTCCTGGG TGCCCCACCG GTTACCTGGG GCCCGGCGGC  1260
ATCGGAGACG GGGGCAGGTA CCGGAACTGC ACGGGCGGCG CCGCGGGCTA CGTGGACCGC  1320
CTGCTCCTGG GCGACCAGCA CCTCTACCAG CACCCTTCTT CCGCTGTGCT TTACCACACC  1380
GAGGTGGCCT ATGACCCAGA GGGCATCCTG GGGACCATCA ACTCCATCGT GATGGCATTT  1440
TTGGGAGTTC AGGCAGGGAA GATACTCCTG TATTACAAGG ACCAGACCAG AGGCATCCTA  1500
ATCAGATTCG CTGCCTGGGG TTGTCTTCTT GGGCTTGTTT CGGTGGCTCT GACGAAAGCA  1560
TCTGAAAATG AAGGCTTTAT TCCAGTAAAC AAAAACCTCT GGTCCGTCTC CTACGTCACC  1620
ACGCTGAGCT CCTTGGCCTT CCTCATCCTG CTGGCCCTCT ACCCCGTGGT GGATGTCAAG  1680
GGGCTGTGGA CGGGAGCCCC GTTCTTCTAC CCGGGGATGA ACTCGATCCT GGTGTACGTG  1740
GGCCATGAGG TCTTCGCCAA CTACTTCCCC TTCCAGTGGA AGCTGGGGGA CCAGCAGTCG  1800
CACAAGGAGC ACCTCGTGCA GAACACGGTC GCCACCGCCC TGTGGGTGCT CATCGCCTAC  1860
GTTCTCTATA AGAAGAAGGT GTTCTGGAAA ATCTGA                            1896