Gene CAPHI28684 (A0A452E363)

E Capra hircus | Polypeptide N-acetylgalactosaminyltransferase [GALNT7]

Select a TabArrow
Protein sequences and cDNA of the selected isoforms are displayed below the tables
Protein ID
Sequence length
Exons
Domain Architectures
Number exons
Cross reference
CAPHI28685 657 12 UniProtKB/TrEMBL A0A452E387
CAPHI28684 reference isoform 657 12 UniProtKB/TrEMBL A0A452E363


Protein Sequence

Download: Fasta
MRLKIGLLLR SLLVVGSFLG LVVLWSSLSP RPDEPSPLSR MREDRAVNNP MPDRGGNGLA    60
LGDDRFKPVV PWPHVEGVEV DLESIRRRNK ARHEQERRSG GEDQKDIMQR QYLTFKPQTF   120
TYHDPVLRPG VLGNFEPKEP EPPGVVGGPG EKAKPLVLGP EFKHAVQASI KEFGFNMVAS   180
DMISLDRSVN DLRQEECKYW HYDENLLTAS IIIVFHNEGW STLMRTVHSV IKRTPRKYLA   240
EIVLIDDFSN KEHLKEKLDD YIKLWNGLVK VFRNERREGL IQARSIGAQK AKLGQVLIYL   300
DAHCEVAVNW YAPLVAPISK DRTICTVPLI DVINGNTYEI VPQGGGDEDG YARGAWDWSM   360
LWKRVPLTPR EKRLRKTKTE PYRSPAMAGG LFAIERDFFF ELGLYDPGLQ IWGGENFEIS   420
YKIWQCGGKL LFVPCSRVGH IYRLEGWQGN PPPVYVGSSP TLKNYVRVVE VWWDEYKDYF   480
YASRPESKAL AYGDISELKK FREDHNCKSF KWFMEEIAYD ITSHYPLPPK NVDWGEIRGF   540
ETVYCIDSMG KTNGGFVELG PCHRMGGNQL FRINEANQLM QYDQCLTKGP DGSKVMITHC   600
NLNEFKEWQY FKNLHRLTHI SSGKCLDRSE VLHQVFISDC DSSKMTQKWE INNIHSV      657

cDNA Sequence

Download: Fasta
ATGAGGCTGA AGATCGGGCT CCTCTTACGC AGTTTGCTGG TGGTGGGCAG CTTCCTGGGG    60
CTGGTGGTCC TCTGGTCCTC CCTGTCCCCG CGGCCGGACG AGCCGAGCCC GCTGAGCAGG   120
ATGAGGGAAG ACAGAGCTGT TAATAACCCC ATGCCTGACA GAGGGGGCAA CGGACTAGCC   180
CTCGGGGATG ACAGATTCAA ACCCGTGGTG CCCTGGCCTC ACGTGGAAGG CGTAGAAGTG   240
GACTTGGAGT CTATTCGAAG AAGGAACAAG GCCAGACATG AACAAGAGCG CCGCTCCGGA   300
GGAGAGGACC AGAAAGACAT CATGCAGAGG CAATATCTCA CGTTTAAGCC CCAGACTTTC   360
ACCTACCATG ATCCTGTGCT CCGCCCGGGG GTCCTCGGTA ACTTTGAGCC CAAAGAGCCT   420
GAGCCTCCCG GAGTGGTCGG TGGGCCTGGA GAGAAAGCCA AGCCGTTGGT TCTGGGACCA   480
GAATTCAAAC ACGCAGTTCA AGCCAGCATT AAAGAGTTTG GATTTAACAT GGTGGCTAGT   540
GACATGATCT CCTTGGACCG CAGTGTGAAC GACTTGCGAC AGGAAGAATG CAAGTACTGG   600
CATTACGATG AAAACCTGCT CACAGCCAGC ATCATCATCG TCTTCCATAA TGAAGGGTGG   660
TCCACCCTCA TGAGAACAGT CCACAGTGTA ATTAAAAGGA CTCCAAGGAA ATACTTAGCA   720
GAAATCGTGT TAATTGATGA TTTCAGTAAC AAAGAACACT TAAAAGAAAA ACTGGATGAC   780
TATATTAAAC TGTGGAATGG CCTAGTCAAG GTATTTCGAA ATGAGAGAAG AGAAGGTTTA   840
ATTCAGGCAC GAAGTATCGG TGCCCAGAAG GCCAAACTTG GACAGGTTTT AATATACCTC   900
GATGCCCACT GTGAGGTGGC GGTTAACTGG TATGCGCCGC TTGTAGCTCC CATATCTAAG   960
GACAGAACCA TTTGCACTGT GCCGCTCATA GATGTCATAA ATGGCAACAC ATATGAGATT  1020
GTGCCCCAAG GGGGTGGTGA CGAAGATGGG TATGCCCGAG GAGCATGGGA CTGGAGTATG  1080
CTCTGGAAAC GGGTGCCTCT GACCCCCCGA GAGAAGAGAC TGAGAAAGAC AAAAACTGAA  1140
CCATATCGGT CCCCAGCCAT GGCTGGAGGG TTATTCGCCA TCGAAAGAGA CTTCTTCTTT  1200
GAGCTGGGTC TCTACGATCC AGGCCTCCAG ATTTGGGGTG GTGAAAACTT TGAGATCTCA  1260
TACAAGATCT GGCAATGTGG AGGCAAGTTG CTGTTTGTCC CTTGTTCCCG TGTTGGGCAC  1320
ATCTACCGTC TTGAGGGCTG GCAGGGGAAC CCCCCACCCG TCTATGTCGG GTCTTCTCCT  1380
ACCCTAAAGA ATTATGTTAG AGTTGTGGAG GTCTGGTGGG ATGAGTACAA AGACTACTTC  1440
TATGCCAGTC GCCCTGAATC GAAGGCCTTA GCATATGGGG ATATATCAGA GCTGAAGAAG  1500
TTTCGAGAAG ATCACAACTG TAAGAGTTTC AAATGGTTCA TGGAAGAAAT AGCTTATGAC  1560
ATCACCTCAC ACTACCCTCT GCCGCCCAAA AATGTCGACT GGGGAGAAAT CAGAGGCTTT  1620
GAAACTGTGT ACTGCATCGA CAGCATGGGG AAAACCAATG GAGGCTTTGT GGAGCTGGGC  1680
CCCTGCCACA GGATGGGCGG GAATCAGCTT TTCCGAATCA ATGAAGCCAA TCAGCTCATG  1740
CAGTACGACC AGTGTTTGAC GAAGGGGCCT GATGGATCCA AAGTTATGAT TACACACTGC  1800
AATCTGAATG AATTTAAGGA ATGGCAGTAC TTCAAGAACC TGCACAGACT GACTCACATT  1860
TCTTCTGGAA AGTGCTTGGA TCGCTCGGAG GTCCTCCATC AAGTATTCAT CTCCGACTGC  1920
GACTCCAGCA AAATGACTCA AAAGTGGGAG ATAAATAACA TCCATAGTGT TTAG        1974