E Sarcophilus harrisii
Protein ID | Sequence length | Exons | Domain Architectures | Number exons | Cross reference | |
---|---|---|---|---|---|---|
SARHA14412 reference isoform | 631 | 11 | ![]() |
MGHPELVFVS LICCLAAASA AKLGAVYTEG GFVEGVNKKL GLLGDSVDIF RGIPFAAPPK 60 ALEIPKRHPG WQGTLKAKEF KKRCLQATIT QDDTYGSEDC LYLNIWIPQG RKQVSRDLPV 120 MIWIYGGAFL MGAGHGANFL KNYLYDGEEI ATRGNVIVVT FNYRVGPLGF LSTGDSNLPG 180 NFGLWDQHMA IAWVKRNIAA FGGDPNNITI FGESAGGASV SIQTLTPHNK GLIKRAISQS 240 GVALSPWVIQ KNPLFWAKRI ASKVGCPLDD TAKMAKCFKI TDPRALTLAY KMPLAGMEYP 300 MLHYLSFVPV IDGDFIPDDP VNLFANAADI DYIIGTNNMD GHIFASIDMP AINKASQSIK 360 EEDFYKLVSG LTITKGLPGA KATFDFYTQL WSQDSSQETK KKTVVDFETD ILFLVPTKIA 420 LAQHIANAKS GRTYSYLFSH PSRMPVYPDW VGADHADDIQ YVFGKPFATP LGYRAQDRTV 480 SKTLIAYWTN FAKTGDPNMG NSAVPTHWSP YTVENGNYLE INKKVTANSM KQHLRTDYLR 540 FWTLTYQALP TVNQEDKDDV PENNVPEGSV PEGSVPEGSV PEGSVPEGSV PEGSVPEGSV 600 PEGSVPEDKV PVPPAADPEV SAPIAPVADS I 631
ATGGGGCACC CAGAGTTGGT GTTCGTCAGC CTCATCTGTT GCCTAGCAGC TGCTAGTGCA 60 GCTAAGTTGG GAGCTGTCTA CACCGAAGGG GGCTTTGTTG AAGGTGTCAA TAAGAAGTTG 120 GGTCTCCTTG GTGACTCGGT AGACATTTTC AGGGGCATCC CTTTTGCTGC TCCCCCCAAG 180 GCCTTGGAGA TCCCCAAGCG TCACCCTGGA TGGCAAGGAA CCCTGAAGGC CAAGGAATTC 240 AAGAAACGAT GCCTTCAAGC CACCATCACT CAGGATGACA CCTACGGCAG TGAGGACTGC 300 CTCTACCTCA ACATCTGGAT TCCACAGGGA AGGAAGCAAG TTTCCAGAGA CCTGCCAGTC 360 ATGATCTGGA TCTATGGTGG GGCCTTCCTA ATGGGGGCAG GCCATGGTGC CAACTTTCTG 420 AAGAATTACC TGTATGATGG GGAGGAGATT GCCACCCGAG GAAACGTCAT TGTGGTTACC 480 TTCAACTACC GTGTTGGGCC CCTGGGCTTC CTCAGCACTG GAGACTCCAA CCTGCCAGGT 540 AACTTTGGTC TTTGGGATCA GCACATGGCT ATTGCATGGG TGAAGAGGAA CATCGCAGCC 600 TTCGGTGGAG ACCCCAACAA CATCACCATC TTTGGGGAGT CAGCTGGTGG GGCCAGTGTC 660 TCCATCCAGA CTTTAACTCC CCACAATAAG GGACTCATCA AGAGAGCCAT CAGTCAGAGC 720 GGTGTGGCCC TGAGTCCTTG GGTCATCCAG AAAAACCCCC TCTTCTGGGC TAAAAGGATC 780 GCCAGCAAAG TGGGATGCCC GCTGGATGAT ACCGCCAAGA TGGCCAAATG CTTCAAGATC 840 ACTGATCCCC GAGCTCTGAC TTTGGCCTAT AAGATGCCTC TAGCTGGCAT GGAATACCCT 900 ATGCTGCATT ATCTGAGCTT TGTGCCAGTC ATCGATGGAG ACTTCATTCC TGATGATCCC 960 GTCAACCTCT TCGCCAACGC GGCCGATATC GACTACATCA TCGGCACCAA CAACATGGAC 1020 GGCCACATCT TTGCCAGCAT CGATATGCCT GCCATCAACA AGGCCTCCCA GAGTATCAAA 1080 GAGGAAGACT TTTACAAGCT GGTCTCTGGT CTCACCATAA CAAAAGGTCT TCCTGGTGCC 1140 AAGGCCACCT TTGACTTCTA CACCCAACTC TGGAGCCAAG ACTCCTCCCA GGAGACCAAG 1200 AAAAAGACTG TGGTGGACTT TGAGACTGAC ATCCTCTTTC TGGTGCCCAC AAAGATTGCC 1260 TTGGCCCAGC ACATAGCCAA TGCCAAGAGC GGCAGAACTT ACTCCTACCT GTTCTCCCAT 1320 CCTTCTCGAA TGCCTGTTTA CCCTGACTGG GTGGGGGCCG ACCATGCTGA CGATATCCAG 1380 TACGTCTTTG GCAAGCCTTT TGCCACACCC TTGGGTTATC GGGCCCAGGA CCGGACCGTC 1440 TCGAAGACCC TCATTGCTTA CTGGACTAAC TTTGCCAAAA CTGGGGACCC CAACATGGGC 1500 AATTCTGCCG TCCCTACCCA CTGGTCTCCT TATACTGTGG AGAATGGTAA CTACCTGGAA 1560 ATCAACAAGA AAGTGACTGC CAACTCCATG AAACAGCACC TAAGAACCGA CTATCTGCGC 1620 TTCTGGACCC TCACTTATCA GGCCCTCCCC ACAGTGAACC AGGAAGACAA AGACGATGTT 1680 CCTGAGAATA ATGTTCCCGA GGGCAGCGTC CCCGAGGGCA GCGTCCCCGA GGGCAGCGTT 1740 CCCGAGGGCA GCGTCCCAGA GGGCAGCGTT CCCGAGGGCA GCGTTCCTGA GGGCAGTGTC 1800 CCTGAGGGCA GCGTCCCAGA GGACAAAGTT CCTGTACCTC CAGCAGCTGA CCCTGAGGTG 1860 TCTGCTCCAA TTGCCCCAGT GGCTGACTCC ATATAA 1896