E Sarcophilus harrisii
Protein ID | Sequence length | Exons | Domain Architectures | Number exons | Cross reference | |
---|---|---|---|---|---|---|
SARHA16600 reference isoform | 788 | 25 | ![]() |
EKMLKNGGPL SPKDFAQLQE YMEYSTKKVS DVLALFNEDG ELAQYLQGDS IGYEGFQLFL 60 KAYLEVNNVP PQLSQALFGS FQTIESQKQE RRGDLGCYLT GGDKDKGGLD QGRPLTLLNS 120 PQPEVALICL NDVSCYFSLL EGGRPEDKLE FTFKLYDTDR NGLLDSSEVD RIITQMMRVA 180 RYLDWDVTEL RPILQEMMKE IDYDGSGTVS LAEWVRGGST TVPLLVLLGL EVSLKDDGQH 240 MWRLKQFSRP AYCNVCETML LGITKRGLSC TFCKFTVHER CAKKASPCEV STYAKSKKDT 300 GAQTHVWVRG GCDSGRCDCC QKKIRNFQSL TGLHCVWCHF QIHEDCLSTM SPECDCGMLR 360 DHILPPSAIY PCILKIQPSS LLFTSASEQD RQNLSARNVD ELQHLVPDGQ ALRIIPVPNT 420 HPLLVFVNPK SGGKQGERVL RKFQYLLNPR QVYNLAKGGP EPGLKFFKDL PDFRVLVCGG 480 DGTVGWILDA IDKASFPNPP PVAVLPLGTG NDLARCLRWG GGYDGENLSK ILKDLELSET 540 VYMDRWSVEV IPLDPQEKSD PVPYNIINNY FSIGVDASIA HRFHIMREKH PEKFNSRMKN 600 KLWYLEFATS ESIFSTCKKL EESVSVEICG TPLTLSDLSL EGIAVLNIPS MHGGSNLWGD 660 KKRPSKDVQG LDLASVPPEA ITNPEALKTC VQDLSDKRLE VVGLEGAIEM GQIYTRLKNA 720 GHRIAKCSQI TFRTKKALPM QIDGEPWMQA PCTIQITHKN QMPMLMGPPP RSSNFFNLCN 780 RRRNRDQQ 788
GAAAAGATGT TGAAAAATGG GGGCCCCTTA AGCCCCAAGG ACTTTGCTCA ATTACAGGAA 60 TATATGGAAT ATTCAACCAA GAAAGTCAGT GATGTCCTAG CTCTGTTCAA TGAAGATGGA 120 GAATTGGCTC AGTATCTCCA GGGAGATTCC ATAGGATACG AAGGGTTTCA GCTCTTCCTG 180 AAAGCCTACC TGGAGGTGAA TAATGTTCCT CCACAACTGA GCCAAGCACT TTTTGGGTCC 240 TTCCAGACTA TAGAGTCTCA AAAACAAGAA AGGAGGGGTG ATTTAGGCTG TTATTTGACC 300 GGAGGGGACA AAGATAAAGG CGGGCTGGAT CAGGGAAGAC CTCTGACCTT GCTGAACTCT 360 CCTCAACCAG AAGTAGCCTT GATCTGTCTC AATGATGTCT CCTGCTATTT CTCCTTACTG 420 GAGGGTGGAC GGCCTGAAGA CAAGCTAGAA TTTACCTTCA AGCTATATGA TACAGACAGA 480 AATGGACTTT TGGACAGCTC GGAAGTGGAC AGAATTATCA CCCAGATGAT GCGTGTGGCA 540 AGATATCTGG ACTGGGATGT AACTGAGCTG AGACCAATTT TGCAGGAGAT GATGAAAGAG 600 ATTGACTATG ATGGCAGTGG TACAGTCTCT CTGGCTGAGT GGGTACGAGG TGGATCTACC 660 ACTGTGCCTC TGCTGGTGCT GTTGGGACTA GAGGTGAGCT TGAAGGATGA TGGGCAGCAC 720 ATGTGGCGGC TGAAGCAATT TTCCCGTCCA GCTTATTGTA ATGTATGTGA GACAATGCTG 780 CTCGGGATAA CCAAGCGGGG ACTCTCTTGT ACCTTTTGTA AATTCACAGT ACATGAGCGC 840 TGTGCCAAGA AGGCTTCACC CTGTGAAGTC AGCACCTATG CCAAGTCCAA GAAAGACACT 900 GGTGCTCAGA CCCATGTGTG GGTTCGTGGA GGCTGTGACT CTGGACGCTG TGACTGTTGC 960 CAAAAGAAGA TACGGAATTT CCAAAGCCTG ACAGGACTGC ATTGTGTGTG GTGCCATTTC 1020 CAGATCCATG AGGACTGTCT CTCTACCATG AGCCCTGAAT GTGACTGTGG GATGCTCCGT 1080 GACCACATCC TGCCTCCCTC AGCCATCTAT CCTTGTATCC TGAAGATTCA GCCTTCCTCC 1140 CTTCTGTTTA CCTCTGCCTC TGAACAGGAC CGCCAGAACT TGAGCGCCAG GAATGTGGAT 1200 GAATTACAAC ATTTGGTCCC TGATGGGCAG GCCCTGCGGA TTATTCCGGT CCCCAACACC 1260 CACCCACTCC TCGTCTTTGT AAACCCCAAA AGTGGTGGGA AGCAAGGGGA AAGGGTCCTT 1320 CGAAAGTTCC AGTACCTGCT GAATCCACGA CAGGTTTACA ACCTTGCAAA GGGGGGTCCT 1380 GAACCAGGGC TCAAGTTCTT CAAGGACCTC CCAGATTTCC GGGTATTGGT ATGTGGAGGA 1440 GACGGCACAG TAGGCTGGAT CCTAGATGCC ATTGACAAGG CCAGCTTCCC TAACCCACCA 1500 CCAGTGGCTG TGCTTCCTCT AGGCACAGGG AATGACCTAG CTCGATGCCT AAGATGGGGA 1560 GGAGGTTATG ATGGGGAGAA CTTATCGAAG ATACTCAAGG ACTTAGAGTT AAGTGAAACA 1620 GTATATATGG ATCGATGGTC TGTGGAAGTG ATTCCTCTGG ACCCCCAAGA AAAGAGTGAT 1680 CCAGTCCCCT ACAACATCAT CAACAATTAC TTCTCCATTG GTGTGGATGC CTCTATTGCT 1740 CACCGATTTC ACATCATGAG GGAGAAACAT CCTGAGAAGT TCAACAGCAG GATGAAGAAC 1800 AAGCTGTGGT ACTTGGAGTT CGCCACATCC GAGTCCATCT TCTCCACATG CAAAAAGCTG 1860 GAAGAGTCTG TTTCAGTGGA GATCTGTGGG ACACCTTTGA CACTGAGTGA CCTATCCTTG 1920 GAGGGCATTG CAGTCCTAAA TATTCCCAGC ATGCATGGAG GCTCCAATCT CTGGGGAGAC 1980 AAAAAGCGAC CCTCCAAAGA TGTCCAGGGT TTAGATTTGG CCTCTGTGCC TCCTGAAGCA 2040 ATCACCAACC CTGAAGCCCT GAAAACCTGT GTACAAGATC TGAGTGACAA GCGACTAGAA 2100 GTGGTAGGAC TAGAAGGAGC AATTGAAATG GGCCAGATAT ATACCAGGCT GAAGAATGCC 2160 GGACACCGGA TTGCTAAATG CTCCCAAATT ACTTTTCGGA CTAAAAAAGC CCTCCCCATG 2220 CAGATCGACG GGGAACCCTG GATGCAGGCA CCATGTACTA TCCAGATCAC GCACAAGAAT 2280 CAAATGCCTA TGCTTATGGG CCCCCCACCA CGCTCCTCCA ACTTCTTCAA CCTTTGTAAC 2340 AGGAGAAGAA ACAGGGATCA GCAGNNN 2367