Entry THECC11562 (A0A061G268)

E Theobroma cacao


General Information

Description
Dentin sialophosphoprotein-related, putative isoform 1
Organism
THECC - Theobroma cacao (Taxon-ID: 3641)
Locus
3join(26923688..26923882, 26923968..26926234, 26926319..26927558)
Number of exons
3

IDs and Cross-references

loading xrefs...

Domain Architecture

Gene Ontology

loading GO annotations...

Protein Sequence

Download: Fasta
MMSTVGLELT SFINPDLTWK TVSKGNRSGT RRTRKLGAKN LTMGMGLADK NARTAEDVTV    60
SESEKLGVDV LGRRFSDKVE QVPIKKRRFM FRSTSPPPPL TPLLHLEASG QDVDFQSASG   120
KNSGSNSAQR RRLKKTDILT KSTVAVDDGK FSEVINDVED FSGIEILAAA ACSDSMGDDV   180
TENEGNTLLE ASTQERIESS ASAIPLEETT ASLETPCCSP KDSVNEGKTE GSSSQDNSSA   240
ALQTACCSPK VSVMEGKTEG SSSQDNSSAA LQTACCSPKV SVMEGKTEGS SSQDNSSAAL   300
HESLGDRDNP TAGRSIPLPD DRLLWDLNLS MDAWPCDGGN IDSQKDAVDN TSVRSEELQT   360
KEPQDIENDT MNRVVSSDVD GNDECNKMTS DLKIMPVGTD DLSGEKQESE GCSGSIENKT   420
EHVPVPVVDA ENSLICAAAE TNAPTEAGNM DQCLSHSPLP GLDKSTPGSE GNRETSFSTH   480
NIELNTVGCI SEAEVGKTVR GENAQVEESD VASPYVPVLE TVANDVQKTS VNEDDKDHGI   540
DSGLHDVKSF AQDLDNPRPL EPPEDEHANG TEEMDTCHPS PKSEDMSISD DYIVEAMDRT   600
DGASSTYTAQ TDSDTHVRSE ALLQKSSRNF VATSGAGEFS AHEACRNYVN GPTSCLDKAN   660
LNDLSNESHD SAVSQDKVLT VGIGTHSEVQ AGYDSQFEDG ELRESDVHCW EEAEQVDYDT   720
EFEEERSFGL EAESGEKKLQ AERGSSPDVT GNFKYCETGD ALRENSVSLK MRTVEVSDGE   780
TKKTDCLDGS NVRDYDFKVT KRELLSRVEG SLSSDAVHRS RSDNFDGSFP RAEREAGSDK   840
FMGRDRSASH MRGRSPVGGH YFNPSASYWD SKRQNSPIYH GPYNFGRPRP KSVVESRGYP   900
MATDQASSEA TGVARPDNRI NRQYVGSSNG LYRPLFRRRS PVERDDSYGM HTRMATVRDT   960
SPDRTRFRRY PQGFSRGIRD EYLRHIPDDG TEYFSRMPHR LGRRERSISP HGRPHYTLPY  1020
KKTRSRSRSR SPIGWLLPRD RNEGSRRRSR SPDFRSDARV DRVRLPFPKR FAADYGEEFI  1080
SPPRSRISPQ RNSRMFEDRN AGLDHFRGRK SPMRMFRQGQ RLDQGHPIRR SNSDDYFRHM  1140
IRPRRFPDMA GGGKGCKYEG SDDDKHGSRY EMIHRVRRYD TDGAVRRFRY NAEDSYVANN  1200
SLTVTNAIGV SSRRPDDAPR TASEDRSFKM QQP                               1233

Coding Sequence

Download: Fasta
ATGATGAGCA CAGTAGGTCT GGAACTTACA AGTTTTATAA ACCCTGACTT GACTTGGAAG    60
ACAGTGTCGA AGGGGAATAG GAGTGGCACC AGGCGTACAA GGAAACTGGG TGCAAAAAAC   120
TTGACGATGG GAATGGGGCT AGCTGATAAA AATGCCAGAA CAGCGGAGGA TGTTACAGTT   180
TCTGAGTCTG AGAAGCTTGG TGTGGATGTT TTAGGACGGC GCTTTAGTGA CAAAGTAGAG   240
CAGGTGCCAA TTAAGAAAAG AAGATTTATG TTCCGGTCTA CATCACCTCC ACCACCACTG   300
ACCCCTTTGC TGCACCTTGA AGCCTCTGGA CAGGATGTAG ATTTTCAATC TGCATCGGGT   360
AAAAATTCTG GTTCAAATTC AGCTCAGAGG CGACGGCTCA AGAAAACTGA TATTTTGACC   420
AAGTCAACTG TTGCTGTTGA TGATGGGAAG TTTTCAGAGG TAATAAATGA TGTCGAGGAT   480
TTCTCAGGGA TTGAAATACT TGCAGCTGCT GCTTGCAGTG ATAGTATGGG AGATGATGTT   540
ACTGAAAATG AGGGAAATAC GTTACTAGAA GCATCAACAC AAGAGAGAAT TGAGTCTTCA   600
GCTTCTGCAA TACCTTTGGA AGAAACTACT GCTTCATTGG AGACACCTTG TTGCTCCCCA   660
AAAGATTCGG TAAATGAAGG TAAAACTGAG GGTTCATCTT CCCAGGATAA TTCTTCTGCT   720
GCTTTGCAGA CAGCTTGTTG CTCCCCAAAA GTTTCAGTAA TGGAAGGTAA AACTGAGGGT   780
TCATCTTCCC AGGATAATTC TTCTGCTGCT TTGCAGACAG CTTGTTGCTC CCCAAAAGTT   840
TCAGTAATGG AAGGTAAAAC TGAGGGTTCA TCTTCCCAGG ATAATTCTTC TGCTGCTTTG   900
CATGAATCTC TTGGAGATAG GGATAACCCA ACAGCAGGAA GGTCCATCCC CTTGCCAGAT   960
GATAGGTTAC TCTGGGATTT AAATCTTTCA ATGGATGCTT GGCCTTGTGA TGGTGGGAAT  1020
ATTGATTCTC AAAAGGATGC TGTGGATAAT ACATCTGTGA GAAGTGAAGA GCTGCAGACT  1080
AAGGAACCTC AAGATATCGA AAATGATACT ATGAACAGAG TGGTTTCATC TGATGTTGAT  1140
GGCAACGACG AGTGTAACAA GATGACTTCA GACTTGAAAA TCATGCCTGT TGGAACTGAT  1200
GATTTGAGTG GAGAGAAACA GGAATCTGAA GGATGCTCTG GCTCCATTGA GAATAAAACT  1260
GAACATGTTC CAGTGCCAGT GGTTGATGCT GAGAATTCCT TGATCTGTGC TGCTGCTGAG  1320
ACAAATGCCC CCACTGAGGC AGGAAACATG GATCAATGCC TAAGTCATTC TCCTCTTCCT  1380
GGATTGGATA AGAGCACACC TGGGTCTGAG GGAAACCGAG AAACATCATT TTCAACTCAT  1440
AATATTGAGC TGAACACAGT AGGGTGTATT TCTGAGGCAG AGGTGGGCAA AACTGTTCGT  1500
GGAGAAAATG CTCAGGTCGA GGAGTCTGAT GTTGCTTCTC CCTATGTACC AGTTTTAGAG  1560
ACGGTTGCTA ATGACGTTCA GAAAACCAGT GTTAATGAAG ATGATAAGGA TCATGGAATA  1620
GATTCTGGCT TGCATGATGT TAAAAGTTTT GCTCAGGACC TGGATAATCC TCGACCTCTA  1680
GAACCCCCTG AAGATGAGCA TGCAAATGGG ACAGAGGAGA TGGACACATG CCATCCTTCA  1740
CCAAAGTCTG AAGATATGTC CATATCGGAT GATTATATTG TGGAGGCTAT GGACCGAACT  1800
GATGGAGCTT CTTCAACGTA TACTGCTCAA ACTGATTCTG ATACTCATGT TAGGTCTGAG  1860
GCACTGTTGC AAAAGTCTTC CAGAAATTTT GTTGCTACTT CAGGAGCTGG TGAGTTTTCT  1920
GCTCATGAGG CATGTAGAAA TTATGTTAAT GGTCCTACAA GCTGCTTGGA TAAGGCCAAC  1980
CTAAATGATC TTTCTAATGA AAGCCATGAC TCTGCTGTTT CTCAAGATAA GGTATTGACA  2040
GTTGGAATCG GAACCCACTC AGAAGTTCAG GCTGGTTATG ATTCACAGTT TGAGGATGGG  2100
GAGTTGAGGG AATCAGATGT TCACTGTTGG GAGGAAGCCG AACAGGTGGA CTATGACACT  2160
GAATTTGAGG AAGAAAGATC ATTTGGCTTG GAAGCTGAGA GTGGTGAGAA GAAATTACAG  2220
GCAGAAAGAG GGTCAAGTCC TGATGTAACT GGAAATTTTA AATATTGCGA AACTGGAGAT  2280
GCTTTGAGGG AAAATTCAGT GAGCTTAAAG ATGAGAACTG TGGAAGTGTC TGATGGTGAA  2340
ACAAAGAAAA CCGATTGCTT GGATGGGTCT AATGTCAGAG ATTATGATTT TAAGGTAACG  2400
AAAAGGGAGT TGCTTTCCCG TGTTGAAGGA TCATTATCCT CTGATGCTGT CCATAGAAGC  2460
AGGTCTGACA ATTTTGATGG TTCGTTTCCT CGAGCTGAGA GGGAGGCTGG TTCTGACAAA  2520
TTCATGGGCA GGGATAGATC TGCTTCACAT ATGCGTGGCA GAAGCCCAGT AGGTGGTCAC  2580
TATTTCAATC CTTCAGCTAG TTATTGGGAT TCTAAACGGC AGAATTCACC TATTTATCAT  2640
GGTCCTTACA ATTTTGGGCG CCCCAGACCA AAAAGTGTTG TGGAGAGCCG TGGATATCCG  2700
ATGGCCACAG ACCAAGCATC TTCAGAAGCT ACTGGTGTTG CAAGACCTGA CAACCGCATT  2760
AACAGGCAAT ATGTGGGCTC CTCCAATGGT TTGTATAGAC CTCTCTTTAG AAGGAGATCA  2820
CCAGTCGAAA GAGATGATTC CTATGGTATG CATACAAGAA TGGCAACTGT AAGAGATACT  2880
AGTCCTGATC GGACTAGGTT TAGAAGATAC CCACAGGGGT TTAGTAGAGG CATCAGAGAT  2940
GAATATCTCA GGCATATTCC TGATGATGGC ACAGAATATT TTAGTCGCAT GCCACATCGT  3000
CTAGGCAGGA GAGAAAGAAG CATTTCCCCC CATGGTAGAC CTCATTATAC TTTACCTTAC  3060
AAGAAAACTC GGTCAAGATC AAGGAGCCGC TCTCCCATTG GTTGGTTATT GCCAAGGGAT  3120
CGAAATGAGG GTTCAAGACG TCGCAGCAGG TCACCAGATT TTAGGTCTGA TGCTAGAGTA  3180
GATAGGGTGA GGTTGCCATT TCCAAAGCGT TTTGCAGCAG ATTATGGAGA GGAGTTTATT  3240
TCCCCACCAA GAAGTCGGAT TTCACCACAG CGAAATTCCA GGATGTTTGA GGATCGAAAT  3300
GCTGGCTTAG ACCATTTTAG GGGCCGGAAA TCACCTATGA GGATGTTTCG GCAGGGCCAA  3360
AGGTTAGATC AAGGGCACCC AATTCGGAGA TCGAATTCAG ATGATTACTT CAGACACATG  3420
ATTCGACCCA GAAGATTTCC TGATATGGCT GGTGGTGGTA AAGGATGTAA ATATGAAGGC  3480
AGTGATGATG ATAAACATGG CAGCAGATAT GAGATGATTC ATCGAGTGAG GCGGTATGAT  3540
ACTGATGGTG CAGTCCGGCG GTTTCGGTAT AATGCAGAAG ATTCATATGT GGCTAATAAC  3600
TCTTTAACTG TGACTAATGC AATTGGGGTC TCATCTAGGA GGCCTGATGA TGCACCTAGG  3660
ACAGCTAGTG AAGATAGGTC ATTTAAGATG CAACAACCGT GA                     3702