Entry THECC19222 (A0A061F458)

E Theobroma cacao


General Information

Description
Transducin/WD40 repeat-like superfamily protein, putative
Organism
THECC - Theobroma cacao (Taxon-ID: 3641)
Locus
5join(complement(39391698..39392417), complement(39390584..39390791), complement(39390181..39390307), complement(39389716..39389818))
Number of exons
4

IDs and Cross-references

loading xrefs...

Domain Architecture

Gene Ontology

loading GO annotations...

Protein Sequence

Download: Fasta
MSSSGETLPQ SDPDSSPGPT QALNQNPLTI HHVSFNQDNT CFSAATDCGF LVFSTEPYGP    60
QFRRDFNAGL SLVSMLFRFQ LFALVGSSPS PAAANTNADT KALLWDDNVS RCVGELSFRS   120
PIRSLRLRRD TIVVALLHKI YVYNLSDFKL LHQLETTSNP KGLCEVSQVT GPMVLVCPGL   180
QKGAVRVENY GSKRSTFINA HSSNITCLAL SYDGRVLATA STKGTLIRVF NALDGTLIQE   240
VRRGADRAEI FSLAFSSTAQ WLAVSSDKGT VHVFSLKVDS VVLGNDRSSS ASESPLSNQS   300
ALSSLSILKG VLPKYFSSEW SVAQFRLPEG THYIVAFGQQ KNTVMIIGMD GSFLRCKFDP   360
VNGGQMTQLE AHNFLKPEET FSKSD                                         385

Coding Sequence

Download: Fasta
ATGTCGTCCT CCGGCGAGAC CCTTCCCCAA TCCGACCCGG ACTCGTCTCC GGGTCCAACC    60
CAAGCCCTAA ACCAGAACCC ATTAACCATC CACCACGTCT CCTTCAACCA AGACAATACC   120
TGCTTCTCCG CGGCCACCGA CTGTGGCTTC CTCGTCTTCA GCACCGAACC TTATGGCCCT   180
CAATTTCGTC GCGACTTCAA CGCCGGTCTT TCCCTCGTCT CCATGCTGTT CCGCTTCCAA   240
CTCTTCGCCC TCGTGGGCTC CTCTCCGTCC CCCGCCGCCG CCAACACGAA CGCCGACACC   300
AAAGCCCTCC TTTGGGACGA CAACGTCTCC CGCTGCGTCG GCGAGCTCTC CTTTCGTTCC   360
CCAATCCGCT CCCTCCGCCT CCGCCGCGAC ACAATCGTCG TCGCCCTCCT CCACAAAATC   420
TACGTCTACA ACTTGTCGGA CTTCAAACTG TTGCACCAGC TCGAAACGAC GTCGAATCCG   480
AAGGGGTTGT GCGAGGTTTC GCAGGTGACT GGACCCATGG TGTTGGTCTG CCCGGGGTTG   540
CAGAAGGGAG CAGTGAGAGT GGAGAATTAT GGGAGTAAAA GGTCTACATT TATAAATGCT   600
CATAGTTCGA ACATCACGTG CTTGGCTTTG TCTTATGATG GCAGGGTCTT GGCCACGGCT   660
AGTACTAAAG GAACTTTGAT TAGAGTTTTT AATGCTTTGG ATGGGACATT GATTCAAGAG   720
GTAAGGAGAG GGGCAGACCG AGCAGAGATA TTCAGCCTTG CTTTCTCTTC CACTGCCCAG   780
TGGCTAGCTG TCTCAAGTGA CAAAGGAACG GTCCATGTCT TCAGCCTCAA GGTTGATTCT   840
GTAGTTTTAG GGAATGATAG GTCAAGCAGT GCATCTGAAT CACCTCTTTC TAATCAATCA   900
GCTTTATCAT CCCTTTCTAT TTTAAAAGGT GTATTGCCAA AGTATTTCAG CTCAGAGTGG   960
TCAGTGGCTC AATTTCGGCT GCCTGAAGGC ACACACTACA TTGTTGCCTT TGGACAACAA  1020
AAAAACACAG TCATGATTAT TGGCATGGAT GGAAGCTTCC TTCGATGCAA GTTTGACCCA  1080
GTGAATGGTG GACAAATGAC TCAACTTGAA GCTCACAATT TTTTAAAGCC AGAAGAAACA  1140
TTTTCGAAGT CAGACTGA                                                1158