Entry THECC16672 (A0A061ESL9)

E Theobroma cacao


General Information

Description
Hydroxycinnamoyl CoA shikimate/quinate hydroxycinnamoyltransferase
Organism
THECC - Theobroma cacao (Taxon-ID: 3641)
Locus
5join(2874386..2874796, 2875393..2876295)
Number of exons
2

IDs and Cross-references

loading xrefs...

Domain Architecture

Gene Ontology

loading GO annotations...

Protein Sequence

Download: Fasta
MEVTMKESTM VCPAEETPNQ RLWVSNLDLV VTRYHICTVY FYKPNGSSDF FDTKVLKESL    60
GKILVPFYPI AGRLGYDENG RLEIICNAKG VLFIEAETTS IMDDLVKEFK DDSEGPQLVP   120
KIDYSGGISS YPLLGLQVTT FKCGGISLGV STQHTMVDGS SGLHFINSWA NTVRGCSPNI   180
APFLDRTLLR ARHPPIPKFR HVEFEPSPSL QTMFSTSECQ PSPKPSIVSI FTITADKLNA   240
LKAKVNGNSN SNTKYSTYSI LTAHIWRCAT KARDLLEDQQ LKLNMPVDGR NRLHPPFPPG   300
YIGNVIFMAA LVALAGELLS ESFIDTVKRI HKILKEMDDE YLRSEIDYIE KAPDIEAIRR   360
GSQTMRCPSL VINSWILLPI HEADFGWGRP IFMRPANIVH EGIVYILPSP TKDGSLTLVT   420
RLETSHMKLF GKLLYEF                                                  437

Coding Sequence

Download: Fasta
ATGGAGGTTA CCATGAAGGA GTCAACAATG GTATGTCCGG CTGAAGAAAC TCCAAACCAA    60
AGGCTATGGG TCTCTAACTT GGACTTAGTG GTGACAAGAT ACCATATTTG TACTGTATAC   120
TTCTACAAGC CAAATGGTTC TTCTGATTTT TTTGACACAA AAGTGTTGAA GGAGTCTTTA   180
GGTAAGATTC TCGTGCCATT TTACCCCATA GCAGGAAGGC TGGGATATGA TGAAAATGGA   240
AGACTTGAAA TAATCTGCAA TGCAAAGGGA GTGTTATTCA TAGAGGCTGA GACTACTTCT   300
ATCATGGATG ATTTAGTTAA AGAATTTAAA GATGATTCAG AAGGTCCTCA ACTAGTTCCA   360
AAAATAGACT ATTCTGGAGG AATTTCTTCC TATCCTCTAC TCGGGTTACA GGTAACTACT   420
TTCAAATGTG GAGGAATTTC TCTTGGAGTT TCTACTCAAC ACACAATGGT AGATGGTTCA   480
TCTGGACTTC ATTTCATTAA TAGCTGGGCC AACACAGTAC GAGGATGCTC CCCTAACATT   540
GCTCCGTTCC TGGATCGTAC CCTTCTTCGA GCTCGGCACC CACCGATTCC AAAATTTCGT   600
CATGTTGAAT TTGAGCCATC TCCTTCTTTA CAAACAATGT TTTCAACTTC AGAATGCCAG   660
CCAAGCCCCA AGCCATCAAT TGTATCTATC TTTACGATCA CAGCCGATAA GCTAAATGCC   720
CTCAAAGCCA AAGTAAATGG AAATTCAAAT AGTAATACAA AATATAGCAC CTACAGTATC   780
TTAACTGCAC ACATATGGCG TTGCGCAACC AAAGCGAGAG ACCTCTTAGA GGATCAGCAA   840
CTCAAGTTAA ACATGCCAGT TGATGGACGG AATAGATTGC ATCCTCCTTT CCCTCCTGGC   900
TACATTGGCA ATGTAATCTT CATGGCTGCA CTCGTTGCTC TAGCCGGTGA ACTTCTATCA   960
GAATCATTCA TAGATACCGT CAAGAGAATT CATAAAATAT TGAAAGAAAT GGATGATGAA  1020
TATCTGAGAT CTGAAATTGA CTACATAGAA AAAGCTCCTG ACATAGAAGC TATCAGGCGA  1080
GGGTCACAAA CTATGCGGTG CCCGAGCCTT GTTATCAACA GCTGGATACT GTTGCCTATC  1140
CATGAAGCAG ATTTCGGATG GGGTCGTCCT ATTTTTATGA GGCCCGCAAA TATTGTCCAT  1200
GAAGGAATAG TATACATACT TCCAAGCCCA ACCAAGGATG GTAGCTTGAC ATTGGTGACA  1260
CGTCTAGAGA CATCTCACAT GAAACTCTTT GGTAAGCTTC TTTACGAATT CTGA        1314