Entry THECC29317 (A0A061GZ49)

E Theobroma cacao


General Information

Description
Sorting nexin 2A
Organism
THECC - Theobroma cacao (Taxon-ID: 3641)
Locus
9join(41612528..41613616, 41613719..41613928, 41614277..41614537, 41615167..41615253, 41615984..41616064)
Number of exons
5

IDs and Cross-references

loading xrefs...

Domain Architecture

Gene Ontology

loading GO annotations...

Protein Sequence

Download: Fasta
MMGSENQGFE EAHLFASREE MENLVLDEPL SNHSSNNNNH QNNNSYSSYR SATSSLSDTT    60
HHPLSPPILA TPADSDPLLS PPLYRNPNAS DNNSYIEPPS YADVIFSPFD ENSVNEINGL   120
ESTSQNPESS LTLSRSPSSS SDYIKITVSN PKKEQETTNS IVPGGNAYYT YLITTRTNIP   180
EFGGSEFSVR RRFKDVVTLS DRLAESYRGF FIPPRPDKNV VESQVMQKQE FVEQRRVALE   240
KYLRRLAEHP VIRLSDELKV FLKVEGRLPL ATSTDVASRM LDGAVKLPKQ LFGESTAVVA   300
PHEVVQPAKG GMDLLRLFKE LKQSVANDWG GSKPPVVEED KVFMEKKEWI HDLEQQLSNA   360
SQQAEALVKA QQDMGETMGE LGLAFIKLTK FENEEGRFNS QKVRAADMKC LATAAVKASR   420
FYRELNAQTV KHLDTLHEYL GLMLAVHSAF SDRSSALLTV QTLLSELSTL HSKAEKLEAA   480
SSKIFGGDKS RIRKIEELRE SIRVTENAKN AAISEYERIK ENNKFELERF DKERRTDLFN   540
MLKGFVVNQV GYAEKISNVW AKVAEETSGY ANDSS                              575

Coding Sequence

Download: Fasta
ATGATGGGCT CTGAGAACCA GGGCTTCGAA GAAGCTCACC TTTTCGCTTC CCGTGAAGAA    60
ATGGAGAATC TGGTTCTTGA CGAGCCACTC AGCAACCACA GCAGCAACAA CAACAACCAC   120
CAGAACAATA ATTCCTACTC GAGTTATCGG AGCGCCACGT CGTCGCTCTC CGACACGACG   180
CACCACCCGC TGTCCCCGCC GATCCTCGCG ACTCCGGCCG ACTCAGATCC CTTGCTTTCC   240
CCTCCGCTCT ATCGTAACCC GAACGCATCT GATAACAACT CCTACATCGA GCCTCCTTCG   300
TACGCTGACG TCATCTTTAG TCCTTTCGAT GAGAACTCAG TCAACGAAAT CAACGGTCTC   360
GAATCGACTT CTCAGAATCC AGAATCTTCG TTGACGCTCT CCAGATCGCC ATCGTCCAGC   420
TCAGATTACA TCAAGATTAC GGTTTCCAAC CCCAAGAAAG AGCAAGAAAC CACTAATTCA   480
ATTGTGCCGG GAGGTAATGC TTATTACACT TACTTAATTA CGACGAGAAC GAATATTCCT   540
GAATTTGGAG GATCCGAATT TAGTGTCAGG AGAAGGTTTA AAGATGTAGT CACGTTATCG   600
GATCGGTTAG CGGAATCTTA CCGTGGATTT TTTATTCCGC CGAGGCCGGA TAAAAATGTG   660
GTGGAATCTC AGGTGATGCA GAAGCAAGAA TTCGTGGAGC AAAGAAGGGT TGCATTGGAG   720
AAATATTTGA GGAGGCTGGC GGAGCATCCT GTGATAAGAC TAAGTGATGA ATTGAAGGTG   780
TTTTTGAAGG TGGAAGGGAG GCTTCCGTTG GCGACGAGCA CTGATGTGGC TTCACGGATG   840
CTTGATGGTG CCGTGAAGTT ACCGAAGCAG TTGTTTGGGG AAAGTACTGC GGTGGTGGCG   900
CCACATGAGG TGGTGCAGCC TGCCAAAGGT GGTATGGATT TGTTGCGGTT GTTTAAGGAA   960
TTGAAACAGT CTGTTGCAAA TGATTGGGGT GGTTCAAAAC CGCCCGTGGT AGAGGAGGAT  1020
AAGGTGTTTA TGGAGAAGAA AGAGTGGATT CATGATCTTG AGCAGCAACT CAGCAATGCT  1080
TCTCAGCAGG CTGAGGCACT TGTTAAAGCA CAGCAAGATA TGGGAGAGAC AATGGGAGAA  1140
TTAGGATTGG CATTTATTAA GCTGACTAAG TTTGAGAATG AGGAGGGTAG GTTCAATTCG  1200
CAAAAAGTAA GGGCTGCTGA CATGAAGTGT TTGGCAACTG CTGCAGTCAA AGCTAGCAGA  1260
TTTTATCGAG AATTGAATGC ACAGACCGTC AAGCATTTGG ATACACTTCA TGAATATCTT  1320
GGGCTTATGT TAGCTGTGCA CAGTGCATTT TCGGATCGAT CAAGTGCTTT ATTGACAGTG  1380
CAAACTCTCC TATCAGAACT ATCTACGTTG CATTCAAAGG CTGAAAAACT CGAAGCTGCG  1440
TCATCAAAAA TATTTGGCGG TGACAAATCA AGGATTCGCA AGATAGAGGA GTTGAGAGAA  1500
AGTATACGGG TTACTGAAAA TGCTAAAAAT GCTGCCATAA GTGAATATGA GCGGATCAAG  1560
GAAAACAACA AGTTTGAGCT TGAGAGGTTT GACAAAGAAA GACGCACTGA CCTCTTCAAC  1620
ATGTTGAAGG GCTTTGTTGT TAATCAGGTG GGATATGCTG AGAAAATTTC AAATGTGTGG  1680
GCAAAGGTTG CAGAGGAGAC CAGTGGATAT GCAAATGATA GCTCTTAA               1728