Entry THECC11013 (A0A061FYU3)

E Theobroma cacao


General Information

Description
Arginosuccinate synthase family isoform 2
Organism
THECC - Theobroma cacao (Taxon-ID: 3641)
Locus
3join(complement(23469185..23469251), complement(23468827..23468885), complement(23468663..23468744), complement(23467577..23467718), complement(23467295..23467343), complement(23467013..23467195), complement(23466530..23466604), complement(23466216..23466401), complement(23466009..23466097), complement(23465376..23465928))
Number of exons
10

IDs and Cross-references

loading xrefs...

Domain Architecture

Gene Ontology

loading GO annotations...

Protein Sequence

Download: Fasta
MAQLKALSPN SSINLACYGP RRNTLLFSDS LSCSRKLSSF LEVSGRASAL HGRAILSNNG    60
SVTLAASNRG IRAVLSSDRE MEVSTATKAG GLRGKLNKVV LAYSGGLDTS VIVPWLRENY   120
GCEVVCFTAD VGQGIKELDG LEEKAKASGA CQLVVKDLKE EFVRDYIFPC LRAGAIYERK   180
YLLGTSMARP VIAKAMVDVA REVGADAVSH GCTGKGNDQV RFELTFFALN PELNVVAPWR   240
EWDITGREDA IEYAKKHNVP VPVTKKSIYS RDRNLWHLSH EGDILEEPAN EPKKDMYMMS   300
VDPEDAPDQP EYLEIGIVSG IPVSVNGKNL SPASLLAELN EIGGRHGVGR IDMVENRLVG   360
MKSRGVYETP GGTILFNAVR ELESLTLDRE TIQVKDSLAL KYAELVYAGR WFDPLRESMD   420
AFMEKITETT TGSVTLKLYK GSVSVTGRTS PHSLYRQDIS SFESGQIYDQ ADAAGFIRLY   480
GLPIRVRAML EKGI                                                     494

Coding Sequence

Download: Fasta
ATGGCTCAGC TTAAGGCCTT ATCGCCAAAT TCGTCGATTA ACCTGGCCTG TTACGGGCCC    60
AGAAGAAACA CATTGCTCTT TTCTGATAGC TTGTCTTGCT CTCGAAAGTT GTCTTCTTTC   120
CTAGAGGTAA GTGGAAGGGC AAGTGCACTC CATGGCCGTG CAATTCTAAG CAACAATGGT   180
TCTGTTACTC TAGCTGCTAG TAATCGAGGA ATAAGAGCCG TTTTATCCAG TGACAGAGAG   240
ATGGAAGTTT CCACAGCTAC AAAGGCTGGA GGGCTGCGAG GCAAATTAAA CAAAGTTGTT   300
TTAGCCTATA GTGGTGGCTT AGACACTTCT GTGATTGTGC CATGGCTAAG GGAGAATTAT   360
GGTTGTGAGG TTGTTTGCTT CACAGCCGAT GTTGGCCAAG GCATAAAGGA ATTGGACGGT   420
TTGGAAGAAA AGGCCAAGGC AAGCGGGGCT TGCCAGTTAG TAGTGAAGGA CTTAAAGGAG   480
GAATTTGTGA GAGACTACAT ATTTCCTTGC TTGCGAGCTG GCGCCATTTA TGAAAGGAAA   540
TACTTGTTAG GGACCTCAAT GGCCCGGCCT GTTATTGCAA AGGCCATGGT GGATGTTGCC   600
AGAGAGGTTG GGGCAGATGC TGTTTCTCAT GGATGTACGG GGAAAGGAAA TGATCAGGTT   660
CGCTTTGAGC TTACATTCTT TGCTCTGAAT CCTGAATTAA ATGTTGTGGC TCCTTGGAGA   720
GAATGGGATA TTACAGGGAG AGAAGATGCT ATTGAATATG CTAAGAAGCA TAATGTTCCT   780
GTTCCAGTGA CGAAGAAATC AATTTACAGC AGAGACAGGA ACTTATGGCA CTTAAGCCAT   840
GAGGGAGATA TCTTGGAGGA GCCAGCCAAT GAACCAAAGA AAGATATGTA CATGATGAGT   900
GTTGACCCAG AAGATGCACC TGATCAACCT GAATATTTGG AAATTGGAAT TGTTTCTGGT   960
ATCCCTGTTT CAGTTAATGG AAAGAATCTT TCACCGGCTT CTCTTCTTGC TGAACTCAAT  1020
GAGATTGGCG GGAGACATGG AGTTGGTCGT ATTGACATGG TTGAAAACCG GCTTGTTGGT  1080
ATGAAAAGTC GTGGAGTCTA TGAAACTCCT GGTGGTACCA TTCTATTCAA TGCTGTTCGT  1140
GAGCTGGAGT CTCTAACACT TGACCGGGAA ACCATTCAAG TTAAAGATTC ACTTGCCCTC  1200
AAGTATGCTG AGCTAGTTTA TGCTGGAAGG TGGTTTGACC CACTTCGTGA GTCCATGGAT  1260
GCATTTATGG AGAAGATTAC TGAAACAACC ACCGGTTCTG TCACTCTGAA ACTATACAAA  1320
GGATCTGTTT CTGTAACAGG TCGGACCAGT CCCCATAGCT TGTACAGGCA GGATATCTCC  1380
TCCTTTGAGA GTGGACAGAT ATATGATCAA GCTGATGCTG CTGGGTTTAT TAGGCTGTAT  1440
GGTCTTCCGA TAAGGGTCAG GGCAATGCTT GAGAAGGGCA TCTGA                  1485