Entry THAOC08730 (K0T203)

E Thalassiosira oceanica


General Information

Description
transcript_id=EJK72653
Organism
THAOC - Thalassiosira oceanica (Taxon-ID: 159749)
Locus
supercontig_To_g05567complement(2451..6596)
Number of exons
1

IDs and Cross-references

loading xrefs...

Domain Architecture

Gene Ontology

loading GO annotations...

Protein Sequence

Download: Fasta
MNTQGLTDET VHSMMSLPAE DVGHPQSKRS TNGGMRTSRR RQMMELGEGE VPSATVIRSS    60
APKVHTSKSK GSSSNAGPRS GPAVAASSQS TTDQHNNTID GSMKRGARPI VISENIRERP   120
AGRCGSSITN GPKRASRFRQ RNRSKIGAVD VPTKGGFPSL DAAPVGKFTR KGRVNSSSME   180
ASLPSQRVNA VTHTKQQQPD SSSESALAHM SSEEIADGIA EVESVLSAAS IAFLKRRGKQ   240
KMSDKPKDNR LKETNAIRPT PGYNHLSRGD QRDQYEKQFI SSLLSTVRTP EDMERVYSEA   300
VEKGLAPELP SSSLDIEEPN VTCSENRDRH RSIQIATSLL RSTAHRQRLL GARSLCEILE   360
EDVGTFTESG PKRTAYPELL PVALRCLLDD AVATYRTSSG RLLLTFAIRC IAAITKLCVH   420
PLHAAVMSID DDSKDPFSLY QTHFMSDISH YPPGGKLYPP TKIAPIETNS LSNACYRSDS   480
SAATAESDSK AFYDDPAWTL LSRMRIIPCI ADVIPSLPND PSTGITIQSM LDILAMLSVR   540
LPGAASAIAM HERILPFILS HCLAPAGVKL SKEESMSQNE ENLFRTDLAV PAIKLLSLLA   600
RQSKDIAELD LFQKEAISDV LAILCSDAMD REETRLQIWC LIYVRIIMRY GLATSAIQAI   660
ISIATPKVAL LERNDHVCVH YLQLFAVICH QSNSILSSGV IAEDVSDGAL MSLVWLASSA   720
RSCASTMTRT MAQSKNSSDA EATLLITSQM NLLASYFQAS KSSEEGDSVP VISDEQCDNV   780
VSEVIASEVC TSMLSTAMTA AFLPSWDNAV DAHADLPLPE EAAACSFVTT LSTFVRVIGL   840
ENLSGNAGNV LIDKILERME RANQSRPPVA RSNLSNPSRQ SWLVEAEFSI LMVVCEWLHS   900
NQERVDGLRP HLAAFTYSLL GRLSIGHEAL AQAIFFQPEL FQLSNSQRSE NVQKIFTREF   960
NTEGQVQQLR HSMFIYPPCV SGPTASTSLR CKADTSGRSE DVLPLGKLWM WNTLSSTVTE  1020
HGSTRQIVDM ISHSLGLLTS LECTNNPRYV EGISRGTKLY HLANICLFPE SVISDEFVGP  1080
TAQELFQILT TTPDLQDEIS LVVDFIRACY DHSRLSRETP RSSDSKEDGE SVKTLLGSSP  1140
SETVGGPLSQ KELKALEDFV DDMCGAYVEF GAQYDAFTLF VRFFLRPWFP PKVTASVLSR  1200
LHPILNLLTV DGESRPDLML SLRTSVRGGL PGVDPAARRD PSGVLDAHAL SLRRREKTPS  1260
RDDYHYLLST AVLGRNLASS SRRCECGVGA MRSRLRDVPA GVVYDVFRVA ESFLQGDGSR  1320
GSLVECALSV CCDQTLALES QGRDVRDEWQ RGAVEDAKWE RAMEGLQSVN EVQPTEDMHK  1380
Y                                                                  1381

Coding Sequence

Download: Fasta
ATGAACACTC AGGGGCTCAC AGACGAGACA GTCCATTCCA TGATGTCACT GCCCGCTGAG    60
GATGTGGGAC ACCCTCAAAG TAAGAGATCG ACGAACGGAG GAATGAGAAC AAGTCGCAGG   120
CGGCAGATGA TGGAGCTTGG CGAAGGAGAG GTCCCATCCG CGACAGTGAT TCGATCGAGT   180
GCTCCTAAAG TCCATACCTC AAAATCCAAG GGGAGCTCGT CCAATGCCGG TCCCCGATCA   240
GGGCCCGCTG TCGCGGCATC CTCACAATCC ACAACAGATC AACATAATAA CACCATCGAT   300
GGATCGATGA AACGCGGCGC TCGACCTATC GTTATATCGG AAAATATTCG AGAAAGACCT   360
GCGGGGAGGT GTGGCAGTTC TATCACGAAT GGGCCAAAGC GAGCATCTAG ATTTAGACAG   420
CGCAATCGAT CGAAAATTGG CGCTGTGGAT GTACCAACGA AAGGGGGATT TCCATCACTA   480
GACGCGGCCC CAGTCGGAAA GTTCACGAGG AAGGGTAGAG TCAACTCCTC CAGCATGGAA   540
GCATCCCTTC CATCGCAGCG GGTTAATGCC GTGACCCATA CGAAACAGCA ACAGCCGGAT   600
TCGTCTTCTG AATCGGCCCT TGCGCATATG TCCAGTGAGG AGATAGCTGA TGGCATAGCA   660
GAAGTTGAAT CAGTTCTGTC CGCTGCATCC ATAGCATTCC TTAAAAGGCG AGGGAAGCAG   720
AAAATGTCTG ACAAGCCGAA GGACAATCGA CTGAAGGAAA CGAACGCTAT TCGGCCCACT   780
CCAGGCTACA ATCATCTTTC CAGAGGAGAT CAGAGGGACC AGTATGAAAA GCAATTTATT   840
TCGTCTCTCC TGTCGACCGT TCGGACCCCA GAGGATATGG AGCGTGTGTA CAGCGAGGCA   900
GTGGAGAAAG GGCTAGCGCC CGAGCTTCCA TCTTCGTCCC TCGACATTGA AGAACCGAAT   960
GTTACATGTA GCGAAAACCG CGACAGGCAC AGATCGATAC AAATAGCGAC ATCACTTTTG  1020
CGATCGACCG CTCATAGGCA GCGGCTTTTG GGCGCAAGGA GTCTTTGCGA GATTCTAGAG  1080
GAGGACGTGG GCACTTTTAC TGAGAGTGGG CCAAAGCGCA CAGCCTACCC TGAACTGCTT  1140
CCCGTTGCCC TTCGATGCCT CCTGGACGAT GCCGTGGCAA CCTACCGCAC TTCTTCGGGG  1200
AGACTTTTGC TAACATTCGC CATCCGGTGT ATTGCCGCCA TCACGAAACT TTGCGTGCAC  1260
CCATTACACG CCGCGGTGAT GTCGATTGAT GACGACAGCA AAGACCCCTT CAGTCTGTAC  1320
CAGACACATT TTATGAGCGA TATATCCCAC TATCCGCCGG GGGGCAAACT CTACCCGCCA  1380
ACAAAAATTG CGCCGATCGA AACTAACTCG CTCAGCAATG CATGCTATCG TTCTGATTCG  1440
TCAGCCGCAA CGGCTGAATC AGATTCGAAG GCATTCTACG ACGACCCCGC TTGGACATTG  1500
CTTTCTCGGA TGAGGATCAT TCCATGTATT GCGGATGTTA TCCCAAGTCT TCCGAACGAT  1560
CCATCAACCG GCATCACCAT TCAGAGTATG CTAGATATAT TGGCAATGCT TTCAGTTCGC  1620
CTACCCGGAG CAGCGAGTGC AATCGCCATG CACGAACGTA TTTTGCCGTT CATCCTCTCA  1680
CACTGCCTCG CGCCGGCCGG AGTCAAATTG TCAAAGGAAG AGTCGATGAG TCAGAACGAA  1740
GAGAACCTGT TTAGAACTGA CCTAGCCGTT CCAGCTATCA AGTTACTGTC GCTGTTGGCC  1800
CGCCAGTCAA AGGATATCGC CGAGCTCGAC CTTTTCCAGA AGGAAGCCAT TTCAGATGTA  1860
CTTGCGATTT TGTGTTCAGA TGCGATGGAT CGCGAGGAGA CGCGACTTCA AATCTGGTGC  1920
TTGATCTATG TGAGAATCAT AATGCGATAC GGCTTGGCAA CATCTGCCAT CCAAGCAATC  1980
ATCAGCATCG CGACGCCGAA GGTGGCTTTA TTAGAGCGAA ATGACCATGT TTGCGTGCAC  2040
TACCTTCAGC TATTCGCTGT CATTTGCCAC CAATCGAACA GTATACTATC GAGTGGGGTT  2100
ATTGCGGAGG ATGTATCCGA TGGAGCACTC ATGTCTTTAG TTTGGCTCGC GTCGTCAGCT  2160
AGGAGCTGCG CGTCAACCAT GACCCGCACC ATGGCCCAAT CGAAAAACTC AAGTGACGCA  2220
GAAGCTACCC TGCTCATCAC ATCTCAGATG AATCTTCTCG CGTCGTACTT TCAGGCGTCC  2280
AAGTCAAGTG AGGAGGGAGA CAGCGTTCCA GTGATTTCCG ACGAACAATG CGATAATGTG  2340
GTTTCTGAAG TGATCGCGTC CGAAGTATGC ACATCCATGT TATCGACGGC AATGACGGCG  2400
GCGTTTTTGC CCTCTTGGGA CAACGCAGTA GATGCGCACG CAGACCTCCC GCTACCAGAA  2460
GAGGCCGCGG CGTGTTCGTT TGTCACGACG CTGTCAACTT TCGTTCGAGT GATCGGTCTG  2520
GAAAACTTGT CTGGAAATGC AGGGAACGTA CTCATTGACA AAATTCTCGA GCGCATGGAG  2580
CGCGCGAATC AAAGCAGACC TCCCGTTGCG CGCTCGAACT TGTCCAACCC TTCTCGCCAA  2640
TCCTGGCTTG TCGAGGCCGA ATTTTCGATA TTGATGGTAG TGTGTGAATG GCTGCATTCG  2700
AATCAAGAGA GGGTGGATGG CTTGCGTCCC CATTTGGCAG CTTTTACCTA TTCTCTGTTG  2760
GGGCGGCTTA GCATCGGCCA CGAGGCCCTG GCGCAAGCCA TTTTTTTCCA GCCCGAGCTG  2820
TTTCAATTGT CTAATAGCCA GAGAAGCGAG AACGTCCAGA AGATTTTCAC ACGGGAATTC  2880
AACACGGAAG GACAGGTGCA ACAGTTAAGG CACAGCATGT TCATCTACCC TCCCTGCGTA  2940
TCAGGACCAA CAGCTTCGAC TTCATTGCGT TGTAAGGCCG ATACATCAGG GAGGTCAGAA  3000
GACGTCCTCC CCCTTGGCAA GCTGTGGATG TGGAATACGT TATCGAGTAC CGTGACCGAA  3060
CACGGCAGTA CCCGGCAAAT TGTCGACATG ATATCCCATT CGCTCGGACT TTTGACCAGC  3120
CTCGAATGCA CCAACAATCC TAGATACGTC GAGGGGATTT CGAGAGGGAC CAAGCTGTAC  3180
CACCTTGCGA ACATCTGCCT CTTTCCCGAG TCCGTAATCA GCGACGAATT CGTCGGCCCG  3240
ACAGCACAGG AGCTCTTCCA GATTCTGACA ACGACACCCG ATCTCCAGGA CGAGATTTCC  3300
TTGGTGGTCG ATTTCATACG AGCGTGCTAC GATCATTCCC GACTGTCAAG AGAGACGCCG  3360
AGATCCTCCG ACTCGAAGGA GGACGGCGAG TCGGTCAAGA CACTCCTCGG ATCGTCGCCG  3420
TCGGAGACTG TCGGCGGTCC GCTGAGCCAG AAGGAGCTCA AGGCCCTCGA GGACTTCGTC  3480
GACGACATGT GCGGCGCGTA CGTCGAGTTC GGCGCTCAGT ACGACGCGTT CACGCTGTTC  3540
GTCCGGTTCT TCCTGCGCCC CTGGTTTCCG CCCAAGGTAA CCGCGTCGGT CCTGTCGAGG  3600
CTCCATCCCA TCCTCAACCT GCTCACGGTC GATGGCGAAT CGAGGCCCGA TCTCATGCTG  3660
TCCCTACGGA CGTCGGTCCG CGGCGGTTTA CCCGGCGTCG ACCCCGCGGC CCGCCGTGAT  3720
CCGAGCGGCG TCCTTGACGC ACATGCTCTC TCCCTGAGGA GGCGCGAGAA GACCCCGTCC  3780
CGGGACGACT ACCACTACCT CCTCTCGACG GCCGTCCTCG GTCGGAACCT GGCGTCGAGC  3840
TCGAGGCGGT GCGAGTGTGG CGTCGGCGCC ATGAGGAGCA GGCTCCGGGA CGTCCCGGCT  3900
GGGGTCGTCT ACGACGTGTT CCGTGTCGCC GAGTCATTCC TCCAAGGGGA CGGATCGAGG  3960
GGCAGCCTCG TCGAGTGCGC GCTGTCAGTG TGCTGCGACC AGACCCTGGC GCTGGAGTCG  4020
CAGGGCCGGG ACGTGCGGGA CGAGTGGCAG CGGGGAGCGG TCGAGGATGC CAAATGGGAG  4080
AGGGCTATGG AGGGGCTCCA ATCGGTGAAT GAGGTGCAGC CAACGGAAGA CATGCATAAA  4140
TATTAA                                                             4146