Entry THAPS09161 (EED91530)

E Thalassiosira pseudonana


General Information

Description
Predicted protein [Source:UniProtKB/TrEMBL;Acc:B8C5M0]
Organism
THAPS - Thalassiosira pseudonana (Taxon-ID: 296543)
Locus
6join(complement(759979..761924), complement(759528..759909), complement(759352..759461), complement(759114..759261))
Number of exons
4

IDs and Cross-references

loading xrefs...

Domain Architecture

Gene Ontology

loading GO annotations...

Protein Sequence

Download: Fasta
MNSLKERIVH IVDPTNPTHI PLFLPRISVS NADNEDIRPF QSTDDIDRVM DYANMPDPPQ    60
PCLVMHCEAP GSGIDDGEGD GTDGKDDDGG VVFPKRLAHL RGGYIFLCKP TIGQDDYGDG   120
DGADKRGKIA CIPLAGCTVE FPPGGRRVFR EHAHTGARAG YSMAIHVKSL EVEKQQKTTC   180
FIVLDSLSHR EAWAAAIRMR CDVGNKVTVL RPGGMAGSRM MSVARDGYDQ GYYNEKGHLA   240
TTYGSGVGNR QSKKPKHWGL RKSQQIDGGA GAGGGSGNAD LDAALEQFGQ PGFLEEKLVH   300
QFLQQHAVNE LPGECDKLEK WMDVIKNGLR GAVLEQYEYF VEASREMSTM GREVAWLRNL   360
VERQQETLMS MKNIDFGAGL EDVEEGVGYE DGYGYLSDED EMMGDFGIDG DDSDASSVAS   420
SSSEEGTSTK TPMRNNIRNR RRESRKGTQG DILVSPIDED SPARPSVVGS SAYIEIPPWI   480
SDVTEEVEAL VKESRYTDAT DLILKAKAEV SDILAAHEQP TPPVVTAQTS LSSRNPIVAT   540
TTLSTPGGSP QKKLHKKQQA LLHRTSLQID ALMDRMSKRL AENLRRKNEA LKASAKRERA   600
DPLSTLAPLA SPVCLNDDAV ALQLLVKLGR HQEAATAYAA RRSLVLSECL HERPISSPVG   660
MDAVIYAAQL SSSFFSSLAM AVEGFLDLFD APDCGKGGAD DDTSLNSRSH FMGGGGKRIP   720
SGALSAVVLW CDSELAKFTN VFGSSRLLGS LSLSPPGLRR TEDKAEGKNS LAKEREHSIE   780
VASKCIDQAF LFASENLDSI GLPLTPKLAE NMRPRLKGCE SEVASLLDAR WKSLAYDWVV   840
ESNVQKRTSS SRPIAIDDRR V                                             861

Coding Sequence

Download: Fasta
ATGAACTCCC TCAAAGAACG AATCGTCCAC ATCGTCGACC CAACAAATCC AACCCACATT    60
CCCCTCTTCC TCCCTCGCAT CTCTGTCTCC AACGCCGACA ATGAAGACAT TCGCCCCTTC   120
CAATCCACAG ACGATATTGA TCGTGTCATG GATTACGCCA ACATGCCTGA TCCTCCACAG   180
CCATGTTTAG TAATGCACTG TGAGGCCCCG GGATCGGGAA TTGACGATGG CGAAGGAGAC   240
GGTACGGATG GGAAAGATGA TGATGGGGGT GTTGTGTTTC CAAAGAGACT TGCTCATTTG   300
AGAGGGGGGT ATATATTTTT ATGCAAGCCT ACCATAGGCC AAGACGACTA TGGTGATGGT   360
GACGGCGCCG ATAAACGGGG AAAGATTGCA TGTATTCCAC TTGCGGGATG TACGGTCGAG   420
TTTCCTCCGG GTGGGAGAAG GGTATTTCGT GAGCATGCCC ATACGGGTGC CAGAGCAGGA   480
TATTCAATGG CTATTCATGT CAAGTCTTTA GAGGTCGAGA AACAGCAGAA GACGACGTGC   540
TTCATCGTGT TGGATAGTTT GAGTCATCGT GAAGCATGGG CAGCTGCTAT TCGTATGCGA   600
TGTGATGTGG GGAACAAGGT TACAGTATTG CGTCCTGGTG GAATGGCGGG GAGTAGGATG   660
ATGAGTGTTG CTAGAGATGG ATACGATCAG GGTTATTACA ATGAAAAGGG ACATCTTGCA   720
ACGACTTATG GATCAGGCGT GGGTAATCGA CAGAGTAAGA AACCAAAGCA TTGGGGGTTG   780
CGTAAATCAC AACAGATAGA TGGAGGAGCA GGGGCAGGAG GTGGAAGTGG CAATGCTGAT   840
TTGGATGCGG CACTAGAACA GTTTGGTCAA CCTGGCTTTC TGGAGGAGAA GTTGGTACAT   900
CAATTCTTAC AACAGCATGC AGTCAATGAA TTGCCGGGTG AATGCGACAA ACTAGAGAAG   960
TGGATGGATG TTATTAAGAA TGGATTGAGG GGTGCAGTTT TGGAACAATA CGAGTACTTT  1020
GTGGAGGCAT CGCGTGAAAT GTCAACCATG GGAAGGGAAG TGGCGTGGTT ACGTAATTTG  1080
GTAGAGAGAC AGCAAGAGAC TTTGATGAGT ATGAAGAACA TTGATTTTGG AGCTGGATTG  1140
GAGGACGTTG AGGAAGGAGT GGGATATGAG GATGGATACG GCTATCTCAG TGACGAGGAC  1200
GAGATGATGG GTGACTTTGG TATCGATGGC GACGACAGTG ATGCATCGAG TGTGGCCTCT  1260
TCATCGTCGG AAGAAGGAAC ATCTACCAAA ACACCAATGA GAAACAATAT ACGGAATCGC  1320
CGGAGAGAAT CGCGCAAAGG TACCCAAGGA GATATTCTCG TGTCTCCCAT AGACGAAGAC  1380
TCACCAGCTC GTCCAAGTGT TGTCGGATCA TCTGCGTACA TTGAGATACC ACCGTGGATT  1440
AGTGATGTGA CGGAGGAAGT TGAAGCTCTA GTAAAAGAGA GTCGGTACAC CGATGCTACA  1500
GACCTCATAC TCAAAGCAAA GGCCGAAGTA TCTGATATAC TCGCAGCGCA CGAACAACCA  1560
ACTCCACCCG TGGTGACTGC TCAAACTTCT CTCTCCTCGA GAAATCCCAT CGTTGCAACA  1620
ACTACATTAT CGACACCTGG TGGATCTCCA CAGAAGAAAC TTCATAAGAA GCAGCAGGCT  1680
CTTCTTCACC GAACCAGCCT CCAAATTGAT GCTCTCATGG ATCGCATGTC GAAACGTCTA  1740
GCCGAGAACT TGAGACGAAA GAACGAAGCT CTCAAAGCAT CTGCAAAGAG AGAACGTGCC  1800
GATCCTTTGT CGACGCTAGC TCCCTTGGCA TCGCCTGTAT GCCTAAATGA TGATGCGGTT  1860
GCGTTGCAGT TGCTTGTAAA GTTGGGAAGA CATCAGGAGG CTGCCACTGC TTATGCAGCC  1920
AGGAGGAGTT TGGTGTTATC GGAATGCCTA CACGAAAGGC CGATATCTAG TCCTGTTGGA  1980
ATGGATGCTG TCATATACGC TGCTCAGTTG AGTTCAAGCT TCTTCTCATC CCTCGCCATG  2040
GCGGTCGAGG GATTTCTCGA TCTTTTTGAT GCACCGGATT GCGGTAAGGG TGGTGCAGAC  2100
GACGACACGT CTCTGAACTC TAGGTCTCAT TTTATGGGAG GAGGAGGGAA GCGAATTCCA  2160
TCGGGGGCAC TATCGGCGGT TGTGTTGTGG TGTGATTCTG AGTTGGCCAA GTTTACTAAT  2220
GTGTTTGGGA GTTCACGTTT ACTTGGCAGT CTCTCGTTGT CACCTCCAGG ATTACGAAGA  2280
ACCGAAGATA AGGCGGAAGG AAAGAATTCT CTCGCAAAGG AACGGGAGCA TTCAATTGAG  2340
GTAGCATCGA AATGTATCGA TCAAGCGTTT CTCTTTGCTT CTGAGAACTT GGATTCAATC  2400
GGACTTCCTT TGACTCCCAA ACTGGCTGAG AACATGAGAC CTAGATTGAA AGGTTGCGAG  2460
TCGGAGGTCG CTTCACTTTT AGATGCCCGT TGGAAGAGCC TTGCTTATGA TTGGGTTGTT  2520
GAAAGCAATG TTCAGAAACG CACCTCTTCT TCTCGCCCAA TAGCGATTGA CGACAGACGT  2580
GTGTGA                                                             2586