Entry THAPS03444 (EED88271)

E Thalassiosira pseudonana


General Information

Description
Predicted protein [Source:UniProtKB/TrEMBL;Acc:B8CDV2]
Organism
THAPS - Thalassiosira pseudonana (Taxon-ID: 296543)
Locus
17join(complement(50110..50740), complement(49960..50010), complement(47740..49841))
Number of exons
3

IDs and Cross-references

loading xrefs...

Domain Architecture

Gene Ontology

loading GO annotations...

Protein Sequence

Download: Fasta
MEAAKDITNT TSDNDVENPT VNAEDTIGNV SEDQTVHSPV TSGRSISKGV KLGAVLVLLT    60
VAVVLALTLG LDIGSTGGEG GVPVNGAAVQ QGSGSASSSS VGTDTATTST SVTEDSLIAP   120
SDNTPYFTST GPLPSKLKMI SPAITNGYET CSDLEKDIVE AMKHSANSII MEQVETGEMY   180
ASCDPENDDW YSAYYEEHYG YDYHYWEGYY DDDDDSEKNN DTEFDLFYDD GAVDISRTKK   240
QVRSSSPSMH HKRNAPLPRP RTTDKSSSHI RSNGPSSNQQ RLSTSSKPED NFHTNTPVDG   300
VDPADKVKSD GKFVYAAYGD VLYVWNATDA TQGVSITAML GNATDCHWNE TEAEPCNYVS   360
KPNIQALLLS GGRLTAIVDQ FSYVYPLPEN YTSPIISDST KLSVQVYDVS NLKLGTPLKM   420
LAHEELNGSF FDGHSVDDKA LIISQVLVDS YQFTEDLYRY DSQYCGLSSS EYTARAAEVA   480
ASKVEAFAKQ LVDELKLINN GCSNIFQVSM MQGSSNGTNG NSSMPDLTGG NLISGFTQVT   540
TFDMASDFGS DGSISLSVAG AFNEGWISVT SVGDGFIATV TNGYTYDYST EKSYYNTYLL   600
GFDTTGATAE PFCYGSVPGW LSNKFSLDLW DGHLRVATTA YDDWTSNTSN KIFVLRIPEN   660
GSVMETVGET EHLGEDNDII YSVRFIEKRA YVVTYGAVDP FIIVDLSDHT EPKTVGELEI   720
PGHSSYLQKL EVDGEHFILG IGSQVTNETT WESSLKLTLF DVKDPSSPRV AAEHLAANLS   780
TDAEYDFQAI RYLTESKKLV IPFSSYVDGI STDGFMVFDV ASDEIELAYA IMPPSETDYC   840
WYEASVPPRM LVVQSKLTTV KGHSVVNADL STGSVVWELD LDVGFNYSVC EIYEYDYDEY   900
YYNQSWGTYS PTYVPTRPPV SDDDGAL                                       927

Coding Sequence

Download: Fasta
ATGGAAGCAG CCAAAGACAT CACCAATACA ACTAGCGACA ATGACGTCGA GAACCCAACC    60
GTTAACGCTG AAGATACCAT CGGCAATGTG TCTGAAGATC AGACGGTACA CTCGCCAGTT   120
ACAAGCGGCC GATCTATCTC CAAGGGAGTC AAGTTGGGGG CCGTTCTCGT ACTTCTCACT   180
GTGGCTGTTG TTCTAGCTCT GACACTTGGA CTCGACATTG GATCTACCGG TGGAGAAGGA   240
GGTGTTCCAG TTAATGGAGC TGCGGTGCAA CAAGGAAGTG GAAGTGCATC CTCCTCATCC   300
GTTGGTACTG ATACCGCTAC TACCTCTACT TCCGTCACCG AAGACTCTCT CATCGCCCCC   360
TCTGACAACA CTCCGTACTT CACATCCACC GGACCACTCC CATCCAAACT CAAGATGATT   420
TCGCCCGCCA TCACCAACGG TTACGAAACA TGCTCTGATC TTGAAAAGGA TATTGTCGAG   480
GCGATGAAGC ATTCTGCCAA CAGTATCATT ATGGAGCAAG TTGAGACGGG TGAGATGTAT   540
GCAAGTTGTG ATCCAGAGAA TGATGATTGG TATTCTGCTT ATTATGAAGA GCATTATGGA   600
TATGATTATC ACTACTGGGA AGGATATTAT GACGATGACG ACGACTCGGA AAAGAACAAT   660
GATACTGAAT TTGATCTTTT TTACGACGAC GGTGCTGTTG ATATCAGCAG AACAAAGAAG   720
CAAGTGAGAT CCTCTTCTCC TTCAATGCAC CACAAAAGGA ATGCTCCACT TCCCCGCCCT   780
CGCACGACGG ATAAATCCTC CTCCCACATC CGTAGCAACG GTCCTTCATC TAATCAACAA   840
CGTTTGTCAA CGTCCAGCAA GCCCGAAGAT AACTTCCATA CCAACACTCC AGTTGACGGT   900
GTGGATCCAG CGGACAAGGT CAAATCCGAT GGAAAGTTCG TCTATGCAGC CTATGGGGAT   960
GTGCTCTATG TATGGAACGC TACAGATGCC ACTCAAGGAG TATCCATCAC CGCCATGCTT  1020
GGTAATGCAA CTGATTGTCA CTGGAACGAG ACTGAAGCTG AACCTTGCAA CTACGTCTCG  1080
AAGCCAAATA TCCAAGCACT GCTCTTGAGT GGAGGTCGCT TGACTGCGAT TGTAGATCAG  1140
TTCTCCTACG TGTATCCACT TCCCGAGAAC TACACTAGCC CCATCATTTC GGATTCCACC  1200
AAGCTCTCTG TGCAGGTCTA TGATGTTTCC AACTTGAAAC TCGGGACCCC TCTCAAGATG  1260
CTAGCACACG AGGAGTTGAA TGGATCCTTC TTCGATGGTC ATTCTGTTGA CGACAAAGCT  1320
CTCATCATTA GTCAAGTGTT AGTTGATTCC TATCAGTTCA CTGAGGATTT GTATCGTTAT  1380
GACTCTCAGT ACTGTGGTTT GAGTTCATCC GAGTATACTG CCCGTGCTGC TGAAGTTGCT  1440
GCATCAAAGG TTGAGGCGTT TGCCAAGCAA CTGGTCGATG AACTAAAGCT TATCAACAAC  1500
GGTTGCTCCA ACATCTTTCA AGTGTCCATG ATGCAGGGCT CTTCCAACGG CACCAACGGT  1560
AACAGCTCGA TGCCTGACTT GACCGGAGGC AACCTAATCA GTGGCTTTAC TCAAGTTACC  1620
ACCTTTGACA TGGCATCTGA TTTTGGTTCC GATGGAAGCA TTTCGTTGTC AGTTGCTGGC  1680
GCCTTTAACG AAGGATGGAT TTCAGTGACC TCTGTTGGGG ACGGATTCAT CGCTACTGTA  1740
ACGAATGGCT ACACATATGA CTATTCAACT GAAAAGTCTT ACTACAACAC ATACCTCCTC  1800
GGCTTTGACA CGACCGGAGC CACCGCAGAG CCATTCTGCT ACGGAAGTGT TCCCGGATGG  1860
CTGAGTAATA AGTTCTCTTT GGATCTATGG GATGGGCATT TACGTGTTGC CACGACGGCA  1920
TACGACGATT GGACATCCAA CACTTCGAAT AAGATCTTTG TTCTTCGTAT TCCGGAGAAT  1980
GGATCAGTCA TGGAGACCGT TGGAGAGACG GAGCATCTTG GGGAGGACAA TGACATTATC  2040
TATTCGGTTA GATTCATTGA AAAAAGGGCT TACGTCGTCA CGTATGGAGC TGTTGATCCC  2100
TTCATCATTG TTGACTTGTC CGATCATACG GAACCAAAGA CTGTAGGCGA GCTGGAGATC  2160
CCTGGGCACT CTTCATACTT GCAAAAGCTT GAAGTCGACG GTGAACACTT TATTCTCGGC  2220
ATTGGATCTC AGGTAACGAA CGAGACGACC TGGGAATCTT CATTGAAGCT CACATTGTTT  2280
GATGTAAAGG ATCCATCTTC ACCCAGAGTT GCAGCTGAAC ATCTTGCTGC CAACTTATCG  2340
ACTGATGCTG AGTACGACTT CCAGGCGATC AGATACTTGA CTGAGTCAAA GAAGCTCGTC  2400
ATTCCATTCT CATCGTATGT TGACGGTATC TCTACCGATG GATTCATGGT GTTTGATGTT  2460
GCGAGCGATG AGATTGAGCT TGCATACGCC ATTATGCCTC CCTCTGAAAC TGACTATTGC  2520
TGGTACGAAG CCTCCGTTCC TCCAAGGATG CTCGTTGTTC AGTCCAAACT CACGACCGTC  2580
AAAGGACACA GCGTTGTCAA TGCAGATTTG AGTACGGGAA GTGTTGTATG GGAGTTGGAT  2640
TTGGATGTTG GATTCAACTA TTCCGTTTGT GAGATCTACG AGTATGACTA TGACGAGTAC  2700
TACTACAATC AATCGTGGGG CACTTACTCC CCCACATATG TTCCTACCAG GCCTCCTGTG  2760
AGCGATGATG ATGGTGCATT GTAA                                         2784