Entry GUITH17971 (EKX48407)

E Guillardia theta


General Information

Description
hypothetical protein
Organism
GUITH - Guillardia theta (Taxon-ID: 55529)
Locus
GUITHscaffold_22join(complement(763626..765392), complement(763532..763577), complement(763426..763481), complement(760278..763373), complement(759311..760220), complement(759123..759234), complement(758841..759074), complement(757467..758787), complement(754524..757415), complement(754419..754466), complement(751327..754368))
Number of exons
11

IDs and Cross-references

loading xrefs...

Domain Architecture

Gene Ontology

loading GO annotations...

Protein Sequence

Download: Fasta
MGDESHSKPS DPRIDYISEQ TSTLLPCKLE IAQRIKDDEK GVKLLNGFFD DPSEKLLWIY    60
LQSKDQCVVT ASPPQNFKTK AIYLLKSNGS VSKENFVQDM SFCEFVGQPL EQFLAISQEV   120
YFPLLTNPRN QEGWPEVITK EVTESIHRFL AQTYITIGNT QGKTLLPLPP NHGAALMDAA   180
GRDKDRIHVL ESAVVMWTKQ IKNILKMDPE SALKQGNPGP EVELEFWKNK AENLNSLHNQ   240
LVGEKIRKVV KVLEVTKSTY FPAFNRLCKE VAQARMEAND NLVYLEPMVK YVNMLGAEQF   300
QDLIPHFKPT MHTLLLIWKN SKFYNTPARF VVILRELCNT LISQACNFVS GATLFENIEE   360
PQDAVDNLKL CLKVGVAFKS AYFDYKSKAN IECPSNPWRF QNSALFSRLD SFLERCHDVL   420
DLMQTILQFN KLGGDRGVEI GGTKGKMLTT SVRQTHVDFL QSIKVFIEAK YDIMDVESKQ   480
FDDDFYTFRC QINELERRLA SVITQAFDDS VTVLMRFKLL DSFEGLLERE IIQADLEKKN   540
ADLLNAYAMD LKQVQEIFVM FKDNPPINDN APPRAGAVTW VRGLMERVQE PMNRLKAMNK   600
SILDSEEGKE CIRAYSAILN SLQEYERQHR EEWALEIENT SAEKLKQSLL RRDPNTSWVY   660
VNFDPALVCL LRETKYFLLL GLEVPQSAMT IYSMAETYRQ QTGNLDLIVN IYNSILDTVL   720
DVERPLIQQK LDGIDKVLLK GIKHLNWKSN QISDFITQAM SLVKETKAIL TDIKSNVAST   780
QSVLNTWSEN VMMDRKAGKT YTVDEFQQIH FTLMKNREKT IEKGGEELHK HLDASNKVIA   840
IARSKPEWLA YVDYVNDIII EGLVVAIVKS AQYLRNQMDK EEIANQELAP LIEVNMKLVG   900
KSVNFEPPMV SQASGKGIRH MVTGWLNQFI NISTLVARID TPEEDGDYLA DMHENGSIRL   960
VTSQVSRNLA ISEKDCEDFR QEFEKFSFLW RHDINEVFNL FLNGEDPPEF HKWQKFAEGA  1020
SSPSLEAFEE QIIKYKSLED QVRELPATKI VGWLKIDARP VKNSLTTLIS KWSYKFLEFL  1080
LNRVITSVQD VSSFVESTDQ KLDMEVSDQD SLLEVMNFLT QIRRRTEAID NMFEPLRQTC  1140
SLLKKYNITV EEPVLEALDH VPQAWDKLKK KAVVTKEKHS KAQTAEAEKL KKKSKEFEER  1200
VAEFFAYFKK EMPFTFTEQY DISYATIDRV HHGPRDEKNQ LGSVRELITD AKRLNELQEL  1260
FELYVIEYRE AMQCEKEARM LKELWDMISM IVDTFNLWKK TRWDEIDVEY LSEACKTLAK  1320
HVKGMNKNLK NWPGYKGLED TVKNMQTSLP LVEELHHPAM RERHWKQLMK TTGVSFVMDE  1380
NFSLGALLAL KLHQFEDDVM EIVDKAQKEL TIEKQLKKIE DTWKVQSFSF DLQPDGEMYL  1440
LRIDESVMEC LEDSVVQLGN LQGSKYVQNN ASFLEIVSNW QRKIGNVDSV IQAWTEVQKK  1500
WQNLQSIFVG SADIRVQLPE DSKRFDSVDV EFKDLMKEAA NVTNAVDSCN VDGRLEKIEL  1560
MLAGLEKCEK ALADYLETKR LAYPRFYFVA SADLLDILSK GSNPQLILKH LPKCFDNIST  1620
LEFKKDGDGH PTKDSIGMYS GEKEYVTFHA PFTCEGPVET WLFNLTVHTH ECLKHILVEA  1680
VGVYEEKPKT EFVFDWCAMI AGTCSRIAYR EEVDFTFEQM EEGNEQAMRE YSGKQIEQLT  1740
SFVTLILGEL SSNDRKKIVM IVTVDVHARD VVLELIETKA ENNQCFLWMS QLKFSLDEKT  1800
HLGKINICDY ECFFGYEYIG NCGCLVVTPL TDRCYITLTQ AMRLILGGAP AGPAGTGKTE  1860
TTKDLGRALG IMVYVFNCSD QMDYKSMGQI FKGLAQAGAW GCFDEFNRID ISVLSVTSTQ  1920
YKCILDAIRA KKRRFIFEDD DIPLNHEPFC CGFITMNPGY AGRTELPESV KALFRPCAMV  1980
TPDMDLIAEI MLMSEGYAEG KVLARKFMIL YRLSEALLSA QRHYDWKLRA VKTTLAVAGG  2040
LRRLDRENTE DKVLLRALRD FNLGKLVADD VGIFMGMLND LFPRSLDLVP RNRDLDFEAK  2100
LAEAATLLRL QPEDIFILKC TQLREILVVR WSVFVLGPAG CGKSEMLKTL SKAQNIFGEK  2160
STMNYLNPKS VTRNELYGYI HPATREWKDG LISQIFRDLA NTTTVKHEWI VLDGDIDAEW  2220
IESMNTVMDD NKTLTLASNE RIPLTPPMRL LLEIENMREA SPATVSRGGV IFLNDTDVGW  2280
APYIASWVQN RELEGERTTL TKLFEEYMEK TINFVFRNFR MICPLPKINA AMTTCYILEG  2340
VLGNGESFAQ HQKQLGPEDG LKLIEQVFIY ALMWGAGGAL THDKSGDYKE QFNKWFRTEF  2400
TKCPFPEEGS IFDYVVDMEN NCWQHWNDRI AAYVHQPGIV FGNIYVNTLQ TQRITAILDL  2460
VVPYRRPVCF VGSAGTGKTT IMKDKLRNID SEQFLTLGIN FNSFTDSMIA QMAMESILEK  2520
KTGRVFGPPG SKRLIYFIDD LNMPVVDKYG TQQPIALLRQ AFDYAAWYDR IKLQPKEIQK  2580
VQFLSCMNPT AGSFVIDPRL QRSYMTFSVM MPSNEVLMSI YSAILKGHLS NGFAPEVLKW  2640
AENTVQATID LYTQVSKIFL PTAVKFHYLF NLRELANVMQ GVCRSRPQTC GSGVLLARLW  2700
YHECERVFLD RMIAEDDVDK FKQILNDTNK KYWSELNQEK LMERPNYFTT FAVPGVDDSD  2760
RPYANISDID KLCGIVEDRL REYNESNAVM NLVMFEQAVE HVCRITRVIE LPRGNALLVG  2820
VGGSGKQSLA RLSAFIVGYE SYQITVTSSY GMADFKENLM GLYIKSGVKG IGMAFILTDG  2880
QIVNEKFMVL INDFLASGNI ADLMPKEEKD NCANAVRGEV KQAGLIDTAE NCWDFFIEKV  2940
RKNLHMVLCF SPVGDSFRIR ARQFPALVND TVYDYFMGWS QEALMKVANR FIKEVEAITS  3000
EEGLQNNVAL HMAHVHRSVE NCSLEFFDAE RRYNYTTPKS FLDLISLYKS MLAAKKEGIK  3060
VLRERLENGL EKMNSAAEQV AELQENLVKD MAVVEAKKAA TDELIVIVGQ ETAVAEEQKA  3120
AAAIEEEKCS KIAEEVMAFQ AECDKDLAAA EPVIQEAEAA LNTLDKKSLT ELKALSSPPA  3180
GVDDVTSAVM VLMGGGKIPK DLSWNAAKKM MGNVDQFLNS LINFDKDNTP EIACTWCENN  3240
VINKPYFNTA TMQAKSQAAA GMTSWVINIC KYFRIYQYVE PKRKLLNEAN QRLEEANTKL  3300
AAVRKQVAEL EEKLADLTRQ FEEATQEKNE AIAAAEKTQN KANMADRLVN GLADEKVRWS  3360
QSIERFGVQE RNYVGDVMIA SAFVSYIGAF NLAFREKLVN TLWITDMIEK QIPMTEGIQP  3420
LDMLCDSATV AIWNSEGLPT DTVSTQNGAI MTNCQRWPLM IDPQLQGIKW IKNKYTKEVK  3480
EMKVVQQTQD KYINFIELAM SNGEPIMIEN VSESIDAVLE PVMMRAVIRR GRALVIKLGD  3540
KEVEYDENFR LFLQTKLSNP HYKPEIAAQT SLINFMITLD GLEEQLLNKV VEKERPDLGA  3600
QKAQLVEDQN GFNIKLKQLE DDLLYSLSNS QGDILEDIAL IEGLEQTKIT STEIKQKQEL  3660
GKITEQEIAT AMESYRAVAI RGALMYFLVD QLWVLSHMYR FSMANFVTMF KKGMDNADIY  3720
EEGETPPDQP EEGTTMTPAQ LKSRVERLID KSCYTVFQYI SQGLFERHKL IFAAQLCFRV  3780
LARKGELDQR MFEFLIRAPK NLSSDNPLKE WLDDAAWGAI AALQDITEPV NFASLPEEMV  3840
SSAKRFREWY ELERPEDAGL PGDWKKLAEF PKLLIIRCLR TDRMGEALST FVRKEMGEKY  3900
VTSVPFSLPR SFEDAAPDTP IFFILSPGVD PVKDTEEIGK KYGISYDAGN FGLVSLGQGQ  3960
EPVAEKIVET AYKNGGWGFL QNIHLTPRWT AGWLEKRCDD LSSAHQDFRL FLSAEAAMLP  4020
INILQVCIKL TNEPPEGLKP NLLKALLPFD DAFYEQCSKV GELRSITFMT CFFHAIILER  4080
RKFGPQGWNV LYPFNMGDLI SCAQVCVNYL EANTKIPWAD LRYIFGEIMY GGHVTNNFDR  4140
RLVSSYLESY LTPELLDGFQ IYPGFTTPSN TLNTKQTIEY VQEVMPQESP IAYGMHPNAE  4200
IGFRFSQAEA MFQSIVELMP RSAAGGGGMT LQDKAKAALD DTLEKLPDQF VMVDILERIE  4260
ERTPYVNVFL QEIERMVILT TEIRRSLIEL DQGLKGDLQI TDKMEKLMTS LAENKVPESW  4320
ENIAYPSKRP YSSWLINLLD RQKQLDVWTG ELGLPKCTWI SGLFSPQSFL TAVMQTTARR  4380
NEWALDRTVN QTEVTRFMEP SQIPGFNKEG AYVNGLIMEG ARWDEKTGTI EDSRPKELFA  4440
KMPVVLIKAV PADKVETGVY QCPVFKTQIR GAGGGKDTFV FLAGLKTKQK PSKWIQAGVA  4500
LLMDVVI                                                            4507

Coding Sequence

Download: Fasta
ATGGGAGACG AGTCACATTC GAAGCCCTCT GATCCCCGAA TCGATTACAT CAGCGAACAG    60
ACTTCCACGC TGCTTCCCTG TAAACTCGAA ATCGCTCAGC GAATCAAGGA CGATGAAAAA   120
GGTGTAAAGC TGCTCAATGG CTTTTTCGAT GATCCCTCAG AGAAATTGCT TTGGATATAC   180
TTGCAATCGA AAGATCAATG CGTTGTGACA GCTTCTCCGC CTCAAAATTT CAAAACAAAG   240
GCGATCTATT TGTTGAAGAG CAATGGATCC GTTTCCAAGG AAAACTTTGT TCAAGATATG   300
AGTTTCTGTG AATTTGTTGG GCAGCCACTT GAGCAATTCT TGGCTATCTC GCAAGAAGTT   360
TATTTTCCTC TTTTGACTAA CCCAAGGAAC CAAGAAGGCT GGCCAGAAGT GATCACGAAA   420
GAGGTGACCG AAAGTATTCA TCGCTTTCTC GCGCAAACGT ACATCACTAT CGGCAACACG   480
CAGGGGAAAA CCCTTTTGCC TTTACCACCA AATCATGGTG CAGCGCTGAT GGATGCAGCT   540
GGAAGGGACA AAGATAGAAT TCATGTTCTG GAATCTGCAG TTGTCATGTG GACCAAGCAA   600
ATCAAGAATA TTCTGAAGAT GGATCCTGAA AGTGCGCTCA AGCAAGGAAA TCCTGGACCA   660
GAAGTCGAAC TAGAATTTTG GAAAAACAAG GCAGAGAATT TGAACTCTTT GCACAATCAA   720
CTTGTTGGTG AAAAGATTAG GAAAGTTGTC AAGGTCCTGG AGGTCACCAA AAGCACCTAT   780
TTTCCCGCGT TCAATCGATT GTGCAAAGAA GTAGCACAGG CAAGGATGGA GGCAAACGAT   840
AATCTTGTGT ACTTGGAGCC GATGGTCAAA TATGTCAACA TGCTTGGAGC TGAACAATTC   900
CAAGATTTAA TTCCACATTT CAAACCGACC ATGCATACAC TTCTTTTGAT ATGGAAGAAT   960
TCAAAATTTT ACAATACGCC AGCTCGTTTT GTTGTCATTT TGAGAGAATT ATGTAACACT  1020
TTGATCTCAC AAGCCTGCAA CTTTGTATCT GGTGCGACTC TTTTCGAGAA TATTGAAGAA  1080
CCTCAAGACG CTGTTGATAA TCTAAAGCTG TGCTTGAAGG TTGGGGTTGC ATTTAAAAGC  1140
GCATACTTCG ACTACAAGAG CAAAGCAAAC ATAGAATGTC CTTCAAATCC ATGGAGATTT  1200
CAGAACTCAG CTCTTTTTTC ACGATTAGAT TCATTCCTCG AACGTTGCCA CGATGTTTTG  1260
GATCTGATGC AGACCATTTT GCAATTTAAC AAGCTAGGCG GTGATCGTGG TGTTGAAATT  1320
GGTGGAACTA AAGGAAAAAT GTTGACGACC AGCGTTCGAC AGACTCACGT TGATTTCCTT  1380
CAGTCTATCA AAGTATTTAT TGAAGCAAAA TATGATATCA TGGACGTTGA GTCCAAGCAG  1440
TTTGATGATG ACTTTTACAC TTTTCGCTGC CAGATCAATG AACTTGAGAG GAGGCTTGCT  1500
TCAGTCATCA CACAGGCTTT CGATGACAGC GTGACAGTTC TAATGAGGTT CAAGTTGTTA  1560
GACAGCTTTG AGGGTTTGTT GGAAAGGGAA ATCATACAGG CAGACCTTGA AAAGAAGAAT  1620
GCAGATCTGT TGAATGCCTA CGCCATGGAT CTAAAACAAG TTCAGGAGAT ATTTGTCATG  1680
TTCAAGGACA ACCCTCCTAT CAATGATAAT GCTCCTCCAA GAGCCGGAGC GGTGACGTGG  1740
GTGCGAGGAT TGATGGAGAG AGTTCAGGAA CCGATGAATC GACTAAAGGC TATGAACAAG  1800
TCAATACTTG ATAGCGAGGA AGGAAAAGAG TGCATTCGTG CCTACTCTGC TATCTTGAAT  1860
AGTCTCCAGG AATACGAACG CCAACACCGA GAAGAATGGG CTTTGGAGAT TGAAAACACT  1920
TCTGCTGAGA AACTGAAGCA GTCTCTTTTG AGACGTGATC CGAACACATC TTGGGTGTAC  1980
GTCAACTTTG ATCCTGCCTT AGTCTGTCTT CTCCGTGAAA CCAAATATTT CTTGTTGTTG  2040
GGCTTGGAGG TCCCTCAATC TGCAATGACG ATATATTCTA TGGCGGAAAC TTATCGTCAA  2100
CAGACTGGAA ATCTTGATTT GATTGTCAAC ATTTATAACT CAATCCTTGA CACGGTTCTT  2160
GATGTTGAGA GGCCGTTGAT CCAGCAGAAG CTTGATGGGA TCGACAAAGT TCTTCTTAAA  2220
GGTATAAAAC ATCTCAATTG GAAAAGCAAC CAGATCTCGG ATTTCATCAC TCAGGCAATG  2280
TCATTGGTAA AAGAGACCAA GGCTATTTTG ACCGATATTA AGAGCAACGT GGCATCAACC  2340
CAATCTGTTC TAAATACATG GTCTGAAAAT GTTATGATGG ATAGGAAAGC TGGAAAGACC  2400
TACACTGTGG ACGAGTTTCA ACAAATTCAC TTCACCCTTA TGAAGAATAG GGAGAAAACT  2460
ATTGAGAAAG GTGGCGAAGA GCTTCACAAG CACTTGGATG CATCGAACAA AGTTATTGCA  2520
ATTGCACGGA GCAAGCCGGA ATGGCTTGCC TATGTTGATT ACGTGAATGA CATCATCATT  2580
GAAGGGTTGG TTGTTGCAAT TGTCAAATCT GCGCAATACC TGCGAAATCA GATGGACAAA  2640
GAGGAAATTG CAAATCAGGA ACTGGCTCCT CTTATTGAAG TCAACATGAA GCTGGTTGGG  2700
AAAAGTGTGA ATTTTGAGCC TCCAATGGTA TCTCAGGCAA GTGGAAAGGG AATCAGACAT  2760
ATGGTCACTG GATGGCTGAA TCAGTTTATC AATATTTCCA CACTAGTAGC TCGTATTGAC  2820
ACACCAGAAG AGGATGGCGA CTATCTTGCG GATATGCATG AGAATGGTTC TATTCGTCTT  2880
GTAACCAGTC AAGTGAGTAG AAACCTTGCA ATTAGCGAAA AAGATTGTGA AGACTTTCGT  2940
CAAGAATTTG AAAAGTTTTC TTTCTTGTGG AGGCATGACA TCAACGAAGT TTTCAATCTG  3000
TTCTTGAATG GCGAAGATCC TCCAGAGTTC CATAAATGGC AGAAATTTGC CGAAGGTGCA  3060
TCATCTCCAT CGTTGGAAGC TTTTGAGGAG CAGATCATCA AGTACAAGAG TCTTGAAGAC  3120
CAAGTAAGGG AGCTTCCTGC TACCAAAATT GTCGGATGGC TCAAAATTGA TGCTCGCCCT  3180
GTCAAGAATT CATTGACAAC ACTTATCTCC AAGTGGTCGT ATAAATTCTT GGAGTTTCTT  3240
CTAAACAGAG TAATCACCTC GGTGCAAGAT GTATCCTCAT TCGTTGAATC AACAGATCAG  3300
AAATTAGACA TGGAAGTCAG TGATCAAGAC TCTTTGCTTG AAGTGATGAA TTTTCTCACT  3360
CAAATTCGTC GAAGAACCGA GGCCATCGAC AACATGTTTG AGCCTTTGAG ACAGACTTGT  3420
TCATTGCTAA AGAAGTACAA CATTACAGTG GAAGAACCTG TTCTTGAAGC CTTGGATCAT  3480
GTACCTCAGG CATGGGATAA GCTGAAGAAG AAAGCTGTAG TTACCAAAGA GAAGCACAGC  3540
AAAGCTCAAA CAGCAGAAGC AGAAAAACTA AAGAAGAAGT CCAAAGAATT CGAGGAGCGT  3600
GTTGCGGAGT TCTTTGCATA CTTCAAGAAA GAGATGCCTT TCACTTTCAC AGAACAATAC  3660
GACATTTCGT ACGCTACAAT TGATCGTGTT CATCATGGAC CGAGAGATGA AAAGAATCAG  3720
TTGGGAAGTG TGCGAGAGCT TATCACTGAT GCAAAGAGAT TGAATGAGCT GCAGGAGTTG  3780
TTTGAGCTTT ATGTCATCGA GTATAGAGAG GCCATGCAGT GCGAGAAGGA AGCACGAATG  3840
TTGAAGGAAT TGTGGGACAT GATTTCGATG ATAGTTGACA CATTCAATCT GTGGAAGAAG  3900
ACGAGATGGG ATGAAATTGA TGTAGAGTAC CTCAGTGAAG CATGCAAGAC TCTTGCAAAG  3960
CATGTGAAAG GGATGAACAA GAATTTAAAG AACTGGCCAG GATACAAGGG TTTGGAGGAT  4020
ACTGTGAAGA ACATGCAGAC ATCGCTACCT CTGGTTGAGG AGCTGCACCA TCCTGCTATG  4080
CGCGAAAGGC ATTGGAAGCA GTTGATGAAG ACAACCGGCG TCTCATTTGT GATGGATGAA  4140
AACTTCAGCT TGGGAGCTCT GCTTGCTCTT AAACTCCATC AATTCGAAGA TGATGTCATG  4200
GAAATTGTAG ATAAAGCTCA GAAAGAATTG ACGATTGAAA AACAGCTCAA GAAAATTGAA  4260
GATACTTGGA AGGTTCAGAG CTTCAGCTTT GATTTGCAAC CAGATGGAGA AATGTACCTC  4320
TTGAGAATTG ACGAAAGTGT AATGGAGTGT CTCGAAGATA GCGTAGTACA GCTTGGAAAT  4380
CTTCAAGGGA GCAAGTATGT TCAGAACAAT GCTTCATTCC TTGAGATCGT CTCGAATTGG  4440
CAGCGCAAAA TTGGTAATGT TGATAGTGTT ATTCAGGCCT GGACTGAGGT TCAAAAGAAA  4500
TGGCAAAATC TGCAGAGTAT TTTTGTTGGA TCGGCAGATA TCAGGGTACA ACTGCCTGAA  4560
GACTCAAAGC GCTTTGACAG CGTAGATGTA GAATTCAAGG ATCTCATGAA AGAAGCAGCA  4620
AATGTAACCA ATGCGGTAGA TTCTTGCAAT GTCGATGGAC GATTAGAAAA GATCGAACTT  4680
ATGCTTGCTG GATTAGAAAA GTGCGAGAAA GCGCTCGCAG ATTATCTGGA GACAAAAAGG  4740
CTCGCATATC CCAGGTTCTA CTTTGTTGCA TCAGCAGATT TGTTGGATAT TCTGAGCAAG  4800
GGAAGCAATC CGCAGCTGAT ACTCAAGCAT TTGCCGAAAT GCTTCGACAA CATCTCAACT  4860
CTGGAGTTCA AGAAGGATGG AGATGGTCAT CCAACGAAGG ATTCAATTGG AATGTATTCA  4920
GGAGAAAAGG AATATGTCAC ATTTCATGCT CCCTTCACGT GCGAGGGTCC GGTTGAGACA  4980
TGGTTGTTCA ATTTGACTGT CCATACACAT GAATGCCTGA AGCATATTCT CGTGGAGGCA  5040
GTCGGTGTAT ATGAAGAAAA GCCGAAGACA GAATTCGTCT TTGATTGGTG CGCGATGATT  5100
GCTGGAACGT GTTCGCGTAT AGCTTACCGC GAAGAAGTGG ATTTCACTTT TGAACAGATG  5160
GAAGAAGGAA ACGAGCAGGC CATGCGCGAG TATAGCGGCA AACAAATCGA GCAGTTGACT  5220
TCCTTTGTCA CTCTGATCCT GGGCGAACTC AGCTCGAATG ATCGCAAGAA GATTGTCATG  5280
ATTGTCACAG TCGATGTGCA CGCTCGTGAT GTGGTACTGG AACTGATCGA AACAAAGGCA  5340
GAAAACAATC AATGCTTCCT GTGGATGTCC CAGCTCAAAT TCAGCTTAGA TGAAAAGACT  5400
CATTTGGGAA AGATCAATAT CTGCGACTAC GAGTGTTTCT TTGGCTATGA GTACATTGGC  5460
AATTGTGGAT GTCTTGTCGT CACACCACTT ACTGACCGCT GTTATATCAC TCTGACCCAG  5520
GCTATGCGAC TCATTCTCGG AGGTGCTCCT GCAGGACCCG CTGGAACTGG AAAAACAGAG  5580
ACAACCAAAG ATCTTGGACG AGCTCTGGGT ATTATGGTGT ACGTGTTCAA TTGCTCGGAT  5640
CAGATGGACT ATAAGAGCAT GGGCCAGATC TTCAAAGGGC TAGCACAAGC AGGGGCTTGG  5700
GGTTGCTTTG ACGAGTTCAA TCGGATCGAC ATTTCCGTTC TCTCCGTGAC GAGCACACAA  5760
TACAAATGCA TCCTAGATGC TATTCGCGCA AAGAAACGTC GATTTATCTT CGAGGACGAT  5820
GACATCCCCT TGAATCATGA GCCTTTCTGT TGCGGATTCA TTACTATGAA TCCCGGTTAT  5880
GCGGGTAGAA CTGAATTACC AGAAAGTGTA AAAGCGTTGT TTCGTCCCTG TGCAATGGTT  5940
ACTCCGGATA TGGATCTCAT CGCAGAAATT ATGCTGATGT CAGAAGGGTA TGCGGAAGGA  6000
AAGGTCTTAG CGCGAAAGTT TATGATCTTG TACCGGTTGA GTGAGGCCTT GCTGTCAGCC  6060
CAGCGTCACT ATGATTGGAA ATTGAGGGCT GTAAAGACGA CACTTGCTGT TGCTGGTGGA  6120
TTACGAAGAT TGGACCGTGA GAATACAGAG GACAAAGTGC TTCTGAGGGC TCTCCGTGAT  6180
TTTAATCTGG GAAAGCTTGT GGCGGATGAC GTTGGAATTT TCATGGGTAT GCTCAACGAT  6240
TTGTTTCCTC GTTCTCTCGA CTTGGTGCCA AGAAATAGGG ACCTTGACTT CGAAGCAAAG  6300
TTGGCAGAGG CTGCCACTTT GTTGCGATTG CAACCAGAGG ACATTTTTAT TCTGAAGTGC  6360
ACGCAATTAC GTGAAATCTT GGTCGTCCGT TGGTCAGTTT TTGTGCTTGG TCCAGCAGGT  6420
TGCGGTAAGT CAGAGATGTT GAAGACTTTG TCCAAGGCTC AGAATATCTT TGGGGAGAAG  6480
AGTACAATGA ACTATTTGAA TCCCAAGTCA GTAACCAGAA ATGAACTGTA TGGTTACATT  6540
CATCCAGCCA CCAGAGAATG GAAGGATGGT TTGATTTCCC AGATTTTCCG TGATTTAGCC  6600
AACACTACAA CTGTTAAACA TGAATGGATT GTTTTGGACG GAGACATCGA TGCTGAATGG  6660
ATTGAGTCGA TGAACACTGT TATGGACGAT AACAAGACCC TGACGCTGGC AAGTAATGAA  6720
AGGATTCCCC TTACGCCGCC AATGAGGTTG CTCTTGGAGA TCGAAAACAT GCGCGAGGCT  6780
TCACCCGCGA CCGTTTCTCG TGGTGGAGTG ATTTTTCTCA ACGACACAGA TGTCGGGTGG  6840
GCACCATACA TTGCTTCCTG GGTTCAGAAC CGAGAGCTTG AAGGTGAACG TACAACCCTA  6900
ACTAAGCTCT TTGAGGAATA CATGGAAAAG ACCATCAATT TTGTTTTCCG TAACTTCAGG  6960
ATGATATGTC CGCTGCCGAA GATTAATGCA GCGATGACAA CATGCTATAT CCTTGAAGGA  7020
GTTCTTGGAA ATGGTGAATC GTTTGCTCAA CATCAGAAGC AGCTAGGACC TGAAGATGGA  7080
TTGAAGCTCA TTGAGCAAGT GTTCATCTAT GCACTCATGT GGGGAGCTGG TGGTGCTCTC  7140
ACGCATGACA AATCTGGAGA TTACAAGGAA CAGTTCAACA AGTGGTTTCG CACTGAATTC  7200
ACTAAATGTC CATTTCCAGA AGAAGGATCT ATCTTTGATT ACGTTGTCGA CATGGAGAAC  7260
AACTGCTGGC AGCATTGGAA TGACAGGATT GCTGCCTATG TGCATCAACC AGGAATCGTG  7320
TTCGGAAACA TCTATGTCAA TACTTTGCAA ACTCAACGCA TCACAGCTAT TCTTGATTTG  7380
GTTGTTCCGT ATAGGAGGCC TGTATGTTTT GTTGGTAGTG CAGGAACGGG CAAGACAACA  7440
ATCATGAAAG ATAAATTGCG AAACATTGAT TCCGAACAAT TCCTCACCTT AGGAATCAAC  7500
TTCAATTCCT TTACTGACTC CATGATTGCA CAAATGGCAA TGGAAAGTAT TTTGGAAAAG  7560
AAGACTGGCA GAGTGTTTGG TCCACCAGGA TCGAAACGGC TAATTTACTT CATTGATGAC  7620
CTGAACATGC CTGTGGTGGA CAAGTATGGA ACACAGCAAC CGATAGCTCT GCTCAGACAG  7680
GCCTTCGACT ATGCTGCATG GTACGATAGG ATCAAGTTGC AACCAAAAGA AATCCAAAAA  7740
GTGCAATTCC TCTCATGCAT GAATCCAACT GCCGGTTCCT TTGTCATCGA TCCCAGGTTG  7800
CAACGATCGT ATATGACCTT TTCTGTAATG ATGCCTTCGA ACGAAGTTCT TATGAGCATC  7860
TACTCTGCTA TTCTTAAAGG GCATCTTAGC AATGGTTTTG CTCCCGAGGT GCTAAAGTGG  7920
GCTGAAAATA CTGTGCAAGC AACAATAGAT TTGTACACGC AGGTTTCAAA GATCTTCCTA  7980
CCTACTGCCG TGAAGTTTCA TTATCTCTTC AATCTCCGTG AGTTGGCAAA CGTCATGCAA  8040
GGAGTTTGTC GTTCCAGGCC TCAGACTTGC GGCTCAGGAG TTCTTCTTGC TCGCTTGTGG  8100
TACCATGAAT GTGAGAGAGT TTTCCTCGAT CGAATGATCG CTGAAGATGA TGTGGACAAA  8160
TTTAAGCAGA TACTTAATGA CACCAACAAA AAGTATTGGT CCGAATTAAA TCAGGAGAAG  8220
TTGATGGAAA GGCCAAACTA CTTCACCACG TTCGCTGTAC CTGGAGTAGA CGATTCTGAT  8280
AGGCCATATG CAAACATCAG TGACATTGAC AAGCTTTGTG GAATTGTGGA AGATCGATTG  8340
AGGGAATACA ACGAAAGCAA TGCAGTGATG AATCTTGTTA TGTTCGAACA AGCTGTGGAG  8400
CATGTCTGCA GAATTACACG AGTTATTGAA TTGCCTAGAG GAAATGCCTT ACTCGTTGGG  8460
GTAGGTGGAA GCGGCAAACA AAGTCTTGCA AGACTTTCTG CCTTTATTGT TGGTTACGAG  8520
TCATATCAGA TAACTGTCAC CTCTTCATAT GGAATGGCTG ATTTCAAGGA GAATTTGATG  8580
GGGTTGTACA TTAAGTCTGG TGTGAAGGGA ATCGGGATGG CTTTCATCCT AACAGATGGA  8640
CAGATCGTGA ACGAGAAATT CATGGTTCTC ATAAATGACT TCTTGGCTTC GGGCAATATT  8700
GCAGACCTGA TGCCGAAAGA AGAAAAGGAC AACTGTGCCA ACGCTGTGCG AGGTGAAGTC  8760
AAACAGGCAG GATTAATAGA CACAGCTGAG AATTGTTGGG ATTTCTTCAT CGAGAAGGTG  8820
AGAAAGAACC TTCATATGGT CCTGTGCTTC AGCCCTGTCG GTGACTCCTT CCGCATTCGC  8880
GCTCGACAAT TTCCTGCCTT GGTAAATGAC ACTGTTTATG ACTACTTCAT GGGTTGGTCT  8940
CAAGAAGCCC TCATGAAAGT GGCAAATAGA TTCATTAAGG AGGTTGAGGC GATCACATCT  9000
GAGGAAGGGC TTCAAAACAA CGTTGCATTA CACATGGCGC ATGTCCATCG TTCTGTCGAG  9060
AATTGCTCTC TTGAATTCTT CGATGCGGAG AGAAGATACA ATTATACAAC CCCAAAATCA  9120
TTCTTAGACC TAATTTCCTT GTACAAAAGC ATGTTAGCAG CAAAGAAGGA AGGCATCAAA  9180
GTTCTTCGTG AGCGTCTAGA GAACGGACTT GAGAAGATGA ATAGTGCTGC CGAGCAGGTA  9240
GCCGAGCTCC AGGAGAATTT GGTGAAAGAC ATGGCAGTTG TTGAAGCAAA GAAAGCAGCT  9300
ACCGATGAGC TCATAGTTAT TGTTGGTCAG GAGACCGCGG TTGCGGAGGA GCAAAAAGCT  9360
GCTGCTGCCA TCGAAGAGGA GAAATGCAGC AAGATTGCAG AAGAGGTTAT GGCATTTCAG  9420
GCTGAGTGCG ATAAAGATCT GGCAGCGGCT GAGCCCGTTA TCCAGGAGGC AGAGGCAGCT  9480
TTGAACACGT TGGACAAGAA GTCGTTGACA GAATTGAAAG CGTTGTCCTC CCCTCCAGCT  9540
GGAGTCGATG ATGTCACAAG TGCTGTCATG GTATTGATGG GAGGTGGAAA GATCCCGAAG  9600
GACTTGAGCT GGAATGCTGC CAAGAAGATG ATGGGCAACG TTGATCAATT CTTGAATTCT  9660
TTGATCAACT TTGACAAGGA CAACACCCCT GAGATTGCTT GCACGTGGTG TGAAAATAAT  9720
GTTATAAACA AACCTTACTT CAATACTGCA ACAATGCAAG CCAAGTCGCA GGCTGCTGCT  9780
GGGATGACCT CCTGGGTTAT TAACATCTGC AAGTATTTCC GAATTTATCA GTACGTCGAA  9840
CCAAAACGAA AGTTGCTGAA TGAAGCCAAC CAGCGTTTAG AAGAGGCCAA CACCAAACTT  9900
GCAGCTGTTC GAAAGCAAGT TGCTGAGCTG GAAGAGAAGC TTGCTGACCT TACAAGACAA  9960
TTCGAAGAAG CAACACAAGA AAAGAATGAG GCCATCGCAG CAGCAGAAAA GACGCAGAAC 10020
AAGGCAAACA TGGCAGACAG GTTGGTGAAC GGCTTGGCTG ACGAGAAGGT CAGATGGTCG 10080
CAGTCTATTG AGAGGTTTGG AGTCCAGGAG CGCAACTACG TCGGAGATGT GATGATTGCC 10140
TCTGCTTTTG TTTCTTACAT TGGAGCTTTC AATCTTGCTT TCAGAGAGAA GCTCGTCAAT 10200
ACTCTTTGGA TCACAGACAT GATCGAGAAG CAGATCCCGA TGACCGAAGG CATTCAACCT 10260
CTCGACATGC TTTGTGATTC TGCAACCGTT GCTATCTGGA ATAGTGAAGG CCTCCCGACC 10320
GATACTGTCT CAACGCAAAA CGGCGCCATC ATGACAAATT GTCAAAGATG GCCTCTCATG 10380
ATTGATCCTC AGTTGCAAGG AATCAAGTGG ATTAAGAACA AGTACACTAA GGAGGTGAAA 10440
GAAATGAAAG TTGTTCAACA GACTCAAGAC AAATATATCA ATTTTATTGA GCTTGCAATG 10500
AGCAATGGAG AACCAATCAT GATCGAGAAT GTCTCTGAGT CCATCGATGC CGTGCTTGAA 10560
CCTGTCATGA TGCGTGCTGT GATTCGAAGG GGAAGAGCGC TTGTCATCAA ACTTGGAGAT 10620
AAGGAAGTTG AATACGACGA GAACTTCAGG CTGTTCTTGC AAACGAAGCT GAGCAATCCG 10680
CATTACAAAC CAGAAATTGC TGCTCAAACC AGTTTGATCA ATTTCATGAT CACTCTCGAT 10740
GGTCTTGAAG AGCAGCTGCT GAACAAAGTC GTTGAGAAAG AAAGGCCAGA TTTGGGTGCG 10800
CAGAAAGCGC AGTTGGTAGA AGATCAGAAT GGCTTCAACA TCAAGCTGAA GCAGTTAGAA 10860
GATGATTTGC TTTACAGCTT GAGCAATTCG CAGGGTGATA TTTTGGAGGA TATTGCACTC 10920
ATTGAAGGCC TCGAACAAAC CAAAATCACT TCGACTGAGA TCAAACAGAA GCAAGAATTA 10980
GGCAAGATCA CGGAGCAGGA GATTGCCACG GCAATGGAAA GTTACAGAGC AGTTGCCATT 11040
CGAGGTGCAC TAATGTACTT CCTGGTCGAT CAACTTTGGG TTTTGTCGCA CATGTACAGA 11100
TTTTCCATGG CAAATTTCGT CACGATGTTT AAGAAAGGAA TGGACAACGC AGACATTTAT 11160
GAGGAAGGAG AGACTCCGCC GGATCAGCCG GAGGAGGGAA CCACGATGAC TCCTGCTCAA 11220
CTGAAATCAA GAGTTGAACG TCTGATAGAC AAGTCTTGCT ATACAGTCTT CCAGTATATT 11280
TCCCAAGGCT TGTTTGAAAG GCACAAGCTC ATCTTTGCTG CTCAACTTTG CTTCCGTGTG 11340
CTGGCAAGGA AGGGTGAGCT GGATCAACGA ATGTTTGAAT TCTTGATTCG AGCACCTAAG 11400
AACTTGTCTT CGGACAATCC ATTGAAGGAA TGGTTAGACG ATGCTGCCTG GGGAGCTATT 11460
GCGGCTCTTC AGGATATCAC CGAACCAGTC AACTTTGCAT CGTTGCCGGA GGAGATGGTC 11520
TCGTCCGCAA AGCGATTCAG GGAGTGGTAT GAGCTCGAGA GGCCCGAAGA CGCCGGACTA 11580
CCGGGAGACT GGAAGAAGCT CGCTGAGTTC CCGAAGCTGT TGATCATTCG CTGCCTCCGC 11640
ACAGACAGGA TGGGAGAGGC CTTGTCAACG TTTGTGAGGA AGGAGATGGG AGAAAAGTAC 11700
GTCACTTCTG TTCCTTTCAG TCTCCCAAGA TCGTTCGAGG ATGCTGCGCC AGACACGCCG 11760
ATCTTCTTCA TTTTGTCTCC TGGTGTCGAC CCTGTGAAGG ACACAGAAGA GATAGGAAAG 11820
AAGTACGGGA TCTCATATGA TGCTGGTAAT TTCGGACTTG TTTCCCTGGG ACAAGGACAA 11880
GAGCCTGTTG CAGAGAAGAT TGTCGAGACA GCTTACAAGA ATGGTGGATG GGGATTTTTG 11940
CAGAATATTC ACCTGACACC AAGATGGACA GCCGGCTGGC TTGAGAAGCG ATGTGATGAT 12000
TTATCTAGTG CTCATCAGGA TTTCAGGCTC TTTTTATCTG CCGAAGCTGC TATGCTACCT 12060
ATCAACATTC TTCAAGTTTG TATCAAGTTG ACGAACGAAC CCCCAGAAGG GTTGAAGCCT 12120
AACCTCCTCA AGGCTCTGCT TCCGTTTGAT GATGCATTCT ATGAACAATG CTCCAAGGTT 12180
GGTGAACTGC GCAGCATCAC ATTCATGACG TGCTTCTTCC ATGCCATCAT TCTTGAAAGG 12240
AGAAAGTTCG GCCCACAAGG ATGGAACGTG TTGTATCCTT TCAACATGGG CGATCTCATC 12300
AGCTGCGCGC AAGTCTGCGT CAACTACCTC GAGGCAAATA CCAAGATTCC ATGGGCTGAC 12360
TTGCGATACA TCTTCGGAGA AATCATGTAC GGTGGACACG TAACAAATAA CTTTGATCGT 12420
CGGCTTGTCA GCTCGTACCT CGAGTCCTAC CTCACTCCAG AGCTCTTGGA TGGCTTCCAG 12480
ATCTACCCGG GTTTCACGAC TCCGTCCAAC ACGCTCAACA CCAAACAAAC GATCGAATAC 12540
GTGCAGGAGG TGATGCCGCA AGAGAGTCCC ATCGCTTACG GCATGCACCC TAACGCAGAG 12600
ATCGGCTTCA GGTTTTCTCA AGCTGAGGCG ATGTTCCAGT CCATCGTAGA GCTGATGCCC 12660
AGAAGTGCAG CTGGTGGTGG GGGGATGACG TTGCAGGACA AGGCAAAGGC AGCATTGGAT 12720
GATACACTCG AGAAGCTGCC AGACCAGTTC GTCATGGTGG ATATTCTTGA GCGCATTGAA 12780
GAGCGCACGC CATACGTGAA CGTGTTCTTG CAAGAAATAG AACGAATGGT CATCTTGACG 12840
ACTGAAATTC GTCGCTCGTT GATAGAGCTG GATCAGGGTT TGAAGGGAGA TCTTCAGATC 12900
ACTGACAAGA TGGAGAAGCT CATGACCTCT TTGGCCGAGA ACAAGGTACC TGAGAGCTGG 12960
GAGAACATCG CATACCCTTC GAAGAGACCA TACTCGTCGT GGTTGATCAA CCTGCTCGAT 13020
CGGCAGAAGC AGCTAGATGT ATGGACGGGC GAGCTTGGCC TGCCCAAGTG CACCTGGATA 13080
TCTGGCTTGT TCAGCCCGCA GTCGTTCCTC ACTGCTGTGA TGCAGACGAC AGCGCGAAGA 13140
AACGAATGGG CATTAGATCG CACGGTGAAT CAAACTGAGG TGACGCGCTT CATGGAGCCT 13200
TCACAGATCC CCGGTTTTAA CAAGGAAGGC GCGTACGTGA ATGGCCTCAT CATGGAAGGT 13260
GCCAGGTGGG ACGAGAAGAC AGGAACAATC GAAGACAGTC GTCCTAAGGA GCTTTTCGCA 13320
AAGATGCCTG TTGTTCTCAT CAAGGCAGTG CCTGCGGACA AGGTCGAGAC GGGAGTTTAT 13380
CAGTGCCCTG TCTTCAAGAC TCAAATCCGT GGAGCAGGAG GAGGAAAGGA CACGTTTGTG 13440
TTCCTTGCTG GCCTCAAGAC GAAGCAGAAG CCAAGCAAGT GGATCCAGGC TGGTGTCGCG 13500
CTTCTCATGG ACGTAGTGAT CTAA                                        13524