Entry HELRO02723 (T1FQ78)

E Helobdella robusta


General Information

Description
jgi|Helro1|188646
Organism
HELRO - Helobdella robusta (Taxon-ID: 6412)
Locus
scaffold_3join(complement(5346588..5346762), complement(5346449..5346497), complement(5346254..5346380), complement(5345988..5346162), complement(5345825..5345900), complement(5345573..5345692), complement(5345355..5345480), complement(5344860..5345055), complement(5343870..5344049), complement(5343503..5343745), complement(5343191..5343383), complement(5342282..5342472), complement(5341053..5341284), complement(5340771..5340979), complement(5340503..5340594), complement(5340184..5340298), complement(5339939..5340109), complement(5339564..5339826), complement(5339133..5339334), complement(5338762..5338968), complement(5338517..5338634), complement(5338297..5338372), complement(5338006..5338128), complement(5337780..5337888), complement(5337454..5337652), complement(5337276..5337367), complement(5336983..5337074), complement(5336521..5336722), complement(5336285..5336437), complement(5335938..5336072), complement(5335703..5335790), complement(5335350..5335480), complement(5335016..5335225), complement(5334657..5334842), complement(5334413..5334574), complement(5333888..5334013), complement(5333567..5333724), complement(5333176..5333406), complement(5332987..5333053), complement(5332806..5332857), complement(5332479..5332612), complement(5332174..5332400), complement(5331856..5332062), complement(5331338..5331465), complement(5331060..5331184), complement(5330747..5330932), complement(5330494..5330661), complement(5329844..5330263), complement(5329393..5329560), complement(5329081..5329292), complement(5328685..5328892), complement(5328294..5328554), complement(5327986..5328187), complement(5327276..5327769), complement(5326895..5327137), complement(5326649..5326809), complement(5326422..5326579), complement(5325928..5326200), complement(5325620..5325711), complement(5325401..5325517), complement(5325069..5325159), complement(5324789..5324895), complement(5324296..5324439), complement(5323942..5324105), complement(5323706..5323859), complement(5323427..5323558), complement(5323158..5323286), complement(5322916..5323052), complement(5322756..5322844), complement(5322445..5322563), complement(5321380..5322228))
Number of exons
71

IDs and Cross-references

loading xrefs...

Domain Architecture

Gene Ontology

loading GO annotations...

Protein Sequence

Download: Fasta
MWERTLYPPI INEMSWTLAA PFQTNAYCAT PSQSIASNYT PSLQNPKMSK WLKKSRPSVL    60
ANQNRFKKNT KVRKKSGPEL KRPTSRSMTP MEQMEVISVM HKEHTNPLNE MSANDRERYS   120
YYITNMIPEN MLAPMLPDQM NNFKQLIFPV ALRSRTAELE KEVLEDYKNS LKISIVNYIL   180
EDPNQRTRLN IHATPWQWPV RVIRAPVPWH DKLKSISHFI KNHLQLLNPI MDRLQLLWVK   240
EYSNTKFISK EGLLNNNLPM LPAEMKKYIE DCCWNMREQL KKNWVPTVAD ILIEMRSLWQ   300
FMVPRTDDGS TVRVQEFFSC IAALMSLQLR SLVIRSLDEY LAFFEDYAAG NDFQPPFNDS   360
MFLMKSCLLF KIRVQDLQIE FYPLFDEVEE NIIASIREII YCAEGIPRVD HKLFPDLNYL   420
SMTIKSIKFK ESIVTRYVEK AVSIFRKNTL GPKLYIAEYT KYLELVGDQA KNEIRSFTNQ   480
PQSLEKFNEK LQNIQNLKNE LASLRVTVPM NLVCLDCFDI NDIFFKKTNA LQEHLLQFLV   540
DENRETNKRH LTFFEDLKLH CSLFNWPDNI VITLDNAQTM LNIYRDKCEE RLQKQVEDFR   600
QKLNDINKEI ETFRKKEITS LDEMKINVDI LTRITDNLVD ARNELVAINF DESLLEWEES   660
KFPILNQMFA AKDPYEKLWK NALNFTIKSE IWLNGPFRNV MADVVNDELA EMFRVMHKLT   720
KIFVDAGPRT VAERFRNKIE KFKINLPLLN VISNPGIRER HWQAMSDIVG FDLKPGPDVS   780
LAQMIDYGLM IYVDKLEEIG AAAAKEYTLE KNLNKMVSEW QNLQFELVMY RDTEVYILSA   840
IDDIQLLLDD HIIKAQTMKG SSFIKPLEAE ITDWESKLVS MQNILDNWLK CQATWLYLEP   900
IFSSEDIMAQ MPEEGQKFAI VDGIWKEVMY GAAKDKRCLV ATGQPNMLKK LIEANALLED   960
VQKGLNNYLE KKRLFFPRFF FLSNDELLEI LSETKDPTRV QPHLKKCFEG IAKLEFNQDL  1020
EILEMVSSEN ESVSFYQKIS TAKAKGMVEK WLLQVEEVMV NSVKKVVGEA FDAYASTPRK  1080
EWVVEWPGQV VICVGSAYWT MEVSNSFKGP QGLTAYAKQC NRQIEEIVEL VRGNVITTGT  1140
RITLGALIVI DVHARDVVLN LIEKKIHSYH DFNWLSQLRY YWDKKELYVQ MITTNLVYGY  1200
EYLGNTSRLV ITPLTDRCYR TLMGALKLNL GGAPEGPAGT GKTETSKDLA KAVAKQCVVF  1260
NCSDGLDYKA MGKFFKGLAQ SGAWACFDEF NRIEVEVLSV VAQQIHCIQT AIAQNVKRFL  1320
FEGTDLMLNN TCAIFITMNP GYAGRQELPD NLKVLFRTVA MMVPDYALIG EISLYSMGFV  1380
NARGLAQKIV ATYKLCSEQL SSQHHYDYGM RAVKSVLTAA GNLKLKYPDH NESVVLLKAL  1440
MDVNLPKFLA QDIPLFEGII ADLFPGVKAT HPDFDDFFTA VKNRIASNEL QPVPWFISKI  1500
TQIYEMILVR HGLMIVGEPM GGKTKAYQVL AGALTDLSLM DPPLENQVFY KIINPKAITM  1560
GQLYGQFDLV SHEWTDGVLA TTFRKYAINT LEDRKWIVFD GPVDAVWIEN MNTVLDDNKK  1620
LCLMSGEIIQ MSVNMNMIFE PADLEQASPA TVSRCGMIYL EPLQLGWMPF KDSYLEHVFP  1680
KAAEADHVQQ VSDMFDWLVQ PCLDFIKSDC KVLLSTSPIH LVFTFIRLFH SLLDEIIQEY  1740
KNPTNMTLTS AQITLWLQGL FQFALVWSIG STVSGDSRKK FDAFFRNLVY GLNAKCPKPK  1800
SSKISRNNSF PERGLVYDFC FEKKASGIWV EWIEKLDKIN VSFPSNAKIS DLIIPTRETA  1860
RQFFFLNTYV LHNIPLLFVG PTGTGKSVIT IDYIFNLSKE SFIKNMINFS ARTSANQTQD  1920
IIMGKLDRRR KGLYGPPVGK KCIVFVDDLN MPAKEKYGAQ PPIELLRLVG LKCLLLVWQW  1980
IDHGHWYNKK DTSKLFLADM LFLGAMGPPG GGRNDITGRF TRHLQLMSID EFDDSTLIKI  2040
FSTIIDWHFS KGFDVIFLHQ GKALVQATLK VYKDLFYGFL PTPTKSHYIF NLRDFFRVIS  2100
GVILVSQQKL TDVEKLIRLW FHEVYRVFSD RLIESADSSR FFSIVVEACS THFKVNIDKL  2160
FSHLTNGTRL QENHVRSLFF GDYLLPDAEK IYDEILDIDE LFRVMNSYLE DYNAVSRAPM  2220
RLVMFRFAIE HISRVSRVLK QEKGHVLLVG IGGSGKMSVT KLATFMADCE LFRVEITKNY  2280
GTNEWRDDLK KLLLKAGLEG RYCCFLFSDG QIRDEGFMED ISMILNSGDV PNLFGADEKS  2340
EIVEKIQNVA KNEGKTIEVN PLALFNFFTD RVKQNLHIAL TMSPIGDAFR NRLRMFPSLI  2400
NCCTIDWFHS WPDDALEMVA NKFFEDLHLD EAIHKETVIM CKHFHLSVIE LSTMFYNELR  2460
RHNYVTPTSY LELILTFKKL LNEKRESITT LQMRYKTGLE KLQFAASQVT VMQMELTNLH  2520
PQLLRTSTET EELMIKIEQD TVEVEAKKEV VAADEAVANE AAARAQSIKD ECENDLAEAI  2580
PALESAISAL NTLKPSDITE VKTMKNPPGV VKLVMEAVCI MLNIKPDRKP DGSGKMFDDY  2640
WSASQKLLGD MKFLDKLKQY DKDNIPPPVI KKIREKFIPS SDFEPSIVKN ASKACEGLCK  2700
WVRAIDVYDR VIKIVAPKKI KLADAENELA LQMSKLNLKR DQLKQVADKL QMLNDEFEKM  2760
TQKKKELEEN IEICSKKLDR AEKLIGGLGG EKTRWTINEM KLSEQLFNVI GDVLLSAALV  2820
AYMGAFTFNY RQSCIKQWHE MCLARNIPCS ANYSLMVTLG DPITIRAWQI AGLPVDSFSI  2880
ENGIIVSNSR RWPLMIDPQA QANKWVKKME AENKIAVIKL SDSSYIRQLE NALQFGFPVL  2940
MESIGEELDA MLEPILQKSI FRQQGVDYIK FGDNVIEYSF NFRFYMTTRL RNPHYLPEVS  3000
VKVCLLNFMI TPQGLEDQLL GIVAATEKPD LEEKKNELIL ASASNKKKLQ EIEDKILQVL  3060
SMSQGNILED ETAIEILSSS KVLSEEISAK QEIATLTEEE IDGTRNGYQP IAVHSSILFF  3120
CISDLANIEP MYQYSLTWFI NLYLNSIYKS EQSEMLEQRL ENLKQHFTYS IYKNVCRSLF  3180
EKDKLLFSVI LTVGILKGRN EVDDSLWRFL LTGGVALNNP NPNSFSKWLS DKSWSEIVRL  3240
SDHEYFPNFM KNFVESVQTW KILYDSPAPH LMKFPEPYSV HVNNMQSLVL LRVLRPDKMV  3300
PALQNFIKAN LGQQYIEPPT FDLDGSFSDS NCCTPLIFIL SPGADPMAAL LKFAEDKGFG  3360
GPKIQTISLG QGQGPIASKM IDEAIATGTW VVLQNCHLAT SWMPSLEKIC EEVITIEKTK  3420
PDFRLWLTSY ATPAFPVVIL QNGVKMTNEP PKGLRSNLLR SFLNDPISDP AFFNGCKTAR  3480
RWKKMLFGLC FFHAVVQERR KFGPLGWNIP YEFNESDLRI SMRQMQMFLN DYEELPLTAL  3540
TYLTGECNYG GRVTDDKDRR LLTSILSIFY TSEIVFFESY KFSPSGLYYS PPEGSYQNYI  3600
EYIKSLPLIA TPEVFGLHDN ADITKDNKET VELFNSILLT LPRLATKTGE KSSSDTVFDL  3660
AGSILAEVPE KFDIEEVNNK YPVIYSESMN TVLRQELIRY NKLVHVIRKT LVNLRKAIKG  3720
LVVMSLELDD IFNSMLIGKV PLAWASKSYP SLKPLGSYIN DLILRLKFFQ SWIDDGIPIS  3780
FWVSGFYFTQ SFFTGVFQNY ARKYLIPIDM LGYQYDMMYS DVVSEKPEDG AYIYGLFLEG  3840
ARFDNERMIL AESHPKILFV SMPIIWLRPG KIDDFLVRRV YSCPVYKTTE RRGTLSTTGH  3900
STNFVLMLDI PSDKPERHWI NRGVAAICQL DN                                3932

Coding Sequence

Download: Fasta
ATGTGGGAGA GGACCTTGTA TCCTCCAATA ATAAATGAAA TGTCATGGAC ATTAGCGGCT    60
CCATTTCAAA CAAATGCATA TTGTGCAACA CCCAGCCAAT CTATTGCTAG CAATTATACA   120
CCATCACTTC AGAATCCAAA AATGTCAAAA TGGTTGAAGA AAAGTAGACC ATCTGTTCTA   180
GCAAATCAAA ATCGTTTTAA GAAGAATACA AAAGTACGAA AAAAAAGTGG TCCAGAGTTG   240
AAGAGGCCAA CAAGTCGATC AATGACTCCC ATGGAACAGA TGGAAGTGAT CAGCGTCATG   300
CACAAAGAAC ATACCAACCC ACTCAATGAA ATGTCTGCTA ACGATAGAGA AAGGTACAGT   360
TATTACATCA CCAACATGAT ACCCGAAAAC ATGCTGGCTC CCATGCTTCC TGATCAGATG   420
AACAATTTCA AACAACTCAT CTTTCCAGTG GCGCTCAGGA GCAGGACTGC TGAGCTGGAA   480
AAGGAAGTGC TGGAAGATTA TAAAAACAGT TTGAAGATCT CTATAGTAAA CTACATACTG   540
GAGGACCCAA ATCAACGAAC CAGACTGAAT ATACATGCGA CTCCATGGCA ATGGCCTGTC   600
AGAGTCATTC GTGCTCCAGT TCCGTGGCAT GATAAGTTGA AGTCCATCTC CCACTTCATT   660
AAAAATCACT TGCAACTGTT GAACCCCATT ATGGACAGAC TGCAGCTGCT TTGGGTCAAA   720
GAATATTCCA ACACAAAATT TATAAGCAAA GAAGGATTGT TGAACAACAA TTTACCAATG   780
CTGCCAGCAG AAATGAAGAA ATACATTGAG GATTGTTGCT GGAACATGAG GGAGCAACTC   840
AAAAAGAACT GGGTTCCAAC AGTGGCGGAT ATATTGATAG AGATGAGGTC GTTGTGGCAG   900
TTCATGGTTC CGAGAACAGA CGACGGATCG ACTGTGAGGG TTCAAGAATT TTTTTCCTGC   960
ATTGCTGCCC TCATGTCTCT GCAACTCAGA TCGCTGGTCA TCAGGTCTCT GGATGAGTAT  1020
TTGGCCTTCT TTGAGGATTA TGCCGCAGGA AATGATTTTC AGCCACCTTT CAACGATTCC  1080
ATGTTCCTCA TGAAGTCGTG TTTGTTGTTC AAGATTCGCG TGCAAGATCT ACAGATTGAA  1140
TTCTATCCAT TGTTTGATGA GGTTGAGGAA AATATCATTG CATCTATCAG GGAGATTATC  1200
TACTGCGCCG AGGGCATACC CAGAGTGGAC CACAAATTGT TCCCAGATCT CAACTATCTA  1260
TCCATGACGA TTAAATCAAT CAAATTTAAG GAATCGATTG TTACAAGATA CGTTGAAAAA  1320
GCTGTCAGCA TATTTCGGAA GAACACCCTT GGTCCCAAGT TATACATAGC AGAATATACA  1380
AAATATTTGG AACTTGTGGG GGATCAAGCC AAAAATGAAA TAAGATCCTT CACTAACCAG  1440
CCTCAATCAC TTGAAAAGTT TAACGAGAAA CTACAAAACA TCCAAAACTT AAAGAACGAG  1500
TTGGCGTCAC TGAGAGTAAC AGTGCCAATG AATTTGGTGT GTCTCGATTG CTTTGACATC  1560
AATGATATCT TCTTCAAGAA AACGAACGCA CTGCAAGAAC ATCTGCTGCA ATTTCTTGTC  1620
GACGAGAACA GAGAGACCAA TAAGAGGCAT CTAACTTTTT TCGAAGATCT GAAGTTGCAT  1680
TGCAGCCTGT TTAACTGGCC GGACAACATT GTTATCACAC TGGACAACGC GCAAACCATG  1740
TTAAACATCT ACAGAGATAA GTGTGAGGAA AGGCTGCAGA AGCAGGTTGA AGACTTCCGA  1800
CAAAAACTTA ACGACATCAA TAAAGAAATT GAAACATTTA GGAAGAAGGA GATAACAAGT  1860
CTTGACGAGA TGAAGATCAA TGTAGACATA TTGACTCGCA TCACGGATAA TCTCGTTGAT  1920
GCCAGAAATG AGCTGGTGGC GATCAACTTC GATGAAAGTT TGCTGGAGTG GGAGGAATCC  1980
AAGTTCCCGA TACTCAATCA AATGTTTGCG GCTAAGGATC CTTATGAGAA ACTTTGGAAG  2040
AATGCGCTGA ACTTCACTAT CAAAAGTGAA ATATGGCTGA ACGGTCCATT CAGGAATGTG  2100
ATGGCAGATG TTGTTAACGA CGAGTTAGCA GAAATGTTCA GAGTGATGCA CAAGCTGACA  2160
AAAATATTTG TTGATGCTGG CCCTCGAACG GTAGCTGAAC GATTTCGCAA CAAGATTGAA  2220
AAATTCAAAA TCAATTTACC GTTACTTAAT GTTATCAGCA ACCCAGGAAT CAGAGAGAGG  2280
CATTGGCAGG CTATGAGCGA TATAGTAGGG TTTGACTTGA AACCTGGACC AGATGTTTCC  2340
TTGGCTCAAA TGATTGACTA TGGATTAATG ATTTATGTTG ATAAATTGGA AGAGATTGGA  2400
GCAGCTGCCG CAAAAGAGTA CACTTTGGAA AAGAATTTGA ATAAGATGGT CAGTGAGTGG  2460
CAGAACCTGC AGTTTGAGTT GGTCATGTAC AGAGATACCG AGGTGTACAT TCTTTCTGCC  2520
ATTGATGATA TCCAGCTGTT GCTGGATGAC CACATCATCA AGGCGCAGAC GATGAAAGGA  2580
TCTTCGTTCA TTAAACCACT TGAAGCCGAA ATCACAGATT GGGAATCTAA ACTTGTATCT  2640
ATGCAGAACA TTCTAGATAA TTGGCTAAAG TGCCAAGCTA CTTGGTTGTA CTTAGAGCCC  2700
ATATTCAGTT CTGAAGACAT AATGGCTCAA ATGCCAGAGG AAGGCCAGAA GTTTGCCATA  2760
GTAGATGGCA TATGGAAGGA GGTTATGTAT GGAGCTGCGA AGGACAAGAG ATGCTTGGTG  2820
GCCACCGGTC AGCCCAACAT GCTGAAGAAG TTGATTGAGG CCAATGCGTT GTTGGAAGAT  2880
GTTCAGAAAG GTCTGAATAA TTATCTAGAG AAGAAGAGAT TGTTCTTCCC AAGATTTTTC  2940
TTCTTATCGA ACGACGAATT GTTGGAGATT TTGTCAGAAA CAAAGGATCC AACGCGAGTT  3000
CAGCCGCACC TGAAAAAATG TTTTGAAGGA ATCGCCAAGT TGGAATTCAA CCAAGATCTA  3060
GAGATTCTTG AGATGGTGTC TTCTGAGAAC GAATCTGTCA GCTTCTATCA GAAAATATCC  3120
ACTGCAAAGG CCAAGGGTAT GGTGGAGAAG TGGTTGCTTC AGGTGGAGGA AGTGATGGTG  3180
AACAGCGTGA AGAAGGTGGT TGGTGAGGCG TTCGACGCTT ATGCTTCAAC ACCACGAAAA  3240
GAGTGGGTGG TTGAGTGGCC AGGGCAAGTG GTCATCTGCG TTGGGTCAGC CTATTGGACC  3300
ATGGAAGTCA GCAATTCATT CAAAGGCCCG CAAGGATTGA CTGCGTATGC TAAACAGTGC  3360
AACAGACAGA TTGAGGAAAT AGTAGAGTTG GTGAGAGGAA ATGTGATAAC GACAGGAACC  3420
AGGATAACGT TGGGGGCTCT CATTGTGATT GATGTACACG CTCGTGACGT TGTTCTCAAT  3480
CTCATTGAAA AGAAGATACA CAGTTATCAT GATTTCAACT GGCTGAGTCA ACTCAGATAT  3540
TATTGGGACA AAAAGGAGTT GTACGTGCAG ATGATAACGA CGAACCTGGT CTATGGTTAC  3600
GAATATCTGG GCAACACATC GAGACTCGTC ATCACTCCAC TGACGGACAG ATGTTACAGA  3660
ACATTGATGG GGGCGTTGAA GTTGAACCTC GGAGGAGCAC CTGAGGGACC TGCAGGTACT  3720
GGAAAGACTG AAACTTCCAA AGACCTTGCC AAAGCTGTTG CTAAGCAGTG TGTGGTCTTC  3780
AACTGTTCTG ACGGGTTAGA TTACAAAGCA ATGGGAAAGT TTTTCAAAGG ACTGGCGCAG  3840
TCAGGAGCCT GGGCTTGTTT TGATGAGTTC AACAGGATTG AAGTTGAAGT ACTTAGTGTT  3900
GTGGCACAAC AGATTCACTG CATTCAAACA GCCATTGCCC AAAATGTTAA GAGATTTTTA  3960
TTCGAAGGAA CTGACTTGAT GTTGAATAAC ACTTGCGCCA TCTTCATCAC CATGAATCCC  4020
GGGTATGCAG GCAGACAAGA ACTTCCTGAT AACTTGAAAG TTCTGTTTAG AACAGTTGCA  4080
ATGATGGTGC CAGACTATGC GCTCATAGGC GAGATTTCTT TGTACTCCAT GGGATTCGTC  4140
AATGCAAGAG GATTGGCTCA GAAGATAGTG GCAACATATA AGCTGTGCTC GGAGCAGTTG  4200
TCATCCCAGC ACCACTACGA TTACGGAATG CGAGCCGTCA AATCCGTTTT AACGGCAGCT  4260
GGTAACTTGA AGTTGAAGTA CCCGGACCAC AATGAGAGTG TTGTTTTATT GAAGGCACTC  4320
ATGGACGTCA ACCTACCCAA GTTCCTCGCT CAGGACATTC CATTGTTTGA GGGGATCATT  4380
GCCGATTTGT TTCCAGGAGT GAAGGCCACC CACCCTGATT TTGATGACTT TTTCACCGCC  4440
GTTAAGAACA GGATTGCTTC AAACGAACTT CAACCCGTGC CATGGTTTAT CAGCAAAATT  4500
ACTCAGATAT ACGAGATGAT ATTGGTTCGA CACGGTCTGA TGATCGTTGG AGAACCAATG  4560
GGAGGCAAAA CAAAGGCCTA CCAAGTGCTG GCTGGTGCCC TCACTGACCT CTCCCTGATG  4620
GACCCACCCC TCGAGAACCA GGTGTTTTAC AAAATAATCA ATCCCAAAGC AATAACCATG  4680
GGACAGTTGT ATGGACAGTT TGATCTTGTT TCTCATGAAT GGACTGATGG AGTGTTAGCT  4740
ACGACATTCA GGAAATATGC CATAAACACA TTGGAAGATA GGAAGTGGAT TGTGTTCGAT  4800
GGACCTGTTG ATGCCGTTTG GATCGAAAAT ATGAACACCG TGCTGGACGA CAATAAAAAG  4860
TTGTGTTTGA TGAGTGGTGA GATAATTCAA ATGTCGGTAA ACATGAACAT GATATTCGAG  4920
CCGGCTGACT TGGAGCAGGC ATCGCCAGCC ACAGTCAGTC GGTGTGGGAT GATTTATTTG  4980
GAGCCACTGC AGCTCGGTTG GATGCCTTTC AAAGATTCCT ACCTGGAACA TGTCTTCCCT  5040
AAAGCGGCTG AAGCTGACCA CGTCCAACAA GTGTCCGATA TGTTTGATTG GTTGGTGCAA  5100
CCCTGTCTAG ACTTCATAAA ATCTGACTGC AAGGTGTTGT TGAGTACTTC TCCCATCCAC  5160
CTGGTATTCA CGTTCATCAG GTTGTTCCAT TCCCTCCTCG ACGAGATCAT ACAAGAGTAC  5220
AAGAACCCAA CCAACATGAC ACTCACAAGC GCTCAGATAA CATTGTGGCT GCAAGGTTTG  5280
TTTCAATTCG CTCTGGTTTG GTCAATCGGC AGTACAGTGT CAGGTGATTC TCGGAAAAAG  5340
TTTGATGCCT TCTTCAGAAA TCTCGTTTAC GGGTTAAATG CCAAGTGTCC TAAACCTAAA  5400
AGCTCTAAAA TTTCTAGGAA CAATTCCTTC CCCGAACGTG GCCTGGTGTA TGATTTTTGT  5460
TTTGAGAAGA AAGCAAGTGG AATCTGGGTG GAGTGGATTG AAAAGTTGGA CAAAATCAAC  5520
GTCAGCTTTC CCTCCAATGC CAAAATCAGT GACCTAATCA TACCAACGCG TGAAACGGCG  5580
CGTCAGTTCT TCTTCTTGAA CACCTACGTG CTTCATAACA TTCCCCTCTT GTTCGTTGGA  5640
CCCACAGGAA CAGGAAAGTC TGTCATAACC ATCGACTACA TCTTTAATCT CTCCAAAGAA  5700
TCCTTCATAA AGAACATGAT CAACTTCTCA GCTCGGACGT CAGCCAATCA GACGCAGGAC  5760
ATCATAATGG GCAAGTTGGA TAGACGAAGG AAGGGTTTGT ACGGTCCACC TGTTGGCAAG  5820
AAATGCATCG TGTTTGTTGA CGACCTGAAC ATGCCAGCCA AAGAAAAATA TGGGGCACAA  5880
CCACCAATCG AGTTGCTCAG GCTAGTTGGG CTAAAATGTT TGCTTCTAGT GTGGCAGTGG  5940
ATTGACCATG GTCACTGGTA TAACAAGAAG GACACCTCCA AGCTTTTCCT TGCCGATATG  6000
TTGTTTCTGG GAGCCATGGG TCCACCAGGC GGGGGTAGGA ATGACATCAC AGGTAGATTT  6060
ACGCGACATT TGCAACTGAT GTCCATTGAT GAGTTCGATG ATAGCACGTT GATTAAAATC  6120
TTCTCGACAA TTATTGATTG GCACTTTTCT AAAGGTTTCG ACGTCATATT TCTTCATCAG  6180
GGCAAGGCTC TTGTACAAGC AACATTAAAA GTATACAAAG ATTTGTTTTA TGGCTTCCTA  6240
CCGACACCTA CAAAGAGCCA CTACATTTTC AATTTGAGAG ATTTTTTCAG AGTCATCAGC  6300
GGCGTCATCT TGGTTTCTCA GCAGAAGTTG ACAGACGTCG AAAAGCTCAT CAGACTCTGG  6360
TTTCACGAGG TCTACAGGGT CTTTAGTGAT AGGTTGATTG AAAGTGCTGA CAGCTCTCGC  6420
TTCTTTTCGA TCGTCGTTGA GGCGTGCAGC ACGCACTTCA AGGTGAACAT CGACAAGTTG  6480
TTCAGTCACC TGACGAACGG CACGCGTCTG CAAGAGAACC ACGTGCGAAG TTTGTTCTTC  6540
GGCGATTACC TGCTGCCCGA TGCAGAAAAG ATTTATGACG AAATACTGGA CATTGATGAA  6600
TTGTTCAGGG TCATGAATAG TTACTTGGAA GATTACAACG CTGTAAGCAG GGCACCGATG  6660
AGATTGGTCA TGTTTCGGTT TGCGATTGAG CACATATCAA GGGTCAGCAG GGTACTGAAG  6720
CAAGAGAAGG GGCACGTTCT ACTAGTTGGT ATCGGCGGCA GTGGGAAGAT GAGTGTGACG  6780
AAACTTGCGA CCTTCATGGC CGATTGTGAG TTGTTTCGAG TTGAAATCAC AAAAAATTAC  6840
GGCACCAACG AATGGAGGGA TGACCTTAAA AAGCTACTGT TGAAAGCCGG CTTAGAAGGA  6900
AGGTACTGTT GTTTTCTGTT CAGTGACGGT CAGATAAGAG ATGAAGGTTT TATGGAAGAC  6960
ATCAGCATGA TACTAAATTC AGGCGATGTT CCAAATTTAT TTGGGGCGGA TGAAAAATCA  7020
GAAATTGTCG AAAAAATTCA AAACGTTGCT AAAAATGAGG GGAAAACAAT AGAAGTGAAC  7080
CCGCTGGCTC TCTTCAACTT CTTCACAGAC CGAGTGAAAC AGAACCTGCA CATAGCGCTG  7140
ACCATGAGTC CCATCGGTGA TGCCTTCAGG AACAGGCTTC GTATGTTTCC ATCTCTCATC  7200
AACTGCTGCA CCATTGACTG GTTCCATTCG TGGCCAGATG ATGCTCTGGA GATGGTTGCC  7260
AACAAGTTTT TTGAAGACTT GCACCTCGAT GAAGCAATCC ACAAAGAAAC TGTTATCATG  7320
TGCAAACATT TCCATCTCAG TGTCATAGAA TTATCTACAA TGTTCTACAA CGAACTGCGA  7380
CGTCACAACT ACGTCACCCC GACGTCATAT TTGGAGTTGA TATTAACTTT TAAGAAGTTA  7440
TTAAATGAGA AAAGAGAGAG CATAACTACA TTGCAGATGA GGTACAAAAC TGGTTTGGAG  7500
AAATTGCAGT TTGCAGCCTC CCAAGTGACC GTCATGCAGA TGGAACTGAC CAACTTGCAT  7560
CCACAACTTC TTAGGACGAG CACCGAGACT GAGGAGCTGA TGATTAAAAT TGAACAGGAC  7620
ACTGTAGAAG TGGAAGCCAA GAAAGAGGTT GTTGCAGCCG ACGAAGCTGT GGCTAATGAA  7680
GCAGCCGCCA GAGCGCAGAG TATCAAGGAT GAATGTGAGA ATGATTTGGC GGAGGCCATC  7740
CCAGCTTTGG AGTCTGCAAT TTCAGCTCTC AATACTTTGA AACCAAGTGA CATCACAGAA  7800
GTTAAAACAA TGAAGAATCC GCCCGGCGTG GTAAAGTTGG TGATGGAGGC AGTGTGCATC  7860
ATGCTAAATA TCAAACCGGA TCGCAAACCA GATGGCAGTG GAAAAATGTT TGATGACTAC  7920
TGGAGTGCTT CACAGAAACT TCTTGGCGAC ATGAAATTTC TTGATAAATT GAAACAATAC  7980
GATAAAGATA ACATTCCTCC TCCTGTAATC AAGAAGATAA GAGAAAAATT TATACCTAGC  8040
TCAGATTTTG AACCATCGAT AGTGAAGAAC GCTTCGAAGG CGTGTGAAGG ACTGTGCAAG  8100
TGGGTGAGGG CCATCGACGT TTACGACAGG GTCATCAAAA TTGTGGCGCC AAAAAAAATC  8160
AAGCTGGCCG ATGCTGAAAA TGAACTGGCT CTTCAGATGT CCAAGTTGAA TTTGAAACGA  8220
GACCAGTTGA AGCAGGTAGC CGACAAGTTA CAGATGTTGA ATGACGAGTT TGAAAAAATG  8280
ACTCAAAAGA AGAAAGAGTT AGAAGAGAAC ATTGAGATTT GTTCGAAGAA GTTAGACAGG  8340
GCGGAGAAGT TGATAGGAGG ATTAGGAGGA GAGAAAACAA GATGGACCAT CAACGAGATG  8400
AAGTTGTCTG AGCAGCTGTT CAATGTTATT GGAGATGTAC TTCTAAGTGC TGCTCTTGTC  8460
GCGTACATGG GGGCATTCAC TTTTAATTAC AGACAGTCTT GCATAAAACA ATGGCATGAA  8520
ATGTGCTTGG CTCGAAACAT TCCATGTTCA GCGAACTACT CCCTGATGGT CACCTTGGGA  8580
GACCCCATCA CCATCAGGGC GTGGCAAATA GCTGGCCTGC CTGTCGATAG TTTCAGTATT  8640
GAGAATGGCA TCATTGTATC GAACTCAAGA AGGTGGCCTC TTATGATTGA TCCGCAAGCC  8700
CAGGCAAACA AGTGGGTCAA AAAGATGGAA GCTGAAAATA AAATAGCAGT GATTAAGTTA  8760
TCAGACAGTA GTTACATTCG ACAGCTTGAA AATGCACTGC AGTTTGGCTT TCCAGTGCTA  8820
ATGGAAAGTA TAGGGGAAGA ATTGGATGCC ATGTTGGAGC CAATTTTACA AAAATCAATT  8880
TTCAGACAGC AGGGAGTTGA TTATATAAAA TTTGGAGATA ATGTCATTGA ATATTCATTT  8940
AATTTCCGTT TCTACATGAC AACTCGTCTG AGAAATCCTC ACTACCTGCC CGAAGTCTCT  9000
GTGAAGGTCT GCCTGCTGAA CTTCATGATC ACCCCGCAAG GGCTTGAAGA TCAGCTGCTG  9060
GGCATTGTGG CTGCTACTGA AAAACCTGAT TTGGAAGAAA AGAAAAATGA ACTGATACTG  9120
GCAAGTGCAT CCAACAAAAA GAAGTTACAG GAAATTGAAG ATAAAATTTT GCAAGTGTTG  9180
TCGATGTCAC AGGGCAATAT TTTGGAAGAT GAGACGGCCA TTGAAATTCT GTCATCATCC  9240
AAAGTGTTGT CCGAAGAAAT ATCCGCCAAG CAAGAAATTG CCACATTGAC TGAAGAAGAA  9300
ATAGATGGCA CTAGAAATGG ATACCAACCG ATTGCTGTAC ATTCGTCCAT TCTGTTCTTC  9360
TGTATATCCG ATCTTGCAAA TATCGAGCCT ATGTACCAAT ATTCACTGAC ATGGTTCATT  9420
AACCTTTATC TCAACTCCAT CTATAAAAGC GAACAGTCTG AAATGTTGGA GCAAAGATTG  9480
GAAAATTTAA AACAACATTT TACCTATTCC ATTTACAAAA ATGTCTGCAG ATCACTCTTT  9540
GAGAAAGACA AACTTCTCTT TTCAGTAATT TTAACGGTGG GCATACTTAA AGGAAGGAAC  9600
GAGGTGGATG ACAGTCTGTG GAGGTTCCTG TTAACAGGTG GGGTTGCTCT AAATAATCCT  9660
AATCCCAATT CTTTCTCTAA ATGGCTCAGT GACAAATCTT GGTCGGAAAT TGTCAGATTG  9720
TCAGATCATG AATATTTTCC AAATTTTATG AAAAACTTTG TGGAATCAGT GCAGACTTGG  9780
AAGATTTTAT ATGATTCTCC TGCCCCGCAC CTCATGAAAT TTCCGGAGCC CTATTCGGTT  9840
CACGTCAACA ACATGCAGTC ACTCGTTCTT CTGCGGGTCC TCAGGCCAGA CAAGATGGTG  9900
CCCGCCCTAC AAAACTTCAT CAAAGCGAAT CTCGGCCAAC AGTACATCGA GCCTCCCACA  9960
TTCGATCTCG ATGGATCATT TTCCGATTCA AACTGCTGCA CTCCGCTCAT TTTCATTTTA 10020
TCCCCTGGCG CTGATCCAAT GGCGGCCTTG TTGAAATTTG CAGAAGACAA GGGCTTTGGC 10080
GGTCCAAAAA TCCAAACAAT ATCTCTTGGT CAAGGACAGG GTCCGATAGC ATCCAAGATG 10140
ATTGATGAGG CAATAGCTAC GGGCACGTGG GTTGTACTTC AGAACTGCCA CCTGGCTACC 10200
AGTTGGATGC CTAGTCTTGA AAAAATATGT GAAGAGGTGA TAACGATTGA GAAGACGAAG 10260
CCGGACTTTC GTTTGTGGTT GACCAGTTAT GCTACTCCAG CTTTTCCAGT CGTTATTCTG 10320
CAAAATGGAG TGAAGATGAC TAATGAGCCT CCAAAAGGCC TGAGATCAAA TTTGCTGAGG 10380
TCTTTCCTCA ACGATCCCAT ATCTGATCCT GCATTCTTTA ATGGATGCAA AACTGCCAGG 10440
CGATGGAAGA AGATGTTGTT TGGCTTGTGC TTCTTTCACG CTGTCGTACA GGAAAGAAGA 10500
AAGTTTGGAC CGCTTGGTTG GAACATTCCC TACGAGTTCA ATGAATCTGA TTTGAGAATC 10560
AGTATGAGGC AGATGCAGAT GTTTTTGAAC GATTACGAAG AACTTCCACT AACTGCTCTG 10620
ACGTATCTGA CTGGTGAATG CAACTACGGG GGTCGTGTCA CTGATGACAA AGATAGAAGG 10680
CTGCTCACTT CAATTTTATC AATATTCTAT ACTAGTGAAA TCGTCTTCTT TGAATCTTAC 10740
AAGTTTTCTC CAAGTGGCTT GTACTATTCG CCACCAGAAG GCTCTTACCA AAATTACATT 10800
GAATACATCA AATCACTACC TTTGATCGCA ACTCCAGAAG TTTTCGGCCT GCACGATAAT 10860
GCAGATATCA CTAAGGACAA CAAAGAAACA GTCGAGTTGT TTAACAGTAT TTTGCTGACG 10920
CTGCCTCGCT TGGCCACGAA AACAGGAGAA AAGAGTTCGT CAGACACCGT CTTCGACCTG 10980
GCGGGCAGTA TTCTAGCGGA GGTTCCTGAG AAGTTTGACA TTGAAGAAGT GAACAATAAA 11040
TATCCAGTGA TCTACTCGGA ATCGATGAAT ACAGTTTTGA GGCAGGAGCT AATAAGATAC 11100
AACAAACTCG TACACGTCAT ACGCAAAACT CTCGTCAACC TCAGGAAAGC CATCAAGGGA 11160
TTAGTGGTCA TGTCTCTGGA GCTGGACGAT ATATTCAACA GCATGTTAAT TGGCAAGGTA 11220
CCATTGGCCT GGGCCTCCAA GTCATATCCG TCCTTGAAAC CTCTCGGAAG TTATATTAAT 11280
GATTTGATCT TGAGGTTAAA ATTTTTCCAA TCGTGGATTG ACGATGGCAT TCCAATTAGT 11340
TTCTGGGTCT CGGGATTTTA TTTCACGCAA TCCTTTTTCA CCGGTGTATT TCAAAATTAT 11400
GCTCGCAAAT ACCTGATTCC AATTGATATG CTGGGATACC AATATGACAT GATGTACAGT 11460
GATGTGGTCA GCGAAAAACC GGAGGATGGT GCATATATAT ATGGTTTGTT CTTAGAGGGA 11520
GCCAGATTTG ATAACGAGCG CATGATCCTC GCCGAATCAC ATCCCAAAAT TTTGTTTGTT 11580
TCCATGCCCA TTATCTGGCT GCGACCTGGA AAAATTGACG ACTTCCTCGT GCGGCGAGTC 11640
TACAGTTGTC CTGTTTATAA GACGACGGAA CGCAGGGGCA CCTTGAGTAC GACGGGACAT 11700
TCAACTAATT TCGTACTAAT GTTGGATATC CCATCTGACA AGCCTGAAAG ACATTGGATT 11760
AATAGGGGAG TTGCCGCTAT TTGCCAACTC GATAATTAA                        11799