Entry PARTE37616 (A0D7F8)

E Paramecium tetraurelia


General Information

Organism
PARTE - Paramecium tetraurelia (Taxon-ID: 5888)
Locus
CT868318join(742509..742679, 742705..754158, 754195..755310, 755338..756156)
Number of exons
4

IDs and Cross-references

loading xrefs...

Domain Architecture

Gene Ontology

loading GO annotations...

Protein Sequence

Download: Fasta
MEESETQLNV KVQEQGKLYS ANEIENFNQY LSAICLSLLI IDKDQWNVAC HEDVNQQNIC    60
QFLSDSQIKA LIVSKTVENE KFNIQIRSEY EASNNYAHTI CFLKRHTFQY DNQLQPQQFS   120
NHVQVINVGY AESQGGANPF TLSHNYVQNC FIPIFTQYKG EIDKKRIVDQ SSYNDLIKKL   180
NEVNLAFIKC RQNVEVPEII LQFDPRIKEA VKQRGGKPTI EDAAQLNKPD IVQSISQTVT   240
RWISDINQIS NTKLELTNAS IVDEINYWMS MERSLFFIEN QLKQPEVDFT IEVLTQAKKM   300
NITAQFKEIA LKQSLQKCQS CNQFMKEFPI NNLLIATNLV EIKDAMIQIF QHMKKLSNIQ   360
ETYTIPRSLQ LAESFSRELT NEMIKYFKGF QILHIKYVDF KGLIIKTQEI FSQWDEEYKI   420
FKQSIVKKSV HQKDQYGQFE HIKLQKQIQH IQRLREMHEN LKEVIEQIIQ NDQEEQKENV   480
QQFATLQEIQ QAYDIFKNVE VFDLSRDGED QFFRALKQYE IAIESVEATI TTNLRDSLGS   540
ASSAKEMFRI LAKFNKLFSR PRIKGAIQEY QSQLLKTVHK DIQSLQNKFK ETYQKSQNSR   600
LASARDIPLT SGFVIWSKQL QIRLQKYMQK VEQILGPQWA EDTDGKKCKE MGETFERILD   660
SGPALEDWKQ EINHHNKAVS QNEKLFEVVT RRRGLEIRVN YEKKLSQLFK EVRNLSNMKT   720
KVPYSISHIA NDAKASYPFA LSLQESLHTY IQITSQLNAK SAKLVAALRK EVQLQIGQGF   780
NYLWTHKTQL QPYVKKFTDK VFELEQAVNG LNERIGQIES LCEAMKTCPV DSLADKLKDI   840
QEVIDSLCFN NFSNLHIWIQ DIDKQIESIL CDRVTVQMKE WLNQFINYQK IQERGLVNQT   900
VVHELKLQDQ IIYVDPPVEY AKYFWFQEFH KMIGQICSLP RLVANRFDNT IQQNTGPWGT   960
QRDLDYSTTI NKINQQLIKD AYSQIGQLLE DMEQYVQTWL NYQSLWELDI KQVEQILQDD  1020
IEKWQQMLTD IKQGRATFDN STTEEHFGAI IIDYRMVQVK INHKYDAWHK ELLNHFGNKF  1080
GEQLRVFNKN VTTEKEKLLK INFQDLTSDI IESITIIQEQ DKKFPGWSAD IESFKNGQKV  1140
LDRQRYQYPG DWLSFEQVEM QWNQFKQIRS KKLQSQESEM NNIQSKIQQD ERYLNQQIQE  1200
IEEQWKTSKP DSGDCSPNEA EQILKSLNEQ LISVQEKYEK CSQAKEILKM DPPTHQQKLN  1260
VLLESISDLQ DVWQELGKIW KVMQSIKEQL ISALQNKKIK DTCDEAQKQL NGVSTKTRNY  1320
DAFEKMKEKV KNYIKMNKLI MDLKDESMKE RHWRQLLSKL KINESLNQLQ MQHLWNANLL  1380
NYENLAKDIM TVARGEQVLE TMISQVKDFW NSFELELVKY QTKCKLIRGW DELFQKLDED  1440
LNNLASMKIS PFYKTFEAEI SQWDDKLQKV KLTMDIWIDV QRRWVYLEGI FFGSSDIKTQ  1500
LQNEYNKFKD IDSQFTNLMK KVAQKPQLMD VQGIPNLAKT LERLSDFLQK IQKALGDYLE  1560
TQRQAFARFY FVGDDDLLDI IGNSKDVTNV QRHFPKMYAG IVQLQSRKDG NDDVVLGMSS  1620
KEGEVVPFSK EVKIAEDPRI NIWLGKVDNE MMNSLALDLE KSVLDIQANQ QNRMKVIEEH  1680
PAQIILLALQ VGWCFSVESS FNNEQQMKQT LQYVLEFLSE LAESVLKDHP KQLRQKFEQI  1740
ITDFVHQRDV IRLLMNNKIN SKNDFGWQYH MRFNWNSKEA DPGKRLLIQM GNAQFHYGFE  1800
YLGVAEKLVQ TPLTDKCFLT LTQALHLRMG GSPFGPAGTG KTESVKALGA QLGRFVLVFN  1860
CDETFDFNAM GRIFVGLCQV GAWGCFDEFN RLEERMLSAC SQQILLIQTG LREKQKQIEL  1920
MGKDVKLSSQ MGVFVTMNPG YAGRSNLPEN LKQLFRQMAM VKPDRELIAQ VMLFSQGFRT  1980
AEKLAGKIVS LFELCDNQLS SQPHYDFGLR ALKSVLNSAG NMKRQEMIDR KQEPVPQSEI  2040
EEFEQTILLR SVCDTVVPKL IKDDIKLLET LLQGVFPGSC IPEIKEEQLR KELALACQRK  2100
NLQSSKNFIE KVLQLYQIQR LQHGLMLVGP CGCGKSAAWR VLLEAMYKCD KVKGEFYIVD  2160
PKAISKDELY GRLDNTTLEW TDGVFTSILR KIISNQRQES TRRHWIIFDG DVDPEWAENL  2220
NSVLDDNKLL TLPNGERLAI PPNVRMIFEV ETLKYATLAT VSRCGMVWFS EETINDENIF  2280
YHFLERLKQD DYDQQKSEDD NNKQVNSQES ELRTKCVKAL ESIIKFLSQF LQIAQKPEYK  2340
HVMEFTRIRV LESTFALVRR SISNIIEYNE NNSEVPLEDD QINDFMVKQF LIAVMWGVAG  2400
SMNLYQRTQY SKEICQLLPH NVILPQFNDS APSLIDFEVT LPEAQWSQYK KKVPQIEIDP  2460
QRVTDADLII ETVDTLRHKD VLCGWLNEHR PFLLCGPPGS GKTMTLMSTL KALTDFEMIF  2520
INFSSSTMPQ LIIKQFDHYC EYKKTTNGVF LQPKNQKWLV VFCDEINLPD QDKYGTMAII  2580
TFLRQLTEQH GFWRSSDRQW ISLDRIQFVG ACNPPTDVGR KPLTPRFLRH CPLILVDFPG  2640
PESLKQIYGT FNKAMLRRTV NLKQYSEQLT NAMVEFYTKS QQHFTADQQA HYIYSPRELT  2700
RWKYALNEAL EPLESVEDLV RLWAHEGLRL FQDRLVHEHE KEWCNKLIDQ VAYNNFNNLK  2760
DEALQRPILF SNYLHKVYQS VDREELRKYI QGRLKQFNEE ELSVPLVVFD DVLDHILRID  2820
RVLKQPLGHL LLVGSSGVGK TTLTRFVSWI NNLTVFQIKA GRDYQLADFD NDLREVMKRA  2880
GAKGEKITFI FDESNVLGPS FLEKMNALLA SGEIPGLFEN DEYLALINLL KENSNQNKQF  2940
DSSEEQLFKN FTYQVQRNLH VVFTMNPKNP DFSNRTASSP ALFNRCVIDW FGDWTNEALF  3000
QVGKAFTMYI DPPENAFSKK IKDETQRQHI LVSTLVYIQN TIIELNNKLQ KGAKRFNYIT  3060
PRDYLDFLKH FEKLHNEKKS QLEDQQLHLN VGLDKLKETE QQVLEMQKSL DQKKVELLTK  3120
ERQAGEKLQT IIEEKKIAEK KKEDSTRLSS DAEKKAKEME VRQSQVNKEL NEALPALENA  3180
KQCVNSIKKD DLNQIRALGS PPALVKLTME AVVCAINSLE KSPEWKDVQK SMANMNFINN  3240
VINFNTETMP PKVKKFILTK YLSAQEWNID RINFASKAAG PLAMWLDSQL KYADILQKVD  3300
PLRQEVAKLL QESDELNTQK KIYDDEVAAA EAKIHNLQQE YSELISQKES IKSEMLKVQE  3360
KVTRSQALLS DLSGERVRWE EASQNFKSQL ATMIGDVLLS SAFLSYIGFF DHFYRKVVIN  3420
TWKDYLSGQA NISYRQDLSL IEFLSRPSDR LNWQSHTLPS DDLCMENAII LYRFQRYPLV  3480
IDPSGQALSY ISSLYKDKKL ARTSFTDESF LKTLETCLRF GCPLLVQDVE KVDPILNSVL  3540
NNETYKTGGR VLIRVGNQEI DFSQGFTMFM ITRDSTARFT PDLCSRVTFV NFTVTQSSLQ  3600
EQCLNIFLRN ESPETEEKRL NLMKLQGEYI VKLRELEDQL LDSLNNSRGS ILEDEKVIQT  3660
LEKLKKEAAV IVQEMKQADT IMNEVMNTTH SYVPLANTTS KIFFSLTSLA NIHYLYQFSL  3720
QFFMDTIYNV LNKNEQLQKI PKQDLIKRRI LIFNEMFKEI YKRMNFSLLQ EDKLVFAITL  3780
AQVKLGDNTL GQEFLNVFKP PTVMETTFSN TFLQGKLSIQ QLKQLEGITQ QNQTFNRLID  3840
NLNKNEDRWL NFLNDEAPEN DIPTQWYNEV QRDDIRQLDD LHILRIFRAD RFQIIARKLI  3900
NQILGEGFMD EQTVDMKLVV EKEASNKIPI LLCSAPGFDP SFKVEQLSRE MGIKLTSVAI  3960
GSAEGFDQAE QAITQSVKSG SWVMLKNVHL ATSWLNDLEK KLFRLTPNAN FRIFLTMEFN  4020
PKIPTTLIRQ SYKLVFEPPD GIKASLIRTF KTVLSQQRTD RQPVERARLH FLLAWLHAVI  4080
LERLRFTPIG WSKTYEFNEA DQRCSLDLID EYVDALGIRQ NIDPSKLPWD AFRTILTQNL  4140
YGGKVDNEYD QKILQSLVEQ FFTEQSFNHN HPLFFTLEGK EAITVPEGRT YLDFMQWIEQ  4200
LPKTESPEWS GLPSNVERVQ RDQLTQKLIT KVQNLQQEGE EEITQIEDNK KSDQVQWLQD  4260
LLEKVEKFKA ILPNKISPLE RTADSINDPL FRFLDREITV ASKLLKAVRQ NIEELIQLAQ  4320
GKILATNILR QLAKDVFNNI VPAQWNKYNV ITMPLNDWVG DFKRRIDQFD LLGKTKDFQK  4380
GQVWFGGLLF PEAYLTATRQ YVAQANKWSL EELELQMIPE DQGIDEDSFV IEGVSMEGGH  4440
LDSKTLQVRI VNEISVALKP ITLKWCKTSQ KGVVGDDEIV LPVYLNKTRK NLIFSLKVKM  4500
GKLNRYTLYQ KGLSFILFN                                               4519

Coding Sequence

Download: Fasta
ATGGAAGAAA GTGAGACANN NCTGAATGTG AAAGTACAAG AANNNGGCAA ATTGTACAGT    60
GCTAATGAGA TTGAAAATTT CAATNNNTAT TTAAGCGCAA TTTGCTTGAG TTTATTGATC   120
ATCGATAAGG ATNNNTGGAA TGTAGCATGT CACGAGGATG TGAATNNNNN NAATATTTGT   180
CAATTTTTAT CGGATTCCNN NATAAAGGCT CTGATAGTTA GTAAGACTGT GGAGAATGAG   240
AAATTCAATA TTNNNATTCG CTCAGAATAT GAGGCATCCA ACAATTATGC TCATACAATT   300
TGCTTTTTAA AGAGACACAC CTTTCAATAT GACAATNNNT TACAACCANN NNNNTTTAGT   360
AACCATGTTN NNGTGATAAA CGTTGGATAT GCAGAATCAC AAGGAGGAGC AAATCCCTTC   420
ACTTTGTCAC ACAATTACGT ANNNAATTGT TTCATTCCAA TCTTCACCCA ATACAAGGGC   480
GAGATTGACA AAAAGAGGAT TGTAGATCAA TCGAGTTATA ACGATTTGAT TAAGAAGCTG   540
AATGAAGTTA ATTTGGCATT CATTAAATGC AGANNNAATG TGGAAGTGCC TGAAATCATT   600
TTACAATTCG ATCCTCGTAT CAAGGAAGCT GTCAAANNNA GAGGAGGCAA GCCAACAATT   660
GAAGATGCAG CTCAATTGAA TAAGCCTGAC ATAGTTNNNT CAATATCTCA GACAGTGACA   720
AGATGGATTT CAGACATAAA TNNNATATCA AACACAAAGT TGGAGTTGAC TAATGCAAGT   780
ATAGTGGATG AAATAAACTA TTGGATGAGC ATGGAGAGGT CGTTGTTTTT TATTGAGAAT   840
CAATTGAAAC AACCTGAAGT TGATTTCACA ATCGAAGTGT TGACTCAAGC TAAGAAGATG   900
AACATCACAG CACAATTTAA GGAGATAGCC TTGAAGNNNA GCTTANNNAA ATGTCAATCA   960
TGCAACNNNT TCATGAAAGA ATTCCCAATC AACAATTTAT TGATAGCAAC AAATTTGGTG  1020
GAAATCAAAG ATGCTATGAT CNNNATATTC NNNCATATGA AGAAATTGTC AAACATTCAA  1080
GAGACATACA CCATTCCTAG ATCACTTCAA TTGGCAGAAT CCTTTTCAAG AGAATTGACA  1140
AATGAAATGA TCAAATATTT TAAGGGATTT NNNATACTCC ACATTAAGTA TGTTGACTTC  1200
AAGGGCTTAA TCATCAAGAC TCAGGAGATC TTCAGTCAAT GGGATGAGGA ATATAAGATA  1260
TTCAAANNNA GTATTGTCAA GAAATCTGTG CATCAGAAGG ATNNNTATGG TNNNTTTGAG  1320
CACATCAAAT TGCAAAAGCA GATCCAACAT ATTNNNAGAC TTAGGGAAAT GCATGAAAAT  1380
CTCAAAGAAG TTATTGAGNN NATCATTCAA AATGATNNNG AAGAGNNNAA GGAGAATGTC  1440
CAANNNTTTG CCACCTTANN NGAAATTCAA NNNGCATATG ACATTTTCAA AAATGTTGAA  1500
GTGTTTGATT TGAGTAGAGA TGGCGAAGAT NNNTTTTTCA GAGCATTGAA ANNNTACGAA  1560
ATTGCTATTG AATCCGTTGA AGCAACTATT ACTACAAATT TGAGAGATTC ACTTGGTTCA  1620
GCATCGTCTG CCAAAGAAAT GTTTAGAATA TTGGCTAAAT TCAATAAGTT GTTTTCAAGA  1680
CCAAGAATCA AGGGGGCCAT CCAAGAGTAT CAATCACAAT TGTTAAAGAC TGTCCACAAG  1740
GATATTCAAT CCTTACAGAA TAAGTTCAAG GAAACTTATN NNAAGAGTNN NAATTCGAGA  1800
TTGGCATCAG CAAGAGATAT TCCATTGACT TCAGGATTTG TTATCTGGTC CAAANNNTTA  1860
CAAATTAGAT TGNNNAAATA TATGCAAAAG GTAGAACAGA TCTTAGGTCC TNNNTGGGCA  1920
GAAGATACTG ATGGCAAGAA ATGTAAAGAA ATGGGTGAGA CATTTGAAAG GATTCTGGAT  1980
TCAGGACCAG CTTTGGAGGA TTGGAAACAA GAAATTAATC ATCATAATAA GGCAGTCAGT  2040
CAAAATGAAA AGTTATTTGA AGTAGTTACA AGAAGGAGAG GATTAGAAAT TAGAGTCAAT  2100
TATGAGAAAA AGCTAAGCNN NTTATTCAAA GAAGTGAGAA ATCTAAGTAA TATGAAAACA  2160
AAGGTCCCAT ATTCAATATC TCACATTGCT AATGATGCTA AGGCTTCTTA TCCATTTGCC  2220
TTATCTTTAC AAGAATCATT GCACACTTAC ATCNNNATAA CATCTCAATT GAATGCCAAA  2280
TCTGCCAAAT TGGTAGCTGC TTTAAGAAAG GAAGTGNNNC TANNNATTGG TNNNGGATTC  2340
AACTACTTGT GGACTCACAA AACCNNNTTA CAGCCATACG TGAAGAAATT CACTGACAAG  2400
GTCTTTGAAT TGGAGNNNGC TGTTAATGGA TTGAATGAGA GAATTGGTNN NATAGAATCA  2460
TTATGTGAAG CTATGAAGAC ATGTCCCGTA GATTCTTTGG CTGATAAATT AAAGGATATC  2520
CAAGAAGTTA TAGATTCACT ATGCTTCAAT AATTTCTCAA ATTTACACAT TTGGATCNNN  2580
GATATTGATA AANNNATTGA AAGCATTTTA TGTGATAGAG TGACTGTGCA AATGAAGGAA  2640
TGGTTAAATN NNTTCATCAA TTATCAGAAA ATTCAAGAAA GAGGACTTGT AAATNNNACA  2700
GTTGTGCATG AATTGAAACT ACAAGATNNN ATCATTTATG TGGACCCTCC TGTTGAGTAT  2760
GCAAAGTACT TCTGGTTCNN NGAGTTTCAT AAAATGATTG GANNNATTTG CAGCTTACCC  2820
AGGTTAGTTG CCAATAGATT CGACAATACA ATANNNNNNA ATACAGGCCC ATGGGGTACT  2880
NNNAGAGACT TAGACTATTC AACTACTATA AATAAGATAA ATNNNCAGTT GATAAAGGAT  2940
GCTTATTCAN NNATTGGGNN NTTATTGGAA GACATGGAAN NNTACGTGNN NACATGGTTA  3000
AATTATCAAT CACTATGGGA ATTAGATATT AAANNNGTAG AANNNATACT ANNNGATGAT  3060
ATTGAAAAGT GGNNNNNNAT GTTAACAGAT ATTAAANNNG GAAGAGCTAC ATTCGACAAT  3120
TCAACAACAG AGGAACATTT TGGAGCAATT ATAATTGATT ATAGAATGGT TNNNGTCAAG  3180
ATTAATCATA AGTATGATGC TTGGCATAAG GAGTTGTTAA ATCATTTTGG TAATAAGTTT  3240
GGAGAGNNNT TGAGAGTGTT TAATAAGAAT GTTACAACAG AAAAGGAGAA ACTACTCAAA  3300
ATCAATTTCN NNGATTTAAC ATCAGACATC ATTGAATCCA TTACAATAAT ANNNGAGNNN  3360
GATAAGAAAT TCCCAGGATG GTCTGCAGAT ATAGAATCCT TTAAGAATGG TCAAAAGGTT  3420
TTGGATAGAC AAAGATATCA ATATCCAGGA GATTGGCTGA GTTTTGAANN NGTAGAGATG  3480
CAGTGGAATC AATTTAAANN NATTCGTAGT AAGAAACTAC AATCACAAGA GAGTGAAATG  3540
AATAACATTC AGTCTAAGAT TNNNNNNGAT GAGAGATATT TGAATNNNNN NATACAAGAG  3600
ATTGAAGAAC AATGGAAGAC ATCTAAACCT GATTCTGGAG ATTGCTCTCC AAATGAGGCT  3660
GAANNNATAC TCAAAAGTTT GAATGAACAA CTGATATCAG TTNNNGAGAA GTATGAGAAG  3720
TGCAGTNNNG CTAAAGAGAT TTTGAAGATG GATCCACCTA CTCATNNNNN NAAGTTGAAT  3780
GTTTTGTTGG AATCAATTTC AGATCTCNNN GATGTTTGGN NNGAATTGGG CAAGATTTGG  3840
AAGGTGATGC AATCGATTAA GGAANNNTTG ATATCAGCTC TANNNAATAA GAAGATCAAG  3900
GATACATGTG ATGAGGCTNN NAAACAATTG AATGGAGTAT CTACTAAGAC AAGGAATTAC  3960
GATGCATTTG AAAAGATGAA GGAGAAGGTA AAGAACTATA TTAAGATGAA TAAACTCATT  4020
ATGGATTTGA AGGATGAATC AATGAAGGAG AGACATTGGA GACAATTATT GTCAAAATTG  4080
AAAATCAATG AATCTTTAAA TNNNTTGNNN ATGCAACATT TGTGGAATGC TAATCTTTTG  4140
AATTATGAGA ATCTAGCTAA GGACATTATG ACAGTTGCAA GAGGAGAACA GGTGTTAGAG  4200
ACAATGATTT CANNNGTGAA GGACTTTTGG AATTCATTTG AATTAGAATT AGTCAAATAT  4260
NNNACTAAAT GTAAATTGAT CAGAGGTTGG GATGAATTGT TTNNNAAGTT AGATGAGGAT  4320
CTTAATAACT TAGCCTCAAT GAAGATTTCT CCTTTCTATA AAACCTTTGA GGCAGAAATC  4380
TCTNNNTGGG ATGATAAATT GCAGAAAGTG AAATTGACCA TGGATATTTG GATTGATGTT  4440
CAGAGGAGAT GGGTTTATTT GGAAGGTATC TTCTTTGGTT CTTCTGATAT CAAAACANNN  4500
TTGNNNAATG AATACAATAA ATTCAAAGAT ATTGACAGTC AATTCACTAA TTTGATGAAG  4560
AAAGTGGCTC AAAAACCANN NTTAATGGAT GTCNNNGGAA TCCCCAATTT GGCTAAGACC  4620
TTAGAAAGAT TAAGTGATTT CCTANNNAAG ATCCAAAAGG CATTGGGTGA TTACTTGGAG  4680
ACTNNNAGAN NNGCATTCGC TAGATTTTAT TTTGTGGGAG ATGACGATCT ATTGGATATC  4740
ATTGGTAATT CAAAAGATGT GACAAATGTG NNNAGACATT TCCCTAAGAT GTATGCAGGA  4800
ATTGTGCAAT TGNNNAGTAG AAAGGATGGA AATGATGATG TTGTATTGGG AATGTCAAGT  4860
AAAGAAGGAG AGGTGGTTCC ATTTAGTAAG GAGGTGAAGA TTGCAGAGGA TCCCAGAATC  4920
AACATATGGT TGGGTAAGGT TGACAATGAG ATGATGAATT CATTGGCTCT GGATTTAGAG  4980
AAATCAGTCT TAGATATCCA AGCTAATCAA NNNAACAGGA TGAAAGTAAT TGAGGAACAT  5040
CCAGCTCAAA TCATTTTATT GGCCTTGNNN GTGGGATGGT GTTTTTCAGT TGAATCATCA  5100
TTCAATAATG AANNNNNNAT GAAACAAACA TTGNNNTATG TGTTGGAATT CTTGTCTGAA  5160
TTGGCAGAGA GTGTATTGAA GGATCATCCT AAANNNTTGA GANNNAAGTT TGAGNNNATT  5220
ATTACAGACT TTGTTCATNN NAGAGATGTG ATCAGATTGT TGATGAACAA TAAAATAAAC  5280
AGTAAGAATG ACTTCGGTTG GNNNTACCAC ATGAGATTCA ATTGGAATTC TAAGGAAGCA  5340
GATCCTGGAA AAAGGTTACT GATTNNNATG GGTAATGCAC AATTCCATTA TGGATTTGAA  5400
TATTTGGGAG TGGCAGAGAA GTTAGTCNNN ACTCCATTGA CTGATAAATG CTTTTTGACA  5460
TTGACACAAG CTTTGCATTT GAGAATGGGA GGATCTCCTT TTGGACCAGC TGGTACAGGT  5520
AAGACAGAGA GTGTTAAGGC ATTGGGTGCA NNNTTGGGTA GATTCGTTTT GGTGTTTAAT  5580
TGTGATGAGA CATTCGATTT TAATGCCATG GGTAGAATCT TCGTAGGATT GTGTNNNGTA  5640
GGAGCTTGGG GTTGTTTCGA TGAGTTCAAT AGATTGGAGG AACGTATGTT GTCTGCTTGT  5700
TCANNNCAAA TATTATTGAT TNNNACAGGA TTGAGAGAGA AGNNNAAGCA AATAGAATTG  5760
ATGGGCAAGG ATGTGAAATT GAGTTCANNN ATGGGAGTGT TTGTTACTAT GAATCCTGGA  5820
TATGCAGGGA GATCAAATCT ACCAGAGAAT TTGAAANNNT TGTTCAGANN NATGGCCATG  5880
GTTAAACCAG ATAGAGAATT GATTGCTNNN GTCATGTTAT TCAGTNNNGG ATTCAGAACA  5940
GCAGAAAAAT TAGCAGGAAA GATAGTTTCA CTATTTGAAT TATGTGACAA TCAATTATCA  6000
TCACAACCAC ATTACGATTT TGGTTTGAGA GCCTTGAAAT CAGTATTGAA CTCTGCTGGT  6060
AACATGAAGA GACAAGAAAT GATAGACAGA AAANNNGAAC CTGTTCCANN NTCTGAAATT  6120
GAAGAGTTCG AANNNACCAT TTTATTAAGG AGTGTTTGTG ATACTGTGGT ACCCAAATTA  6180
ATCAAAGACG ATATCAAATT GTTGGAAACA TTGTTANNNG GTGTATTTCC AGGATCTTGC  6240
ATTCCAGAAA TTAAGGAGGA ACAATTAAGA AAGGAATTAG CCTTGGCTTG TCAGAGAAAG  6300
AATCTACAAT CGAGTAAGAA CTTCATTGAA AAAGTGTTGN NNTTGTATNN NATTCAGAGA  6360
TTACAACACG GTTTGATGTT AGTTGGTCCA TGTGGTTGTG GCAAGAGTGC TGCCTGGAGA  6420
GTCCTATTGG AAGCGATGTA TAAATGTGAT AAAGTGAAAG GAGAATTTTA TATTGTTGAT  6480
CCAAAGGCCA TATCCAAAGA TGAATTATAT GGTAGATTAG ACAATACCAC TTTGGAATGG  6540
ACTGATGGAG TCTTCACTTC TATACTCAGG AAAATCATAT CTAATCAAAG ACAAGAGAGC  6600
ACCAGAAGAC ATTGGATTAT ATTTGATGGA GACGTTGATC CTGAATGGGC TGAAAATCTC  6660
AACTCAGTGC TTGATGACAA TAAACTATTA ACCTTACCTA ACGGTGAAAG ATTGGCTATT  6720
CCCCCTAATG TTAGAATGAT CTTTGAAGTC GAAACTCTTA AATATGCCAC CTTAGCTACT  6780
GTTTCCAGAT GTGGTATGGT TTGGTTTAGC GAAGAAACCA TCAATGATGA AAATATCTTC  6840
TATCACTTCT TGGAACGATT GAAGNNNGAT GATTATGATC AANNNAAATC TGAAGATGAC  6900
AACAACAAGN NNGTTAATTC TCAAGAAAGT GAACTAAGAA CTAAATGTGT TAAAGCCCTA  6960
GAATCAATCA TCAAGTTTTT ATCCNNNTTC CTCNNNATTG CCNNNAAACC AGAATATAAA  7020
CATGTCATGG AATTCACCAG AATCAGAGTC TTAGAAAGCA CTTTCGCTTT AGTTAGAAGA  7080
AGCATTTCAA ATATCATCGA ATACAATGAA AATAACTCTG AAGTTCCATT GGAAGATGAT  7140
CAAATTAATG ATTTCATGGT TAAACAATTT CTTATTGCTG TCATGTGGGG TGTCGCTGGT  7200
TCCATGAATC TTTATNNNAG AACACAATAC TCCAAGGAAA TCTGTNNNTT ATTACCACAC  7260
AATGTTATCC TACCANNNTT CAATGATAGC GCACCTTCAT TGATTGACTT TGAAGTCACT  7320
CTCCCAGAAG CTNNNTGGAG CCAATATAAG AAGAAGGTAC CTCAAATTGA AATTGATCCA  7380
CAAAGAGTCA CAGATGCTGA TCTCATCATC GAAACAGTTG ATACTCTCAG ACATAAAGAT  7440
GTTCTCTGTG GATGGCTTAA TGAACATAGA CCCTTCTTAT TATGTGGACC TCCTGGTAGT  7500
GGTAAGACCA TGACTTTAAT GAGCACACTA AAGGCCTTAA CTGATTTCGA AATGATCTTC  7560
ATCAATTTCA GTAGTTCCAC TATGCCTCAA TTAATCATTA AANNNTTTGA TCATTATTGC  7620
GAATATAAGA AAACTACAAA TGGAGTCTTC TTACAACCAA AGAATNNNAA ATGGCTCGTT  7680
GTATTTTGTG ATGAAATCAA TCTACCTGAT CAAGATAAAT ATGGCACCAT GGCTATTATC  7740
ACTTTCTTAC GTNNNTTAAC AGAANNNCAC GGATTTTGGA GATCTTCAGA TAGACAATGG  7800
ATTTCCCTTG ATAGAATCNN NTTCGTAGGA GCATGTAATC CTCCTACTGA TGTGGGTAGA  7860
AAACCACTTA CTCCAAGATT TCTTAGACAT TGTCCTTTAA TCCTCGTCGA TTTCCCAGGA  7920
CCTGAATCAC TCAAANNNAT TTATGGCACT TTCAATAAGG CAATGCTTAG AAGAACAGTT  7980
AACCTTAAAC AATATTCTGA ANNNCTCACT AATGCTATGG TTGAATTTTA CACCAAATCA  8040
NNNNNNCATT TCACAGCTGA TNNNNNNGCT CATTATATTT ATTCACCCAG AGAATTAACT  8100
AGATGGAAGT ATGCTTTGAA TGAAGCCTTA GAACCACTTG AATCTGTTGA AGATTTAGTT  8160
AGACTATGGG CACATGAAGG TTTAAGATTA TTCCAAGATA GATTGGTACA TGAACATGAA  8220
AAAGAGTGGT GCAATAAACT TATTGATCAA GTTGCTTATA ATAATTTCAA CAATTTGAAA  8280
GATGAAGCTT TACAAAGACC AATTCTATTC AGTAACTATC TCCATAAGGT TTATCAAAGT  8340
GTTGATAGAG AAGAGTTGAG AAAGTACATC CAAGGAAGAC TCAAANNNTT TAATGAAGAA  8400
GAATTATCTG TTCCCTTGGT TGTTTTTGAT GATGTACTTG ATCACATATT GAGAATCGAT  8460
AGAGTTCTGA AANNNCCTTT AGGACATCTA CTATTGGTTG GATCTTCTGG AGTTGGTAAA  8520
ACAACACTCA CAAGATTTGT AAGTTGGATT AATAATCTTA CAGTCTTCNN NATTAAGGCA  8580
GGTAGAGATT ATNNNTTAGC TGACTTTGAT AATGATCTGA GGGAGGTAAT GAAGAGGGCT  8640
GGTGCTAAAG GCGAGAAGAT TACATTCATT TTTGATGAAT CAAATGTATT AGGTCCATCT  8700
TTCTTAGAGA AGATGAATGC ATTGTTGGCA TCAGGTGAAA TTCCAGGATT GTTTGAAAAT  8760
GACGAATATT TGGCATTGAT TAACTTGCTT AAAGAAAACT CAAATCAAAA CAAANNNTTT  8820
GATTCTTCAG AAGAANNNTT ATTCAAGAAT TTCACTTACN NNGTGNNNAG AAATTTACAC  8880
GTTGTATTTA CAATGAATCC TAAGAATCCA GATTTTTCAA ATAGAACAGC TTCTTCCCCT  8940
GCATTGTTTA ATAGATGTGT TATAGATTGG TTTGGAGATT GGACCAATGA AGCTCTCTTT  9000
NNNGTTGGTA AAGCATTTAC TATGTACATT GATCCTCCTG AAAATGCATT CAGCAAAAAG  9060
ATTAAAGATG AAACTNNNAG ANNNCATATT TTAGTATCCA CTTTAGTCTA TATCNNNAAT  9120
ACTATCATTG AATTGAATAA CAAATTACAG AAGGGAGCTA AGAGATTCAA TTATATAACT  9180
CCAAGAGACT ATTTAGATTT CTTGAAGCAT TTCGAGAAAT TGCATAATGA AAAGAAATCG  9240
CAATTGGAAG ATNNNNNNTT GCATCTGAAT GTTGGTTTGG ATAAGTTGAA GGAGACAGAA  9300
CAACAAGTAT TGGAGATGCA GAAAAGCTTA GATCAGAAGA AGGTTGAGTT ATTGACAAAG  9360
GAGAGACAGG CAGGTGAGAA GTTGCAGACA ATTATTGAAG AGAAGAAGAT AGCTGAGAAG  9420
AAGAAGGAAG ACAGTACAAG ATTATCAAGT GATGCTGAGA AGAAGGCTAA GGAGATGGAA  9480
GTAAGGNNNT CTNNNGTCAA TAAGGAATTG AATGAAGCAT TGCCTGCTTT GGAAAATGCA  9540
AAACAATGTG TGAATAGCAT TAAGAAGGAT GATTTGAATN NNATCAGAGC ATTGGGATCA  9600
CCTCCTGCCT TGGTTAAGTT GACAATGGAA GCCGTTGTTT GTGCAATTAA TTCATTGGAA  9660
AAGAGTCCAG AATGGAAGGA CGTANNNAAA TCAATGGCCA ATATGAATTT CATCAATAAT  9720
GTTATTAACT TCAATACAGA AACAATGCCA CCCAAAGTGA AGAAATTCAT TTTAACAAAA  9780
TATTTATCAG CCCAAGAGTG GAATATTGAC AGAATTAACT TTGCATCAAA AGCAGCAGGT  9840
CCATTGGCTA TGTGGTTGGA TTCACAATTG AAATATGCCG ATATTTTANN NAAAGTGGAT  9900
CCACTTAGAN NNGAAGTGGC TAAATTATTG NNNGAGAGTG ACGAACTCAA CACTNNNAAG  9960
AAGATCTATG ATGATGAAGT AGCAGCTGCT GAGGCTAAGA TTCACAACTT ACAANNNGAG 10020
TATTCGGAAT TGATTAGTNN NAAGGAATCA ATTAAGTCAG AGATGTTGAA AGTGNNNGAG 10080
AAAGTAACAA GGTCCNNNGC ATTATTGAGT GATTTGAGTG GAGAAAGAGT ACGTTGGGAA 10140
GAGGCATCAN NNAACTTTAA AAGTNNNTTA GCTACAATGA TAGGAGATGT GTTATTGTCA 10200
TCAGCATTCC TGTCTTATAT TGGGTTCTTT GATCATTTTT ACAGGAAGGT GGTAATAAAC 10260
ACTTGGAAAG ATTATCTCTC GGGCNNNGCG AATATCTCTT ACAGACAAGA TCTTTCATTG 10320
ATTGAATTCT TATCTAGACC TTCAGACAGA TTGAATTGGC AATCACATAC ATTGCCTTCG 10380
GATGATTTGT GCATGGAGAA TGCCATCATA TTATATAGAT TCNNNAGATA TCCTCTAGTT 10440
ATTGATCCAT CTGGACAGGC ATTGTCTTAT ATTTCCTCAT TGTACAAAGA TAAGAAATTA 10500
GCAAGGACAT CATTCACAGA TGAATCATTC TTGAAGACCT TGGAAACATG CCTAAGATTT 10560
GGTTGTCCAT TGTTGGTACA GGATGTGGAA AAGGTGGATC CAATTTTGAA TTCAGTTTTG 10620
AACAATGAAA CATATAAAAC AGGAGGAAGA GTGTTAATAA GAGTAGGCAA TNNNGAAATC 10680
GATTTTTCAC AAGGCTTTAC TATGTTCATG ATCACCAGAG ATAGCACAGC TAGATTCACT 10740
CCTGATTTAT GCAGCAGAGT TACATTTGTC AATTTTACAG TCACTCAAAG CAGTTTGNNN 10800
GAACAATGTT TGAATATCTT CTTGAGAAAT GAATCCCCAG AAACTGAGGA GAAGAGATTA 10860
AATTTAATGA AATTGCAAGG AGAATATATA GTTAAGTTGA GAGAATTGGA AGATCAATTA 10920
TTGGATTCAT TGAATAATAG TAGAGGATCC ATATTGGAAG ACGAGAAGGT TATCNNNACA 10980
TTAGAGAAGT TGAAGAAGGA GGCTGCTGTT ATTGTTNNNG AAATGAAANN NGCTGACACT 11040
ATTATGAATG AGGTCATGAA TACAACACAT TCATATGTTC CTTTGGCTAA TACTACCTCT 11100
AAGATCTTCT TCAGTTTGAC ATCTTTGGCC AATATTCATT ATTTGTACNN NTTTTCATTG 11160
NNNTTCTTCA TGGATACCAT TTATAATGTG TTAAATAAGA ATGAANNNTT GCAGAAGATT 11220
CCCAAACAAG ATTTGATAAA GAGAAGAATT CTAATTTTCA ATGAAATGTT CAAAGAAATA 11280
TACAAACGAA TGAATTTCTC TTTGTTANNN GAGGATAAAT TGGTGTTTGC AATTACCTTG 11340
GCCNNNGTGA AATTGGGAGA CAACACCTTG GGANNNGAAT TCTTGAATGT ATTCAAACCA 11400
CCAACAGTGA TGGAGACTAC ATTTTCAAAT ACATTCTTGN NNGGGAAGTT GAGTATCCAA 11460
NNNTTGAAAN NNTTAGAAGG AATAACTCAA NNNAATNNNA CTTTCAATAG ACTCATAGAC 11520
AACTTAAATA AGAATGAAGA TCGTTGGCTC AACTTTTTAA ATGATGAAGC TCCAGAAAAT 11580
GACATTCCAA CANNNTGGTA TAATGAAGTT NNNAGAGATG ACATTAGANN NTTGGATGAT 11640
TTACACATCC TAAGAATATT CAGAGCTGAT AGATTCNNNA TAATAGCTAG AAAATTAATC 11700
AATCAAATTT TGGGAGAGGG TTTCATGGAT GAACAAACAG TGGATATGAA ATTGGTAGTT 11760
GAGAAGGAAG CATCGAACAA GATTCCTATT CTTTTATGTT CAGCCCCAGG ATTTGATCCA 11820
TCCTTTAAAG TCGAANNNCT CAGTAGAGAA ATGGGAATCA AATTGACTAG CGTGGCTATA 11880
GGAAGTGCTG AAGGTTTTGA TNNNGCCGAA NNNGCAATTA CTCAAAGTGT CAAATCCGGA 11940
TCTTGGGTGA TGTTGAAGAA TGTTCATTTA GCTACCAGTT GGCTTAATGA TTTAGAGAAG 12000
AAATTATTTA GATTGACTCC AAATGCCAAT TTCAGAATAT TCTTAACCAT GGAATTCAAT 12060
CCAAAGATAC CAACTACTTT GATTAGANNN TCCTATAAAT TGGTCTTCGA ACCCCCTGAT 12120
GGTATCAAAG CATCTCTGAT TAGAACATTC AAAACTGTCT TGTCTCAACA AAGAACAGAC 12180
AGANNNCCAG TGGAAAGAGC AAGATTACAT TTCCTGTTAG CTTGGTTACA TGCTGTTATT 12240
CTGGAGAGAT TGAGATTCAC TCCTATTGGA TGGAGCAAAA CCTATGAATT TAATGAGGCA 12300
GATNNNAGAT GCTCATTGGA TCTCATTGAT GAATATGTAG ATGCCTTAGG TATAAGANNN 12360
AATATAGATC CATCTAAATT ACCATGGGAT GCCTTTAGAA CTATATTGAC ANNNAATTTA 12420
TATGGTGGTA AAGTAGATAA TGAATATGAC CAAAAGATTC TTCAATCCTT GGTTGAACAA 12480
TTCTTCACTG AACAAAGTTT CAATCATAAT CATCCTCTAT TCTTCACCTT AGAAGGCAAA 12540
GAAGCCATCA CAGTTCCCGA AGGCAGAACT TATCTAGATT TCATGNNNTG GATTGAACAA 12600
TTACCAAAGA CTGAAAGTCC AGAGTGGTCA GGTTTACCTT CCAACGTAGA AAGAGTCCAA 12660
AGAGATCAAT TAACTCAAAA GTTGATCACC AAAGTCNNNA ATCTCNNNCA AGAAGGCGAA 12720
GAAGAAATCA CTNNNATTGA AGATAACAAA AAGTCTGACC AAGTTNNNTG GCTANNNGAT 12780
CTTCTAGAGA AGGTTGAGAA ATTCAAAGCC ATACTCCCCA ATAAAATCTC TCCCTTGGAA 12840
AGAACAGCCG ATTCTATCAA TGACCCCTTA TTTAGATTCT TAGATAGAGA AATTACTGTG 12900
GCCTCTAAAT TATTAAAAGC TGTAAGANNN AATATTGAGG AATTAATTNN NTTGGCTNNN 12960
GGAAAAATCT TAGCCACAAA CATACTAAGA NNNTTGGCTA AGGATGTTTT CAACAATATT 13020
GTCCCAGCAN NNTGGAATAA ATACAATGTA ATCACTATGC CTTTGAATGA TTGGGTTGGT 13080
GACTTTAAGA GAAGAATTGA TNNNTTCGAT CTCTTAGGAA AAACTAAAGA CTTCNNNAAA 13140
GGCCAAGTCT GGTTTGGTGG ATTGCTATTT CCTGAAGCAT ATCTTACAGC CACCAGANNN 13200
TACGTGGCCC AAGCCAACAA ATGGTCATTG GAAGAATTAG AACTANNNAT GATACCAGAA 13260
GATCAAGGAA TTGATGAGGA TTCCTTTGTT ATTGAAGGAG TATCAATGGA AGGAGGACAT 13320
TTAGACTCCA AGACTTTACA AGTAAGAATT GTCAATGAAA TCTCAGTCGC ATTAAAACCA 13380
ATAACCTTAA AATGGTGCAA AACTTCCCAA AAAGGAGTTG TAGGAGATGA TGAAATTGTT 13440
TTACCTGTTT ATTTGAACAA AACAAGGAAG AATCTTATAT TCTCCCTTAA AGTCAAGATG 13500
GGTAAACTTA ATAGATACAC ACTCTATNNN AAAGGTTTGT CATTTATCCT CTTTAATTGA 13560