Entry ENTNP03434 (K2G686)

E Entamoeba nuttalli (strain P19)


General Information

Description
CPSF A subunit region protein, putative
Organism
ENTNP - Entamoeba nuttalli (strain P19) (Taxon-ID: 1076696)
Locus
asmbl_8251join(complement(4903..5351), complement(4709..4834), complement(3920..4647), complement(1770..2977))
Number of exons
4

IDs and Cross-references

loading xrefs...

Domain Architecture

Gene Ontology

loading GO annotations...

Protein Sequence

Download: Fasta
MTWLLHETVV NPQGVEHSII AKFTGKYEQL ITSIRNQFVC YDITPQGLQV RCKEVTEANI    60
IQIGNVEVNH VMMLVLLFKE AKVSVLRYDE TNNKFVIHSL HCFELPLKRM QEGLTPTTYT   120
DPRLLIDKRG RCISLICYDR LMWVIPLGLD KTSYSINLEK FGINRIIDCI VLDGYDLPSV   180
AFLHMKIPTW EGRIVNTGET TNEIIILSLE PDVIHERQDI VATISYQFSY VPYNALQIVD   240
CYPTNGLLIL TVNSIIYLST TSFESFILPF GKFFVIPKNI NGPLSSFQIL QMQTKIMNSV   300
KSIFKLTNHL YIIFSMNGES YYVHLLSIAN RICDVIITNS PYKYHPTTFT ISSNHLFIGS   360
TVHDSYIYNY EIVEYGKGKQ HEISQHINQE IRSKNLLFRH NEMEEEYPFD EPQPVSIPQE   420
LTVEFPQLDY IVSILSYKNK LFVTVSFTTV LDTPDCPPQL PHSSNNQQEE PPVLEAQPLN   480
SDVLLERYRI INLTTNDYYE FKKGQFICTA KLLKFKVGKQ LKNYLVVGVN KQTTEDNPVK   540
GKTYIFNIEN QIQLINKIGD GKKSVHAVNE IGGFLAVASG NELELIERVD ETRWIKKCFS   600
DISILINSIE YLPLKVMERG NEKECYLILL SDFYRSVVLL LFKPYDYTVI PLGKDARNIH   660
CIDSTFIITK DYFSVLEFDS EQNLSLLNYS SAATEQLSIF EIDATFNLGM NLLKFTRLWN   720
GKGYIYMYVT VEGSVGYISV VEEKIYQVLR QINIKMNREP WHFAGTNAEE YRFEKGYGMG   780
FGTRKHVFLD GDMLKQFRLL NEEQQKRVCL RNTSINDVFK LLSTGLQYNT FLQYNN       836

Coding Sequence

Download: Fasta
ATGACTTGGT TGCTTCATGA AACAGTTGTC AATCCACAAG GAGTTGAACA TTCTATAATT    60
GCTAAATTTA CTGGTAAATA TGAACAATTA ATCACAAGTA TTCGTAATCA ATTTGTGTGT   120
TATGATATTA CTCCTCAAGG ACTTCAAGTT CGTTGTAAAG AAGTTACTGA AGCAAATATT   180
ATTCAAATAG GAAATGTAGA AGTGAACCAT GTAATGATGC TAGTTTTATT ATTTAAAGAA   240
GCTAAAGTGT CAGTACTAAG ATATGATGAA ACAAATAATA AATTTGTTAT TCATTCTTTA   300
CATTGTTTTG AACTTCCTCT TAAAAGAATG CAAGAAGGAT TAACGCCAAC AACATATACT   360
GATCCAAGGC TCTTAATTGA TAAACGAGGG AGATGTATTA GTTTAATTTG TTATGATAGA   420
TTAATGTGGG TTATTCCATT AGGACTTGAT AAAACATCAT ATTCTATTAA CTTAGAGAAA   480
TTTGGGATTA ATAGAATTAT TGACTGTATT GTTTTAGATG GGTATGATTT ACCGTCAGTA   540
GCTTTTCTTC ATATGAAAAT ACCAACATGG GAAGGACGAA TAGTAAATAC AGGAGAAACA   600
ACTAATGAGA TTATTATATT ATCATTAGAA CCAGATGTCA TTCATGAAAG GCAAGATATT   660
GTTGCAACAA TATCATATCA ATTTTCATAT GTTCCTTATA ATGCATTACA AATTGTAGAT   720
TGTTATCCAA CAAATGGACT TCTTATATTA ACAGTAAATA GTATTATTTA TTTATCAACA   780
ACATCGTTTG AGTCATTCAT TCTTCCTTTT GGAAAGTTTT TTGTAATACC AAAAAATATA   840
AATGGACCAT TATCTTCATT CCAAATTCTT CAAATGCAAA CAAAAATTAT GAACTCAGTT   900
AAATCAATAT TTAAATTAAC AAATCATCTT TATATTATAT TTTCTATGAA TGGAGAAAGT   960
TATTATGTTC ATTTACTTTC TATTGCAAAT AGAATTTGTG ATGTAATTAT TACTAATTCT  1020
CCATATAAAT ATCATCCAAC TACTTTTACT ATTTCTTCAA ATCATTTATT TATTGGAAGT  1080
ACAGTTCATG ATAGTTATAT TTATAATTAT GAAATTGTTG AATACGGAAA AGGTAAACAA  1140
CATGAAATAT CTCAACATAT TAACCAAGAA ATTCGTTCAA AAAATTTATT ATTTAGACAT  1200
AATGAAATGG AAGAAGAATA TCCTTTCGAT GAACCACAAC CAGTTTCTAT TCCACAAGAA  1260
TTAACAGTTG AATTCCCACA ATTAGATTAT ATTGTTTCTA TTTTATCATA TAAAAATAAA  1320
TTATTTGTTA CTGTATCATT TACTACTGTA TTAGATACAC CTGATTGTCC ACCTCAATTA  1380
CCTCATTCAA GTAATAATCA ACAAGAAGAA CCACCAGTAT TAGAAGCACA ACCATTAAAT  1440
TCAGATGTAT TATTAGAACG TTATAGAATA ATAAATTTAA CAACCAATGA TTATTATGAA  1500
TTTAAAAAAG GTCAATTCAT TTGTACAGCA AAATTATTAA AATTTAAAGT TGGAAAACAA  1560
CTTAAAAATT ATTTAGTTGT TGGAGTTAAT AAACAAACAA CAGAAGATAA TCCTGTTAAA  1620
GGAAAAACAT ATATTTTTAA TATAGAAAAT CAAATTCAAT TAATTAATAA AATTGGAGAT  1680
GGAAAAAAGT CAGTTCATGC CGTTAATGAA ATAGGTGGAT TTCTTGCCGT AGCATCAGGA  1740
AATGAATTAG AATTAATTGA ACGTGTAGAT GAAACAAGAT GGATAAAGAA ATGTTTTAGT  1800
GATATTTCTA TATTAATTAA TTCAATTGAA TATCTTCCAT TAAAGGTAAT GGAAAGAGGA  1860
AATGAAAAAG AATGTTATTT AATATTATTA AGTGATTTTT ATAGATCAGT AGTATTATTA  1920
TTATTTAAAC CATATGACTA TACTGTTATA CCCCTTGGTA AAGATGCAAG AAATATTCAT  1980
TGCATTGATT CAACTTTTAT AATAACAAAA GATTATTTCT CAGTACTTGA ATTTGATTCT  2040
GAACAAAATT TAAGTTTATT AAATTATAGT TCAGCTGCAA CAGAACAATT GTCAATATTT  2100
GAAATTGATG CTACATTTAA TCTTGGAATG AATTTATTAA AATTTACAAG ACTATGGAAT  2160
GGAAAAGGTT ATATTTATAT GTATGTAACT GTTGAAGGAA GTGTTGGTTA TATTTCAGTT  2220
GTTGAAGAAA AAATATATCA AGTATTAAGA CAAATTAATA TAAAAATGAA TAGAGAACCA  2280
TGGCATTTTG CAGGAACAAA TGCAGAAGAG TATAGATTTG AAAAAGGATA TGGAATGGGA  2340
TTTGGAACAA GAAAGCATGT ATTTTTAGAT GGAGATATGT TAAAACAATT TAGATTATTA  2400
AATGAAGAAC AACAAAAAAG AGTCTGTTTA AGAAATACAT CAATTAATGA TGTATTTAAA  2460
TTACTTTCAA CAGGACTACA ATACAATACA TTTTTACAAT ATAATAATTG A           2511