Entry CAEEL07490 (SET2_CAEEL)

E Caenorhabditis elegans


General Information

Description
Probable histone-lysine N-methyltransferase set-2 [Source:UniProtKB/Swiss-Prot;Acc:Q18221]
Organism
CAEEL - Caenorhabditis elegans (Taxon-ID: 6239)
Locus
IIIjoin(complement(4930072..4930131), complement(4929875..4930012), complement(4929582..4929823), complement(4929359..4929531), complement(4928933..4929181), complement(4928673..4928861), complement(4927689..4928077), complement(4926800..4927603), complement(4926497..4926745), complement(4925687..4925746), complement(4924233..4925240), complement(4923942..4924169), complement(4923583..4923894), complement(4923425..4923535), complement(4923211..4923375), complement(4923002..4923157))
Number of exons
16

IDs and Cross-references

loading xrefs...

Domain Architecture

Gene Ontology

loading GO annotations...

Protein Sequence

Download: Fasta
MSTHDMNHHP PRKSHSKRDK PSSSNSGPKI ENHKCKWAWQ KVFETGKSFL RRDGFPQDCK    60
SKEDFERIKR TGVRKTSENM LEDPRKNFES LQQSSVYQTN SFRNPRYLCR AHLRVDSYYC   120
TIPPKREVSL FNMDDNCTEV LLRDFAKDCG KVEKAYVCIH PETKRHMKMA YVKFATVKEA   180
HNFYSMYHAQ NLLATKCTPR IDPFLSILNE EYEVATNGQV LPILPDDLAS IDPSVLRDLR   240
ANFLRDQNEK YELAMRNTYE DEGGMLSGVI MDTSDHYERD YTMDHDVGPS SMKMSPIPPP   300
PIKEESPPPP PPPPVASVSN LAPVPSVQLP YYNNIQPSSS TMHMPEFRPT EPPPSYSRED   360
PYRSTSRSSL SRHRNRSRSP SDGMDRSGRS SSRRTHRRPE SRNGSKNANG DVVKYETYKM   420
EKRKIKYEGG NKKYEQVHIK ERTAVIRGKN QLENVSSESA SGSSSVDTYP DFSDEERKKK   480
KRPKSPNRSK KDSRAFGWDS TDESDEDTRR RRSGRSQNRS SERKFQTTSS SSTRRELSST   540
HTNSVPNLKS HETPPPPPPK GHPSVHLQTP YQHVQPQMIP ATYYNLPPQH MAPPPITTSL   600
PPFCDFSQPP PGFTPTFKPI TNAPLPTPYQ ASNIPQPGLV QIAALSAAPE PFSSIPGPPP   660
GPAPIQEDVG RAESPEKPSL SERFSGIFGP TQREEPAQVE VEYDYPLKHS ESHDDRHSLE   720
DMDVEVSSDG ETVSNVEKIE CMEEKKRQDL ERIAIARTPI VKKCKKRMMD ELSRKVAEDI   780
RQQIMRQCFA ALDEKLHLKA IADEEKRKKE REEKARQEAE KPSNHLIADM MPSQTLYNNQ   840
SFASSSRGFY RKQKPIPKSH PKHQEHHHHA KASVSTPVHS SSTSRNSSVA PTPQRTVSTS   900
SSSSSAATSA RVSEDESDSD STPGEVQRRK TSVLSNDKRR RRASFSSTSI QSSPERQRDV   960
SSSSRTSSSS STSSMKQEET ADEKSRKRKL IMSSDESSTT GSTATSVVSS RQSSLEPQQE  1020
KTDGEPPKKK SQTDFISERV SKIEGEERPL PEPVETSGPI IGDSSYLPYK IVHWEKAGII  1080
EMNLPANSIR AHEYHPFTTE HCYFGIDDPR QPKIQIFDHS PCKSEPGSEP LKITPAPWGP  1140
IDNVAETGPL IYMDVVTAPK TVQKKQKPRK QVFEKDPYEY YEPPPTKRPA PPPRFKKTFK  1200
PRSEEEKKKI IGDCEDLPDL EDQWYLRAAL NEMQSEVKSA DELPWKKMLT FKEMLRSEDP  1260
LLRLNPIRSK KGLPDAFYED EELDGVIPVA AGCSRARPYE KMTMKQKRSL VRRPDNESHP  1320
TAIFSERDET AIRHQHLASK DMRLLQRRLL TSLGDANNDF FKINQLKFRK KMIKFARSRI  1380
HGWGLYAMES IAPDEMIVEY IGQTIRSLVA EEREKAYERR GIGSSYLFRI DLHHVIDATK  1440
RGNFARFINH SCQPNCYAKV LTIEGEKRIV IYSRTIIKKG EEITYDYKFP IEDDKIDCLC  1500
GAKTCRGYLN                                                         1510

Coding Sequence

Download: Fasta
ATGTCCACAC ATGATATGAA CCATCATCCT CCGCGGAAAT CGCATTCAAA GCGAGATAAG    60
CCCAGTTCAA GCAACAGCGG CCCAAAAATT GAAAATCATA AGTGCAAATG GGCTTGGCAG   120
AAGGTTTTCG AAACAGGAAA GTCGTTTTTG CGGCGTGATG GTTTTCCACA GGATTGCAAA   180
TCGAAAGAGG ATTTTGAGCG AATAAAACGC ACTGGCGTGA GAAAAACAAG TGAGAATATG   240
CTTGAAGATC CTAGAAAGAA TTTTGAAAGT CTTCAACAAT CATCGGTATA TCAAACCAAC   300
TCCTTCCGGA ATCCACGATA CTTATGTCGA GCCCATCTTC GCGTAGATTC GTATTATTGT   360
ACAATTCCAC CAAAACGAGA AGTTTCTTTG TTTAATATGG ACGACAATTG TACAGAAGTT   420
TTGCTCCGGG ATTTTGCAAA GGATTGTGGA AAGGTTGAGA AAGCTTACGT GTGTATTCAT   480
CCCGAAACAA AGCGGCATAT GAAAATGGCA TATGTTAAGT TTGCGACCGT CAAGGAGGCT   540
CACAATTTTT ACAGCATGTA TCATGCTCAA AATTTACTTG CCACAAAATG TACACCTCGT   600
ATTGATCCTT TCCTGTCAAT TCTCAATGAA GAATACGAAG TTGCAACTAA TGGACAAGTC   660
CTCCCAATTC TTCCTGACGA TCTTGCCTCA ATCGATCCGA GTGTTCTTCG AGATCTTCGA   720
GCCAATTTCC TGCGAGATCA GAATGAGAAA TATGAGCTAG CTATGAGAAA TACGTATGAA   780
GATGAAGGAG GAATGTTATC TGGTGTTATA ATGGATACTT CTGATCATTA TGAGCGTGAT   840
TATACAATGG ATCATGATGT TGGACCTTCT TCAATGAAAA TGTCTCCTAT ACCACCACCT   900
CCGATCAAAG AAGAATCACC TCCACCACCG CCACCTCCTC CAGTGGCTTC CGTTTCGAAT   960
CTTGCTCCAG TTCCATCAGT ACAGCTTCCG TATTACAACA ACATTCAGCC AAGTTCAAGT  1020
ACTATGCACA TGCCAGAGTT TCGACCAACT GAACCTCCAC CATCGTATTC CCGTGAGGAT  1080
CCTTACCGAA GCACAAGTCG AAGCTCGCTT TCTCGCCACA GAAATAGATC GAGAAGTCCA  1140
TCCGATGGTA TGGATCGTTC TGGACGTAGT TCCAGCCGAC GGACTCATCG AAGGCCTGAA  1200
TCAAGAAATG GATCGAAAAA TGCAAATGGG GACGTTGTGA AGTACGAAAC TTATAAGATG  1260
GAAAAACGGA AAATTAAGTA CGAGGGAGGA AATAAGAAAT ATGAGCAAGT TCATATAAAG  1320
GAACGGACTG CTGTGATACG AGGAAAAAAT CAGTTGGAGA ATGTTTCTTC CGAATCTGCA  1380
TCGGGAAGCT CTTCTGTCGA TACTTATCCA GATTTTTCAG ATGAAGAAAG AAAGAAGAAG  1440
AAACGTCCAA AATCTCCTAA TCGATCGAAA AAAGATTCCC GAGCTTTTGG ATGGGATTCA  1500
ACGGACGAAT CTGACGAGGA TACTCGAAGA CGACGATCCG GAAGATCTCA GAATCGAAGC  1560
TCGGAACGAA AATTTCAAAC TACTTCCTCC TCCTCAACAC GGCGTGAACT TTCAAGCACT  1620
CACACAAATA GTGTCCCAAA TTTGAAATCT CATGAAACTC CGCCGCCACC ACCACCAAAA  1680
GGTCATCCAT CAGTGCATCT TCAAACACCA TATCAACACG TACAACCGCA AATGATCCCT  1740
GCTACTTATT ATAATTTGCC CCCACAGCAT ATGGCTCCAC CACCGATAAC CACGAGTTTA  1800
CCACCGTTTT GCGATTTTTC TCAACCACCA CCAGGATTTA CCCCCACTTT CAAGCCCATC  1860
ACAAATGCTC CTTTGCCAAC TCCCTATCAA GCATCCAACA TTCCACAGCC GGGATTGGTT  1920
CAAATAGCAG CACTATCAGC AGCTCCAGAA CCTTTCAGTT CAATACCAGG ACCACCACCA  1980
GGGCCTGCGC CTATTCAAGA AGATGTTGGT AGAGCAGAGT CTCCTGAGAA ACCGTCTCTT  2040
TCTGAGAGAT TTTCTGGGAT TTTTGGACCA ACCCAACGCG AAGAACCTGC ACAAGTCGAA  2100
GTTGAATATG ACTATCCCCT CAAACACTCG GAATCTCATG ATGATCGTCA TTCGTTAGAA  2160
GATATGGATG TCGAGGTTTC CAGTGATGGA GAAACTGTAT CCAATGTTGA GAAAATCGAG  2220
TGTATGGAGG AAAAGAAACG ACAGGATCTC GAACGTATTG CAATTGCTCG AACACCGATT  2280
GTGAAAAAGT GCAAGAAGAG AATGATGGAT GAGCTCAGCA GAAAAGTGGC TGAAGATATT  2340
CGCCAGCAAA TTATGCGACA GTGCTTTGCT GCATTGGACG AAAAACTTCA TTTGAAAGCT  2400
ATTGCTGACG AAGAAAAACG GAAAAAGGAA CGAGAGGAGA AAGCAAGGCA AGAAGCTGAA  2460
AAACCGAGCA ATCATTTGAT TGCTGATATG ATGCCTTCTC AGACTCTCTA CAACAATCAA  2520
TCGTTTGCCA GCTCGTCACG TGGATTCTAC AGAAAACAAA AGCCGATTCC AAAAAGCCAT  2580
CCCAAACACC AAGAGCACCA TCATCATGCG AAGGCATCAG TTTCGACCCC AGTCCATTCG  2640
TCAAGCACGT CAAGGAATTC ATCCGTAGCT CCAACACCTC AACGTACAGT TTCCACGTCG  2700
TCATCGTCTT CATCGGCCGC AACATCAGCT CGAGTATCAG AAGATGAATC GGATTCCGAC  2760
AGCACGCCGG GTGAAGTTCA AAGACGCAAA ACATCAGTTT TGAGCAATGA CAAGCGACGA  2820
AGACGTGCTT CGTTTAGTTC AACAAGTATT CAAAGTAGTC CAGAAAGGCA AAGAGATGTC  2880
TCATCGAGCT CTCGAACTTC TTCATCTTCG TCTACAAGCT CAATGAAGCA GGAAGAAACT  2940
GCTGATGAAA AGTCACGAAA ACGCAAGTTG ATTATGTCAA GCGATGAGTC GTCTACAACA  3000
GGATCAACTG CAACATCAGT TGTGTCAAGT AGGCAAAGTA GTTTGGAGCC TCAACAAGAG  3060
AAGACGGACG GAGAACCACC GAAAAAGAAA TCACAGACAG ATTTTATTTC TGAACGAGTT  3120
TCGAAGATTG AAGGTGAAGA GAGACCACTA CCAGAGCCAG TGGAAACTTC TGGTCCAATA  3180
ATCGGTGATT CTTCGTATCT ACCATATAAA ATAGTACATT GGGAGAAAGC AGGTATAATT  3240
GAAATGAATC TTCCGGCAAA CTCCATACGA GCCCACGAGT ATCATCCTTT CACAACGGAA  3300
CACTGCTATT TCGGAATAGA CGATCCAAGG CAACCGAAAA TTCAAATATT TGATCATTCA  3360
CCGTGCAAAT CGGAGCCAGG AAGTGAACCT CTAAAAATAA CACCAGCTCC ATGGGGACCT  3420
ATAGACAATG TTGCGGAAAC CGGACCCTTG ATTTACATGG ATGTTGTTAC TGCACCCAAA  3480
ACAGTCCAGA AGAAGCAGAA ACCGAGAAAA CAAGTTTTTG AAAAGGATCC CTACGAGTAT  3540
TATGAACCAC CACCAACCAA GCGCCCAGCT CCTCCGCCAC GCTTTAAGAA AACATTCAAA  3600
CCACGATCTG AAGAGGAAAA GAAGAAAATC ATCGGAGACT GCGAAGATCT GCCCGATTTG  3660
GAAGATCAAT GGTATTTGAG GGCAGCACTT AATGAAATGC AAAGTGAAGT TAAAAGTGCT  3720
GATGAATTGC CCTGGAAGAA AATGTTAACC TTCAAAGAAA TGTTGAGATC AGAAGATCCT  3780
CTGTTAAGAC TCAACCCGAT TCGTTCGAAA AAAGGACTGC CAGATGCTTT CTACGAGGAT  3840
GAAGAACTCG ATGGAGTAAT TCCGGTAGCT GCTGGATGCT CTCGTGCTCG ACCATACGAG  3900
AAGATGACAA TGAAGCAGAA AAGGAGTCTT GTGAGACGTC CTGATAATGA ATCGCATCCG  3960
ACTGCAATCT TCAGTGAAAG AGATGAAACC GCGATTCGGC ATCAGCATCT GGCCAGTAAA  4020
GATATGAGAT TGCTCCAACG TCGATTGCTT ACGTCACTTG GTGATGCTAA CAATGATTTC  4080
TTCAAGATAA ATCAGCTCAA ATTCCGCAAA AAGATGATCA AGTTTGCACG TTCTCGTATT  4140
CACGGATGGG GTCTCTACGC GATGGAGTCG ATTGCACCAG ATGAGATGAT TGTGGAGTAT  4200
ATAGGACAGA CGATCCGATC TCTGGTGGCC GAAGAACGAG AAAAGGCTTA TGAACGTCGT  4260
GGAATCGGAA GTTCATATCT GTTCCGAATT GATCTGCATC ATGTGATCGA TGCAACAAAG  4320
CGTGGAAATT TTGCTCGATT TATTAATCAC TCGTGCCAAC CTAATTGCTA CGCGAAGGTT  4380
CTTACAATCG AAGGCGAGAA GCGGATAGTC ATCTACAGCC GAACAATTAT CAAGAAGGGT  4440
GAAGAAATCA CGTATGACTA CAAGTTCCCA ATTGAAGACG ACAAAATAGA TTGTCTGTGT  4500
GGTGCGAAGA CGTGTCGTGG ATATCTTAAT TGA                               4533