Entry MOUSE49375 (CTCF_MOUSE)

E Mus musculus


General Information

Description
transcript_id=ENSMUST00000005841.15
Organism
MOUSE - Mus musculus (Taxon-ID: 10090)
Locus
8join(105663763..105664543, 105664856..105665026, 105666659..105666792, 105670231..105670351, 105671291..105671440, 105674869..105675029, 105675996..105676178, 105677215..105677350, 105680071..105680259, 105681435..105681619)
Number of exons
10

IDs and Cross-references

loading xrefs...

Domain Architecture

Gene Ontology

loading GO annotations...

Protein Sequence

Download: Fasta
MEGEAVEAIV EESETFIKGK ERKTYQRRRE GGQEEDACHL PQNQTDGGEV VQDVNSSVQM    60
VMMEQLDPTL LQMKTEVMEG TVAPEAEAAV DDTQIITLQV VNMEEQPINI GELQLVQVPV   120
PVTVPVATTS VEELQGAYEN EVSKEGLAES EPMICHTLPL PEGFQVVKVG ANGEVETLEQ   180
GELPPQEDSS WQKDPDYQPP AKKTKKTKKS KLRYTEEGKD VDVSVYDFEE EQQEGLLSEV   240
NAEKVVGNMK PPKPTKIKKK GVKKTFQCEL CSYTCPRRSN LDRHMKSHTD ERPHKCHLCG   300
RAFRTVTLLR NHLNTHTGTR PHKCPDCDMA FVTSGELVRH RRYKHTHEKP FKCSMCDYAS   360
VEVSKLKRHI RSHTGERPFQ CSLCSYASRD TYKLKRHMRT HSGEKPYECY ICHARFTQSG   420
TMKMHILQKH TENVAKFHCP HCDTVIARKS DLGVHLRKQH SYIEQGKKCR YCDAVFHERY   480
ALIQHQKSHK NEKRFKCDQC DYACRQERHM IMHKRTHTGE KPYACSHCDK TFRQKQLLDM   540
HFKRYHDPNF VPAAFVCSKC GKTFTRRNTM ARHADNCAGP DGVEGENGGE TKKSKRGRKR   600
KMRSKKEDSS DSEENAEPDL DDNEEEEEPA VEIEPEPEPQ PQPPPPPQPV APAPPPAKKR   660
RGRPPGRTNQ PKQNQPTAII QVEDQNTGAI ENIIVEVKKE PDAEPAEGEE EEAQAATTDA   720
PNGDLTPEMI LSMMDR                                                   736

Coding Sequence

Download: Fasta
ATGGAAGGTG AGGCGGTTGA AGCCATTGTG GAGGAGTCTG AAACTTTCAT TAAAGGAAAA    60
GAAAGAAAGA CTTACCAGAG ACGCCGGGAA GGGGGCCAGG AAGAAGATGC TTGCCACCTG   120
CCCCAGAACC AGACAGATGG GGGTGAGGTG GTCCAGGATG TCAACAGCAG TGTACAGATG   180
GTAATGATGG AACAGCTGGA TCCTACCCTT CTCCAGATGA AGACTGAAGT CATGGAGGGT   240
ACAGTGGCTC CGGAAGCAGA GGCTGCAGTG GACGATACCC AGATCATAAC CTTGCAGGTT   300
GTAAATATGG AGGAACAGCC CATTAACATA GGAGAGCTTC AGCTTGTCCA AGTACCTGTT   360
CCTGTGACGG TACCTGTTGC TACTACTTCA GTAGAAGAAC TTCAGGGGGC TTATGAGAAT   420
GAAGTGTCTA AAGAGGGCCT TGCAGAAAGT GAACCGATGA TATGTCACAC CTTACCTTTG   480
CCTGAAGGAT TTCAGGTGGT GAAAGTGGGG GCCAATGGAG AAGTGGAGAC ACTAGAGCAG   540
GGCGAGCTTC CTCCTCAGGA AGACTCTAGC TGGCAAAAAG ACCCAGACTA TCAGCCACCA   600
GCCAAAAAAA CAAAGAAAAC CAAAAAGAGC AAACTTCGTT ACACAGAAGA GGGCAAAGAC   660
GTGGATGTGT CTGTGTATGA TTTTGAGGAA GAACAGCAGG AAGGACTGCT GTCTGAGGTT   720
AATGCAGAGA AAGTAGTTGG TAATATGAAG CCTCCGAAGC CAACAAAAAT TAAAAAAAAA   780
GGTGTAAAGA AAACATTCCA GTGTGAGCTT TGCAGTTACA CATGTCCCCG GCGTTCAAAT   840
TTGGATCGTC ACATGAAAAG CCACACTGAT GAGAGACCAC ACAAATGCCA TCTGTGTGGC   900
AGAGCATTCA GAACAGTGAC CCTCCTGAGG AATCATCTGA ACACACACAC AGGTACTCGT   960
CCTCACAAGT GCCCAGACTG CGATATGGCC TTTGTGACCA GTGGAGAATT GGTGCGGCAT  1020
CGTCGTTATA AACACACTCA TGAGAAACCA TTTAAGTGTT CCATGTGTGA TTATGCCAGT  1080
GTAGAAGTCA GCAAATTAAA ACGACACATT CGCTCTCATA CTGGAGAGCG CCCGTTCCAG  1140
TGCAGTTTGT GCAGTTATGC CAGCAGGGAC ACATACAAGC TGAAAAGGCA TATGAGAACC  1200
CATTCAGGGG AAAAACCTTA TGAATGTTAT ATTTGTCACG CTCGGTTTAC CCAGAGTGGT  1260
ACCATGAAGA TGCACATTTT ACAGAAGCAC ACAGAAAATG TGGCCAAATT TCATTGTCCC  1320
CATTGTGACA CTGTCATAGC CCGAAAAAGT GATTTGGGTG TCCACTTGCG AAAGCAGCAT  1380
TCCTATATTG AACAGGGCAA AAAATGTCGC TACTGTGATG CTGTGTTTCA TGAGCGATAT  1440
GCTCTCATCC AGCATCAAAA ATCACACAAA AATGAGAAGC GCTTCAAGTG TGACCAGTGT  1500
GATTATGCTT GTAGACAGGA GCGGCACATG ATCATGCACA AGCGCACTCA CACGGGGGAG  1560
AAGCCTTATG CCTGCAGCCA CTGCGACAAG ACCTTCCGCC AGAAACAGCT CCTCGACATG  1620
CATTTCAAGC GCTATCATGA TCCCAACTTT GTCCCTGCTG CCTTTGTCTG TTCCAAGTGT  1680
GGGAAAACAT TCACCCGCCG GAACACAATG GCAAGACATG CAGATAACTG TGCTGGTCCA  1740
GATGGCGTAG AGGGGGAAAA TGGAGGGGAG ACAAAGAAGA GCAAACGAGG AAGAAAAAGA  1800
AAGATGCGAT CTAAAAAAGA AGACTCCTCT GACAGTGAAG AAAATGCTGA GCCGGATCTC  1860
GATGACAATG AGGAGGAGGA GGAGCCTGCT GTAGAAATTG AACCTGAGCC AGAGCCTCAG  1920
CCCCAGCCTC CGCCTCCACC TCAGCCTGTG GCCCCGGCCC CCCCACCTGC CAAGAAGAGA  1980
AGGGGGCGAC CTCCTGGAAG AACCAACCAG CCCAAACAGA ACCAGCCAAC AGCCATCATT  2040
CAGGTCGAAG ATCAGAATAC AGGTGCAATT GAGAACATTA TAGTTGAAGT CAAAAAGGAG  2100
CCAGATGCCG AGCCTGCGGA GGGGGAAGAA GAGGAGGCTC AGGCAGCCAC CACAGACGCC  2160
CCCAACGGAG ACCTCACGCC TGAGATGATC CTCAGCATGA TGGACCGGTG A           2211