Entry LATCH20069 (H3BIG3)

E Latimeria chalumnae


General Information

Description
WD repeat and HMG-box DNA binding protein 1 [Source:HGNC Symbol;Acc:23170]
Organism
LATCH - Latimeria chalumnae (Taxon-ID: 7897)
Locus
JH126564.1join(6806859..6806935, 6809851..6809962, 6816210..6816361, 6817291..6817402, 6819930..6819961, 6820288..6820321, 6820673..6820762, 6823889..6823981, 6825650..6825756, 6829240..6829381, 6831000..6831192, 6831525..6831727, 6832968..6833152, 6836229..6836470, 6838713..6838850, 6840615..6840771, 6843492..6843606, 6844869..6845000, 6846612..6846817, 6849946..6850057, 6853677..6853687, 6854050..6854096, 6855287..6855339, 6856497..6856667, 6856732..6856760, 6857102..6857124, 6857420..6857428, 6858667..6858678, 6859216..6859231, 6859552..6859565, 6860737..6860776, 6860920..6861055, 6865445..6865653, 6865843..6865849)
Number of exons
34

IDs and Cross-references

loading xrefs...

Domain Architecture

Gene Ontology

loading GO annotations...

Protein Sequence

Download: Fasta
MPAEKKPMRY GHPEGHTDVC FDDSGSYIVT SGSDGDVRIW ESLEDDDPKS ISVGEKVYSV    60
ALKNGKLVSA VSSNTVQIHT FPEGAPDGIL TRFTTNANHV TFNDDGSRVA AGSSDFMVKV   120
VEVNDSSQQK SFRGHEAPVL SVAFDPQDIF LSCVSCTRSA LKLWKISDYV LRIVTSWPLL   180
QKCNDVSSAK SICRLAWQPK TGKMLAVPVD KAVKLYERDS WDSVHSLTDS FITQPLNVVT   240
WSPCGQYLAA GSIGGCIVVW NTETKECLER VKHEKGYTVC GLAWHPKGGQ IAYTDSEGNL   300
GLLEDVCDGT GKSSVPRVSI ALTKDYDALF DGEDDELLSE NLEEAQSPPR KAVLDDEDDD   360
PMPTLSRPRH RSTILDDDRS LDLLVDVASP RPKGELDKEE DDDDDPGNSL QNDTAAVPQQ   420
PVYDGPVPTP QQKPFQSAST PVYLMHRFMV WNSVGIVRCY NDEQDNAIDI EFHDTAVHHA   480
MHLTNTLNHT MADLSQEAVL LACVGTEELP SKLQCLHFSS WDTNKEWTVE LPKEEDIEAV   540
CLGQGWIAAA TTALMVRIFT VGGVQKELFG LPGPVVAMAA HGEQLLIVFQ RGTGFGGEQC   600
LGTQLIELRK EKRQILSGEP LPLTRKSYLA WLGFSAEGTP CYVDSEGIVR MMNRAFGNTW   660
TPVCNTREHC KGRSDHYWVV GIHENPQQLR CIPCKGSRFP PTLPRPAVAI LPFKLPYCQT   720
ATEKGQMEEQ FWRSMLFQNH LDYLAEGNYE FDESVKNRAL KEQQELLMKM FALSCKLERE   780
FRCIEISEFM MPNVINLAIK YASRSKRLML AQRLSELALE RAAELAEEEE EEDFRTKLNV   840
GYSAPSTEWN LTRGRSHQAE DVEDNEVTEE AEDEMEAELH RSEGCNSPEK STMNLKPVLG   900
TAAITGSQGR ANPFKVSSNR SSPSVGGQAR CVNILDNMTK LTKKPTTTNN QAMNKHKSPV   960
LKPLVPKPKS RQLLLCSFFS LKRSNDNNKK ERQKKKMLGT LRSQTPQKIV EGEGTKNYKG  1020
PKTGFQLWLE ENRSRILADK PDLEEADIIK DGMNRFRILP PEERMTWTEK AKGGAVAMVT  1080
CHAMVVKKRK HPEKENKDVQ ESRREQLDDD TTSAKKQKPS DTATNSKLSM FAFKNQK     1137

Coding Sequence

Download: Fasta
ATGCCAGCAG AAAAGAAACC TATGAGATAT GGCCACCCAG AGGGTCACAC TGATGTCTGT    60
TTTGATGACT CAGGAAGTTA CATCGTAACC AGCGGCAGTG ATGGAGATGT GAGGATCTGG   120
GAGAGCCTGG AAGACGACGA TCCCAAATCC ATTAGCGTGG GAGAAAAGGT CTATTCTGTA   180
GCTTTAAAGA ATGGGAAGTT AGTTTCTGCT GTTTCTAGCA ATACTGTTCA AATTCACACG   240
TTCCCTGAGG GAGCTCCAGA TGGAATTCTC ACCAGATTTA CCACCAATGC AAACCATGTC   300
ACCTTTAACG ATGATGGTTC CAGAGTTGCC GCTGGATCCA GTGATTTTAT GGTGAAAGTT   360
GTAGAAGTTA ATGACAGCAG CCAGCAGAAA TCATTCCGAG GACACGAAGC TCCTGTCTTA   420
AGTGTTGCTT TTGATCCACA AGACATTTTC TTATCTTGCG TCAGCTGTAC TAGATCTGCG   480
CTGAAACTTT GGAAGATCTC TGACTACGTC TTACGCATAG TGACCAGCTG GCCATTGTTA   540
CAAAAATGTA ATGATGTAAG CAGTGCGAAA TCCATCTGCA GACTTGCATG GCAGCCTAAA   600
ACAGGAAAGA TGTTGGCGGT GCCAGTGGAT AAGGCCGTAA AACTGTATGA AAGAGATTCC   660
TGGGATAGTG TACATAGCCT GACCGATAGC TTCATTACTC AGCCTTTAAA CGTTGTAACC   720
TGGTCTCCTT GTGGACAGTA CCTCGCTGCT GGAAGCATTG GCGGCTGCAT TGTGGTGTGG   780
AACACAGAGA CCAAAGAATG CTTGGAAAGG GTGAAACATG AAAAGGGTTA CACAGTTTGT   840
GGCCTTGCGT GGCACCCCAA AGGAGGACAG ATTGCATATA CAGACAGTGA AGGAAATCTG   900
GGACTCCTGG AAGATGTTTG TGATGGAACG GGAAAAAGCT CTGTGCCCAG GGTTTCCATT   960
GCCCTTACTA AAGACTATGA TGCCTTATTT GATGGAGAGG ATGATGAGTT GTTAAGTGAA  1020
AATCTAGAGG AAGCTCAGTC GCCCCCTAGA AAAGCGGTTC TTGACGATGA GGACGATGAC  1080
CCGATGCCTA CCCTCAGCCG CCCCAGACAT CGTAGCACTA TATTGGACGA CGATCGCTCG  1140
TTAGATTTGC TTGTAGATGT TGCATCCCCT AGGCCCAAGG GAGAGCTTGA TAAAGAGGAA  1200
GATGATGACG ATGATCCAGG CAATAGCTTA CAGAACGATA CAGCAGCCGT GCCACAACAG  1260
CCTGTGTATG ATGGACCTGT GCCAACCCCT CAACAAAAGC CCTTCCAGTC TGCTTCCACC  1320
CCTGTTTATC TCATGCATCG CTTTATGGTT TGGAACTCTG TAGGCATTGT CCGCTGCTAC  1380
AATGATGAGC AGGATAACGC TATAGATATA GAGTTCCATG ATACCGCCGT GCATCATGCA  1440
ATGCACTTGA CCAACACTTT GAATCACACA ATGGCTGACC TGTCCCAAGA AGCTGTTCTG  1500
TTGGCCTGTG TGGGCACTGA GGAACTCCCC AGCAAACTGC AGTGCTTACA TTTTAGTTCC  1560
TGGGATACAA ACAAGGAATG GACAGTGGAG CTGCCAAAGG AGGAGGATAT TGAAGCCGTT  1620
TGCTTGGGTC AGGGTTGGAT CGCCGCTGCT ACCACCGCTC TGATGGTTCG TATCTTTACA  1680
GTAGGAGGGG TACAGAAGGA GCTCTTCGGC CTTCCTGGAC CTGTGGTTGC AATGGCGGCG  1740
CACGGAGAAC AGCTTCTTAT TGTGTTTCAA AGAGGCACTG GATTTGGTGG GGAACAGTGC  1800
CTCGGAACGC AGCTGATCGA GCTGAGAAAG GAAAAAAGGC AAATTCTGTC CGGAGAGCCT  1860
CTTCCCCTTA CTAGAAAGTC TTATCTAGCA TGGCTAGGAT TTTCGGCCGA AGGCACACCT  1920
TGCTATGTGG ATTCCGAAGG AATAGTACGG ATGATGAACA GAGCGTTTGG AAACACTTGG  1980
ACTCCTGTAT GTAACACTAG GGAGCACTGC AAGGGAAGAT CTGACCATTA CTGGGTTGTT  2040
GGCATCCACG AAAACCCCCA GCAACTCAGG TGTATCCCTT GCAAAGGATC CCGGTTCCCG  2100
CCCACACTGC CTCGTCCGGC TGTTGCAATT TTACCATTTA AACTTCCTTA CTGCCAGACC  2160
GCTACAGAAA AAGGCCAAAT GGAGGAACAG TTCTGGCGTT CTATGCTGTT CCAGAATCAT  2220
TTGGATTACT TAGCAGAAGG CAACTATGAA TTTGATGAGA GTGTTAAAAA CCGGGCACTT  2280
AAGGAGCAAC AGGAGCTTCT AATGAAAATG TTTGCTCTTT CGTGTAAGTT GGAGCGTGAG  2340
TTCCGTTGCA TAGAAATTTC TGAGTTCATG ATGCCAAATG TTATCAATCT GGCCATCAAG  2400
TATGCTTCCC GCTCAAAGCG GCTAATGCTA GCCCAGCGTC TTAGTGAGTT GGCTCTTGAG  2460
AGAGCAGCTG AACTTGCGGA AGAGGAAGAA GAGGAAGACT TCAGAACTAA ACTGAATGTT  2520
GGTTACAGCG CGCCGAGCAC TGAATGGAAC CTGACAAGAG GTAGAAGCCA TCAGGCTGAA  2580
GATGTTGAGG ATAACGAAGT AACAGAAGAA GCTGAAGATG AAATGGAAGC CGAGTTGCAT  2640
AGAAGTGAAG GCTGTAACTC TCCAGAAAAG TCTACCATGA ATCTGAAACC TGTCCTGGGA  2700
ACTGCAGCTA TAACCGGTAG TCAAGGACGT GCCAACCCTT TCAAGGTGTC TAGTAATAGA  2760
TCATCACCGT CTGTCGGAGG TCAAGCCCGC TGTGTTAATA TTCTGGATAA CATGACGAAG  2820
CTTACCAAGA AACCCACCAC CACTAACAAT CAGGCCATGA ACAAACACAA ATCGCCAGTA  2880
TTAAAGCCAT TGGTACCAAA ACCAAAGTCC AGGCAGCTTT TACTCTGTTC TTTTTTTTCC  2940
CTCAAAAGGA GTAACGACAA CAACAAAAAG GAAAGACAAA AAAAAAAAAT GCTTGGAACA  3000
TTGAGATCTC AAACCCCCCA AAAAATTGTA GAGGGGGAGG GGACAAAAAA TTATAAAGGG  3060
CCTAAGACTG GATTTCAGCT CTGGCTAGAA GAGAATAGAT CTCGTATACT GGCAGACAAA  3120
CCAGACCTAG AAGAAGCAGA TATAATAAAA GATGGCATGA ACCGATTTCG AATACTTCCG  3180
CCTGAGGAGA GGATGACCTG GACAGAAAAG GCTAAAGGGG GAGCAGTCGC TATGGTAACG  3240
TGCCATGCAA TGGTTGTGAA GAAACGAAAA CACCCTGAGA AGGAAAACAA AGATGTGCAG  3300
GAAAGCAGGA GAGAGCAGCT TGATGATGAC ACCACCTCTG CCAAAAAACA GAAACCTTCA  3360
GATACAGCTA CCAACTCAAA ACTGTCTATG TTTGCCTTTA AAAACCAAAA AXXX        3414