Entry ORENI04141 (I3K7E7)

E Oreochromis niloticus


General Information

Description
GATA binding protein 4 [Source:HGNC Symbol;Acc:4173]
Organism
ORENI - Oreochromis niloticus (Taxon-ID: 8128)
Locus
GL831215.1join(complement(2790510..2791032), complement(2786273..2786439), complement(2786035..2786160), complement(2785561..2785642), complement(2785164..2785309), complement(2784839..2784964))
Number of exons
6

IDs and Cross-references

loading xrefs...

Domain Architecture

Gene Ontology

loading GO annotations...

Protein Sequence

Download: Fasta
MYQGIMTSNH GPSSYEAGFL HNASTGTSPV YVPTTRVVTP MIPALPYLQT PGASQQSSPA    60
SSHSAWAQPG ADSVPSYNHS PVSPRFTFST SPPLSSGVAT ARETAVSYTS PLNISANGRD   120
HYSARGLSGS YHSPYSAYMS PNVGGTWPAS PFDSPVLHGL QAGAPPGTAR HPNIDLFDDF   180
AEGRECVNCG AMSTPLWRRD GTGHYLCNAC GLYHKMNGIN RPLIKPQRRL SASRRVGLSC   240
TNCHTTTTTL WRRNAEGEPV CNACGLYMKL HGVPRPLAMK KEGIQTRKRK PKNLNKSQTG   300
TPGNEGAPIT PSSTPRSSTT STTAEETRQI KTEPENHSLY THHSSHAQIS ALPAYMATQG   360
GPIPLKMSPG GHNGSSGSKA EPWNSLILA                                     389

Coding Sequence

Download: Fasta
ATGTACCAGG GCATAATGAC AAGCAACCAC GGACCCTCTT CCTATGAGGC CGGATTTCTT    60
CACAACGCCT CGACGGGCAC CTCTCCGGTT TATGTCCCCA CCACCCGGGT CGTCACCCCC   120
ATGATTCCTG CGCTTCCTTA CCTTCAGACG CCCGGTGCGT CGCAGCAGAG CAGCCCGGCG   180
TCGAGCCACT CTGCGTGGGC ACAGCCGGGC GCAGACTCGG TGCCCTCATA CAACCACTCC   240
CCGGTGTCTC CGCGTTTCAC CTTCTCCACC AGCCCGCCTC TGTCCTCCGG GGTGGCCACG   300
GCCCGGGAGA CGGCAGTTTC TTACACAAGC CCGCTGAACA TCTCCGCTAA CGGTAGGGAT   360
CACTACAGCG CTCGTGGTCT GAGCGGGTCG TATCACAGCC CTTATTCGGC CTACATGAGC   420
CCTAACGTGG GCGGAACGTG GCCGGCGTCC CCCTTCGACA GTCCGGTCCT CCACGGTCTC   480
CAAGCTGGCG CTCCTCCTGG GACAGCAAGA CACCCCAACA TAGATTTGTT TGATGACTTT   540
GCTGAGGGCC GAGAGTGCGT GAACTGCGGA GCAATGTCAA CCCCACTCTG GAGGCGAGAC   600
GGTACCGGTC ACTACCTGTG TAACGCCTGC GGGCTGTACC ATAAGATGAA CGGCATCAAC   660
AGACCTCTTA TCAAACCCCA GAGACGGCTG TCTGCCTCGA GGAGGGTGGG CCTGTCCTGC   720
ACCAACTGTC ACACCACCAC CACCACCCTG TGGCGACGAA ACGCAGAGGG AGAGCCGGTC   780
TGCAACGCCT GTGGGCTCTA CATGAAACTC CACGGGGTCC CTCGACCTTT GGCTATGAAG   840
AAAGAGGGGA TCCAGACCCG CAAACGCAAA CCTAAGAACC TCAATAAATC CCAAACTGGG   900
ACACCAGGCA ACGAGGGGGC TCCAATTACA CCCAGCAGCA CTCCCCGCTC CTCCACCACC   960
TCCACCACTG CAGAGGAGAC TCGCCAGATA AAGACAGAGC CAGAAAATCA CAGCTTGTAC  1020
ACACATCACA GCTCACATGC ACAGATTTCT GCCCTGCCAG CCTACATGGC AACTCAAGGA  1080
GGCCCCATTC CTCTCAAGAT GTCCCCTGGG GGGCACAACG GGTCCTCTGG CTCCAAAGCT  1140
GAACCCTGGA ACAGCCTAAT CCTGGCTTAG                                   1170