Entry SCOMX25608 (ENSSMAG00000013472.1)

E Scophthalmus maximus


General Information

Description
transcript_id=ENSSMAT00000022371.1
Organism
SCOMX - Scophthalmus maximus (Taxon-ID: 52904)
Locus
5join(7972998..7973034, 7974770..7974804, 7984440..7984485, 7984583..7984719, 7985166..7985286, 7986015..7986074, 7986916..7987006, 7988975..7989077, 7989942..7990028, 7990420..7990500, 7990642..7990817, 7991484..7992334, 7992915..7993255, 7994850..7994967, 7995912..7996047, 7996797..7996902, 7997783..7997944, 7998872..7999138, 7999515..7999922)
Number of exons
19
Alternative splicing
SCOMX25607
SCOMX25609
SCOMX25610

IDs and Cross-references

loading xrefs...

Domain Architecture

Gene Ontology

loading GO annotations...

Protein Sequence

Download: Fasta
MNKFEVLGIV GEGAYGVVLK CRHKETNELV AIKKFKDSEE NEEVKETTLR ELKMLRTLKQ    60
DNIVELKEAF RRRGKLYLVF EYVERNMLEL LEELPNGAPP DKVRSYIYQL IKAINWCHKN   120
EIVHRDIKPE NLLIGSEDVL KLCDFGFARN LSEGTDANYT EYVATRWYRS PELLLGAPYG   180
KAVDMWSVGC ILGELSDGQP LFPGESEIDQ LFTIQKVLGS LPAEQMKLFY NNPRFHGIRF   240
PSVTHPQTLE RRYQGILSGL MLDLMKNLLL LNPTERFLTE QSLNHPAFQP LRQAERERPP   300
PASPNPPRSS KRKTHHHGEN TVPTRSHSKS TGRRSNSKEC SSLPRHGDLH HLGNESFLNG   360
NKPAPSSLSP TLHPKSQYMS QTLNRSASSS KDLANNNLPH LLSPKEPKGK TEFDFSLGPS   420
PKLPDQGHGA KYGHSKPSSS RSQQQQQQQQ QQQQPPGRHT FLEGKTNTLQ SGAEKQHGRH   480
SHSMADSAHG SMSSSSKSSA SYLSLSKSHS ALSDAKSVGN LSDGRLHPDD PNSNTAAGVG   540
PGARFFPASC LDLNAPSGPQ GPPGSPSARH SDRSGHSPAS RSSGNVRMES STLDSSSRHK   600
SRHKPLTPED VGAPELLDPG GAGMPSTHTL PSPHESYHYG LGYTSPFSSQ QRPHRHSMYV   660
RRERHRPHGV EGGMAGLPPP GQVIPTRTSS LQILSPQLQH RTALTGHSVS SSREDCTDDM   720
TRCVPQVGMY HDPHGEDGGS SKENRMIFTE SMPRRVGSFY RVPSPRPDNS SSFHDAIGQS   780
RGPVLPIVPG DPVAMANHSK RQTAFDWSAA EAMVMNPPEP VKEKGKQGFF RAIKKKKKKT   840
QITDMEDGRN PSIKKSLFPL FSSKNSLKHN SAVKVLPVVA SPMVQHQPPA PYPASPVIGI   900
GQEHLSLQRS SKSSSHHGSR RKNRERSRDR DREREQSRDR DRDRERERER ERERERERER   960
ERERGRERER VSDWPPEKPV DSHSQSQPLK SLRRLLHLSP SSSNQGQPPP APPPDLRFQA  1020
PLSNPPQPSA KAGYSEGRGH PESRGHSGVS STSQAKSRKP SYPLPGQIES SWHVSALQRA  1080
EGAQFTPEQL GIKPGQNGPT FTRAARSRMP NLNDLKETAL                        1120

Coding Sequence

Download: Fasta
ATGAATAAGT TTGAAGTCCT TGGCATTGTG GGAGAAGGAG CATATGGAGT GGTGCTTAAG    60
TGCAGGCATA AGGAAACCAA TGAGCTGGTG GCCATTAAGA AGTTCAAAGA CAGCGAAGAG   120
AATGAGGAGG TCAAGGAGAC GACGCTTCGG GAGCTGAAGA TGCTTCGGAC TCTGAAGCAG   180
GACAACATCG TCGAGCTGAA GGAGGCCTTC CGCAGGAGGG GGAAACTCTA CCTTGTCTTT   240
GAATATGTCG AGCGGAACAT GCTGGAGCTT CTGGAGGAAC TGCCCAACGG AGCACCGCCC   300
GATAAAGTCC GCAGCTACAT CTACCAGCTC ATCAAGGCCA TCAACTGGTG CCACAAGAAT   360
GAGATCGTGC ACAGGGATAT CAAACCTGAA AACCTCCTCA TCGGCTCCGA GGATGTTCTC   420
AAACTCTGTG ACTTTGGTTT TGCTCGTAAC CTGTCTGAGG GAACAGACGC CAACTACACT   480
GAATATGTGG CCACGAGGTG GTACCGCTCG CCCGAGCTGC TGCTGGGGGC TCCCTATGGA   540
AAGGCCGTGG ACATGTGGTC GGTGGGGTGC ATCCTCGGGG AGCTGAGTGA CGGGCAGCCC   600
TTGTTTCCGG GGGAAAGTGA GATTGATCAG CTCTTCACCA TTCAGAAGGT TTTGGGTTCA   660
CTGCCTGCAG AGCAGATGAA ACTCTTCTAC AACAACCCTC GATTCCATGG AATCAGGTTT   720
CCCTCGGTTA CTCATCCTCA GACTCTGGAG AGAAGGTACC AAGGCATCTT GAGTGGTCTG   780
ATGTTGGATC TGATGAAGAA CCTGCTGCTG CTAAACCCGA CTGAGCGCTT TCTGACGGAG   840
CAGAGTTTGA ACCATCCTGC CTTCCAGCCT CTGAGGCAGG CGGAGAGAGA GAGGCCTCCA   900
CCTGCCTCTC CCAACCCACC GCGGTCCTCC AAGAGGAAAA CACACCATCA CGGGGAAAAC   960
ACCGTTCCTA CAAGGAGTCA CAGTAAGAGC ACAGGCCGTC GCTCCAACAG TAAAGAGTGT  1020
TCCAGCCTCC CTCGCCATGG GGACCTCCAT CACCTCGGCA ATGAGAGTTT CCTGAATGGT  1080
AACAAACCGG CCCCGTCCAG TTTAAGTCCT ACACTCCACC CCAAGAGCCA GTACATGTCC  1140
CAGACCCTCA ATCGCTCCGC CTCATCCAGC AAAGACCTGG CTAACAACAA CCTGCCGCAC  1200
CTCCTCAGTC CCAAAGAACC CAAAGGCAAG ACAGAGTTTG ATTTCAGCTT GGGACCCTCC  1260
CCAAAGCTGC CAGACCAGGG CCACGGGGCA AAGTATGGCC ACAGCAAACC CAGCTCCTCT  1320
CGCTCTCAGC AGCAGCAGCA GCAGCAGCAG CAGCAGCAGC AGCCGCCCGG CCGCCACACC  1380
TTCCTGGAAG GCAAGACCAA CACACTGCAG TCCGGAGCGG AAAAACAACA TGGCCGACAT  1440
TCCCACAGTA TGGCCGACTC TGCCCACGGG TCCATGTCCT CCTCTTCTAA GAGCTCCGCC  1500
TCCTATCTCA GCCTGTCCAA GAGCCACAGC GCACTGAGTG ATGCCAAATC CGTCGGGAAC  1560
CTCAGCGACG GTCGACTCCA CCCAGATGAC CCAAACTCCA ACACAGCAGC GGGAGTGGGA  1620
CCCGGTGCCC GCTTCTTCCC TGCCAGCTGT TTAGACCTCA ACGCCCCTTC GGGTCCCCAG  1680
GGGCCGCCTG GCAGCCCCTC TGCCCGACAC AGTGACCGAT CGGGCCACAG CCCGGCCTCT  1740
CGTAGCAGCG GCAATGTCCG CATGGAGAGC AGCACACTGG ACTCTTCCTC CAGACACAAA  1800
TCCAGACATA AACCTTTAAC ACCGGAGGAT GTCGGCGCAC CGGAGCTCTT AGACCCTGGA  1860
GGAGCAGGGA TGCCCTCCAC TCACACTCTG CCTTCCCCGC ACGAGTCTTA TCATTACGGC  1920
CTGGGCTACA CCTCGCCCTT CTCCTCCCAG CAACGGCCTC ATCGCCACTC CATGTACGTG  1980
AGGAGGGAGC GCCACCGACC CCACGGCGTC GAGGGTGGGA TGGCGGGCTT GCCGCCGCCA  2040
GGGCAGGTGA TACCAACACG AACCAGCAGC CTGCAGATAC TCTCCCCCCA GCTGCAGCAC  2100
CGGACCGCCC TCACCGGACA CTCCGTGAGC TCCTCAAGGG AGGACTGCAC CGATGACATG  2160
ACGAGGTGTG TCCCTCAGGT CGGCATGTAC CACGACCCCC ACGGCGAGGA TGGAGGTTCC  2220
TCCAAGGAGA ACCGCATGAT CTTCACTGAG TCCATGCCGA GGAGAGTCGG CAGCTTCTAC  2280
CGGGTCCCTT CCCCACGACC AGACAACTCC TCCTCTTTCC ATGACGCCAT TGGGCAGAGT  2340
CGTGGGCCGG TCTTACCCAT CGTACCCGGA GACCCTGTTG CCATGGCCAA CCACTCAAAA  2400
CGCCAGACGG CTTTTGACTG GTCCGCCGCA GAGGCGATGG TCATGAACCC CCCAGAGCCC  2460
GTCAAGGAGA AGGGAAAACA AGGCTTCTTC AGAGCCATTA AAAAGAAAAA GAAGAAGACA  2520
CAAATAACGG ACATGGAGGA CGGGAGGAAC CCCAGCATAA AGAAAAGTCT TTTCCCCCTC  2580
TTTAGCTCTA AGAATAGCTT AAAGCACAAC TCAGCTGTCA AAGTCCTACC TGTAGTCGCC  2640
TCGCCCATGG TACAACACCA GCCTCCTGCG CCCTACCCTG CCTCACCAGT CATTGGCATC  2700
GGACAGGAAC ACCTGTCGCT GCAGAGGAGC TCCAAGTCCT CCTCTCACCA CGGCAGCCGG  2760
CGGAAAAACC GCGAGCGCTC CCGAGACAGG GACCGGGAAC GAGAACAGAG CCGCGACCGC  2820
GACAGGGACC GAGAGAGGGA GAGAGAACGA GAGAGGGAGA GGGAGAGGGA GAGAGAGAGA  2880
GAAAGAGAAA GAGGGAGGGA GAGGGAGAGA GTCAGCGACT GGCCACCAGA AAAACCAGTG  2940
GACTCACACT CTCAGAGCCA ACCACTCAAG TCACTCCGCA GGCTCCTTCA CCTCTCCCCC  3000
TCCTCCTCCA ATCAAGGACA GCCTCCTCCC GCTCCTCCTC CAGACCTGCG CTTCCAAGCC  3060
CCCCTCTCCA ACCCGCCTCA GCCCTCCGCC AAAGCGGGCT ACTCTGAGGG TCGAGGGCAT  3120
CCTGAGAGCA GGGGGCACTC CGGGGTGAGC AGCACCTCCC AGGCCAAGAG CCGCAAGCCC  3180
AGCTACCCCC TCCCCGGACA GATCGAGTCC AGCTGGCACG TGTCTGCTTT ACAGCGGGCA  3240
GAGGGCGCTC AGTTCACCCC AGAGCAGCTG GGCATCAAAC CGGGCCAGAA CGGACCCACA  3300
TTCACTCGAG CCGCCCGCAG CAGGATGCCG AACCTCAACG ACCTGAAGGA GACTGCGCTT  3360
TAA                                                                3363