Entry PELSI16690 (ENSPSIG00000008700)

E Pelodiscus sinensis


General Information

Organism
PELSI - Pelodiscus sinensis (Taxon-ID: 13735)
Locus
JH212625.1join(2114479..2114649, 2123480..2123642, 2125340..2125476, 2128049..2128199, 2129437..2129554, 2132723..2132898, 2150701..2150884, 2157017..2157131, 2170937..2171045, 2182354..2182647, 2186149..2186295, 2188695..2188820, 2189576..2189682, 2191071..2191206, 2192912..2193016, 2197265..2197378, 2201642..2201730, 2203766..2203931, 2204603..2204700, 2205725..2205847, 2205954..2206065, 2214340..2214439, 2215055..2215160, 2217026..2217150, 2218429..2218507, 2220129..2220229, 2235291..2235501)
Number of exons
27

IDs and Cross-references

loading xrefs...

Domain Architecture

Gene Ontology

loading GO annotations...

Protein Sequence

Download: Fasta
MGQCCKPMPF CSASCRQXLD HPRVIYVLNV TKEVVVPCRV TAPDIKPKLL QHPSSSEFKY    60
ERMVWNPKRG FVIPSPSFHF FKGMLMCSTN VNGITSKSYY MLQKVETRIQ NLDLKANPKK   120
LLVGETLNLE CSAETFYNGR IGLKWQYPKG RNHKPTAGLT RNNDTNKVFS YVTIQNVTMK   180
DSGLYICNAT GLETNTTITV YSKPFLRVSY KKGPYYEGVE GQKLLKLAVK VDAFPRPNVT   240
WYKDGKPIME NQTCYEPDLQ KYKLSIRDLR PKHAGNYTIV LSNTKHGLYE NLTIQLVVTE   300
RPKIYEKETD FDKVVQVNLG SKHSLECTVS AIPDPKITWE WEPCTPAETL EVHIPGTRFS   360
SCRQGRQRIV IQQNISGAGM STGNKIESIE EKKYEDKKKM ISKLTIEHSN TSGIYYCVAS   420
NKIGKEERSI KFYVSDVPFG LQTDPQITAI VGSDVQLTCR ASRYIYNHLA WYYPSSQVAP   480
GDFLKEQLDR YSISLTLTIS NVTKEHSGLY KCEAQNWHNS TDVLEQHAQL DIKAKEVPYI   540
VQNLTDREVN ISGKIILECK VNGTPLPHIV WRKNGYPILP ASGISMENNT LVIERVKKDD   600
EGLYECVASN EMGHDSTSAF IKIQGSEEKS NIEVIILVCT GLAATLFWLL LTLFIRKLRK   660
PDATDIKTGY LSIIMDPEEM PLDEQCDRLP YDSSKWEFPR DRLRLGKTLG HGAFGKVVEA   720
SAFGIDKSST CKTVAVKMLK ECATTNECKA LMSELKILIH IGHHLNVVNL LGACTKAGGP   780
LMVIVEYCKY GNLSNYLRGK RGDFIAYKSQ ENSDQAEKSL DESNSDLTEL IKRRLESVAS   840
TGSSASSGFI EDKSYSDSED EEEDGEDLQK RPLTLEDLIC YSFQVAKGME FLASRKCIHR   900
DLAARNILLS ENNVVKICDF GLARDIYKDP DYVRKGDARL PLKWMAPEAI FDKIYTTQSD   960
VWSFGVLLWE IFSLGASPYP GVQIDEDFCR RLKEGTRMRS PEYSTPEIYQ TMLDCWHSIA  1020
TERPTFTELV ERLGDLLQAN VQQDGKDYIP LNITLCPDGE SNPKTCPVEE NLNKCINRWS  1080
ALGMGNNSKK RPLSVKTFDE VPVERQKVMN EESESDSGMV LTSDEMKSLK RLEIRSWPYG  1140
IMALAHRAIT KSKESMLSEH ERETAKYQPA VQIDEDTLDF TLEDSVLLPM DPNLECHSPP  1200
PDYNSVIHYS APPV                                                    1214

Coding Sequence

Download: Fasta
ATGGGACAAT GTTGCAAACC AATGCCATTC TGTTCTGCCT CCTGCAGACA GXXXCTTGAC    60
CATCCCAGAG TGATCTACGT ATTGAATGTG ACCAAAGAGG TGGTTGTACC CTGCCGGGTC   120
ACCGCACCGG ATATCAAACC CAAGCTTCTA CAGCATCCTT CTTCTTCGGA GTTCAAGTAT   180
GAGAGGATGG TGTGGAACCC CAAAAGAGGA TTTGTCATCC CCTCTCCTTC TTTCCACTTC   240
TTCAAAGGCA TGCTCATGTG TAGCACCAAT GTAAACGGAA TTACCTCCAA ATCCTACTAC   300
ATGCTACAGA AAGTGGAGAC TAGAATACAA AACCTGGATC TGAAAGCAAA TCCCAAAAAG   360
CTGCTAGTTG GAGAGACCCT CAACCTGGAA TGCAGCGCAG AGACCTTTTA CAATGGACGG   420
ATTGGATTGA AATGGCAGTA TCCCAAGGGG AGAAATCATA AGCCTACAGC AGGTCTTACC   480
CGAAATAATG ACACGAATAA AGTTTTCAGT TATGTCACTA TCCAGAATGT TACCATGAAA   540
GACAGTGGCT TGTATATATG CAATGCAACA GGACTGGAGA CCAACACTAC AATCACAGTA   600
TACAGCAAAC CTTTCCTGCG AGTTTCCTAT AAGAAAGGAC CCTACTATGA GGGGGTAGAA   660
GGACAAAAGT TACTGAAACT AGCTGTGAAG GTGGATGCTT TTCCCCGCCC CAATGTTACC   720
TGGTATAAAG ATGGCAAGCC AATCATGGAG AACCAGACTT GCTACGAGCC AGATCTACAG   780
AAGTACAAGC TGAGCATAAG GGACTTGAGA CCAAAGCATG CAGGGAACTA CACCATAGTC   840
CTGAGCAACA CTAAACATGG CCTATATGAG AACCTCACCA TCCAGCTCGT AGTGACAGAG   900
AGACCCAAAA TCTACGAGAA GGAGACGGAT TTTGACAAGG TTGTGCAAGT GAACCTTGGA   960
AGCAAACACA GTCTTGAGTG CACTGTGAGT GCGATTCCTG ACCCCAAGAT CACATGGGAA  1020
TGGGAGCCAT GCACCCCTGC AGAGACGCTG GAAGTGCACA TCCCTGGGAC TCGCTTTTCC  1080
AGCTGCAGAC AAGGTAGGCA AAGAATTGTC ATTCAGCAAA ACATCAGTGG GGCAGGCATG  1140
TCCACAGGCA ACAAAATCGA GAGCATCGAG GAAAAAAAAT ATGAAGATAA AAAAAAGATG  1200
ATAAGCAAGC TGACTATAGA ACATAGCAAT ACATCAGGGA TTTATTACTG TGTGGCTTCA  1260
AACAAGATTG GCAAAGAGGA GAGAAGCATC AAGTTCTATG TCTCAGATGT GCCATTTGGT  1320
CTGCAGACAG ACCCACAGAT CACCGCCATC GTGGGGAGCG ATGTTCAGCT AACCTGCAGA  1380
GCGTCCAGAT ACATTTACAA TCACCTAGCG TGGTACTACC CCTCCTCACA GGTGGCGCCC  1440
GGTGACTTCC TGAAGGAACA ATTGGACAGA TACTCCATTT CCTTGACACT GACCATTAGT  1500
AATGTCACAA AGGAACACTC AGGGCTTTAC AAGTGTGAAG CTCAGAATTG GCACAACAGT  1560
ACAGATGTCC TAGAACAGCA TGCCCAGCTT GACATTAAAG CAAAGGAAGT GCCCTATATT  1620
GTGCAGAACC TCACAGATCG AGAGGTGAAC ATCAGTGGCA AGATCATTCT GGAGTGCAAA  1680
GTAAATGGAA CCCCACTACC TCATATTGTG TGGCGGAAAA ATGGCTATCC CATTTTACCT  1740
GCTTCAGGAA TTTCCATGGA AAATAACACC CTGGTCATCG AACGAGTGAA AAAAGATGAT  1800
GAAGGCCTCT ATGAGTGTGT GGCTTCCAAT GAAATGGGCC ACGACAGCAC ATCAGCATTT  1860
ATTAAAATAC AAGGCTCTGA AGAAAAATCC AACATTGAAG TTATCATTTT GGTGTGCACT  1920
GGCTTAGCAG CTACCCTTTT CTGGCTTCTT CTGACCCTTT TCATTCGGAA ACTGAGAAAA  1980
CCTGATGCCA CAGATATTAA AACGGGATAC CTGTCAATCA TAATGGATCC AGAGGAAATG  2040
CCCCTTGATG AGCAGTGTGA CCGTCTGCCA TACGACAGTA GTAAATGGGA GTTCCCAAGA  2100
GACAGGCTGC GGCTTGGTAA AACACTGGGT CATGGTGCTT TTGGGAAGGT CGTGGAGGCG  2160
TCTGCGTTTG GTATTGATAA ATCTTCAACC TGCAAAACGG TCGCCGTTAA AATGCTCAAA  2220
GAATGTGCAA CTACAAATGA GTGCAAGGCT TTAATGTCAG AGCTGAAGAT CCTCATCCAC  2280
ATAGGACACC ATCTGAACGT GGTCAACCTG CTGGGAGCCT GCACCAAAGC TGGAGGTCCA  2340
TTGATGGTTA TTGTAGAATA TTGCAAATAT GGCAATCTGT CTAATTACCT GAGAGGAAAA  2400
CGAGGAGACT TTATTGCATA TAAGTCCCAG GAGAACTCTG ACCAGGCAGA GAAAAGTCTG  2460
GATGAATCCA ACAGTGATTT AACAGAGCTG ATCAAGAGGC GCCTTGAGAG TGTGGCCAGC  2520
ACCGGCAGCT CAGCCAGCTC AGGATTCATT GAGGATAAGA GTTACAGTGA CTCGGAAGAT  2580
GAAGAAGAAG ATGGTGAGGA TCTGCAAAAG CGACCCCTGA CCTTGGAAGA TCTGATTTGC  2640
TATAGCTTCC AGGTGGCAAA AGGCATGGAG TTCCTCGCCT CCAGAAAATG TATTCATCGG  2700
GATTTGGCAG CAAGGAACAT CCTTCTGTCA GAAAACAACG TGGTTAAGAT CTGTGACTTT  2760
GGACTGGCTA GGGATATTTA CAAAGATCCT GACTATGTAC GGAAAGGAGA TGCAAGACTT  2820
CCACTTAAAT GGATGGCTCC AGAGGCTATT TTTGATAAAA TTTATACTAC GCAGAGTGAC  2880
GTGTGGTCTT TTGGAGTGTT ACTATGGGAA ATATTTTCTC TGGGTGCCTC ACCATACCCG  2940
GGAGTGCAAA TTGATGAGGA CTTTTGCCGC CGCCTCAAAG AAGGCACACG GATGAGATCA  3000
CCAGAATATT CCACTCCAGA AATCTACCAG ACTATGCTGG ATTGTTGGCA TAGCATTGCT  3060
ACGGAGAGGC CCACCTTCAC AGAGCTGGTT GAGCGCTTAG GAGACCTCCT CCAGGCAAAC  3120
GTGCAACAGG ATGGTAAAGA TTATATTCCT CTGAACATCA CATTATGCCC TGATGGAGAA  3180
TCTAATCCAA AGACCTGCCC AGTGGAAGAA AACTTGAATA AGTGCATTAA TCGCTGGAGT  3240
GCATTGGGAA TGGGTAATAA CAGTAAGAAG CGACCACTGA GTGTGAAGAC ATTTGATGAG  3300
GTGCCCGTGG AGAGGCAGAA AGTGATGAAT GAGGAGAGTG AGTCGGACAG CGGAATGGTG  3360
CTGACATCTG ATGAGATGAA AAGCCTGAAG AGACTGGAAA TCCGCTCCTG GCCTTATGGG  3420
ATTATGGCTC TCGCGCACAG AGCCATCACC AAAAGCAAAG AATCCATGTT ATCAGAACAT  3480
GAACGTGAGA CTGCGAAATA CCAGCCAGCA GTACAGATTG ATGAGGACAC CCTGGACTTC  3540
ACACTGGAAG ACTCAGTCCT CCTGCCCATG GATCCAAACC TGGAATGCCA CAGCCCGCCT  3600
CCCGACTACA ACTCTGTTAT CCACTACTCT GCCCCTCCAG TGTAA                  3645