Entry PELSI13975 (K7FH47)

E Pelodiscus sinensis


General Information

Organism
PELSI - Pelodiscus sinensis (Taxon-ID: 13735)
Locus
JH206105.1join(2533555..2533603, 2535249..2535563, 2541717..2541980, 2542139..2542269, 2543343..2543514, 2543618..2543807, 2545680..2545795, 2550692..2550818, 2551790..2551983, 2552431..2552525, 2552633..2552765, 2559610..2559714, 2562023..2562133, 2562401..2562554, 2562899..2563065, 2564405..2564520, 2564917..2565039, 2565593..2565704, 2569008..2569111, 2569215..2569319, 2571470..2571705, 2572945..2573092)
Number of exons
22

IDs and Cross-references

loading xrefs...

Domain Architecture

Gene Ontology

loading GO annotations...

Protein Sequence

Download: Fasta
MGTSPRTFLF LGCLLAGPWL TFCQLPLPTV LPNRDEMVVQ LNSSFTLKCS GDSEVTWQYP    60
MAEGSHRIDI RNEENNSGLF VTLLEVGNAS AAHTGLYTCY YNHTQEEDGE VEGKDIYIYV   120
PDPDMPFVPL LPEDQFILIE DGDSTIIPCR TSDPNAQVVL INSEAKVVYA YYDSKQGFLG   180
NFPAGSYTCK TTVKGVEFKS DQFFIYILKA TSQLPVEIEA LQTVHVKGET IVVTCVVFDN   240
EMVNLQWNYP GKVKEKGVTK VDDIKVPSQK LVYTLTIPEA SVKDTGDYEC TAQHATKEVK   300
ENKRVAITVY DKGFIHLDPQ FSPLEAVNLH EVKNFIVDVQ AYPAPKMIWL KDNVTLTENL   360
TEIVTSSNKI HETRFQSILK LIRAKEEDSG DYTLVAENGE ETKSYTFSLI IQVAASILDL   420
VDDHHSSGGG QTVQCMAKGT PFPDVDWLIC KDIKKCSNDT SWMLLTNNIS DIHMEAHRVE   480
KDQVESQVTF QKVEETLAVR CMARNTLGAV TRELKLVAPT LNSKLTVAAA VLVLLVFVII   540
LLIVLVIVWK QKPRYEIRWR VIESISPDGH EYIYVDPMQL PYDSRWEFPR DGLVLGRILG   600
SGAFGKVVEG TAYGLSRSQP VMKVAVKMLK PTARSSEKQA LMSELKIMTH LGPHLNIVNL   660
LGACTKSGPI YIITEYCFYG DLVNYLHKNR DNFLSRHPEK SKKDFDIFGM NTADESTRSY   720
VILSFENNGE YMDMKQADTT QYVPMLERKQ GSKYSDIQRS LYDRPASYKK KSVSEPEVKN   780
LLSDDNSDSL SLLDLLSFTY QVARGMEFLA SKNCVHRDLA ARNVLLAQGK IVKICDFGLA   840
RDIMHDSNYV SKGSTFLPVK WMAPESIFDN LYTTLSDVWS YGILLWEIFS LGGTPYPGMM   900
VDSSFYNKIK SGYRMAKPDH ATSEVYYEIM VKCWNSEPEK RPSFYHLSEI VENLLPGEYK   960
KSYERIHLDF LKSDHPAVTR MRVDCDNTYI GVTYKNEDKL KDRESGFDEQ RLSADSGYII  1020
PLPDIDPISE DELGKRNRHS SQTSEESAIE TGSSSSTFIK REDETIEDID MMDDIGIDSS  1080
DLVEDSFL                                                           1088

Coding Sequence

Download: Fasta
ATGGGTACTT CCCCAAGGAC ATTCCTGTTC CTGGGATGTC TCCTGGCAGG ACCCTGGCTA    60
ACTTTCTGCC AGCTTCCATT GCCGACTGTT CTTCCGAATA GAGATGAAAT GGTTGTACAG   120
CTGAATTCCT CCTTTACCCT GAAGTGCTCT GGAGACAGCG AAGTGACCTG GCAGTATCCA   180
ATGGCTGAAG GAAGCCACAG AATAGACATT AGAAATGAAG AGAACAACAG TGGTCTCTTT   240
GTGACTTTGC TGGAAGTGGG GAATGCCTCA GCTGCTCATA CCGGATTATA TACTTGTTAC   300
TACAATCATA CCCAAGAGGA GGACGGAGAG GTGGAGGGGA AGGATATCTA CATCTATGTG   360
CCTGATCCAG ATATGCCCTT TGTTCCTCTA CTTCCAGAGG ATCAATTCAT CCTAATAGAA   420
GACGGAGATT CGACCATTAT CCCTTGTCGG ACAAGTGACC CTAATGCTCA AGTAGTTTTA   480
ATTAACAGTG AAGCCAAGGT CGTATATGCC TATTATGACA GCAAACAAGG GTTCCTAGGA   540
AATTTTCCTG CAGGCTCATA CACATGCAAA ACAACTGTTA AAGGAGTGGA GTTCAAGTCA   600
GATCAGTTTT TCATCTACAT TTTGAAAGCT ACTTCACAGC TGCCAGTTGA GATTGAGGCC   660
CTTCAAACTG TACATGTAAA AGGAGAAACA ATTGTAGTAA CTTGTGTGGT CTTTGACAAC   720
GAGATGGTTA ATTTACAGTG GAATTATCCA GGGAAAGTGA AAGAAAAAGG TGTGACAAAA   780
GTTGATGACA TTAAAGTTCC ATCTCAGAAG CTGGTTTACA CTTTGACCAT TCCTGAGGCT   840
TCAGTGAAAG ACACCGGGGA TTATGAATGT ACTGCCCAAC ATGCAACCAA GGAGGTTAAA   900
GAAAATAAGA GAGTAGCAAT TACAGTTTAT GATAAAGGAT TCATTCATCT GGACCCCCAG   960
TTCAGTCCTT TGGAAGCAGT CAATTTACAT GAGGTCAAAA ATTTCATAGT GGATGTGCAG  1020
GCATACCCAG CTCCAAAAAT GATCTGGCTG AAAGACAATG TGACTCTGAC TGAGAACCTT  1080
ACAGAGATTG TTACTAGTTC AAACAAGATC CACGAGACAA GATTCCAAAG TATATTAAAA  1140
CTGATTCGGG CCAAGGAAGA AGATAGTGGA GATTATACTT TGGTTGCTGA AAACGGAGAA  1200
GAGACTAAAA GCTATACTTT CTCATTGATA ATACAAGTTG CAGCGTCAAT TCTAGACCTA  1260
GTGGATGATC ACCACAGCTC TGGAGGGGGA CAGACAGTCC AATGCATGGC AAAAGGGACA  1320
CCTTTCCCTG ACGTGGATTG GCTGATTTGC AAGGACATTA AAAAATGCAG TAATGATACC  1380
TCATGGATGC TTCTGACTAA CAACATCTCA GATATACACA TGGAGGCTCA TCGGGTTGAG  1440
AAGGACCAGG TGGAAAGCCA GGTGACCTTC CAAAAGGTAG AAGAGACCCT GGCAGTACGG  1500
TGCATGGCGA GGAACACTCT TGGTGCTGTT ACTCGGGAAC TGAAGCTTGT GGCTCCTACA  1560
TTGAATTCAA AATTAACCGT GGCTGCTGCA GTCCTAGTGT TATTAGTGTT TGTGATTATT  1620
TTACTGATTG TCCTGGTCAT CGTATGGAAA CAGAAACCAA GGTATGAGAT AAGATGGAGA  1680
GTCATTGAGT CTATCAGCCC TGATGGCCAT GAGTACATTT ATGTGGACCC AATGCAATTG  1740
CCCTATGACT CAAGATGGGA GTTTCCTAGA GATGGATTAG TACTTGGGCG AATCCTGGGT  1800
TCAGGCGCGT TTGGGAAAGT GGTTGAAGGA ACTGCATATG GATTAAGTCG CTCTCAACCG  1860
GTGATGAAGG TAGCTGTGAA AATGCTAAAA CCCACAGCTA GATCCAGTGA AAAACAGGCA  1920
CTGATGTCTG AATTGAAGAT AATGACACAT CTTGGGCCCC ATTTGAACAT TGTGAATCTC  1980
TTAGGAGCTT GTACCAAATC AGGTCCAATT TACATCATTA CTGAATACTG CTTTTATGGT  2040
GACTTGGTAA ACTACCTGCA TAAAAACAGG GATAATTTCC TGAGCCGACA TCCAGAAAAG  2100
TCAAAGAAGG ACTTCGACAT TTTTGGGATG AACACAGCTG ATGAAAGCAC AAGAAGTTAT  2160
GTCATTTTAT CTTTTGAAAA CAATGGAGAA TACATGGACA TGAAACAAGC TGATACAACA  2220
CAGTATGTGC CAATGCTGGA AAGGAAGCAG GGTTCTAAGT ATTCGGATAT CCAGAGATCG  2280
TTGTATGATC GGCCTGCATC ATATAAGAAA AAATCTGTGT CAGAACCAGA AGTCAAAAAC  2340
CTGCTTTCCG ATGACAATTC TGACAGCCTC AGTCTGCTGG ATTTACTAAG CTTCACGTAC  2400
CAAGTTGCAC GAGGAATGGA GTTCTTGGCT TCTAAAAATT GTGTGCACCG TGACTTGGCA  2460
GCTCGTAATG TCCTTCTGGC TCAAGGAAAA ATTGTGAAGA TCTGTGACTT TGGTCTGGCT  2520
AGGGACATCA TGCACGATTC CAACTATGTC TCCAAAGGCA GCACTTTTCT CCCAGTGAAA  2580
TGGATGGCAC CTGAAAGCAT TTTTGACAAC CTGTACACCA CATTAAGCGA TGTCTGGTCT  2640
TATGGCATTC TGCTGTGGGA GATATTTTCT CTTGGTGGCA CCCCATACCC TGGCATGATG  2700
GTTGATTCTT CTTTCTACAA CAAGATAAAA AGTGGCTACC GAATGGCAAA ACCTGACCAT  2760
GCTACCAGTG AAGTGTACTA TGAGATCATG GTGAAATGTT GGAACAGCGA ACCAGAGAAA  2820
AGACCTTCAT TTTACCATTT GAGTGAAATC GTTGAGAATC TGTTGCCTGG CGAGTACAAG  2880
AAGAGCTATG AAAGGATTCA TCTGGACTTC CTGAAAAGTG ACCACCCGGC TGTCACACGC  2940
ATGAGAGTGG ACTGTGACAA CACCTACATC GGTGTCACCT ACAAGAATGA AGACAAGTTA  3000
AAGGACAGGG AGAGTGGATT TGATGAGCAG CGATTGAGTG CTGACAGTGG CTACATCATT  3060
CCTCTGCCTG ACATTGATCC CATCTCCGAA GATGAACTTG GCAAAAGGAA CAGGCACAGT  3120
TCCCAGACAT CCGAAGAAAG TGCCATTGAG ACTGGTTCCA GCAGCTCTAC CTTTATCAAG  3180
CGAGAGGATG AGACCATTGA GGACATTGAC ATGATGGACG ACATTGGAAT AGATTCCTCA  3240
GACCTGGTGG AAGACAGTTT CCTGTAA                                      3267