E Chrysemys picta bellii | Arylsulfatase G [ARSG]
Protein ID | Sequence length | Exons | Domain Architectures | Number exons | Cross reference | |
---|---|---|---|---|---|---|
CHRPI03625 reference isoform | 535 | 11 | ![]() |
MEILFLKMLL LLAGGLVGLF YHLSDLWVGE KRTVQNKPNF IVILADDVGW GDLGANWAET 60 KDTPNLDQMA AEGMRFVDFH SAASTCSPSR ASLLTGRLGI RNGVTHNFAV TSVGGLPLNE 120 TTLAEVLKEA GYITAVIGKW HLGHSGLYHP NFRGFDCYFG IPYSHDMGCT DTPGYNLPPC 180 PACPRHSAAT SSHGRDCYTK VALPLFENIT IIQQPVNLAG LAAEYAEKAT QFIQRASESG 240 RPFFLYLALA HMHVPLVLAQ PPSSSSFPEP YRASLREMDA LVGRIKDKVD GCGKENTFLW 300 FTGDNGPWAQ KCELAGSVGP FSGAWQRQRG GSAAKQTTWE GGHRVPALAY WPGRIPANMT 360 SAALLSTTDI FPTLVSLAKA SLPPNRRFDG SDMSEILFGQ SHQGHKTLFH PNSGAAGKDG 420 EIKALRLAQY KAFYTTGGAK ACDGSIGLEE HHQPPLIFNL DQDIQEQVPL DVKAGEYQAV 480 LPAITRALTD FLQDIATDNV STADYSHDPA ATPCCNPQHI VCRCQVSCSS CNREP 535
ATGGAGATTC TGTTCCTGAA AATGCTTTTG CTGCTGGCTG GCGGTTTAGT TGGGCTCTTC 60 TATCACCTCT CAGATCTTTG GGTGGGTGAG AAGAGAACAG TTCAGAACAA GCCGAATTTC 120 ATAGTCATTT TGGCAGATGA TGTTGGATGG GGGGACCTTG GGGCCAACTG GGCAGAAACC 180 AAGGACACTC CTAATTTGGA TCAGATGGCT GCAGAAGGAA TGAGATTTGT GGATTTTCAT 240 TCTGCTGCAT CCACTTGCTC CCCATCCCGT GCCTCCCTGC TCACAGGAAG GCTCGGCATT 300 CGCAATGGAG TGACCCACAA CTTTGCTGTC ACGTCAGTGG GTGGCCTCCC CCTCAATGAG 360 ACTACGTTAG CTGAGGTTCT GAAGGAAGCA GGATACATCA CTGCGGTAAT AGGTAAATGG 420 CACCTCGGGC ACAGCGGCCT CTATCATCCC AACTTTCGTG GTTTCGACTG TTACTTCGGG 480 ATCCCCTACA GTCACGACAT GGGTTGTACT GACACCCCAG GCTACAACCT CCCTCCCTGT 540 CCGGCATGCC CTCGACACTC TGCAGCCACA AGCAGCCATG GGAGAGATTG TTACACCAAG 600 GTGGCTCTTC CTCTCTTTGA AAACATCACC ATCATCCAAC AGCCTGTGAA CCTTGCTGGC 660 TTGGCTGCAG AGTATGCAGA AAAAGCAACT CAGTTCATCC AGCGCGCAAG TGAAAGTGGA 720 CGCCCGTTCT TCCTATACCT AGCGCTGGCC CATATGCACG TGCCCTTGGT CCTCGCTCAG 780 CCTCCCTCCA GCTCCTCGTT CCCTGAGCCA TACCGAGCCA GCCTGAGGGA GATGGATGCC 840 CTGGTGGGAC GGATAAAGGA CAAAGTTGAC GGCTGCGGGA AGGAAAACAC ATTCCTTTGG 900 TTCACAGGAG ACAACGGCCC TTGGGCTCAG AAGTGCGAGC TTGCAGGGAG CGTGGGCCCG 960 TTCTCCGGGG CATGGCAAAG GCAACGAGGT GGGAGTGCAG CCAAGCAAAC GACCTGGGAA 1020 GGGGGACATC GTGTTCCGGC GCTGGCTTAT TGGCCTGGCA GGATCCCTGC AAACATGACG 1080 AGCGCTGCAC TGTTGAGCAC CACAGACATT TTCCCCACCT TGGTGTCCCT GGCAAAGGCA 1140 AGCCTGCCTC CCAACAGGCG CTTTGATGGT TCAGACATGT CTGAGATTCT CTTCGGGCAG 1200 TCGCACCAAG GGCACAAGAC GTTGTTTCAC CCCAATAGTG GAGCAGCTGG AAAGGACGGA 1260 GAGATAAAAG CGCTACGGCT GGCACAATAC AAGGCATTTT ACACTACAGG CGGGGCGAAA 1320 GCCTGTGACG GAAGCATTGG GCTAGAAGAG CATCACCAAC CGCCGCTCAT TTTTAATCTG 1380 GATCAAGACA TTCAAGAGCA GGTGCCTCTG GACGTGAAGG CTGGCGAGTA TCAGGCTGTG 1440 CTACCTGCGA TAACCAGGGC TCTGACAGAC TTTCTTCAGG ATATTGCAAC AGACAACGTC 1500 TCAACAGCGG ATTACTCCCA TGATCCAGCA GCGACACCCT GCTGCAATCC ACAGCACATA 1560 GTTTGCCGAT GTCAGGTTAG CTGCTCTAGC TGTAACAGGG AGCCGTAG 1608