E Macaca nemestrina | Arylsulfatase G [ARSG]
Protein ID | Sequence length | Exons | Domain Architectures | Number exons | Cross reference | |
---|---|---|---|---|---|---|
MACNE15042 | 525 | 11 | ![]() | |||
MACNE15041 reference isoform | 525 | 11 | ![]() |
MGWLFLKVLL AGVSFSGFLY PLVDFCISGK TRGRKPNFVI ILADDMGWGD LGANWAETKD 60 TANLDKMASE GMRFVDFHAA ASTCSPSRAS LLTGRLGLRN GVTHNFAVTS VGGLPLNETT 120 LAEVLQQAGY ITGMIGKWHL GHHGSYHPNF RGFDYYFGIP YSHDMGCTDT PGYNHPPCPA 180 CPQGDGPSSN LQRDCYTDVA LPLYENLHIV EQPVNLSSLA RKYAEKATQF IQQASTSGRP 240 FLLYVALAHM HVPLPVTQLP AAPRDRRLYG AGLREMDGLV GQIKDKVDHT AKDNTFLWFT 300 GDNGPWAEKC ELAGSVGAFT GVWQARRGGS PAKQTTWEGG HRVPALAYWP GRVPVNVTST 360 ALLSVLDIFP TVVALAQASL PQGRRFDGVD VSEVLFGRSQ PGHRVLFHPN SGAAGEFGAL 420 QTVRLERYKA FYVTGGARAC DGSIGPELQH KLPLIFNLED DIAEAVPLER GGAEYQAVLP 480 KVRKVLADVL QDIANDNISS ADYTQDPSVT PCCNPYQVAC RCQAP 525
ATGGGCTGGC TTTTTCTAAA GGTTTTGTTG GCGGGAGTGA GTTTCTCAGG ATTTCTTTAT 60 CCTCTTGTGG ATTTTTGCAT CAGTGGGAAA ACAAGAGGCC GGAAGCCAAA CTTTGTGATT 120 ATTTTGGCCG ATGACATGGG GTGGGGTGAC CTGGGAGCAA ACTGGGCAGA AACAAAGGAC 180 ACTGCCAACC TTGATAAGAT GGCTTCGGAG GGAATGAGGT TTGTGGATTT CCATGCAGCT 240 GCCTCCACCT GCTCACCCTC CCGGGCTTCC TTGCTCACTG GCCGACTTGG CCTTCGCAAT 300 GGAGTCACAC ACAACTTTGC AGTCACCTCT GTGGGAGGCC TTCCGCTCAA CGAAACCACC 360 TTGGCAGAGG TGCTGCAGCA GGCGGGTTAC ATCACTGGGA TGATAGGCAA ATGGCATCTT 420 GGACACCACG GCTCGTATCA CCCCAACTTC CGTGGTTTTG ATTACTACTT TGGAATCCCA 480 TATAGCCATG ATATGGGCTG TACTGATACT CCAGGCTACA ACCACCCTCC TTGTCCAGCA 540 TGTCCACAGG GTGATGGACC ATCAAGCAAC CTTCAAAGAG ACTGTTACAC TGACGTGGCC 600 CTCCCTCTCT ATGAAAACCT CCACATTGTG GAACAGCCGG TGAACTTGAG CAGCCTTGCC 660 CGGAAGTATG CGGAGAAAGC AACCCAGTTC ATCCAGCAGG CAAGCACCAG CGGGAGGCCC 720 TTCCTGCTAT ATGTGGCTCT GGCCCACATG CACGTGCCCT TACCTGTGAC TCAGCTACCA 780 GCAGCCCCAC GGGACAGAAG GCTGTATGGT GCAGGGCTCC GGGAGATGGA CGGTCTGGTT 840 GGCCAGATCA AGGACAAAGT TGACCACACA GCGAAGGACA ACACATTCCT CTGGTTTACA 900 GGAGACAATG GCCCATGGGC TGAGAAGTGT GAGCTAGCGG GCAGTGTGGG TGCCTTCACT 960 GGAGTGTGGC AAGCTCGTCG AGGGGGAAGT CCAGCCAAGC AGACGACCTG GGAAGGCGGG 1020 CACCGGGTCC CGGCACTGGC TTACTGGCCT GGCAGAGTTC CAGTTAATGT CACCAGCACT 1080 GCCTTGTTAA GTGTGCTGGA CATTTTTCCG ACTGTGGTAG CACTGGCCCA GGCCAGCTTA 1140 CCTCAAGGAC GGCGCTTTGA TGGTGTGGAC GTCTCCGAGG TGCTCTTTGG CCGGTCACAG 1200 CCTGGGCACA GGGTGCTGTT CCACCCCAAC AGCGGGGCAG CTGGAGAGTT TGGAGCCCTG 1260 CAGACTGTCC GCCTGGAGCG TTACAAAGCC TTCTACGTTA CCGGTGGAGC CAGAGCATGT 1320 GATGGGAGCA TCGGGCCTGA GCTGCAGCAT AAGCTTCCTC TGATTTTCAA CCTGGAAGAC 1380 GATATCGCAG AAGCTGTGCC TCTAGAAAGA GGTGGTGCGG AGTACCAGGC CGTGCTGCCC 1440 AAGGTCAGAA AGGTTCTTGC AGACGTCCTT CAAGACATTG CCAATGACAA CATCTCCAGT 1500 GCAGATTACA CCCAGGACCC TTCAGTCACT CCCTGCTGCA ATCCCTACCA AGTTGCCTGC 1560 CGCTGTCAAG CCCCATAA 1578