E Pan troglodytes | Arylsulfatase [SULF2]
Protein ID | Sequence length | Exons | Domain Architectures | Number exons | Cross reference | |
---|---|---|---|---|---|---|
PANTR25074 | 870 | 20 | ![]() | |||
PANTR25075 | 851 | 18 | ![]() | |||
PANTR25076 reference isoform | 845 | 19 | ![]() |
MGPPSLMLCL LSATVFSLLG GSSAFLSHHR LKGRFQRDRR NIRPNIILVL TDDQDVELGS 60 MQVMNKTRRI MEQGGAHFIN AFVTTPMCCP SRSSILTGKY VHNHNTYTNN ENCSSPSWQA 120 QHESRTFAVY LNSTGYRTAF FGKYLNEYNG SYVPPGWKEW VGLLKNSRFY NYTLCRNGVK 180 EKHGSDYSKD YLTDLITNDS VSFFRTSKKM YPHRPVLMVI SHAAPHGPED SAPQYSRLFP 240 NASQHITPSY NYAPNPDKHW IMRYTGPMKP IHMEFTNMLQ RKRLQTLMSV DDSMETIYNM 300 LVETGELDNT YIVYTADHGY HIGQFGLVKG KSMPYEFDIR VPFYVRGPNV EAGSLNPHIV 360 LNIDLAPTIL DIAGLDIPAD MDGKSILKLL DTERPVNRFH LKKKMRVWRD SFLVERGKLL 420 HKRDNDKVDA QEENFLPKYQ RVKDLCQRAE YQTACEQLGQ KWQCVEDATG KLKLHKCKGP 480 MRLGGSRALS NLVPKYYGQG SEACTCDSGD YKLSLAGRRK KLFKKKYKAS YVRSRSIRSV 540 AIEVDGLGDA AQPRNLTKRH WPGAPEDQDD KDGGDFSGTG GLPDYSAANP IKVTHRCYIL 600 ENDTVQCDLD LYKSLQAWKD HKLHIDHEIE TLQNKIKNLR EVRGHLKKKR PEECDCHKIS 660 YHTQHKGRLK HKGSSLHPFR KGLQEKDKVW LLREQKRKKK LRKLLKRLQN NDTCSMPGLT 720 CFTHDNQHWQ TAPFWTLGPF CACTSANNNT YWCMRTINET HNFLFCEFAT GFLEYFDLNT 780 DPYQLMNAVN TLDRDVLNQL HVQLMELRSC KGYKQCNPRT RNMDLGEQFQ RRKWPEMKRP 840 SSKSL 845
ATGGGCCCCC CGAGCCTCAT GCTGTGCTTG CTGTCCGCAA CTGTGTTCTC CCTGCTGGGT 60 GGAAGCTCGG CCTTCCTGTC GCACCACCGC CTGAAAGGCA GGTTTCAGAG GGACCGCAGG 120 AACATCCGCC CCAACATCAT CCTGGTGCTG ACGGACGACC AGGATGTGGA GCTGGGTTCC 180 ATGCAGGTGA TGAACAAGAC CCGGCGCATC ATGGAGCAGG GCGGGGCGCA CTTCATCAAC 240 GCCTTCGTGA CCACACCCAT GTGCTGCCCC TCACGCTCCT CCATCCTCAC CGGCAAGTAT 300 GTCCACAACC ACAACACCTA CACCAATAAC GAGAACTGCT CCTCGCCCTC CTGGCAGGCA 360 CAGCACGAGA GCCGCACCTT TGCCGTGTAC CTCAATAGCA CCGGCTACCG GACAGCTTTC 420 TTCGGGAAGT ATCTTAATGA ATACAACGGC TCCTACGTGC CGCCCGGCTG GAAGGAGTGG 480 GTCGGACTCC TTAAAAACTC CCGCTTTTAT AACTACACGC TGTGTCGGAA CGGGGTGAAA 540 GAGAAGCACG GCTCCGACTA CTCCAAGGAT TACCTCACAG ACCTCATCAC CAATGACAGC 600 GTGAGCTTCT TCCGCACGTC CAAGAAGATG TACCCGCACA GGCCAGTCCT CATGGTCATC 660 AGCCATGCAG CCCCCCACGG CCCTGAGGAT TCAGCCCCAC AATATTCACG CCTCTTCCCA 720 AACGCATCTC AGCACATCAC GCCGAGCTAC AACTACGCGC CCAACCCGGA CAAACACTGG 780 ATCATGCGCT ACACGGGGCC CATGAAGCCC ATCCACATGG AATTCACCAA CATGCTCCAG 840 CGGAAGCGCT TGCAGACCCT CATGTCGGTG GACGACTCCA TGGAGACGAT TTACAACATG 900 CTGGTTGAGA CGGGCGAGCT GGACAACACG TACATCGTGT ACACCGCCGA CCACGGTTAC 960 CACATCGGCC AGTTTGGCCT GGTGAAAGGG AAATCCATGC CATATGAGTT TGACATCAGG 1020 GTCCCGTTCT ACGTGAGGGG CCCCAACGTG GAAGCCGGCT CTCTGAATCC CCACATCGTC 1080 CTCAACATTG ACCTGGCCCC CACCATCCTG GACATTGCAG GCCTGGACAT ACCTGCGGAT 1140 ATGGACGGGA AATCCATCCT CAAGCTGCTG GACACGGAGC GGCCGGTGAA TCGGTTTCAC 1200 TTGAAAAAGA AGATGAGGGT CTGGCGGGAC TCCTTCTTGG TGGAGAGAGG CAAGCTGCTA 1260 CACAAGAGAG ACAATGACAA GGTGGACGCC CAGGAGGAGA ACTTTCTGCC CAAGTACCAG 1320 CGTGTGAAGG ACCTGTGTCA GCGCGCTGAG TACCAGACGG CGTGTGAGCA GCTGGGACAG 1380 AAGTGGCAGT GTGTGGAGGA CGCCACGGGG AAGCTGAAGC TGCATAAGTG CAAGGGCCCC 1440 ATGCGGCTGG GCGGCAGCAG AGCCCTCTCC AACCTCGTGC CCAAGTACTA CGGGCAGGGC 1500 AGCGAGGCCT GCACCTGTGA CAGCGGGGAC TACAAGCTCA GCCTGGCCGG ACGCCGGAAA 1560 AAACTCTTCA AGAAGAAGTA CAAGGCCAGC TATGTCCGCA GTCGCTCCAT CCGCTCAGTG 1620 GCCATCGAGG TGGACGGCCT GGGTGATGCC GCCCAGCCCC GAAACCTCAC CAAGCGGCAC 1680 TGGCCAGGGG CCCCTGAGGA CCAAGATGAC AAGGATGGTG GGGACTTCAG TGGCACTGGA 1740 GGCCTTCCCG ACTACTCAGC CGCCAACCCC ATTAAAGTGA CACATCGGTG CTACATCCTA 1800 GAGAACGACA CAGTCCAGTG TGACCTGGAC CTGTACAAGT CCCTGCAGGC CTGGAAAGAC 1860 CACAAGCTGC ACATCGACCA CGAGATTGAA ACCCTGCAGA ACAAAATTAA GAACCTGAGG 1920 GAAGTCCGAG GTCACCTGAA GAAAAAGCGG CCAGAAGAAT GTGACTGTCA CAAAATCAGC 1980 TACCACACCC AGCACAAAGG CCGCCTCAAG CACAAAGGCT CCAGTCTGCA TCCTTTCAGG 2040 AAGGGCCTGC AAGAGAAGGA CAAGGTGTGG CTGTTGCGGG AGCAGAAGCG CAAGAAGAAA 2100 CTCCGCAAGC TGCTCAAGCG CCTGCAGAAC AACGACACGT GCAGCATGCC AGGCCTCACG 2160 TGCTTCACCC ACGACAACCA GCACTGGCAG ACAGCGCCTT TCTGGACACT GGGGCCTTTC 2220 TGTGCCTGCA CCAGCGCCAA CAATAACACG TACTGGTGCA TGAGGACCAT CAATGAGACT 2280 CACAATTTCC TCTTCTGTGA ATTTGCAACT GGCTTCCTAG AGTACTTTGA TCTCAACACA 2340 GACCCCTACC AGCTGATGAA CGCAGTGAAC ACACTGGACA GGGATGTCCT CAACCAGCTA 2400 CACGTACAGC TCATGGAGCT GAGGAGCTGC AAGGGTTACA AGCAGTGTAA CCCCCGGACT 2460 CGAAACATGG ACCTGGGTGA GCAGTTTCAG CGTCGAAAGT GGCCAGAAAT GAAGAGACCT 2520 TCTTCCAAAT CACTGTGA 2538