Gene PANTR25076 (A0A2I3RJN5)

E Pan troglodytes | Arylsulfatase [SULF2]

Select a TabArrow
Protein sequences and cDNA of the selected isoforms are displayed below the tables
Protein ID
Sequence length
Exons
Domain Architectures
Number exons
Cross reference
PANTR25074 870 20 UniProtKB/TrEMBL A0A6D2VVM9
PANTR25075 851 18 UniProtKB/TrEMBL A0A2I3TJ35
PANTR25076 reference isoform 845 19 UniProtKB/TrEMBL A0A2I3RJN5


Protein Sequence

Download: Fasta
MGPPSLMLCL LSATVFSLLG GSSAFLSHHR LKGRFQRDRR NIRPNIILVL TDDQDVELGS    60
MQVMNKTRRI MEQGGAHFIN AFVTTPMCCP SRSSILTGKY VHNHNTYTNN ENCSSPSWQA   120
QHESRTFAVY LNSTGYRTAF FGKYLNEYNG SYVPPGWKEW VGLLKNSRFY NYTLCRNGVK   180
EKHGSDYSKD YLTDLITNDS VSFFRTSKKM YPHRPVLMVI SHAAPHGPED SAPQYSRLFP   240
NASQHITPSY NYAPNPDKHW IMRYTGPMKP IHMEFTNMLQ RKRLQTLMSV DDSMETIYNM   300
LVETGELDNT YIVYTADHGY HIGQFGLVKG KSMPYEFDIR VPFYVRGPNV EAGSLNPHIV   360
LNIDLAPTIL DIAGLDIPAD MDGKSILKLL DTERPVNRFH LKKKMRVWRD SFLVERGKLL   420
HKRDNDKVDA QEENFLPKYQ RVKDLCQRAE YQTACEQLGQ KWQCVEDATG KLKLHKCKGP   480
MRLGGSRALS NLVPKYYGQG SEACTCDSGD YKLSLAGRRK KLFKKKYKAS YVRSRSIRSV   540
AIEVDGLGDA AQPRNLTKRH WPGAPEDQDD KDGGDFSGTG GLPDYSAANP IKVTHRCYIL   600
ENDTVQCDLD LYKSLQAWKD HKLHIDHEIE TLQNKIKNLR EVRGHLKKKR PEECDCHKIS   660
YHTQHKGRLK HKGSSLHPFR KGLQEKDKVW LLREQKRKKK LRKLLKRLQN NDTCSMPGLT   720
CFTHDNQHWQ TAPFWTLGPF CACTSANNNT YWCMRTINET HNFLFCEFAT GFLEYFDLNT   780
DPYQLMNAVN TLDRDVLNQL HVQLMELRSC KGYKQCNPRT RNMDLGEQFQ RRKWPEMKRP   840
SSKSL                                                               845

cDNA Sequence

Download: Fasta
ATGGGCCCCC CGAGCCTCAT GCTGTGCTTG CTGTCCGCAA CTGTGTTCTC CCTGCTGGGT    60
GGAAGCTCGG CCTTCCTGTC GCACCACCGC CTGAAAGGCA GGTTTCAGAG GGACCGCAGG   120
AACATCCGCC CCAACATCAT CCTGGTGCTG ACGGACGACC AGGATGTGGA GCTGGGTTCC   180
ATGCAGGTGA TGAACAAGAC CCGGCGCATC ATGGAGCAGG GCGGGGCGCA CTTCATCAAC   240
GCCTTCGTGA CCACACCCAT GTGCTGCCCC TCACGCTCCT CCATCCTCAC CGGCAAGTAT   300
GTCCACAACC ACAACACCTA CACCAATAAC GAGAACTGCT CCTCGCCCTC CTGGCAGGCA   360
CAGCACGAGA GCCGCACCTT TGCCGTGTAC CTCAATAGCA CCGGCTACCG GACAGCTTTC   420
TTCGGGAAGT ATCTTAATGA ATACAACGGC TCCTACGTGC CGCCCGGCTG GAAGGAGTGG   480
GTCGGACTCC TTAAAAACTC CCGCTTTTAT AACTACACGC TGTGTCGGAA CGGGGTGAAA   540
GAGAAGCACG GCTCCGACTA CTCCAAGGAT TACCTCACAG ACCTCATCAC CAATGACAGC   600
GTGAGCTTCT TCCGCACGTC CAAGAAGATG TACCCGCACA GGCCAGTCCT CATGGTCATC   660
AGCCATGCAG CCCCCCACGG CCCTGAGGAT TCAGCCCCAC AATATTCACG CCTCTTCCCA   720
AACGCATCTC AGCACATCAC GCCGAGCTAC AACTACGCGC CCAACCCGGA CAAACACTGG   780
ATCATGCGCT ACACGGGGCC CATGAAGCCC ATCCACATGG AATTCACCAA CATGCTCCAG   840
CGGAAGCGCT TGCAGACCCT CATGTCGGTG GACGACTCCA TGGAGACGAT TTACAACATG   900
CTGGTTGAGA CGGGCGAGCT GGACAACACG TACATCGTGT ACACCGCCGA CCACGGTTAC   960
CACATCGGCC AGTTTGGCCT GGTGAAAGGG AAATCCATGC CATATGAGTT TGACATCAGG  1020
GTCCCGTTCT ACGTGAGGGG CCCCAACGTG GAAGCCGGCT CTCTGAATCC CCACATCGTC  1080
CTCAACATTG ACCTGGCCCC CACCATCCTG GACATTGCAG GCCTGGACAT ACCTGCGGAT  1140
ATGGACGGGA AATCCATCCT CAAGCTGCTG GACACGGAGC GGCCGGTGAA TCGGTTTCAC  1200
TTGAAAAAGA AGATGAGGGT CTGGCGGGAC TCCTTCTTGG TGGAGAGAGG CAAGCTGCTA  1260
CACAAGAGAG ACAATGACAA GGTGGACGCC CAGGAGGAGA ACTTTCTGCC CAAGTACCAG  1320
CGTGTGAAGG ACCTGTGTCA GCGCGCTGAG TACCAGACGG CGTGTGAGCA GCTGGGACAG  1380
AAGTGGCAGT GTGTGGAGGA CGCCACGGGG AAGCTGAAGC TGCATAAGTG CAAGGGCCCC  1440
ATGCGGCTGG GCGGCAGCAG AGCCCTCTCC AACCTCGTGC CCAAGTACTA CGGGCAGGGC  1500
AGCGAGGCCT GCACCTGTGA CAGCGGGGAC TACAAGCTCA GCCTGGCCGG ACGCCGGAAA  1560
AAACTCTTCA AGAAGAAGTA CAAGGCCAGC TATGTCCGCA GTCGCTCCAT CCGCTCAGTG  1620
GCCATCGAGG TGGACGGCCT GGGTGATGCC GCCCAGCCCC GAAACCTCAC CAAGCGGCAC  1680
TGGCCAGGGG CCCCTGAGGA CCAAGATGAC AAGGATGGTG GGGACTTCAG TGGCACTGGA  1740
GGCCTTCCCG ACTACTCAGC CGCCAACCCC ATTAAAGTGA CACATCGGTG CTACATCCTA  1800
GAGAACGACA CAGTCCAGTG TGACCTGGAC CTGTACAAGT CCCTGCAGGC CTGGAAAGAC  1860
CACAAGCTGC ACATCGACCA CGAGATTGAA ACCCTGCAGA ACAAAATTAA GAACCTGAGG  1920
GAAGTCCGAG GTCACCTGAA GAAAAAGCGG CCAGAAGAAT GTGACTGTCA CAAAATCAGC  1980
TACCACACCC AGCACAAAGG CCGCCTCAAG CACAAAGGCT CCAGTCTGCA TCCTTTCAGG  2040
AAGGGCCTGC AAGAGAAGGA CAAGGTGTGG CTGTTGCGGG AGCAGAAGCG CAAGAAGAAA  2100
CTCCGCAAGC TGCTCAAGCG CCTGCAGAAC AACGACACGT GCAGCATGCC AGGCCTCACG  2160
TGCTTCACCC ACGACAACCA GCACTGGCAG ACAGCGCCTT TCTGGACACT GGGGCCTTTC  2220
TGTGCCTGCA CCAGCGCCAA CAATAACACG TACTGGTGCA TGAGGACCAT CAATGAGACT  2280
CACAATTTCC TCTTCTGTGA ATTTGCAACT GGCTTCCTAG AGTACTTTGA TCTCAACACA  2340
GACCCCTACC AGCTGATGAA CGCAGTGAAC ACACTGGACA GGGATGTCCT CAACCAGCTA  2400
CACGTACAGC TCATGGAGCT GAGGAGCTGC AAGGGTTACA AGCAGTGTAA CCCCCGGACT  2460
CGAAACATGG ACCTGGGTGA GCAGTTTCAG CGTCGAAAGT GGCCAGAAAT GAAGAGACCT  2520
TCTTCCAAAT CACTGTGA                                                2538