Gene NOMLE06989 (G1R5I0)

E Nomascus leucogenys | Arylsulfatase [SULF2]

Select a TabArrow
Protein sequences and cDNA of the selected isoforms are displayed below the tables
Protein ID
Sequence length
Exons
Domain Architectures
Number exons
Cross reference
NOMLE06990 845 19 UniProtKB/TrEMBL A0A2I3H4R8
NOMLE06989 reference isoform 852 19 UniProtKB/TrEMBL G1R5I0


Protein Sequence

Download: Fasta
MGPPSLVLCL LSATVFSLLG GSSAFLSHHR LKGRFQRDRR NIRPNIILVL TDDQDVELGS    60
MQVMNKTRRI MEQGGAHFIN AFVTTPMCCP SRSSILTGKY VHNHNTYTNN ENCSSPSWQA   120
QHESRTFAVY LNSTGYRTAF FGKYLNEYNG SYVPPGWKEW VGLLKNSRFY NYTLCRNGVK   180
EKHGSDYSKD YLTDLITNDS VSFFRTSKKM YPHRPVLMVI SHAAPHGPED SAPQYSRLFP   240
NASQHITPSY NYAPNPDKHW IMRYTGPMKP IHMEFTNMLQ RKRLQTLMSV DDSMETIYNM   300
LVETGELDNT YIVYTADHGY HIGQFGLVKG KSMPYEFDIR VPFYVRGPNV EAGSLNPHIV   360
LNIDLAPTIL DIAGLDIPAD MDGKSILKLL DTERPVNRFH LKKKMRVWRD SFLVERGKLL   420
HKRDNDKVDA QEENFLPKYQ RVKDLCQRAE YQTACEQLGQ KWQCVEDATG KLKLHKCKGP   480
MRLGGGRALS NLVPKYYRQG SEACTCDSGD YKLSLAGRRK KLFKKKYKAS YVRSRSIRSV   540
AIEVDGRVYH VGLGDATQPR NLTKRHWPGA PEDQDDKDGD FSGTGGLPDY SAANPIKVTH   600
RRCYILENDT VQCDLDLYKS LQAWKDHKLH IDHEIETLQN KIKNLREVRG HLKKKRPEEC   660
DCHKISYHTQ HKVRLKHKGS SLHPFRKGLQ EKDKVWLLRE QKRKKKLRKL LKRLQNNDTC   720
SMPGLTCFTH DNQHWQTAPF WTLGPFCACT SANNNTYWCM RTINETHNFL FCEFATGFLE   780
YFDLNTDPYQ LMNAVNTLDR DVLNQLHVQL MELRSCKGYK QCNPRTRNMD LGEKGLRRWC   840
LPQGQLWEGW EG                                                       852

cDNA Sequence

Download: Fasta
ATGGGCCCCC CGAGCCTCGT GCTGTGCTTG CTGTCCGCAA CTGTGTTCTC CCTGCTGGGT    60
GGAAGCTCGG CCTTCCTGTC GCACCACCGC CTGAAAGGCA GGTTTCAGAG GGACCGCAGG   120
AACATCCGCC CCAACATCAT CCTGGTGCTG ACGGACGACC AGGACGTGGA GCTGGGTTCC   180
ATGCAGGTGA TGAACAAGAC CCGGCGCATC ATGGAGCAGG GTGGGGCGCA CTTCATCAAC   240
GCCTTCGTGA CCACGCCCAT GTGCTGCCCC TCGCGCTCCT CCATCCTCAC CGGCAAGTAC   300
GTCCACAACC ACAACACCTA CACCAACAAC GAGAACTGCT CCTCGCCCTC CTGGCAGGCA   360
CAGCACGAGA GCCGCACCTT CGCCGTGTAC CTCAATAGCA CCGGCTACCG GACAGCTTTC   420
TTCGGGAAGT ATCTTAATGA ATACAACGGC TCCTACGTGC CTCCCGGCTG GAAGGAGTGG   480
GTCGGACTCC TTAAAAACTC CCGCTTTTAT AACTACACGC TGTGTCGGAA TGGGGTGAAA   540
GAGAAGCATG GCTCCGACTA CTCCAAGGAT TACCTCACAG ACCTCATCAC CAATGACAGC   600
GTGAGCTTCT TCCGCACGTC CAAGAAGATG TACCCGCACA GGCCAGTCCT CATGGTCATC   660
AGCCACGCAG CCCCCCACGG CCCTGAAGAT TCAGCCCCGC AATATTCACG CCTCTTCCCA   720
AACGCTTCTC AGCACATCAC GCCAAGCTAC AACTACGCAC CCAACCCGGA CAAACACTGG   780
ATCATGCGCT ACACGGGGCC CATGAAGCCC ATCCACATGG AATTCACCAA CATGCTCCAG   840
CGGAAGCGCC TGCAGACCCT CATGTCGGTG GACGACTCCA TGGAGACGAT TTACAACATG   900
CTGGTTGAGA CGGGCGAGCT GGACAACACG TACATCGTGT ACACCGCCGA CCACGGCTAC   960
CACATCGGCC AGTTTGGCCT GGTGAAAGGG AAATCCATGC CGTATGAGTT TGACATCAGG  1020
GTCCCTTTCT ATGTGAGGGG CCCCAACGTG GAAGCTGGCT CTCTGAATCC CCACATCGTC  1080
CTCAACATTG ACCTGGCCCC CACCATCCTG GACATTGCAG GCCTGGACAT ACCCGCAGAT  1140
ATGGACGGAA AATCCATCCT CAAGCTGCTG GACACGGAGC GGCCCGTGAA TCGGTTTCAC  1200
TTGAAAAAGA AGATGAGGGT CTGGCGGGAC TCCTTCCTGG TGGAGAGAGG CAAACTGCTA  1260
CACAAGAGAG ACAACGACAA GGTGGATGCC CAGGAGGAGA ACTTTTTGCC CAAGTACCAG  1320
CGTGTGAAGG ACCTGTGTCA GCGCGCTGAG TACCAGACGG CGTGTGAGCA GCTGGGACAG  1380
AAGTGGCAGT GTGTGGAGGA CGCCACGGGG AAGCTGAAGC TGCATAAGTG CAAGGGCCCC  1440
ATGCGGCTGG GCGGCGGCAG AGCCCTCTCC AACCTCGTGC CCAAGTACTA CAGGCAGGGC  1500
AGTGAGGCCT GCACCTGTGA CAGCGGGGAC TACAAGCTCA GCCTGGCTGG ACGCCGGAAA  1560
AAGCTCTTCA AGAAGAAGTA CAAGGCCAGC TACGTCCGCA GTCGCTCTAT CCGCTCAGTG  1620
GCCATCGAGG TGGACGGCAG GGTGTATCAC GTAGGCCTGG GTGATGCCAC CCAGCCCCGA  1680
AACCTCACCA AGAGGCACTG GCCAGGGGCC CCTGAGGACC AAGATGACAA GGATGGGGAC  1740
TTCAGTGGCA CTGGAGGCCT TCCCGACTAC TCAGCCGCCA ACCCCATTAA AGTGACACAT  1800
CGCAGGTGCT ACATCCTAGA GAATGACACA GTCCAGTGTG ACCTGGACCT GTACAAGTCC  1860
CTACAGGCCT GGAAAGACCA CAAGCTGCAC ATCGACCACG AGATTGAAAC CCTGCAGAAC  1920
AAAATTAAGA ACCTGAGGGA AGTCCGAGGT CACCTGAAGA AAAAGCGGCC AGAAGAATGT  1980
GACTGTCACA AAATCAGCTA CCACACCCAG CATAAAGTCC GCCTCAAGCA CAAAGGCTCC  2040
AGTCTGCATC CTTTCAGGAA GGGCCTGCAA GAGAAGGACA AGGTGTGGCT GTTGCGGGAG  2100
CAGAAGCGCA AGAAGAAACT CCGCAAGCTG CTCAAGCGCC TGCAGAACAA CGACACGTGC  2160
AGCATGCCAG GCCTCACGTG CTTCACCCAC GACAACCAGC ACTGGCAGAC GGCGCCTTTC  2220
TGGACACTGG GGCCTTTCTG TGCCTGCACC AGCGCCAACA ATAACACGTA CTGGTGCATG  2280
AGGACCATCA ATGAGACTCA CAATTTCCTC TTCTGTGAAT TTGCAACTGG CTTCCTAGAG  2340
TACTTTGATC TCAACACAGA CCCCTACCAG CTGATGAACG CAGTGAACAC ACTGGACAGG  2400
GATGTCCTCA ACCAGCTACA CGTACAGCTC ATGGAGCTGA GGAGCTGCAA GGGTTACAAG  2460
CAGTGTAACC CCCGGACTCG AAACATGGAC CTGGGTGAGA AGGGACTCAG GAGGTGGTGT  2520
CTACCCCAGG GACAACTGTG GGAAGGCTGG GAAGGTTAA                         2559