Entry DROSI06085 (B4QR27)

E Drosophila simulans


General Information

Organism
DROSI - Drosophila simulans (Taxon-ID: 7240)
Locus
3Ljoin(complement(4351164..4352274), complement(4348996..4351104), complement(4347712..4347910), complement(4347209..4347326), complement(4343204..4346802), complement(4342979..4343144))
Number of exons
6

IDs and Cross-references

loading xrefs...

Domain Architecture

Gene Ontology

loading GO annotations...

Protein Sequence

Download: Fasta
MMVGPSGSGK STAWKTLLKA LERFEGVEGV AHVIDPKAIS KEALYGVLDP NTREWTDGLF    60
THILRKIIDN VRGEINKRQW IIFDGDVDPE WVENLNSVLD DNKLLTLPNG ERLSLPPNVR   120
VMFEVQDLKF ATLATVSRCG MVWFSEDVLS TEMIFENYLS RLRSIPLEDG DEDFVGVIKP   180
AKDKEEEVSP SLQVQRDIAL LLLPFFSADG IVVRTLEYAM DQEHIMDFTR LRALSSLFSM   240
LNQAARNVLT FNAQHPDFPC SADQLEHYIP KALVYSVLWS FAGDAKLKVR IDLGDFVRSV   300
TTVPLPGAAG APIIDYEVNM SGDWVPWSNK VPVIEVETHK VASPDIVVPT LDTVRHESLL   360
YTWLAEHKPL VLCGPPGSGK TMTLFSALRA LPDMEVVGLN FSSATTPELL LKTFDHYCEY   420
RKTPNGVVLS PVQIGKWLVL FCDEINLPDM DSYGTQRVIS FLRQLVEHKG FYRASDQAWV   480
SLERIQFVGA CNPPTDPGRK PLSHRFLRHV PIIYVDYPGE TSLKQIYGTF SRAMLRLMPA   540
LRGYAEPLTN AMVEFYLASQ DRFTQDMQPH YVYSPREMTR WVRGICEAIR PLDSLPVEGL   600
VRLWAHEALR LFQDRLVDDS ERRWTNENID LVGQKHFPGI NQEEALQRPI LYSNWLSKDY   660
MPVNREELRE YVHARLKVFY EEELDVPLVL FDEVLDHVLR IDRIFRQPQG HLLLIGVSGA   720
GKTTLSRFVA WMNGLSIFQI KVHNKYTSED FDEDLRCVLR RSGCKDEKIA FILDESNVLD   780
SGFLERMNTL LANGEVPGLF EGDEYTTLMT QCKEGAQREG LMLDSSDELY KWFTQQVMRN   840
LHVVFTMNPS TDGLKDRAAT SPALFNRCVL NWFGDWSDSA LFQVGKEFTT RVDLEKPNWH   900
APDFFPSVCP LVPANPTHRD AVINSCVYVH QTLHQANARL AKRGGRTMAV TPRHYLDFIH   960
HFVKLYNEKR SDLEEQQLHL NVGLNKIAET VEQVEEMQKS LAVKKQELQA KNEAANAKLK  1020
QMFQDQQEAE KKKIQSQEIQ IRLADQTVKI EEKRKYVMAD LAQVEPAVID AQAAVKSIRK  1080
QQLVEVRTMA NPPSVVKLAL ESICLLLGEN ATDWKSIRAV IMRENFINSI VSNFGTENIT  1140
DDVREKMKSK YLSNPDYNFE KVNRASMACG PMVKWAIAQI EYADMLKRVE PLREELRSLE  1200
EQADVNLASA KETKDLVEQL ERSIAAYKEE YAQLISQAQA IKTDLENVQA KVDRSIALLK  1260
SLNIERERWE STSETFKSQM STIIGDVLLS AAFIAYGGYF DQHYRLNLFT TWSQHLQAAS  1320
IQYRADIART EYLSNPDERL RWQANALPTD DLCTENAIML KRFNRYPLII DPSGQATTFL  1380
LNEYAGKKIT KTSFLDDSFR KNLESALRFG NPLLVQDVEN YDPILNPVLN RELRRTGGRV  1440
LITLGDQDID LSPSFVIFLS TRDPTVEFPP DICSRVTFVN FTVTRSSLQS QCLNQVLKAE  1500
RPDIDEKRSD LLKLQGEFRL RLRQLEKSLL QALNDAKGKI LDDDSVITTL ETLKKEAYDI  1560
NQKVDETDKV IAEIETVSQQ YLPLSVACSN IYFTMDSLNQ VHFLYHYSLK MFLDIFSTVL  1620
YNNPKLEGRT DHSERLGIVT RDLFQVCYER VARGMIHNDR LTFALLMCKI HLKGTSESNL  1680
DAEFNFFLRS REGLLANPTP VEGLSAEQIE SVNRLALRLP IFRKLLEKVR SIPELGAWLQ  1740
QSSPEQVVPQ LWDESKALSP IASSVHQLLL IQAFRPDRVI AAAHNVVNTV LGEDFMPNAE  1800
QELDFTSVVD KQLNCNTPAL LCSVPGFDAS GRVDDLAAEQ NKQISSIAIG SAEGFNQAER  1860
AINMACKTGR WVLLKNVHLA PQWLVQLEKK MHSLQPHSGF RLFLTMEINP KVPVNLLRAG  1920
RIFVFEPPPG IRANLLRTFS TVPAARMMKT PSERARLYFL LAWFHAIVQE RLRYVPLGWA  1980
KKYEFNESDL RVACDTLDTW IDTTAMGRTN LPPEKVPWDA LVTLLSQSIY GGKIDNDFDQ  2040
RLLTSFLKKL FTARSFEADF ALVANVDGAS GGLRHITMPD GTRRDHFLKW IENLTDRQTP  2100
SWLGLPNNAE KVLLTTRGTD LVSKLLKMQQ LEDDDELAYS VEDQSEQSAV GRGEDGRPSW  2160
MKTLHNSATA WLELLPKNLQ VLKRTVENIK DPLYRYFERE VTSGSRLLQT VILDLQDVVL  2220
ICQGEKKQTN HHRSMLSELV RGIIPKGWKR YTVPAGCTVI QWITDFSNRV QQLQKVSQLV  2280
SQAGAKELQG FPVWLGGLLN PEAYITATRQ CVAQANSWSL EELALDVTIT DAGLKNDQKD  2340
CCFGVTGLKL QGAQCKNNEL LLASTIMMDL PVTILKWIKI SSEPRISKLT LPVYLNSTRT  2400
ELLFTVDLAV AAGQDSHSFY ERGVAVLTST ALN                               2433

Coding Sequence

Download: Fasta
ATGATGGTCG GTCCATCCGG ATCCGGCAAG TCCACCGCTT GGAAGACTCT TCTGAAGGCT    60
TTGGAACGCT TCGAAGGCGT TGAGGGCGTG GCTCATGTAA TCGATCCCAA GGCCATTTCT   120
AAGGAAGCCC TTTATGGTGT CCTGGACCCG AATACCCGCG AATGGACCGA TGGTTTGTTC   180
ACCCACATTC TGCGCAAGAT AATCGACAAT GTGCGCGGTG AGATCAACAA GCGGCAATGG   240
ATCATCTTCG ACGGTGATGT GGATCCCGAG TGGGTAGAGA ACTTGAACTC TGTGCTGGAT   300
GATAACAAAC TCTTGACTCT GCCTAATGGA GAGCGTCTTT CTCTGCCTCC TAACGTGCGG   360
GTGATGTTCG AGGTGCAGGA CTTGAAGTTC GCCACTTTGG CTACAGTTTC CCGTTGCGGC   420
ATGGTCTGGT TCTCAGAGGA TGTGCTCTCG ACAGAGATGA TATTTGAGAA CTATCTGTCC   480
CGTTTGCGTA GCATTCCTTT GGAGGATGGA GACGAAGACT TCGTGGGCGT CATTAAGCCG   540
GCAAAGGACA AGGAGGAGGA GGTGTCACCA TCCCTCCAGG TGCAGCGGGA TATTGCTTTG   600
CTTCTGCTAC CATTCTTCTC AGCTGATGGA ATTGTGGTCC GCACATTGGA GTACGCTATG   660
GACCAGGAGC ACATCATGGA CTTCACTCGT TTGCGGGCCC TAAGCTCCCT CTTCTCCATG   720
CTCAACCAGG CTGCTCGAAA TGTTCTTACA TTCAATGCTC AGCATCCAGA TTTCCCCTGT   780
TCCGCTGATC AGTTGGAGCA CTACATTCCC AAGGCGTTGG TATACTCTGT TCTTTGGTCA   840
TTTGCCGGAG ATGCAAAGTT GAAGGTGCGC ATTGATTTGG GAGACTTCGT GCGCAGTGTG   900
ACTACCGTTC CACTGCCGGG AGCAGCCGGT GCTCCAATTA TCGACTACGA GGTCAACATG   960
AGTGGTGACT GGGTTCCGTG GAGCAACAAG GTACCAGTCA TCGAGGTGGA AACGCACAAG  1020
GTGGCGTCTC CGGACATTGT TGTGCCCACA TTGGACACCG TTCGTCACGA GTCGCTGCTG  1080
TATACTTGGT TGGCTGAGCA TAAACCATTG GTGCTCTGCG GCCCACCTGG CTCTGGTAAG  1140
ACTATGACCC TGTTCTCGGC CCTCCGTGCT CTCCCCGATA TGGAAGTGGT AGGCCTGAAT  1200
TTCTCATCGG CTACCACGCC GGAGCTGCTG CTTAAGACGT TTGATCACTA CTGCGAGTAC  1260
CGCAAGACAC CAAACGGAGT CGTGCTTTCC CCAGTGCAAA TTGGAAAGTG GCTGGTGCTG  1320
TTCTGTGATG AAATCAATTT GCCAGACATG GACAGCTATG GCACGCAGCG CGTAATCTCG  1380
TTCTTGCGTC AACTGGTGGA GCACAAGGGC TTCTATAGGG CCAGCGATCA GGCTTGGGTT  1440
TCTCTGGAAC GCATTCAGTT TGTGGGTGCT TGTAATCCAC CCACTGATCC AGGCCGTAAG  1500
CCGCTCTCGC ATCGGTTCTT GAGACATGTG CCCATCATCT ACGTGGATTA TCCTGGAGAG  1560
ACATCTCTGA AGCAGATCTA CGGCACATTC TCGCGTGCCA TGCTAAGATT GATGCCCGCT  1620
CTTCGTGGTT ATGCAGAGCC TTTGACCAAC GCCATGGTGG AGTTCTATCT GGCATCACAG  1680
GATCGCTTTA CGCAGGACAT GCAGCCGCAT TATGTCTATT CGCCACGTGA GATGACCCGT  1740
TGGGTGCGTG GTATCTGCGA GGCTATCCGT CCATTGGATT CCCTTCCTGT TGAAGGTTTG  1800
GTGCGTCTCT GGGCCCATGA AGCTCTGCGC CTGTTCCAGG ATCGACTGGT GGACGATTCG  1860
GAGCGCCGAT GGACAAATGA GAATATCGAT TTGGTGGGCC AGAAGCACTT CCCTGGAATC  1920
AACCAAGAAG AGGCATTGCA GCGTCCTATC CTTTACAGCA ACTGGCTAAG CAAGGATTAC  1980
ATGCCGGTGA ACCGCGAGGA GCTGCGTGAA TATGTTCATG CCCGACTTAA GGTGTTCTAC  2040
GAGGAAGAGC TCGATGTGCC ACTCGTACTG TTCGACGAAG TTCTCGACCA CGTGCTGCGT  2100
ATTGATCGTA TCTTCCGCCA GCCACAAGGT CACTTGCTGC TGATTGGAGT TTCGGGAGCC  2160
GGAAAGACTA CGCTTTCGCG CTTTGTAGCC TGGATGAATG GCTTGTCCAT ATTCCAGATC  2220
AAGGTGCACA ACAAGTACAC CAGCGAGGAC TTTGATGAGG ATTTGCGTTG CGTGCTGCGC  2280
CGCTCTGGCT GCAAAGATGA AAAGATTGCT TTCATTTTGG ATGAGTCGAA CGTTTTGGAC  2340
TCTGGTTTCC TGGAGCGTAT GAACACACTG TTGGCCAACG GAGAGGTGCC TGGATTGTTC  2400
GAGGGTGACG AGTACACCAC TCTGATGACT CAGTGCAAGG AGGGTGCCCA GCGCGAGGGT  2460
CTTATGTTGG ACTCCAGTGA CGAACTGTAC AAGTGGTTCA CCCAGCAGGT GATGCGCAAT  2520
CTGCACGTGG TCTTCACCAT GAATCCTTCC ACCGATGGAC TCAAGGATCG TGCTGCCACT  2580
TCGCCAGCTC TGTTCAATCG TTGTGTGTTA AATTGGTTTG GCGACTGGTC GGACTCGGCT  2640
CTGTTCCAGG TGGGCAAGGA GTTCACCACT CGTGTGGACT TGGAGAAACC CAACTGGCAT  2700
GCGCCGGACT TCTTCCCGTC CGTTTGTCCA CTGGTGCCAG CCAATCCCAC TCATCGCGAT  2760
GCGGTCATCA ACTCGTGCGT GTATGTTCAC CAGACACTCC ACCAGGCCAA CGCTCGTCTG  2820
GCCAAGCGCG GTGGGCGCAC CATGGCGGTG ACTCCACGTC ACTATCTGGA CTTTATTCAC  2880
CACTTTGTCA AGCTGTACAA TGAGAAGCGC AGCGATCTTG AAGAACAGCA GCTACATCTG  2940
AATGTGGGTC TCAACAAGAT CGCCGAAACT GTGGAGCAGG TCGAGGAGAT GCAAAAGTCG  3000
CTGGCTGTGA AGAAGCAAGA GTTGCAGGCC AAGAACGAGG CTGCCAACGC CAAGCTGAAG  3060
CAGATGTTCC AGGATCAGCA GGAGGCCGAG AAGAAGAAGA TTCAGTCGCA GGAAATTCAG  3120
ATACGCTTGG CTGACCAAAC CGTCAAGATC GAAGAGAAAC GCAAATACGT AATGGCCGAT  3180
TTGGCCCAGG TGGAACCGGC TGTCATTGAT GCGCAAGCAG CGGTCAAGTC GATCCGCAAG  3240
CAACAGCTCG TTGAGGTGCG AACCATGGCT AATCCGCCAT CGGTGGTCAA ATTGGCTCTT  3300
GAATCGATCT GTCTGCTGCT GGGCGAAAAT GCGACCGATT GGAAATCGAT CCGCGCCGTG  3360
ATCATGCGCG AAAATTTCAT AAATTCAATT GTATCTAACT TCGGTACAGA GAACATAACC  3420
GATGATGTTC GCGAGAAGAT GAAGTCCAAG TATCTGAGCA ATCCGGACTA TAACTTCGAG  3480
AAGGTTAATC GCGCCAGTAT GGCTTGTGGT CCTATGGTCA AATGGGCCAT TGCTCAGATC  3540
GAGTACGCTG ATATGTTGAA GCGTGTGGAG CCTCTTCGCG AAGAGCTGCG TTCCCTGGAG  3600
GAGCAGGCCG ATGTGAATCT GGCCAGTGCC AAGGAAACCA AGGACCTGGT TGAGCAACTG  3660
GAGCGCAGTA TTGCTGCCTA CAAGGAGGAG TATGCCCAGC TTATTTCCCA GGCTCAGGCC  3720
ATCAAAACGG ATCTGGAGAA TGTCCAAGCC AAGGTGGATC GCTCCATTGC ACTGCTGAAG  3780
AGTTTGAACA TCGAACGCGA GCGCTGGGAG TCCACTAGCG AGACTTTCAA GTCGCAAATG  3840
TCCACCATTA TTGGCGATGT GTTGCTCTCC GCTGCCTTTA TCGCCTATGG TGGTTACTTC  3900
GATCAGCATT ACCGGTTGAA CTTATTCACC ACCTGGTCAC AGCATCTGCA GGCTGCCAGC  3960
ATTCAGTACC GCGCGGATAT CGCCCGCACT GAATATCTTT CCAATCCCGA TGAGCGACTC  4020
CGCTGGCAAG CAAATGCGCT GCCCACGGAT GACCTGTGCA CGGAGAATGC TATTATGTTG  4080
AAGCGCTTCA ACCGCTATCC TTTGATCATT GATCCTTCGG GTCAGGCTAC CACTTTCCTG  4140
CTCAATGAAT ATGCAGGTAA GAAGATCACA AAAACCTCGT TCTTGGATGA TTCGTTCCGC  4200
AAGAATTTGG AGTCCGCACT GCGTTTCGGT AATCCATTGC TGGTCCAGGA TGTGGAGAAC  4260
TATGATCCTA TTCTTAACCC GGTCTTGAAC CGTGAGCTGC GTCGTACTGG TGGTCGTGTG  4320
TTGATTACCC TCGGTGACCA GGACATCGAT TTGTCGCCGT CCTTCGTCAT CTTCCTGTCC  4380
ACTCGCGATC CCACTGTCGA ATTCCCGCCC GATATTTGCT CTAGGGTAAC ATTCGTCAAC  4440
TTTACGGTCA CTCGTAGCTC TCTGCAGTCG CAGTGCCTCA ACCAGGTGCT GAAGGCTGAA  4500
CGTCCGGATA TTGACGAGAA GCGCTCAGAT TTGTTGAAGC TACAGGGTGA ATTCCGTCTG  4560
CGCTTGCGCC AGTTGGAAAA GAGTCTGTTG CAGGCGCTCA ACGACGCAAA GGGCAAGATC  4620
CTTGACGATG ATTCGGTTAT TACAACACTG GAGACGCTTA AGAAGGAGGC CTACGACATT  4680
AATCAGAAGG TGGACGAGAC AGACAAGGTT ATAGCCGAAA TCGAGACTGT GTCCCAGCAG  4740
TATCTTCCAC TTTCTGTGGC GTGCAGCAAC ATCTACTTCA CCATGGACAG TCTGAACCAG  4800
GTGCACTTCC TCTACCATTA CTCGCTGAAA ATGTTCCTCG ATATCTTCTC CACGGTGCTG  4860
TATAATAACC CCAAGCTGGA AGGAAGAACC GATCACTCGG AGCGTCTGGG CATAGTCACC  4920
AGGGACCTCT TCCAGGTGTG CTATGAGCGA GTAGCTCGAG GAATGATCCA CAACGATCGC  4980
TTGACCTTTG CCCTGCTCAT GTGCAAGATA CACCTTAAGG GCACCTCGGA GTCTAATTTG  5040
GATGCCGAGT TCAACTTCTT CCTGCGAAGC CGCGAGGGTC TGCTAGCAAA TCCCACGCCC  5100
GTAGAAGGAC TTTCTGCCGA GCAAATTGAG AGTGTCAACC GGCTGGCCCT TCGTCTTCCC  5160
ATCTTCCGAA AGCTTCTCGA GAAGGTACGA TCCATTCCCG AACTTGGTGC CTGGTTGCAG  5220
CAGAGCTCGC CTGAACAGGT TGTACCCCAG CTGTGGGACG AATCCAAGGC TCTCAGTCCC  5280
ATTGCCAGCT CCGTCCACCA GCTGCTGCTT ATTCAGGCTT TCCGACCGGA TCGCGTCATC  5340
GCCGCCGCTC ACAATGTGGT CAACACTGTG CTCGGAGAAG ACTTTATGCC CAACGCCGAG  5400
CAGGAACTGG ACTTCACCTC TGTGGTGGAC AAGCAGTTGA ACTGCAACAC TCCAGCGCTG  5460
CTTTGCTCGG TGCCCGGTTT CGATGCCTCC GGGCGAGTGG ATGACCTGGC AGCGGAGCAA  5520
AATAAGCAAA TTTCCAGCAT TGCTATCGGT TCAGCCGAAG GATTCAACCA AGCTGAGCGG  5580
GCTATTAACA TGGCCTGCAA GACTGGTCGC TGGGTGCTCT TGAAGAACGT CCACTTAGCT  5640
CCCCAATGGC TTGTGCAGCT GGAGAAGAAG ATGCACTCCC TGCAGCCTCA TTCCGGCTTC  5700
CGGCTGTTCC TCACGATGGA GATTAATCCA AAGGTGCCGG TTAACCTACT GCGTGCTGGC  5760
CGCATCTTCG TGTTCGAGCC ACCACCAGGC ATCAGAGCTA ATCTGCTGCG CACCTTCTCT  5820
ACGGTGCCAG CGGCACGCAT GATGAAGACT CCAAGTGAGC GAGCTCGGCT TTACTTCCTG  5880
CTGGCCTGGT TCCACGCCAT CGTCCAGGAG CGACTCCGGT ATGTGCCTCT TGGCTGGGCC  5940
AAGAAGTACG AGTTCAACGA ATCGGATCTG CGCGTGGCTT GCGACACGTT GGACACCTGG  6000
ATCGACACCA CGGCCATGGG CCGCACCAAT CTGCCACCAG AGAAGGTTCC ATGGGACGCC  6060
CTGGTCACTC TGCTCTCGCA GTCCATTTAC GGAGGAAAGA TCGATAACGA TTTCGATCAG  6120
CGCCTCTTGA CTTCCTTCTT GAAGAAGCTC TTTACCGCAC GCAGCTTCGA GGCGGACTTT  6180
GCCCTGGTTG CTAACGTTGA TGGCGCCTCC GGAGGACTCC GTCACATCAC CATGCCTGAT  6240
GGCACCCGTC GCGATCATTT CTTGAAGTGG ATTGAAAACT TGACTGACCG CCAGACTCCT  6300
TCGTGGCTTG GATTGCCCAA CAATGCGGAG AAGGTGTTGC TTACCACCCG TGGTACTGAT  6360
TTGGTCAGTA AGCTGCTCAA AATGCAGCAG CTGGAGGACG ACGATGAGTT GGCTTACAGC  6420
GTGGAGGATC AGTCGGAGCA ATCTGCAGTG GGTCGCGGCG AGGATGGACG TCCATCGTGG  6480
ATGAAGACAC TTCACAACTC AGCAACTGCC TGGTTGGAAC TGCTCCCGAA GAATCTGCAA  6540
GTGCTCAAGC GTACTGTGGA GAACATCAAG GATCCGTTGT ACCGATACTT CGAGCGCGAG  6600
GTGACGAGTG GTTCCCGTCT GTTGCAAACC GTGATCCTGG ACCTGCAGGA TGTGGTTCTA  6660
ATTTGTCAGG GCGAGAAGAA ACAAACCAAC CACCATCGTT CCATGTTGTC GGAGCTGGTG  6720
CGCGGTATAA TTCCGAAGGG TTGGAAGCGG TATACTGTTC CTGCAGGATG CACTGTGATC  6780
CAATGGATCA CGGACTTCAG CAACCGTGTC CAGCAGCTGC AGAAGGTGTC GCAACTTGTT  6840
TCGCAGGCTG GCGCCAAGGA GCTGCAGGGC TTCCCCGTCT GGCTCGGTGG TTTGCTCAAT  6900
CCGGAGGCCT ACATCACGGC CACCAGGCAG TGTGTGGCCC AGGCGAACAG TTGGTCGCTA  6960
GAGGAGCTCG CCCTGGATGT GACAATTACG GATGCTGGGC TAAAGAACGA TCAAAAGGAT  7020
TGTTGCTTCG GGGTCACTGG ACTCAAATTA CAGGGCGCCC AGTGCAAGAA CAATGAACTG  7080
CTGCTTGCTT CCACTATCAT GATGGATCTA CCTGTCACCA TTCTCAAGTG GATAAAGATT  7140
TCCTCGGAGC CACGCATCAG CAAGTTGACA TTGCCGGTTT ACTTGAACTC CACCCGTACG  7200
GAGCTACTCT TCACTGTTGA CTTGGCCGTT GCAGCTGGGC AGGATTCGCA TAGCTTCTAC  7260
GAAAGAGGCG TGGCAGTTTT GACCTCTACT GCTTTGAACT AA                     7302