Entry LINUN04088 (g29698)

E Lingula unguis


General Information

Organism
LINUN - Lingula unguis (Taxon-ID: 7574)
Locus
LFEI01001404join(52161..52182, 54660..54787, 57884..58061, 58332..58622, 58877..59025, 59299..59488, 59848..60013, 60397..60521, 60729..61040, 61316..61484, 62055..62185, 62536..62722, 63129..63232, 64264..64427, 64905..65028, 65273..65377, 65637..65756, 65910..66343, 66779..66849, 67002..67142, 67746..67851, 71630..72433)
Number of exons
22

IDs and Cross-references

loading xrefs...

Domain Architecture

Gene Ontology

loading GO annotations...

Protein Sequence

Download: Fasta
MGIPITLXNN LLLPPKLDIG TTEVVFQSEK NFTLKCIGEA EMEWVYPSHS FKDFSIENRI    60
SIVHSTDSSG NVSELTVGPA KYYDTGEYTC RYKLSPDNKD ESKVYVYVSD EINILLPHEL   120
HPDPVALFLV QNRPGIIPCR VSSPNATVIL KDGDKIIGPT EAVTYDPREG FKVLFPNVFF   180
AYKQLVCEAS YAGNGPDTMS VLTFYSEAPT NPPTPVIQSN QTRHMTGKDI QLRCIVLTSV   240
GSTGNTLEWI YPNMDERSSR IAQVSEQHTI ENGAFQYTRL RNNLTISKAI PTDAGVYTCR   300
VTTLINQLTA DDTISIDVFD SAFANLRPEK SVVEADLNQD LVSLNVIVDA YPPPQIYWKK   360
NGQTIGRRLP KYEPRQNEDK ASLLIFGVRR EDAGVYTAHA PVGGEEEATS VTLQVIETPE   420
TSILKQHTET FYKPGKDYIL ECAVQGFPVP EIVWWWAGCK SLGCPETAYM IANSSRNYRG   480
ITKESHRSFL SLNAVESGKF SCRASNKKGT SESTEIVIVS DAENGFEVVR SSEVHIEGDM   540
LTVSCRAAKY WYSDLAVEQE NGHPLLSDAR RRVSNSSTKY SNEITVTIDS LSLDDTGVIK   600
CVALNSSRIR HTVDADIIVL GLKAPVFTNP LAGVNTVEAS GVYKLECKAS GLPIPVITWF   660
KDGKNFSSDL KRPGVKFSSE NTKITIERVS RHDAGLYECR ATNRAGSIIS NLTLQVEGDP   720
IPDGIIKRGG LLSAQQTGMI VGIVAAVLLL VIIILVICVM RRKKKQRKEL PLLMDYLQDQ   780
PRHDFNPDIP IDEQTECLAY DPKWEFPKER LKLGMVIGQG AFGRVVKAEA MGINDTEGPT   840
PVAIKMVKDC YDSSQIKSLI SELKVMIHLG HHLNILNVLG AVTKNIRQGE FYVIMEFCTF   900
GNLRSYLIQN RSRFVDTMNQ LTLANEGYLE PVSITGPSST GSAQNTLTGI NSRDSVRGGA   960
SLTDPGAASV TAITETTGGT VAAENYSNMK KEAAADGEDS AQSIKKEPIL TSKDLVCFAF  1020
QIARGMDYLH SKKAPEPIKW LALESLLDKI FTPKSDVWSF GVLLWEIFTM GSTPYPGLEL  1080
NETFVDKLKA GYRMQRPPKA SAEIYEWMLD CWHPEAEERP SAAELAERFG DLLQHDTKQY  1140
YLDLNNSYAA ANDPYLKMNT NGYLSMQVDS DNPKCKELAD DEENQVSMGS DGNHYVDNVR  1200
WKKPTSKAKD AKDASEMEPL TAASESPPRD SQLKDLKEDL MYENSGKGSA AVDVHQQQSP  1260
DVEEHPLISR KSKTKPAPPP KPPGLSPKSS PRTSPEKEVH KPQPRPRSSS PWLPRSGSSS  1320
PSGSSRNISS PTGDRSPLVS RQPEREISPP PKDYVSIFKN REKGSGGSNA SSGFHEDIDS  1380
DIPLERAPRP PSQEPEFSNG LNESVA                                       1406

Coding Sequence

Download: Fasta
ATGGGTATAC CCATCACACT CXXXAATAAT TTGCTGCTGC CTCCTAAGCT GGACATCGGC    60
ACCACAGAGG TAGTGTTCCA GTCAGAGAAG AACTTCACAC TAAAGTGTAT TGGGGAGGCT   120
GAGATGGAAT GGGTATACCC ATCACACTCT TTCAAAGATT TCTCCATTGA AAACAGAATA   180
AGCATCGTCC ACAGCACAGA TAGTTCTGGT AATGTTAGTG AACTGACAGT GGGTCCTGCC   240
AAGTACTACG ACACTGGAGA GTATACCTGC CGCTACAAAC TGTCTCCAGA CAACAAGGAC   300
GAGTCTAAAG TTTATGTCTA TGTCTCAGAT GAAATCAACA TTTTGCTCCC CCATGAACTG   360
CATCCAGACC CTGTAGCTCT TTTCTTGGTT CAGAACAGGC CCGGGATCAT CCCGTGTAGG   420
GTATCTAGCC CCAATGCTAC GGTCATATTG AAAGACGGTG ACAAAATTAT TGGTCCCACT   480
GAGGCAGTGA CATATGATCC CAGAGAGGGC TTCAAAGTAC TTTTTCCAAA TGTGTTCTTT   540
GCTTATAAAC AACTTGTCTG TGAGGCTTCA TATGCTGGCA ATGGCCCAGA TACCATGTCA   600
GTTTTGACTT TCTACAGTGA GGCACCCACC AACCCTCCTA CCCCAGTAAT CCAAAGCAAC   660
CAGACCCGCC ACATGACTGG CAAAGACATC CAGCTTAGGT GTATAGTGCT GACATCTGTT   720
GGCAGTACTG GAAACACCCT AGAATGGATC TATCCTAATA TGGATGAGAG ATCCAGTCGG   780
ATAGCACAAG TATCAGAGCA GCATACAATA GAGAATGGAG CATTCCAGTA CACCAGGCTA   840
AGGAATAACT TGACAATCAG TAAAGCCATA CCCACAGATG CTGGAGTCTA CACATGTCGT   900
GTCACCACAT TGATTAACCA GCTGACTGCT GACGACACTA TCAGCATAGA TGTTTTTGAC   960
TCCGCTTTTG CAAACCTTCG TCCAGAGAAA TCAGTAGTAG AAGCAGACCT GAATCAGGAT  1020
CTTGTATCCT TGAATGTAAT AGTGGATGCT TACCCTCCAC CACAGATATA TTGGAAGAAA  1080
AATGGGCAAA CGATTGGTAG ACGTTTACCT AAGTATGAAC CAAGGCAGAA TGAAGATAAG  1140
GCCAGCCTTC TGATCTTTGG TGTAAGGAGA GAGGATGCTG GAGTCTATAC GGCACATGCC  1200
CCAGTAGGTG GGGAAGAGGA GGCCACCTCT GTCACCCTGC AAGTTATAGA GACTCCAGAG  1260
ACATCCATCT TGAAGCAGCA CACAGAAACC TTCTACAAAC CAGGGAAGGA TTACATTCTG  1320
GAGTGTGCAG TACAGGGGTT CCCAGTGCCA GAGATCGTGT GGTGGTGGGC AGGGTGTAAG  1380
AGTTTGGGAT GCCCAGAGAC TGCTTATATG ATAGCTAACA GCAGCAGAAA CTACAGGGGA  1440
ATAACCAAAG AATCACACAG GAGTTTCCTG TCCCTGAATG CTGTGGAGTC TGGAAAGTTT  1500
TCATGCAGAG CCAGTAACAA AAAAGGAACG AGTGAGAGCA CTGAAATAGT TATTGTGTCA  1560
GATGCAGAGA ATGGATTTGA AGTAGTCCGG AGTTCAGAAG TCCACATAGA GGGAGATATG  1620
CTTACTGTCA GTTGCCGTGC CGCCAAGTAC TGGTATAGTG ACCTAGCAGT GGAGCAGGAG  1680
AACGGGCATC CATTGTTGAG TGATGCTAGG AGAAGAGTTA GTAACTCAAG CACCAAGTAT  1740
TCCAATGAAA TCACTGTTAC TATAGATTCC CTCTCTCTGG ATGATACTGG TGTCATAAAG  1800
TGTGTGGCCT TAAACAGCTC TCGTATCAGG CACACCGTGG ATGCAGATAT CATTGTACTA  1860
GGTCTGAAGG CCCCTGTTTT CACCAATCCT CTGGCTGGTG TGAACACTGT GGAAGCCAGT  1920
GGTGTCTATA AGCTGGAGTG CAAGGCAAGC GGATTACCAA TCCCTGTCAT CACTTGGTTC  1980
AAAGATGGAA AAAACTTCAG CTCTGACCTA AAAAGACCCG GAGTGAAGTT TTCTAGTGAA  2040
AATACAAAAA TTACCATAGA GAGAGTCAGT AGGCATGATG CTGGCTTGTA TGAGTGCAGG  2100
GCAACCAATA GAGCTGGAAG TATCATCAGT AACCTGACAC TCCAAGTGGA GGGCGACCCA  2160
ATTCCAGATG GTATTATCAA AAGAGGCGGA TTGTTGTCAG CCCAGCAGAC AGGAATGATT  2220
GTGGGCATTG TTGCCGCTGT GTTGCTCCTG GTCATCATCA TCCTGGTGAT CTGTGTGATG  2280
AGGAGGAAGA AGAAACAGAG GAAGGAGCTA CCACTGTTGA TGGACTATCT CCAAGATCAA  2340
CCAAGACACG ATTTTAATCC TGACATTCCG ATAGATGAAC AGACAGAGTG CCTGGCATAT  2400
GACCCAAAGT GGGAATTCCC CAAAGAGAGG CTAAAGTTAG GAATGGTGAT AGGTCAAGGT  2460
GCCTTTGGTC GAGTGGTCAA AGCAGAAGCC ATGGGTATTA ATGACACTGA GGGCCCCACA  2520
CCAGTGGCCA TTAAAATGGT CAAAGATTGT TATGACTCCT CCCAGATCAA ATCTCTGATA  2580
TCAGAACTGA AGGTGATGAT CCACCTGGGC CACCATCTTA ATATACTGAA TGTCCTGGGG  2640
GCTGTCACCA AGAATATTAG ACAAGGAGAA TTTTACGTCA TTATGGAGTT TTGCACTTTT  2700
GGCAACCTGC GCTCATATTT GATACAGAAT AGGTCCCGTT TCGTAGACAC CATGAACCAA  2760
TTAACACTGG CCAATGAGGG CTACTTAGAA CCGGTCAGTA TAACTGGCCC ATCATCTACG  2820
GGATCAGCAC AAAACACGCT GACAGGGATC AATTCAAGGG ACAGTGTAAG AGGAGGAGCT  2880
TCATTGACAG ATCCAGGTGC TGCCTCTGTG ACAGCCATTA CAGAGACTAC AGGGGGAACT  2940
GTGGCAGCAG AAAATTATTC AAATATGAAA AAAGAAGCTG CAGCAGACGG AGAAGACAGT  3000
GCGCAAAGCA TTAAGAAGGA ACCTATTCTA ACTTCTAAGG ATTTAGTTTG TTTTGCTTTT  3060
CAAATTGCCA GAGGGATGGA TTATTTGCAC TCTAAAAAGG CACCAGAACC AATAAAATGG  3120
CTTGCCTTAG AATCTTTACT TGACAAAATC TTCACTCCTA AAAGTGATGT ATGGTCATTT  3180
GGGGTGCTGT TATGGGAAAT CTTTACCATG GGAAGCACTC CGTACCCTGG CCTCGAGTTA  3240
AATGAAACTT TTGTAGATAA GTTAAAAGCA GGTTACAGAA TGCAGAGGCC GCCTAAAGCA  3300
TCTGCTGAAA TTTACGAGTG GATGTTGGAC TGTTGGCATC CTGAGGCGGA GGAAAGACCG  3360
TCTGCTGCTG AGTTAGCTGA AAGATTTGGA GATCTGTTAC AGCATGATAC GAAGCAGTAT  3420
TACCTGGACC TAAACAACTC CTATGCAGCA GCCAATGACC CCTACTTGAA GATGAATACC  3480
AATGGCTACC TGAGCATGCA GGTAGACAGC GACAATCCCA AGTGTAAAGA ACTTGCAGAT  3540
GATGAGGAAA ACCAGGTGTC AATGGGTTCA GACGGTAACC ACTATGTGGA CAATGTACGG  3600
TGGAAGAAGC CAACTTCCAA GGCCAAGGAC GCAAAAGATG CCTCAGAGAT GGAGCCACTT  3660
ACAGCAGCCT CTGAATCCCC TCCTAGGGAT AGCCAATTAA AGGACTTGAA GGAAGATCTT  3720
ATGTATGAGA ACAGTGGTAA GGGTTCAGCG GCAGTGGACG TACATCAACA ACAGAGCCCT  3780
GATGTAGAAG AACATCCACT TATCTCACGG AAATCTAAAA CCAAACCTGC ACCTCCTCCC  3840
AAACCACCAG GCCTGTCTCC CAAATCCTCC CCTAGAACCT CACCAGAAAA AGAAGTTCAC  3900
AAACCACAAC CAAGGCCAAG GTCTTCATCA CCATGGTTAC CAAGGTCTGG CAGCAGTAGT  3960
CCTAGTGGTA GTAGTAGAAA TATATCGTCT CCCACTGGAG ATAGGTCCCC ATTGGTATCT  4020
AGGCAACCAG AGAGAGAGAT ATCTCCTCCA CCCAAAGATT ATGTCAGTAT ATTCAAGAAC  4080
AGGGAGAAGG GGTCAGGAGG GTCAAATGCC TCCAGTGGCT TCCATGAGGA CATTGATTCT  4140
GATATTCCCC TGGAGAGGGC GCCGCGACCA CCATCACAAG AACCAGAATT TTCAAATGGC  4200
TTGAATGAAT CAGTGGCATA G                                            4221