Entry PENCH03959 (e_gw1.2.4799.1)

E Penicillium chrysogenum


General Information

Description
jgi|Pench1|71710|e_gw1.2.4799.1
Organism
PENCH - Penicillium chrysogenum (Taxon-ID: 5076)
Locus
scaffold_2join(complement(3102930..3105859), complement(3102572..3102875), complement(3101673..3102521))
Number of exons
3

IDs and Cross-references

loading xrefs...

Domain Architecture

Gene Ontology

loading GO annotations...

Protein Sequence

Download: Fasta
MASGRPPGSH PAAGRDDDLL LDSGPMYSTG QGPPVNDEHL LERYNIDDSE QPYAQTQPRP    60
SVSYDNFVGS SAHQGATQHN VASGPAHPPV NPGLHSNDPY SSGTRDRTYS QTSGLDNYRR   120
YSLDDFDDGH SGYYDLDADE DRIASSHHVR KANERNSVLG LGGGFMGKAK YMFGMGSSQY   180
SEMDLPLTES GARRATVGSD ASPAPPPKQR KKFRASDLNI FSRKVDPSTL GPRMIQLNNP   240
PANATHKFVS NFVSTAKYNI FTFIPKFLFE QFSKYANLFF LFTAVLQQIP NVSPTNRYTT   300
IVPLAIVLAV SAIKELVEDY KRRMSDRGLN YSKTQVLKGS SFHEAKWVDV VVGDIVRVES   360
EQPFPADLVL LASSEPEGLC YIETANLDGE TNLKVKQAIP ETAHLVSPSD LSRLSGRVRS   420
EQPNSSLYTY EATLTMNAGG GEKELPLAPD QLLLRGATLR NTPWIHGIVV FSGHETKLMR   480
NATATPIKRT AVERTVNIQI LMLVSILIVL SVISSVGDLA IRKTRSSTLA YLGYGGSVKL   540
VKQFFMDIFT YWVLYSNLVP ISLFVTIEIV KYFQAFLINS DLDIYYDKTD TPAICRTSSL   600
VEELGQIEYI FSDKTGTLTC NMMEFKQVSI AGVQYGDDVP EDRRATVEDG AEIGIHDFKT   660
LKKNLQSHPS QNAIREFLTL LATCHTVIPE RNSEDPNVIK YQAASPDEGA LVDGAASLGF   720
RFTNRRPRSV IFEVGGQELE YELLAVCEFN STRKRMSTIF RCPDGKVRVY CKGADTVILE   780
RLHPDNPTVE ATLQHLEEYA SDGLRTLCLA MREVPENEFQ QWHQIYDKAS TTVDGNRADE   840
LDKAAELIEK DFYLLGATAI EDRLQDGVPD TIHTLQTAGI KIWVLTGDRQ ETAINIGMSC   900
KLISEDMTLL IINEETSEAT RDSLQKKMDA VQSQISAGDS EPLALVIDGR SLTFALEKDM   960
EKLFLDLAVI CKAVVCCRVS PLQKALVVKL VKRHKKALLL AIGDGANDVS MIQAAHVGVG  1020
ISGVEGLQAA RSADVAIGQF RFLRKLLLVH GAWSYSRISR VILYSYYKNI TLYMTQFWYS  1080
FQNAFSGEVI YESWTLSFYN VLFTVLPPFA MGIFDQFISA RLLDRYPQLY QLGQRGIFFK  1140
KHSFWAWILN GFFHSLILYI VSELLYYWDL PMENGHVAGH WVWGESLYTA VLGTVLGKAA  1200
LITNVWTKYT FIAIPGSMAL WLIFLPAYGY AAPALGFSRE YYGTIPVLFK SPIFYLMAIV  1260
LPCICLLRDY AWKYAKRMYY PQQYHHVQEI QKYNVQDYRP RMEQFQKAIR KVRQVQRMRK  1320
QRGYAFSQAD DGGQMRVLNA YDTTRSRGRY GEMASSRPMA                        1360

Coding Sequence

Download: Fasta
ATGGCCAGCG GTAGGCCTCC TGGGTCGCAT CCAGCCGCTG GCCGCGATGA CGACCTGCTG    60
CTGGATTCCG GACCTATGTA TAGCACCGGT CAAGGCCCCC CTGTGAATGA CGAACACCTA   120
TTAGAACGCT ACAACATTGA CGACTCCGAA CAACCGTACG CCCAAACACA ACCACGCCCT   180
TCCGTCTCCT ACGATAACTT TGTCGGGTCT AGCGCGCATC AAGGCGCAAC GCAGCACAAT   240
GTTGCTTCGG GACCGGCGCA TCCGCCTGTC AACCCCGGTC TCCATTCAAA TGACCCCTAC   300
TCCAGTGGCA CGCGAGATCG AACCTACTCC CAGACGTCTG GATTGGACAA TTATCGGCGG   360
TACTCTTTGG ATGACTTCGA CGATGGGCAT TCAGGTTATT ATGATCTTGA TGCGGATGAA   420
GACCGAATTG CCAGTTCACA CCATGTCCGC AAGGCAAATG AGCGCAATAG CGTCCTCGGA   480
CTCGGGGGTG GATTTATGGG AAAGGCGAAA TACATGTTTG GCATGGGGTC ATCACAATAT   540
TCGGAAATGG ATCTTCCGTT GACTGAATCG GGTGCAAGAA GGGCGACAGT TGGTAGCGAT   600
GCTTCTCCGG CGCCGCCCCC GAAGCAACGC AAGAAGTTCC GCGCGTCTGA CCTTAACATC   660
TTTTCACGCA AGGTTGATCC CTCGACATTG GGTCCCCGCA TGATCCAACT CAACAACCCA   720
CCAGCCAACG CCACCCACAA GTTCGTCAGC AACTTCGTAT CGACGGCCAA GTACAACATC   780
TTTACCTTTA TCCCCAAATT CCTCTTCGAG CAATTCTCCA AATACGCCAA CCTGTTCTTC   840
TTGTTTACCG CTGTCCTGCA ACAAATCCCA AATGTGTCGC CGACGAATAG GTACACAACT   900
ATCGTTCCAC TGGCTATCGT CTTGGCTGTA TCGGCGATCA AAGAGTTGGT AGAAGACTAC   960
AAACGGAGAA TGTCGGACAG AGGACTGAAT TACTCAAAAA CCCAAGTTCT TAAGGGATCT  1020
TCGTTCCACG AAGCAAAATG GGTGGATGTT GTGGTCGGGG ATATAGTTCG GGTTGAATCC  1080
GAGCAGCCTT TTCCAGCCGA CTTGGTATTA CTGGCGTCGT CTGAGCCTGA AGGTCTATGC  1140
TATATTGAGA CGGCAAACTT GGATGGCGAG ACCAATCTCA AAGTCAAACA AGCCATCCCG  1200
GAGACTGCGC ATCTGGTCAG CCCTAGTGAT CTGAGTCGAC TGAGTGGACG TGTTCGATCT  1260
GAACAGCCAA ATAGCAGCTT GTATACTTAC GAAGCCACCT TGACTATGAA CGCTGGAGGT  1320
GGCGAGAAAG AGTTGCCCTT GGCTCCGGAT CAACTCCTGC TTCGTGGTGC CACGCTACGC  1380
AACACTCCTT GGATCCATGG CATTGTGGTT TTTAGTGGCC ACGAGACAAA ACTGATGCGA  1440
AATGCGACAG CAACCCCGAT CAAGAGAACA GCTGTGGAAA GGACAGTGAA CATTCAGATT  1500
CTGATGCTTG TCAGCATTCT CATCGTTTTG AGTGTCATCA GTTCTGTCGG TGATCTCGCT  1560
ATTCGGAAAA CTAGGTCTTC GACGCTTGCA TACCTCGGCT ATGGGGGGTC GGTCAAATTG  1620
GTAAAGCAGT TCTTTATGGA CATTTTCACG TACTGGGTGC TCTATTCAAA TTTGGTCCCT  1680
ATCTCCCTCT TCGTCACCAT CGAAATCGTC AAATACTTCC AAGCCTTCCT CATCAACTCC  1740
GACCTGGACA TTTACTATGA CAAAACCGAT ACCCCAGCCA TCTGCCGCAC ATCCTCCCTT  1800
GTCGAGGAGC TTGGTCAAAT CGAGTACATC TTCTCCGACA AAACAGGTAC TCTTACCTGC  1860
AACATGATGG AGTTTAAGCA AGTTAGCATT GCCGGTGTCC AGTATGGCGA TGATGTCCCT  1920
GAAGATCGAC GTGCCACCGT GGAGGATGGG GCTGAAATTG GTATTCATGA CTTCAAAACA  1980
CTCAAAAAGA ACCTTCAATC GCACCCAAGC CAGAACGCTA TTCGCGAATT CCTCACTCTC  2040
CTTGCCACCT GTCACACTGT TATTCCTGAG CGAAATAGCG AGGACCCGAA TGTGATCAAG  2100
TACCAAGCAG CATCGCCCGA CGAGGGAGCC TTGGTGGACG GAGCGGCCTC GCTGGGTTTC  2160
CGATTCACTA ACCGAAGACC GAGATCGGTG ATCTTTGAGG TTGGTGGGCA GGAGCTTGAA  2220
TACGAACTGC TAGCAGTCTG CGAGTTCAAC TCTACTAGAA AACGAATGTC CACTATCTTC  2280
CGGTGTCCTG ATGGAAAGGT CCGCGTCTAC TGTAAGGGTG CTGACACGGT CATTCTCGAG  2340
CGGTTACATC CCGATAACCC TACCGTGGAA GCAACTCTTC AGCACCTAGA GGAGTATGCC  2400
TCGGATGGTC TACGGACGTT GTGTCTGGCT ATGCGGGAAG TCCCAGAAAA CGAGTTCCAG  2460
CAATGGCACC AGATTTATGA CAAGGCTTCG ACCACTGTCG ATGGCAATCG TGCTGATGAA  2520
CTGGACAAAG CGGCCGAACT GATCGAAAAG GATTTCTATC TTCTTGGCGC CACGGCTATC  2580
GAGGATCGCT TGCAGGATGG TGTTCCGGAT ACCATCCACA CCCTCCAGAC TGCAGGCATC  2640
AAGATTTGGG TCTTGACTGG TGATCGGCAG GAGACTGCAA TCAACATTGG CATGTCCTGC  2700
AAACTCATCT CGGAAGACAT GACGCTTCTC ATCATCAACG AGGAAACGTC AGAAGCCACT  2760
CGTGACAGCC TGCAAAAGAA AATGGATGCC GTTCAGAGCC AAATTTCCGC TGGGGACTCT  2820
GAACCTTTGG CTTTGGTAAT TGACGGCCGG TCGTTGACTT TTGCTTTGGA GAAAGATATG  2880
GAGAAGCTGT TCTTGGATCT GGCTGTCATT TGCAAAGCCG TTGTTTGCTG TCGTGTGTCT  2940
CCCCTTCAAA AGGCCTTGGT CGTCAAGCTT GTCAAGCGCC ACAAGAAGGC TCTCCTCTTG  3000
GCCATTGGTG ACGGCGCCAA TGATGTGTCC ATGATTCAAG CCGCTCATGT CGGAGTCGGT  3060
ATCAGCGGCG TTGAAGGTCT TCAAGCCGCG CGATCTGCTG ATGTCGCAAT TGGCCAGTTC  3120
AGGTTCCTTC GAAAACTGCT TCTGGTTCAC GGAGCATGGA GCTATTCTCG CATCAGTCGA  3180
GTCATTCTAT ATTCGTACTA TAAGAACATT ACGCTGTACA TGACTCAATT CTGGTATTCT  3240
TTCCAAAACG CATTCTCAGG AGAGGTTATC TACGAATCTT GGACGCTCTC CTTCTATAAC  3300
GTGCTCTTCA CTGTTCTGCC TCCATTCGCG ATGGGAATCT TTGACCAGTT CATCTCCGCC  3360
CGATTGCTCG ACCGCTATCC ACAGCTGTAC CAGCTGGGCC AGAGAGGCAT CTTCTTCAAG  3420
AAGCACAGCT TTTGGGCTTG GATTCTGAAT GGATTCTTCC ACTCTCTCAT CCTTTACATT  3480
GTCTCTGAGC TGCTCTACTA CTGGGACCTT CCTATGGAAA ACGGCCATGT CGCCGGCCAT  3540
TGGGTCTGGG GTGAATCTCT GTACACCGCC GTGTTAGGTA CCGTTCTCGG CAAAGCCGCT  3600
CTGATCACCA ACGTCTGGAC CAAGTACACC TTCATTGCCA TCCCAGGCTC AATGGCCCTA  3660
TGGCTCATCT TCCTCCCGGC ATATGGTTAC GCCGCCCCGG CTTTGGGCTT CTCACGTGAA  3720
TACTACGGCA CCATTCCCGT GCTCTTCAAG TCCCCCATCT TCTACCTCAT GGCCATCGTC  3780
CTACCCTGTA TATGTCTCCT ACGTGATTAT GCTTGGAAGT ACGCCAAGCG CATGTACTAC  3840
CCACAACAAT ACCACCACGT CCAGGAAATC CAGAAATACA ATGTCCAGGA TTACCGTCCA  3900
CGCATGGAGC AGTTCCAAAA AGCCATCCGC AAGGTTCGCC AGGTCCAGCG CATGCGCAAG  3960
CAGCGCGGAT ACGCCTTCTC CCAGGCCGAC GATGGCGGCC AGATGCGTGT TCTCAATGCT  4020
TACGATACTA CCCGGAGTCG TGGGCGGTAT GGTGAGATGG CTAGCTCTAG GCCTATGGCG  4080
TGA                                                                4083