Entry VICPA01054 (ENSVPAG00000002176)

E Vicugna pacos


General Information

Description
PRP40 pre-mRNA processing factor 40 homolog A (S. cerevisiae) [Source:HGNC Symbol;Acc:16463]
Organism
VICPA - Vicugna pacos (Taxon-ID: 30538)
Locus
GeneScaffold_1154join(complement(1855825..1855989), complement(1854131..1854262), complement(1842634..1842714), complement(1841409..1841456), complement(1829548..1829658), complement(1828019..1828139), complement(1827724..1827777), complement(1825707..1825725), complement(1825147..1825527), complement(1822982..1823090), complement(1822317..1822381), complement(1821835..1821999), complement(1820897..1821005), complement(1819950..1820100), complement(1818757..1818842), complement(1818710..1818754), complement(1816189..1816303), complement(1815756..1815847), complement(1815457..1815498), complement(1815383..1815454), complement(1814935..1815072), complement(1814559..1814654), complement(1812417..1812503), complement(1812225..1812321), complement(1812077..1812138), complement(1811306..1811437), complement(1810134..1810145))
Number of exons
27

IDs and Cross-references

loading xrefs...

Domain Architecture

Gene Ontology

loading GO annotations...

Protein Sequence

Download: Fasta
MRPGTGAERG GLMVSEMESH PPSRGPGDGE RRLSGSSLCS SSWVSADGFL RRRPSMGHPG    60
MHYAPMGMHP MGQRANMPPV PHGMMPQMMP PMGGPPMGQM PGMMSSVMPG MMMSHMSQAS   120
MQPALPPGVN NMDVAAGMYT IYKSMWTEHK SPDGRTYYYN TETKQSTWEK PDDLKTPAEQ   180
LLSKCPWKEY KSDSGKPYYY NSQTKESRWA KPKELEDLEG YQNTIVAGSL ITKSNLHEMF   240
AELKHSKQEE CTTTSAAPVP TTEIPTTMST MAAAEAAAAV VAAAAAAAAA AAAANAGAST   300
SASSTVSGTV PVVPEPEVTS IVATVVDNEN TVTISTEEQA QLTSTPAVQD QGVEASGNTG   360
EETSKQETVA DFTPKKEEEE SQPAKKTYTW NTKEEAKQAF KELLKEKRVP SNASWEQAMK   420
MIINDPRYSA LAKLSEKKQA FNAYKVQTEK EEKEEARSKY KEAKESFQRF LENHEKMTST   480
TRYKKAEQMF GEMEVWNAIS ERDRLEIYED VLFFLSKKEK EQAKQLRKRN WEALKNILDN   540
MANVTYSTTW SEAQQYLMDN PTFAEDEELQ XNQKSLRERR RQRKNRESFQ IFLDELHEHG   600
QLHSMSSWME LYPTISSDIR FTNMLGQPXD KGFVVEVNTT FEDVAIISST KRSTTLDAGN   660
IKLAFNSLLE KAEARERERE KEEARKMKRK ESAFKSMLKQ ATPPIELDAV WEDIRERFVK   720
EPAFEDITLE SERKRIFKDF MHVLEHECQH HHSKNKKHSK KSKKHHRKRS RSRSGSDSDD   780
DDSHSKKKRQ RSESRSASEH SSSAESERSY KKSKKHKKKS KKRRHKSDSP ESDAEREKDK   840
KEKDRESEKD RTRQRSESKH KSPKKKTGKD SGNWD                              875

Coding Sequence

Download: Fasta
ATGAGGCCGG GGACGGGAGC TGAGCGCGGA GGCCTCATGG TGAGTGAAAT GGAGAGCCAT    60
CCTCCCTCGC GGGGTCCCGG GGACGGGGAG CGGAGATTGT CCGGCTCAAG CCTCTGCTCC   120
AGCTCTTGGG TCTCTGCTGA CGGCTTCCTG AGGAGACGGC CCTCGATGGG GCATCCTGGC   180
ATGCATTATG CCCCAATGGG CATGCACCCT ATGGGTCAGA GAGCGAATAT GCCGCCTGTA   240
CCTCATGGAA TGATGCCACA AATGATGCCC CCTATGGGAG GGCCACCAAT GGGACAAATG   300
CCTGGAATGA TGTCTTCAGT AATGCCTGGA ATGATGATGT CTCATATGTC TCAGGCTTCC   360
ATGCAGCCTG CCTTACCGCC AGGAGTAAAT AATATGGATG TAGCAGCAGG TATGTATACT   420
ATATATAAAT CAATGTGGAC TGAACATAAA TCACCTGATG GACGGACCTA CTACTACAAT   480
ACTGAGACTA AACAGTCTAC TTGGGAGAAA CCAGATGATC TTAAAACTCC TGCTGAGCAA   540
CTTTTATCTA AATGCCCTTG GAAGGAGTAC AAATCTGACT CTGGAAAGCC TTACTATTAC   600
AATTCTCAAA CAAAAGAATC CCGCTGGGCC AAACCTAAAG AACTTGAGGA TCTTGAAGGA   660
TACCAGAATA CCATTGTTGC TGGAAGTCTT ATTACAAAAT CAAACCTGCA TGAAATGTTT   720
GCTGAATTAA AACATAGTAA GCAAGAAGAG TGCACCACAA CATCAGCAGC CCCCGTTCCT   780
ACAACAGAGA TCCCTACCAC GATGAGCACC ATGGCAGCCG CAGAAGCGGC GGCTGCCGTT   840
GTTGCAGCGG CGGCGGCAGC GGCGGCGGCT GCTGCTGCGG CCAATGCCGG TGCTTCCACC   900
TCTGCTTCTA GTACTGTCAG TGGAACTGTT CCAGTCGTTC CTGAGCCTGA GGTTACTTCG   960
ATTGTTGCTA CTGTTGTAGA TAATGAAAAC ACAGTAACAA TTTCAACTGA AGAACAAGCA  1020
CAACTTACTA GTACCCCTGC TGTTCAGGAT CAGGGTGTTG AAGCGTCCGG TAATACTGGA  1080
GAAGAAACGT CTAAGCAGGA AACTGTAGCT GATTTTACTC CGAAAAAAGA AGAGGAGGAA  1140
AGCCAACCAG CAAAAAAAAC ATACACTTGG AATACAAAGG AGGAGGCAAA GCAAGCATTT  1200
AAAGAATTAT TGAAAGAAAA GCGGGTACCA TCTAATGCTT CATGGGAGCA AGCTATGAAA  1260
ATGATCATTA ATGATCCACG ATACAGTGCT TTAGCAAAGT TGAGTGAAAA AAAGCAGGCC  1320
TTTAATGCTT ATAAAGTCCA GACAGAGAAA GAAGAAAAAG AAGAAGCAAG ATCTAAATAC  1380
AAAGAGGCTA AGGAATCCTT TCAGCGTTTT CTTGAAAATC ATGAAAAAAT GACTTCAACA  1440
ACTAGATACA AAAAAGCAGA GCAAATGTTT GGAGAGATGG AAGTCTGGAA TGCAATATCA  1500
GAACGTGATC GTCTTGAAAT CTACGAAGAT GTTTTATTCT TTCTTTCAAA AAAAGAAAAG  1560
GAGCAAGCAA AGCAGTTGCG AAAAAGGAAT TGGGAAGCCT TGAAAAACAT ACTTGACAAC  1620
ATGGCTAATG TAACATACTC TACTACTTGG TCTGAAGCCC AGCAGTATCT GATGGATAAT  1680
CCAACATTTG CAGAAGATGA GGAATTACAG NNNAACCAGA AGAGTTTGAG GGAAAGGAGA  1740
CGACAGCGTA AAAATAGAGA ATCTTTTCAG ATATTTTTAG ATGAACTGCA TGAACATGGA  1800
CAACTGCATT CTATGTCGTC TTGGATGGAG TTGTATCCAA CAATTAGTTC TGATATTAGA  1860
TTCACTAATA TGCTTGGTCA GCCGNNNGAT AAAGGATTTG TAGTTGAAGT AAATACTACT  1920
TTTGAAGACG TGGCAATAAT CAGCTCAACT AAGAGATCAA CCACGTTAGA TGCTGGAAAT  1980
ATTAAGTTGG CTTTCAATAG TTTACTAGAA AAGGCAGAAG CCCGTGAACG TGAAAGAGAA  2040
AAAGAGGAGG CTCGGAAGAT GAAACGAAAA GAATCTGCAT TTAAGAGTAT GTTGAAACAA  2100
GCTACTCCTC CAATAGAATT GGATGCTGTC TGGGAAGATA TCCGGGAGAG ATTTGTAAAA  2160
GAGCCAGCAT TCGAGGACAT AACTCTAGAG TCTGAAAGGA AACGAATATT TAAAGATTTT  2220
ATGCATGTGC TTGAGCATGA ATGTCAACAT CATCATTCAA AGAACAAGAA ACATTCTAAA  2280
AAATCTAAAA AACATCACAG GAAACGTTCC CGTTCTCGAT CGGGGTCAGA TTCAGATGAT  2340
GATGACAGCC ACTCAAAGAA AAAAAGACAG CGATCAGAGT CTCGTTCTGC TTCAGAACAT  2400
TCTTCTAGCG CTGAATCTGA GAGAAGTTAT AAGAAGTCAA AAAAACATAA GAAGAAAAGC  2460
AAGAAGAGAA GACATAAATC TGACTCTCCA GAGTCAGATG CTGAACGAGA GAAGGATAAA  2520
AAAGAAAAAG ACCGGGAAAG CGAAAAAGAT AGAACTAGAC AAAGATCAGA ATCAAAACAC  2580
AAATCACCCA AGAAAAAGAC TGGAAAGGAT TCTGGTAATT GGGATNNN               2628