E Sus scrofa | Complement C4-A isoform 1 preproprotein
Protein ID | Sequence length | Exons | Domain Architectures | Number exons | Cross reference | |
---|---|---|---|---|---|---|
PIGXX31217 reference isoform | 1741 | 41 | ![]() |
MRLLWVLIWA SSFFALSLQK PRLLLFSPSV VHLGVPLSVG VQLQDAPPGQ VVTGSVFLRN 60 PSSTRDRCSP KVEFSLSSER DFVLLSLQIS VAGAKECGLH LLRRAPDVQL VAQSSWLKDY 120 LSKKTNIQGV NLFFSSRRGH IFLQTDQPVY NPGQRVRYRV FALDQKMRPT EETLTVMVEN 180 SHGLLVRKKE VYVPSSIFQD DLVIPDIAEP GTWKISARFS DGLDSNSSTQ FEVKKYVLPN 240 FEVTITPEQP YILTAPGFLN EIQMVIQARY VYGKPVQGVA YVRFGLLDED DKKIFLRGLE 300 NQTKLVEGQC HISLPEAKVQ GALQKLNITI NDLPGKRLYV VAAVIESPGG EMEEAELTSW 360 RFVSSPFSLD LSKTKRHLIP GAPFLLQALV RDVSGSPAAG IPVKVSAKLF SGSAPKNQDF 420 QQNTDERGHV TVPIGIPKTI SEMQLSVSAG SPHPATGTII VRAPPPRSPG FLSIEQLDTR 480 PPKVGDTLNL NLRAMGLVGS FSHFYYMILS RGQIVSVHRE LRRDLTSVSV FVDHHLVPSF 540 HFVAFYYQGG LPVANSLRVD IQAGACEGKL ELNVDSGKAY HPGETLKIRL QTDSPALVAL 600 GAVDTALYAV GSKSHKPLNM AKVFEAINHY DLGCGPGGGD SATQVFEAAG LAFSDGDQLT 660 PTRKSLGCPK KTKIRRKRNV NFQTAINEIL GRYSSPLAKR CCQDGLTQLP MARTCAERVA 720 RVKNPACQEP FLSCCQFAEA LRKKTRRGQG GFARAMELLQ EEELIEEDDI PVRSFFPENW 780 LWRVEEVPHS LQLSLLLPDS LTTWEIHGVS LSKSTGLCVA TPARVRVFRE FHLHLRLPVS 840 IHRFEQLELR PVLYNYLDKD VPVSVHVSPV EGLCLAGGGG LAQQVLVPAG SARPVGFSVV 900 PISAAAVSLK VVARGSFDFP VGDAVSKILQ IANEGAIHQE ELVYALNPQN VLGRNLEIPG 960 HSDPNVIPDG DFRSFVRLTA SDPLDTLGSE GALSPGGLAS LLRLPRGCAE QTMFYLAPTL 1020 AASRYLDKTE QWSTLPPETK DHAVDLIQKG YTRIQEFRKN DGSYGAWLHR ESSTWLTAFV 1080 LKVLSLAQEQ VGGSPEKLQE TAAWLLLQQK EDGSFHDPCP VIHRDMQGGL VGNDEKVALT 1140 AFVVIALHHG LAVFQDRNAE QFKRVENAIS TANDFLGEKV SSGLLGSHAA AISAYALSLS 1200 GAPEQLQDIA HNNLMAMAQK IGDHLFWGTV PSSQSNTLSP TPAPQRPTDP MPQAPALWIE 1260 TTAYALLHLL IREGKAEMAD QTASWLTRQG SYKGGFRSTQ DTVIALDALS AYWILSHTTE 1320 EKELNVTLSS MSRGGFKSHV VRLTNHQVKG LEEELQFSLG SKINVKVGGN SKGTLKILRA 1380 YNVIDLKNTT CQDLQIEVTV KGHVEYMLEA NEDYEEYEYE DLPAQDDPGA HSQPVTPLQL 1440 FDGRRNRRRR EAPKVAEEQE SRVQYTVCIW RNGQVGLSGM AIADITLLSG FYAERADLEK 1500 LTSLSDRYVS HFETEGPHVL LYFDSVPTSR ECVGFGAVQE VAIGLVQPAS AVLYDYYNPE 1560 HKCSVFYGAP SKSKFLSTLC SADVCQCAEG KCPRQRRALE RGLQDLDGYR MKFACYSPRV 1620 DYAFQVKVLR EDSRAAFRLF ETSITQVLHF TKDVKAAAAQ ARNFLVRASC RLHLEPGKEY 1680 LIMGLDGTTH DLKGDPQYLL DSNCWIEEMP SERLCRSTRH REPCAQLRDF IQEYSTQGCQ 1740 V 1741
ATGAGGCTCC TCTGGGTGCT GATCTGGGCA TCCAGCTTCT TTGCCCTGTC TCTGCAGAAG 60 CCCAGGTTGC TCCTGTTCTC TCCTTCTGTG GTTCACCTGG GGGTCCCCCT GTCAGTGGGG 120 GTGCAGCTCC AGGATGCTCC TCCAGGACAG GTGGTGACAG GATCGGTGTT CCTGAGAAAC 180 CCATCCAGCA CTCGTGACCG TTGCTCCCCA AAAGTGGAGT TCAGCCTCAG CTCAGAAAGA 240 GACTTTGTAC TCCTCAGCCT CCAGATCTCC GTGGCAGGCG CCAAGGAGTG TGGGCTCCAT 300 CTTCTCCGCA GAGCCCCCGA CGTGCAACTG GTGGCCCAGT CATCATGGCT CAAGGACTAT 360 CTGTCCAAAA AGACCAACAT TCAGGGAGTC AACCTGTTCT TCTCCTCTCG CCGGGGGCAC 420 ATCTTTCTGC AGACTGACCA GCCCGTTTAT AACCCGGGCC AGCGGGTTCG GTACCGGGTC 480 TTTGCTCTGG ATCAGAAGAT GCGCCCGACG GAAGAAACCC TCACGGTCAT GGTGGAGAAC 540 TCTCATGGCC TCCTCGTGCG GAAGAAGGAG GTGTATGTCC CCTCGTCCAT CTTCCAGGAT 600 GACTTAGTGA TCCCAGACAT CGCCGAGCCA GGAACTTGGA AGATCTCAGC CCGATTCTCA 660 GATGGCCTGG ATTCCAACAG CAGCACCCAG TTCGAGGTGA AGAAATATGT CCTTCCCAAC 720 TTTGAGGTGA CGATCACTCC CGAACAGCCC TACATCCTGA CAGCGCCTGG CTTTCTTAAT 780 GAAATCCAAA TGGTCATCCA AGCCAGGTAC GTCTACGGGA AGCCAGTGCA GGGGGTGGCA 840 TATGTACGCT TTGGGCTCCT GGATGAGGAT GATAAAAAGA TTTTCCTTCG GGGGCTGGAG 900 AATCAGACCA AGCTGGTAGA GGGCCAGTGT CACATTTCCC TCCCAGAGGC CAAGGTGCAG 960 GGTGCGCTGC AAAAGCTTAA TATTACAATT AATGACCTCC CAGGGAAGCG CCTCTATGTT 1020 GTTGCAGCCG TCATTGAGTC TCCAGGCGGG GAGATGGAGG AGGCAGAGCT CACATCCTGG 1080 CGTTTCGTGT CATCTCCCTT CTCCTTGGAT CTAAGCAAAA CCAAGCGACA CCTCATACCT 1140 GGGGCCCCCT TTCTGCTGCA GGCCCTGGTT CGAGATGTGT CAGGCTCCCC AGCCGCTGGC 1200 ATTCCCGTCA AGGTTTCTGC CAAGTTGTTT TCTGGATCTG CTCCTAAAAA CCAGGATTTT 1260 CAACAGAACA CAGACGAGAG GGGCCACGTC ACTGTTCCCA TTGGCATCCC CAAGACCATC 1320 TCGGAAATGC AGCTCTCGGT GTCTGCAGGC TCCCCCCATC CCGCCACAGG CACGATCATC 1380 GTGAGAGCCC CCCCGCCAAG AAGCCCCGGC TTTCTGTCCA TTGAGCAGCT GGACACTCGA 1440 CCCCCTAAAG TCGGGGACAC ACTTAACCTA AACCTGCGAG CCATGGGCCT CGTCGGGAGC 1500 TTCTCTCACT TCTACTACAT GATCCTCTCC CGGGGCCAGA TCGTGTCTGT GCATCGAGAG 1560 CTCAGGAGGG ACCTGACCTC TGTCTCTGTG TTTGTGGACC ATCACCTGGT GCCCTCGTTC 1620 CACTTCGTGG CCTTCTACTA CCAAGGGGGC CTCCCGGTGG CCAACTCCCT GCGAGTGGAC 1680 ATCCAGGCTG GGGCCTGCGA GGGCAAGCTG GAGCTGAACG TGGACAGTGG CAAGGCGTAC 1740 CATCCTGGCG AGACTCTAAA GATCCGCCTG CAAACGGATT CTCCAGCCCT GGTGGCGCTG 1800 GGAGCTGTGG ACACGGCTCT GTATGCCGTG GGCAGCAAGT CCCACAAGCC CCTCAACATG 1860 GCCAAGGTCT TTGAAGCTAT AAATCACTAT GACCTTGGCT GTGGTCCTGG AGGTGGGGAC 1920 AGTGCCACTC AGGTGTTTGA GGCAGCCGGT CTGGCCTTTT CTGATGGAGA CCAATTGACC 1980 CCAACCAGAA AGAGTCTGGG CTGTCCCAAG AAGACAAAGA TCCGGAGAAA GAGAAACGTG 2040 AACTTCCAAA CGGCGATTAA TGAGATTCTG GGCCGGTACT CTTCCCCCTT GGCCAAGCGC 2100 TGCTGCCAGG ATGGGCTGAC CCAGCTGCCC ATGGCACGCA CCTGTGCCGA GCGGGTGGCC 2160 CGTGTGAAAA ACCCGGCCTG CCAGGAGCCC TTCCTGTCCT GCTGCCAGTT TGCTGAAGCC 2220 CTGCGCAAGA AGACACGCAG GGGCCAGGGG GGCTTTGCCC GAGCCATGGA GCTCCTGCAG 2280 GAGGAGGAAC TGATCGAGGA GGATGACATC CCCGTGCGCA GCTTCTTCCC CGAGAACTGG 2340 CTGTGGAGGG TGGAGGAAGT GCCCCATTCC CTCCAATTGT CACTGCTGCT CCCGGACTCT 2400 CTGACCACGT GGGAGATCCA CGGCGTGAGC CTGTCCAAAA GCACAGGCTT GTGTGTGGCG 2460 ACCCCGGCTC GGGTCCGAGT GTTCCGCGAA TTCCACCTGC ACCTCCGCCT GCCTGTCTCC 2520 ATCCACCGCT TTGAGCAGCT CGAGCTGCGG CCTGTGCTTT ACAACTACCT GGATAAGGAT 2580 GTGCCCGTGA GCGTCCACGT CTCCCCAGTG GAGGGGCTGT GCCTGGCTGG GGGCGGAGGG 2640 CTCGCCCAGC AGGTGCTGGT GCCTGCAGGT TCTGCCCGGC CCGTTGGCTT CTCTGTGGTG 2700 CCCATATCAG CCGCTGCTGT GTCCCTGAAG GTGGTGGCTC GAGGATCCTT TGATTTCCCT 2760 GTCGGGGACG CAGTTTCTAA GATTCTACAA ATTGCAAATG AAGGGGCCAT TCACCAGGAG 2820 GAGCTTGTCT ATGCACTCAA CCCCCAGAAC GTCCTAGGCC GGAACTTGGA AATTCCTGGC 2880 CACTCGGATC CCAATGTTAT CCCCGATGGA GACTTCAGGA GCTTTGTCCG TCTCACAGCC 2940 TCAGATCCAC TGGACACTTT GGGCTCTGAG GGGGCCTTGT CACCAGGAGG CCTGGCCTCC 3000 CTCCTGAGGC TTCCTCGGGG CTGCGCGGAA CAAACCATGT TCTACTTGGC TCCAACGCTG 3060 GCTGCTTCCC GCTACCTGGA CAAGACAGAG CAATGGAGCA CACTGCCCCC TGAGACCAAG 3120 GACCACGCCG TGGATCTGAT CCAGAAAGGC TACACGCGGA TCCAGGAATT TCGAAAAAAT 3180 GATGGTTCCT ATGGGGCCTG GTTGCATCGG GAAAGCAGCA CCTGGCTCAC GGCCTTTGTG 3240 CTGAAGGTAC TGAGTTTGGC CCAGGAACAG GTGGGCGGCT CACCCGAAAA GTTGCAGGAG 3300 ACGGCCGCGT GGCTGCTGTT GCAGCAGAAG GAGGACGGGT CATTCCACGA CCCCTGTCCT 3360 GTTATCCACA GGGACATGCA GGGGGGCTTG GTGGGAAATG ACGAGAAGGT GGCGCTCACA 3420 GCCTTCGTGG TCATCGCCCT TCATCACGGG CTGGCCGTCT TCCAGGACAG GAATGCGGAG 3480 CAGTTTAAGA GGGTGGAAAA CGCTATCTCG ACAGCAAATG ATTTCTTGGG GGAGAAAGTG 3540 TCTTCCGGGC TCCTGGGCTC CCACGCAGCT GCCATCTCGG CCTATGCGCT GTCGCTGTCC 3600 GGGGCCCCCG AGCAACTGCA GGATATTGCC CACAACAACC TCATGGCCAT GGCCCAGAAG 3660 ATTGGCGATC ATCTGTTCTG GGGCACAGTC CCCAGTTCTC AGAGCAACAC CTTGTCACCC 3720 ACGCCGGCTC CTCAGAGACC AACAGACCCC ATGCCCCAGG CCCCGGCCCT GTGGATTGAA 3780 ACCACAGCCT ATGCCCTGCT GCACCTGCTG ATCCGAGAGG GCAAGGCCGA GATGGCCGAC 3840 CAGACCGCAT CCTGGCTGAC CCGCCAGGGC AGCTACAAAG GGGGATTCCG CAGCACCCAG 3900 GACACAGTGA TCGCCCTGGA CGCCCTGTCT GCGTACTGGA TCTTGTCGCA CACCACCGAG 3960 GAGAAGGAGC TCAACGTGAC CCTCAGCTCC ATGTCCCGCG GGGGGTTCAA GTCCCACGTG 4020 GTGCGGCTGA CCAACCACCA AGTGAAAGGC CTGGAGGAGG AGCTGCAGTT TTCCTTGGGC 4080 AGCAAGATTA ATGTGAAGGT GGGAGGAAAC AGCAAAGGAA CCTTGAAGAT CCTCCGTGCC 4140 TACAATGTCA TAGACCTGAA GAACACCACC TGCCAGGACC TTCAGATCGA AGTGACGGTC 4200 AAGGGCCACG TCGAGTACAT GTTGGAGGCG AACGAGGACT ACGAGGAATA CGAGTACGAG 4260 GACCTTCCTG CCCAGGATGA CCCTGGGGCC CACTCCCAGC CCGTGACGCC CCTGCAGCTG 4320 TTTGACGGCC GGAGGAACCG CCGCAGGAGG GAGGCACCCA AGGTGGCCGA AGAGCAGGAA 4380 TCCAGGGTGC AGTACACCGT GTGCATCTGG CGGAACGGCC AGGTGGGGCT GTCGGGCATG 4440 GCCATCGCGG ACATCACCCT CCTGAGTGGA TTCTATGCCG AGCGGGCTGA CCTGGAGAAG 4500 CTGACCTCCC TCTCTGACCG GTACGTCAGT CACTTTGAGA CCGAGGGGCC CCATGTCCTG 4560 CTGTACTTCG ACTCGGTCCC TACCTCCCGG GAGTGTGTGG GCTTTGGAGC CGTGCAGGAG 4620 GTGGCCATCG GGCTGGTGCA GCCGGCCAGC GCCGTCCTGT ACGACTACTA CAACCCCGAG 4680 CACAAATGTT CTGTGTTTTA CGGGGCGCCA AGTAAGAGCA AATTCTTGTC CACATTGTGC 4740 TCGGCTGATG TCTGCCAGTG CGCCGAGGGC AAGTGCCCTA GACAGCGCCG GGCCCTGGAG 4800 CGGGGGCTGC AGGACCTGGA TGGTTACAGG ATGAAGTTTG CCTGCTACTC GCCCCGCGTG 4860 GATTACGCCT TCCAGGTGAA GGTTCTCCGA GAAGACAGCA GAGCTGCTTT CCGCCTCTTT 4920 GAGACGAGCA TCACCCAAGT CCTACATTTC ACTAAGGATG TCAAGGCCGC TGCTGCTCAG 4980 GCCCGAAACT TCCTGGTGCG AGCCTCTTGC CGCCTTCACT TAGAACCTGG GAAAGAATAT 5040 CTGATCATGG GCCTGGACGG GACCACCCAC GACCTCAAGG GAGACCCCCA GTACCTGCTG 5100 GACTCGAACT GCTGGATCGA GGAGATGCCC TCTGAGCGCC TGTGCCGGAG CACCCGCCAT 5160 CGGGAGCCCT GTGCCCAGCT CAGAGACTTC ATCCAGGAGT ACAGCACGCA GGGCTGCCAG 5220 GTGTAA 5226