E Gorilla gorilla gorilla | THO complex 5 [THOC5]
Protein ID | Sequence length | Exons | Domain Architectures | Number exons | Cross reference | |
---|---|---|---|---|---|---|
GORGO24026 | 635 | 18 | ![]() | |||
GORGO24027 | 342 | 10 | ![]() | |||
GORGO24025 reference isoform | 683 | 19 | ![]() |
MSSESSKKRK PKVIRSDGAP AEGKRNRSDT EQEGKYYSEE AEVDLRDPGR DYELYKYTCQ 60 ELQRLMAEIQ DLKSRGGKDV AIEIEERRIQ SCVHFMTLKK LNRLAHIRLK KGRDQTHEAK 120 QKVDAYHLQL QNLLYEVMHL QKEITKCLEF KSKHEEIDLV SLEEFYKEAP PDISKAEVTM 180 GDPHQQTLAR LDWELEQRKR LAEKYRECLS NKEKILKEIE VKKEYLSSLQ PRLNSIMQAS 240 LPVQEYLFMP FDQAHKQYET ARHLPPPLYV LFVQATAYGQ ACDKTLSVAI EGSVDEAKAL 300 FKPPEDSQDD ESDSDAEEEQ TTKRRRPTLG VQLDDKRKEM LKRHPLSVML DLKCKDDSVL 360 HLTFYYLMNL NIMTVKAKVT TAMELITPIS AGDLLSPDSV LSCLYPGDHG KKTPNPANQY 420 QFDKVGILTL SDYVLELGHP YLWVQKLGGL HFPKEQPQQT VIADHSLSAS HMETTMKLLK 480 TRVQSRLALH KQFASLEHGI VPVTSDCQYL FPAKVVSRLV KWVTIAHEDY MELHFTKDIV 540 DAGLAGDTNL YYMALIERGT AKLQAAVVLN PGYSSIPPIF QLCLNWKGEK TNSNDDNIRA 600 MEGEVNVCYK ELCGPWPSHQ LLTNQLQRLC VLLDVYLETE SHDDSVEGPK EFPQEKMCLR 660 LFRGPSRMKP FKYNHPQGFF SHR 683
ATGTCATCAG AATCGAGCAA AAAACGGAAG CCCAAAGTGA TCCGAAGCGA TGGAGCCCCA 60 GCTGAAGGAA AGCGGAATCG ATCTGACACC GAGCAGGAAG GTAAATACTA CAGTGAGGAG 120 GCCGAGGTGG ATCTGCGGGA CCCTGGCAGA GACTATGAGT TATACAAGTA CACCTGCCAG 180 GAGCTACAGA GGCTGATGGC TGAGATCCAA GACCTGAAGA GCAGGGGTGG CAAGGATGTG 240 GCAATAGAAA TAGAAGAACG GAGGATCCAG AGCTGTGTGC ATTTCATGAC TCTAAAGAAG 300 CTTAACCGAT TAGCCCACAT CAGGTTGAAG AAAGGAAGAG ATCAGACCCA CGAGGCTAAG 360 CAGAAAGTAG ATGCTTATCA CCTGCAGCTC CAGAACCTGT TGTATGAGGT GATGCACCTA 420 CAGAAGGAGA TCACCAAATG TTTGGAGTTT AAGTCAAAGC ATGAAGAAAT TGATCTGGTC 480 AGTTTAGAGG AGTTTTATAA GGAGGCTCCA CCAGATATCA GCAAGGCCGA AGTCACCATG 540 GGAGACCCTC ACCAGCAAAC ACTGGCACGT CTGGACTGGG AGCTGGAGCA GCGGAAAAGG 600 CTGGCAGAGA AGTACCGAGA GTGCCTATCT AACAAGGAGA AGATTCTCAA GGAGATTGAG 660 GTGAAGAAGG AGTACCTGAG CAGCCTCCAG CCCCGCCTCA ACAGCATCAT GCAGGCTTCC 720 CTTCCGGTGC AGGAGTACCT GTTTATGCCA TTCGACCAGG CTCACAAGCA GTATGAGACA 780 GCCAGACACC TGCCGCCTCC CCTCTATGTC CTCTTTGTTC AGGCCACTGC GTATGGGCAG 840 GCCTGTGATA AGACGTTATC TGTGGCAATC GAAGGCAGTG TGGATGAAGC CAAGGCTCTG 900 TTCAAACCTC CAGAGGACTC CCAAGATGAC GAGAGTGACT CGGATGCCGA GGAGGAGCAG 960 ACTACGAAAC GCCGGAGACC CACACTGGGG GTTCAGTTGG ACGACAAACG CAAGGAGATG 1020 CTGAAGAGGC ACCCACTGTC TGTCATGCTC GACCTGAAAT GCAAAGATGA CAGTGTGCTT 1080 CACCTGACTT TCTACTACCT CATGAACCTC AACATCATGA CAGTAAAAGC CAAAGTGACA 1140 ACTGCCATGG AGCTGATCAC CCCCATCAGT GCAGGTGACT TGCTGTCTCC TGACTCAGTC 1200 CTGAGTTGCT TGTATCCTGG GGATCATGGA AAGAAAACTC CGAATCCAGC CAATCAGTAT 1260 CAGTTTGATA AAGTTGGCAT CCTGACTTTG AGCGACTATG TACTTGAGCT AGGTCACCCC 1320 TATTTGTGGG TGCAGAAGCT GGGTGGCCTC CACTTCCCCA AAGAGCAGCC CCAGCAAACA 1380 GTGATTGCTG ACCACTCGCT GAGCGCCAGC CACATGGAGA CCACCATGAA ACTTCTGAAG 1440 ACCAGGGTGC AGTCCCGCCT GGCCCTCCAC AAACAGTTTG CGTCCCTAGA ACACGGCATT 1500 GTGCCAGTTA CCAGTGATTG CCAGTACCTC TTCCCTGCCA AGGTTGTCTC TCGCCTGGTG 1560 AAATGGGTGA CAATTGCCCA TGAGGATTAC ATGGAGCTGC ACTTCACCAA AGACATTGTG 1620 GATGCGGGAC TGGCTGGGGA CACCAATCTC TACTACATGG CGCTCATCGA AAGGGGCACA 1680 GCCAAACTGC AGGCCGCTGT GGTGTTGAAC CCTGGCTACT CCTCCATCCC ACCTATTTTC 1740 CAGCTCTGTT TGAACTGGAA AGGGGAGAAA ACCAACAGCA ACGATGACAA CATTCGGGCC 1800 ATGGAGGGCG AAGTCAATGT GTGCTACAAG GAGCTGTGTG GCCCTTGGCC CAGCCACCAG 1860 CTGTTGACCA ACCAGCTGCA GCGGCTGTGT GTGCTGCTGG ATGTTTACCT GGAGACCGAG 1920 AGCCATGACG ACAGTGTGGA GGGGCCCAAG GAATTTCCCC AGGAGAAGAT GTGTCTGCGG 1980 CTCTTCAGGG GTCCCAGCAG GATGAAGCCA TTTAAATACA ACCATCCTCA GGGATTCTTC 2040 AGCCATCGCT GA 2052