Gene HAPBU24865 (A0A3Q2UW80)

E Haplochromis burtoni | procollagen galactosyltransferase [COLGALT1]

Select a TabArrow
Protein sequences and cDNA of the selected isoforms are displayed below the tables
Protein ID
Sequence length
Exons
Domain Architectures
Number exons
Cross reference
HAPBU24865 reference isoform 610 12 UniProtKB/TrEMBL A0A3Q2UW80


Protein Sequence

Download: Fasta
MPRLAGLTDA LLLLLLSCCS PVRGYFAEER WNPESSLLAP RVLLALICRN SEHSLPYFLG    60
TIERLNYPKD RMALWVATDH NEDNTTAILH DWLVKVQKLY HYVEWRPKEE PRNYEDEEGP   120
KDWTDPRYEH VMKLRQAALE SAREMWADYF MLADCDNLLT NSNVLRGLMK QNKTIIAPML   180
DSRAAYSNFW CGMTSQGYYK RTPAYIPLRK QIRKGCFAVP MVHSTFLIDL RKEASRQLAF   240
HPPHPEYSWA FDDIIVFAYS AKMAEVQMFV CNKETYGYFP VPLRSHNTLQ DEVDSFLHTV   300
LEVNVRNAPV MPSKYISVPR KQPDKMGFDE VFMINLQRRT DRRERMLRAL HEQEIACKVV   360
NAVDGKAMNV SEIHALGIHM LPGYSDPYHG RPLTKGELGC FLSHYNIWKE IVERSLHTSL   420
VIEDDLRFEV FFKRRLMNLM SEVDHEGLDW DLIYIGRKRM QVAHPEKAVP NIHNLVEADY   480
SYWTLGYMIS LQGAQKLLKA EPLKRILPVD EFLPLMYNKH PVSDYMEQFE TRDLLAFSAE   540
PLLVYPTHYT GDPGYVSDTE TSTVWDNEKI RTDWDRARSG KSHEQAEIST EAQNSDVLQS   600
PLDSTARDEL                                                          610

cDNA Sequence

Download: Fasta
ATGCCCAGGC TCGCTGGCCT CACCGACGCC CTGCTCCTCC TGCTCCTGTC CTGCTGCAGT    60
CCTGTCCGGG GATATTTTGC GGAGGAGCGC TGGAACCCCG AATCTTCACT CCTCGCACCC   120
CGGGTTCTTC TTGCCCTCAT TTGCAGAAAC TCCGAGCACT CATTGCCGTA TTTCCTGGGT   180
ACTATTGAGC GCCTCAACTA CCCGAAGGAC CGTATGGCAC TGTGGGTAGC AACTGATCAT   240
AATGAGGACA ACACCACAGC CATTCTGCAT GACTGGCTTG TAAAAGTACA GAAGCTCTAC   300
CATTATGTAG AGTGGAGGCC AAAAGAGGAG CCCAGAAATT ATGAAGATGA GGAAGGTCCA   360
AAAGACTGGA CAGATCCTCG TTATGAGCAT GTTATGAAGC TTCGGCAAGC AGCGCTGGAG   420
TCTGCTCGTG AGATGTGGGC AGACTATTTT ATGCTGGCAG ATTGTGACAA CCTCCTCACC   480
AATTCAAATG TGCTCCGGGG GCTGATGAAA CAGAACAAGA CTATCATCGC TCCGATGCTT   540
GACTCTCGTG CAGCCTACTC AAATTTTTGG TGTGGAATGA CTTCCCAGGG TTATTATAAG   600
AGGACACCTG CCTACATACC TCTTAGGAAG CAGATACGAA AGGGCTGTTT TGCCGTTCCT   660
ATGGTACACT CCACGTTCCT AATAGACCTC AGGAAAGAGG CCTCCAGGCA GCTAGCCTTT   720
CACCCACCAC ACCCAGAATA CAGCTGGGCA TTTGATGACA TTATTGTGTT TGCTTACTCT   780
GCTAAGATGG CAGAGGTTCA AATGTTTGTA TGTAACAAGG AGACCTATGG ATACTTCCCT   840
GTGCCACTAC GTTCCCACAA TACTTTGCAA GATGAAGTGG ACAGTTTCTT ACACACTGTG   900
TTAGAAGTTA ATGTGCGAAA TGCCCCAGTG ATGCCATCCA AATACATAAG TGTTCCTAGA   960
AAACAACCTG ACAAAATGGG CTTTGATGAG GTGTTCATGA TAAACTTGCA GAGGCGGACT  1020
GACCGCAGAG AACGTATGCT GAGGGCACTG CATGAGCAGG AGATTGCTTG TAAAGTTGTT  1080
AATGCCGTGG ATGGAAAAGC AATGAATGTC AGTGAAATTC ATGCTTTGGG TATCCATATG  1140
CTTCCTGGAT ACAGTGACCC TTATCATGGT CGTCCACTAA CAAAGGGAGA GCTGGGATGC  1200
TTTCTTTCCC ACTATAACAT CTGGAAAGAG ATTGTGGAAC GAAGTCTCCA CACATCTTTA  1260
GTGATCGAAG ATGACCTGCG CTTTGAGGTG TTCTTCAAAC GTCGCTTGAT GAACTTAATG  1320
AGTGAGGTGG ACCATGAAGG TCTGGATTGG GATCTCATTT ATATTGGTCG AAAGAGAATG  1380
CAAGTGGCTC ACCCGGAGAA AGCAGTGCCA AATATACACA ACTTGGTGGA GGCAGACTAT  1440
TCATATTGGA CACTAGGATA TATGATATCA TTACAGGGTG CCCAGAAGCT TTTAAAAGCA  1500
GAACCATTAA AGAGGATTTT GCCAGTGGAT GAGTTTCTTC CTCTCATGTA CAATAAACAC  1560
CCTGTATCTG ATTATATGGA ACAGTTTGAA ACCAGGGACC TGCTGGCATT TTCCGCAGAG  1620
CCTCTCCTTG TGTATCCAAC TCACTATACA GGTGACCCAG GATACGTCAG TGACACTGAG  1680
ACCTCCACTG TGTGGGACAA TGAAAAAATC CGCACAGACT GGGACAGAGC ACGCTCAGGA  1740
AAAAGTCATG AGCAGGCTGA GATCAGCACA GAGGCACAGA ACTCAGATGT GCTCCAGTCT  1800
CCTTTGGACA GTACAGCACG GGATGAGCTA TGA                               1833