Entry HUMAN23501 (C43BP_HUMAN)

E Homo sapiens


General Information

Description
collagen type IV alpha 3 binding protein [Source:HGNC Symbol;Acc:HGNC:2205]
Organism
HUMAN - Homo sapiens (Taxon-ID: 9606)
Locus
5join(complement(75511759..75511844), complement(75511112..75511505), complement(75505982..75506116), complement(75459065..75459181), complement(75426371..75426478), complement(75425361..75425499), complement(75419341..75419424), complement(75416876..75417033), complement(75411011..75411103), complement(75402972..75403058), complement(75400205..75400297), complement(75399310..75399387), complement(75389592..75389687), complement(75385902..75386034), complement(75384642..75384712), complement(75381949..75382077), complement(75381072..75381201), complement(75379346..75379473))
Number of exons
18

IDs and Cross-references

loading xrefs...

Domain Architecture

Gene Ontology

loading GO annotations...

Protein Sequence

Download: Fasta
MQHSCIPTPP SPFSAPPAFL PVVTRESRRG LSSGGSAGRN AGVTATAAAA DGWKGRLPSP    60
LVLLPRSARC QARRRRGGRT SSLLLLPPTP ERALFASPSP DPSPRGLGAS SGAAEGAGAG   120
LLLGCRASMS DNQSWNSSGS EEDPETESGP PVERCGVLSK WTNYIHGWQD RWVVLKNNAL   180
SYYKSEDETE YGCRGSICLS KAVITPHDFD ECRFDISVND SVWYLRAQDP DHRQQWIDAI   240
EQHKTESGYG SESSLRRHGS MVSLVSGASG YSATSTSSFK KGHSLREKLA EMETFRDILC   300
RQVDTLQKYF DACADAVSKD ELQRDKVVED DEDDFPTTRS DGDFLHSTNG NKEKLFPHVT   360
PKGINGIDFK GEAITFKATT AGILATLSHC IELMVKREDS WQKRLDKETE KKRRTEEAYK   420
NAMTELKKKS HFGGPDYEEG PNSLINEEEF FDAVEAALDR QDKIEEQSQS EKVRLHWPTS   480
LPSGDAFSSV GTHRFVQKPY SRSSSMSSID LVSASDDVHR FSSQVEEMVQ NHMTYSLQDV   540
GGDANWQLVV EEGEMKVYRR EVEENGIVLD PLKATHAVKG VTGHEVCNYF WNVDVRNDWE   600
TTIENFHVVE TLADNAIIIY QTHKRVWPAS QRDVLYLSVI RKIPALTEND PETWIVCNFS   660
VDHDSAPLNN RCVRAKINVA MICQTLVSPP EGNQEISRDN ILCKITYVAN VNPGGWAPAS   720
VLRAVAKREY PKFLKRFTSY VQEKTAGKPI LF                                 752

Coding Sequence

Download: Fasta
ATGCAGCACA GCTGCATCCC TACCCCGCCC TCTCCTTTCT CCGCTCCTCC TGCTTTTCTA    60
CCCGTCGTCA CCCGGGAGAG CCGGAGGGGG CTAAGTTCGG GTGGCAGCGC CGGGCGCAAC   120
GCAGGGGTCA CGGCGACGGC GGCGGCGGCT GACGGCTGGA AGGGTAGGCT TCCTTCACCG   180
CTCGTCCTCC TTCCTCGCTC CGCTCGGTGT CAGGCGCGGC GGCGGCGCGG CGGGCGGACT   240
TCGTCCCTCC TCCTGCTCCC CCCCACACCG GAGCGGGCAC TCTTCGCTTC GCCATCCCCC   300
GACCCTTCAC CCCGAGGACT GGGCGCCTCC TCCGGCGCAG CTGAGGGAGC GGGGGCCGGT   360
CTCCTGCTCG GTTGTCGAGC CTCCATGTCG GATAATCAGA GCTGGAACTC GTCGGGCTCG   420
GAGGAGGATC CAGAGACGGA GTCTGGGCCG CCTGTGGAGC GCTGCGGGGT CCTCAGTAAG   480
TGGACAAACT ACATTCATGG GTGGCAGGAT CGTTGGGTAG TTTTGAAAAA TAATGCTCTG   540
AGTTACTACA AATCTGAAGA TGAAACAGAG TATGGCTGCA GAGGATCCAT CTGTCTTAGC   600
AAGGCTGTCA TCACACCTCA CGATTTTGAT GAATGTCGAT TTGATATTAG TGTAAATGAT   660
AGTGTTTGGT ATCTTCGTGC TCAGGATCCA GATCATAGAC AGCAATGGAT AGATGCCATT   720
GAACAGCACA AGACTGAATC TGGATATGGA TCTGAATCCA GCTTGCGTCG ACATGGCTCA   780
ATGGTGTCCC TGGTGTCTGG AGCAAGTGGC TACTCTGCAA CATCCACCTC TTCATTCAAG   840
AAAGGCCACA GTTTACGTGA GAAGTTGGCT GAAATGGAAA CATTTAGAGA CATCTTATGT   900
AGACAAGTTG ACACGCTACA GAAGTACTTT GATGCCTGTG CTGATGCTGT CTCTAAGGAT   960
GAACTTCAAA GGGATAAAGT GGTAGAAGAT GATGAAGATG ACTTTCCTAC AACGCGTTCT  1020
GATGGTGACT TCTTGCATAG TACCAACGGC AATAAAGAAA AGTTATTTCC ACATGTGACA  1080
CCAAAAGGAA TTAATGGTAT AGACTTTAAA GGGGAAGCGA TAACTTTTAA AGCAACTACT  1140
GCTGGAATCC TTGCAACACT TTCTCATTGT ATTGAACTAA TGGTTAAACG TGAGGACAGC  1200
TGGCAGAAGA GACTGGATAA GGAAACTGAG AAGAAAAGAA GAACAGAGGA AGCATATAAA  1260
AATGCAATGA CAGAACTTAA GAAAAAATCC CACTTTGGAG GACCAGATTA TGAAGAAGGC  1320
CCTAACAGTC TGATTAATGA AGAAGAGTTC TTTGATGCTG TTGAAGCTGC TCTTGACAGA  1380
CAAGATAAAA TAGAAGAACA GTCACAGAGT GAAAAGGTGA GATTACATTG GCCTACATCC  1440
TTGCCCTCTG GAGATGCCTT TTCTTCTGTG GGGACACATA GATTTGTCCA AAAGCCCTAT  1500
AGTCGCTCTT CCTCCATGTC TTCCATTGAT CTAGTCAGTG CCTCTGATGA TGTTCACAGA  1560
TTCAGCTCCC AGGTTGAAGA GATGGTGCAG AACCACATGA CTTACTCATT ACAGGATGTA  1620
GGCGGAGATG CCAATTGGCA GTTGGTTGTA GAAGAAGGAG AAATGAAGGT ATACAGAAGA  1680
GAAGTAGAAG AAAATGGGAT TGTTCTGGAT CCTTTAAAAG CTACCCATGC AGTTAAAGGC  1740
GTCACAGGAC ATGAAGTCTG CAATTATTTC TGGAATGTTG ACGTTCGCAA TGACTGGGAA  1800
ACAACTATAG AAAACTTTCA TGTGGTGGAA ACATTAGCTG ATAATGCAAT CATCATTTAT  1860
CAAACACACA AGAGGGTGTG GCCTGCTTCT CAGCGAGACG TATTATATCT TTCTGTCATT  1920
CGAAAGATAC CAGCCTTGAC TGAAAATGAC CCTGAAACTT GGATAGTTTG TAATTTTTCT  1980
GTGGATCATG ACAGTGCTCC TCTAAACAAC CGATGTGTCC GTGCCAAAAT AAATGTTGCT  2040
ATGATTTGTC AAACCTTGGT AAGCCCACCA GAGGGAAACC AGGAAATTAG CAGGGACAAC  2100
ATTCTATGCA AGATTACATA TGTAGCTAAT GTGAACCCTG GAGGATGGGC ACCAGCCTCA  2160
GTGTTAAGGG CAGTGGCAAA GCGAGAGTAT CCTAAATTTC TAAAACGTTT TACTTCTTAC  2220
GTCCAAGAAA AAACTGCAGG AAAGCCTATT TTGTTCTAG                         2259