CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-001964
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Collagen alpha-1(IV) chain 
Protein Synonyms/Alias
 Arresten 
Gene Name
 COL4A1 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
1496TAGSCLRKFSTMPFLglycation[1]
Reference
 [1] Glucose autoxidation induces functional damage to proteins via modification of critical arginine residues.
 Chetyrkin S, Mathis M, Pedchenko V, Sanchez OA, McDonald WH, Hachey DL, Madu H, Stec D, Hudson B, Voziyan P.
 Biochemistry. 2011 Jul 12;50(27):6102-12. [PMID: 21661747
Functional Description
 Type IV collagen is the major structural component of glomerular basement membranes (GBM), forming a 'chicken-wire' meshwork together with laminins, proteoglycans and entactin/nidogen. 
Sequence Annotation
 DOMAIN 1445 1669 Collagen IV NC1.
 REGION 173 1440 Triple-helical region.
 CARBOHYD 126 126 N-linked (GlcNAc...).
 DISULFID 1460 1551
 DISULFID 1493 1548
 DISULFID 1505 1511
 DISULFID 1570 1665
 DISULFID 1604 1662
 DISULFID 1616 1622
 CROSSLNK 1533 1533 S-Lysyl-methionine sulfilimine (Met-Lys)
 CROSSLNK 1651 1651 S-Lysyl-methionine sulfilimine (Lys-Met)  
Keyword
 3D-structure; Alternative splicing; Angiogenesis; Basement membrane; Collagen; Complete proteome; Direct protein sequencing; Disease mutation; Disulfide bond; Extracellular matrix; Glycoprotein; Hydroxylation; Polymorphism; Reference proteome; Repeat; Secreted; Signal. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1669 AA 
Protein Sequence
MGPRLSVWLL LLPAALLLHE EHSRAAAKGG CAGSGCGKCD CHGVKGQKGE RGLPGLQGVI 60
GFPGMQGPEG PQGPPGQKGD TGEPGLPGTK GTRGPPGASG YPGNPGLPGI PGQDGPPGPP 120
GIPGCNGTKG ERGPLGPPGL PGFAGNPGPP GLPGMKGDPG EILGHVPGML LKGERGFPGI 180
PGTPGPPGLP GLQGPVGPPG FTGPPGPPGP PGPPGEKGQM GLSFQGPKGD KGDQGVSGPP 240
GVPGQAQVQE KGDFATKGEK GQKGEPGFQG MPGVGEKGEP GKPGPRGKPG KDGDKGEKGS 300
PGFPGEPGYP GLIGRQGPQG EKGEAGPPGP PGIVIGTGPL GEKGERGYPG TPGPRGEPGP 360
KGFPGLPGQP GPPGLPVPGQ AGAPGFPGER GEKGDRGFPG TSLPGPSGRD GLPGPPGSPG 420
PPGQPGYTNG IVECQPGPPG DQGPPGIPGQ PGFIGEIGEK GQKGESCLIC DIDGYRGPPG 480
PQGPPGEIGF PGQPGAKGDR GLPGRDGVAG VPGPQGTPGL IGQPGAKGEP GEFYFDLRLK 540
GDKGDPGFPG QPGMTGRAGS PGRDGHPGLP GPKGSPGSVG LKGERGPPGG VGFPGSRGDT 600
GPPGPPGYGP AGPIGDKGQA GFPGGPGSPG LPGPKGEPGK IVPLPGPPGA EGLPGSPGFP 660
GPQGDRGFPG TPGRPGLPGE KGAVGQPGIG FPGPPGPKGV DGLPGDMGPP GTPGRPGFNG 720
LPGNPGVQGQ KGEPGVGLPG LKGLPGLPGI PGTPGEKGSI GVPGVPGEHG AIGPPGLQGI 780
RGEPGPPGLP GSVGSPGVPG IGPPGARGPP GGQGPPGLSG PPGIKGEKGF PGFPGLDMPG 840
PKGDKGAQGL PGITGQSGLP GLPGQQGAPG IPGFPGSKGE MGVMGTPGQP GSPGPVGAPG 900
LPGEKGDHGF PGSSGPRGDP GLKGDKGDVG LPGKPGSMDK VDMGSMKGQK GDQGEKGQIG 960
PIGEKGSRGD PGTPGVPGKD GQAGQPGQPG PKGDPGISGT PGAPGLPGPK GSVGGMGLPG 1020
TPGEKGVPGI PGPQGSPGLP GDKGAKGEKG QAGPPGIGIP GLRGEKGDQG IAGFPGSPGE 1080
KGEKGSIGIP GMPGSPGLKG SPGSVGYPGS PGLPGEKGDK GLPGLDGIPG VKGEAGLPGT 1140
PGPTGPAGQK GEPGSDGIPG SAGEKGEPGL PGRGFPGFPG AKGDKGSKGE VGFPGLAGSP 1200
GIPGSKGEQG FMGPPGPQGQ PGLPGSPGHA TEGPKGDRGP QGQPGLPGLP GPMGPPGLPG 1260
IDGVKGDKGN PGWPGAPGVP GPKGDPGFQG MPGIGGSPGI TGSKGDMGPP GVPGFQGPKG 1320
LPGLQGIKGD QGDQGVPGAK GLPGPPGPPG PYDIIKGEPG LPGPEGPPGL KGLQGLPGPK 1380
GQQGVTGLVG IPGPPGIPGF DGAPGQKGEM GPAGPTGPRG FPGPPGPDGL PGSMGPPGTP 1440
SVDHGFLVTR HSQTIDDPQC PSGTKILYHG YSLLYVQGNE RAHGQDLGTA GSCLRKFSTM 1500
PFLFCNINNV CNFASRNDYS YWLSTPEPMP MSMAPITGEN IRPFISRCAV CEAPAMVMAV 1560
HSQTIQIPPC PSGWSSLWIG YSFVMHTSAG AEGSGQALAS PGSCLEEFRS APFIECHGRG 1620
TCNYYANAYS FWLATIERSE MFKKPTPSTL KAGELRTHVS RCQVCMRRT 1669 
Gene Ontology
 GO:0005587; C:collagen type IV; IEA:Compara.
 GO:0005788; C:endoplasmic reticulum lumen; TAS:Reactome.
 GO:0005576; C:extracellular region; NAS:UniProtKB.
 GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
 GO:0048407; F:platelet-derived growth factor binding; IDA:MGI.
 GO:0001525; P:angiogenesis; IEA:UniProtKB-KW.
 GO:0007411; P:axon guidance; TAS:Reactome.
 GO:0071230; P:cellular response to amino acid stimulus; IEA:Compara.
 GO:0030574; P:collagen catabolic process; TAS:Reactome.
 GO:0022617; P:extracellular matrix disassembly; TAS:Reactome.
 GO:0007528; P:neuromuscular junction development; IEA:Compara. 
Interpro
 IPR016187; C-type_lectin_fold.
 IPR008160; Collagen.
 IPR001442; Collagen_VI_NC. 
Pfam
 PF01413; C4
 PF01391; Collagen 
SMART
 SM00111; C4 
PROSITE
 PS51403; NC1_IV 
PRINTS