CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-001961
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Collagen alpha-1(I) chain 
Protein Synonyms/Alias
 Alpha-1 type I collagen 
Gene Name
 COL1A1 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Bos taurus (Bovine) 
NCBI Taxa ID
 9913 
Lysine Modification
Position
Peptide
Type
References
504ADGVAGPKGPAGERGglycation[1]
519APGPAGPKGSPGEAGglycation[1]
750DRGDAGPKGADGAPGglycation[1]
861NVGAPGPKGARGSAGglycation[1]
1032RDGSPGAKGDRGETGglycation[1]
Reference
 [1] Study of posttranslational non-enzymatic modifications of collagen using capillary electrophoresis/mass spectrometry and high performance liquid chromatography/mass spectrometry.
 Mikulíková K, Eckhardt A, Pataridis S, Miksík I.
 J Chromatogr A. 2007 Jul 6;1155(2):125-33. [PMID: 17324437
Functional Description
 Type I collagen is a member of group I collagen (fibrillar forming collagen). 
Sequence Annotation
 DOMAIN 38 96 VWFC.
 DOMAIN 1228 1463 Fibrillar collagen NC1.
 REGION 162 177 Nonhelical region (N-terminal).
 REGION 178 1191 Triple-helical region.
 REGION 1192 1215 Nonhelical region (C-terminal).
 MOTIF 744 746 Cell attachment site (Potential).
 MOTIF 1092 1094 Cell attachment site (Potential).
 METAL 1276 1276 Calcium (By similarity).
 METAL 1278 1278 Calcium (By similarity).
 METAL 1279 1279 Calcium; via carbonyl oxygen (By
 METAL 1281 1281 Calcium; via carbonyl oxygen (By
 METAL 1284 1284 Calcium (By similarity).
 MOD_RES 162 162 Pyrrolidone carboxylic acid.
 MOD_RES 170 170 Allysine.
 MOD_RES 264 264 5-hydroxylysine.
 MOD_RES 276 276 5-hydroxylysine (Potential).
 MOD_RES 285 285 5-hydroxylysine (Potential).
 MOD_RES 708 708 5-hydroxylysine (Potential).
 MOD_RES 780 780 5-hydroxylysine (Potential).
 MOD_RES 861 861 5-hydroxylysine (Potential).
 MOD_RES 933 933 5-hydroxylysine (Potential).
 MOD_RES 1095 1095 5-hydroxylysine (Potential).
 MOD_RES 1107 1107 5-hydroxylysine (Potential).
 MOD_RES 1163 1163 3-hydroxyproline.
 CARBOHYD 264 264 O-linked (Gal...).
 DISULFID 1258 1290 By similarity.
 DISULFID 1264 1264 Interchain (with C-1281) (By similarity).
 DISULFID 1281 1281 Interchain (with C-1264) (By similarity).
 DISULFID 1298 1461 By similarity.
 DISULFID 1369 1414 By similarity.  
Keyword
 Calcium; Collagen; Complete proteome; Direct protein sequencing; Disulfide bond; Extracellular matrix; Glycoprotein; Hydroxylation; Metal-binding; Pyrrolidone carboxylic acid; Reference proteome; Repeat; Secreted; Signal. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1463 AA 
Protein Sequence
MFSFVDLRLL LLLAATALLT HGQEEGQEEG QEEDIPPVTC VQNGLRYHDR DVWKPVPCQI 60
CVCDNGNVLC DDVICDELKD CPNAKVPTDE CCPVCPEGQE SPTDQETTGV EGPKGDTGPR 120
GPRGPAGPPG RDGIPGQPGL PGPPGPPGPP GPPGLGGNFA PQLSYGYDEK STGISVPGPM 180
GPSGPRGLPG PPGAPGPQGF QGPPGEPGEP GASGPMGPRG PPGPPGKNGD DGEAGKPGRP 240
GERGPPGPQG ARGLPGTAGL PGMKGHRGFS GLDGAKGDAG PAGPKGEPGS PGENGAPGQM 300
GPRGLPGERG RPGAPGPAGA RGNDGATGAA GPPGPTGPAG PPGFPGAVGA KGEGGPQGPR 360
GSEGPQGVRG EPGPPGPAGA AGPAGNPGAD GQPGAKGANG APGIAGAPGF PGARGPSGPQ 420
GPSGPPGPKG NSGEPGAPGS KGDTGAKGEP GPTGIQGPPG PAGEEGKRGA RGEPGPAGLP 480
GPPGERGGPG SRGFPGADGV AGPKGPAGER GAPGPAGPKG SPGEAGRPGE AGLPGAKGLT 540
GSPGSPGPDG KTGPPGPAGQ DGRPGPPGPP GARGQAGVMG FPGPKGAAGE PGKAGERGVP 600
GPPGAVGPAG KDGEAGAQGP PGPAGPAGER GEQGPAGSPG FQGLPGPAGP PGEAGKPGEQ 660
GVPGDLGAPG PSGARGERGF PGERGVQGPP GPAGPRGANG APGNDGAKGD AGAPGAPGSQ 720
GAPGLQGMPG ERGAAGLPGP KGDRGDAGPK GADGAPGKDG VRGLTGPIGP PGPAGAPGDK 780
GEAGPSGPAG PTGARGAPGD RGEPGPPGPA GFAGPPGADG QPGAKGEPGD AGAKGDAGPP 840
GPAGPAGPPG PIGNVGAPGP KGARGSAGPP GATGFPGAAG RVGPPGPSGN AGPPGPPGPA 900
GKEGSKGPRG ETGPAGRPGE VGPPGPPGPA GEKGAPGADG PAGAPGTPGP QGIAGQRGVV 960
GLPGQRGERG FPGLPGPSGE PGKQGPSGAS GERGPPGPMG PPGLAGPPGE SGREGAPGAE 1020
GSPGRDGSPG AKGDRGETGP AGPPGAPGAP GAPGPVGPAG KSGDRGETGP AGPAGPIGPV 1080
GARGPAGPQG PRGDKGETGE QGDRGIKGHR GFSGLQGPPG PPGSPGEQGP SGASGPAGPR 1140
GPPGSAGSPG KDGLNGLPGP IGPPGPRGRT GDAGPAGPPG PPGPPGPPGP PSGGYDLSFL 1200
PQPPQEKAHD GGRYYRADDA NVVRDRDLEV DTTLKSLSQQ IENIRSPEGS RKNPARTCRD 1260
LKMCHSDWKS GEYWIDPNQG CNLDAIKVFC NMETGETCVY PTQPSVAQKN WYISKNPKEK 1320
RHVWYGESMT GGFQFEYGGQ GSDPADVAIQ LTFLRLMSTE ASQNITYHCK NSVAYMDQQT 1380
GNLKKALLLQ GSNEIEIRAE GNSRFTYSVT YDGCTSHTGA WGKTVIEYKT TKTSRLPIID 1440
VAPLDVGAPD QEFGFDVGPA CFL 1463 
Gene Ontology
 GO:0005584; C:collagen type I; IEA:Compara.
 GO:0005737; C:cytoplasm; IEA:Compara.
 GO:0005615; C:extracellular space; IEA:Compara.
 GO:0005201; F:extracellular matrix structural constituent; IEA:Compara.
 GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
 GO:0001568; P:blood vessel development; IEA:Compara.
 GO:0060346; P:bone trabecula formation; IEA:Compara.
 GO:0060351; P:cartilage development involved in endochondral bone morphogenesis; IEA:Compara.
 GO:0071230; P:cellular response to amino acid stimulus; IEA:Compara.
 GO:0071260; P:cellular response to mechanical stimulus; IEA:Compara.
 GO:0032964; P:collagen biosynthetic process; IEA:Compara.
 GO:0030199; P:collagen fibril organization; IEA:Compara.
 GO:0048706; P:embryonic skeletal system development; IEA:Compara.
 GO:0001958; P:endochondral ossification; IEA:Compara.
 GO:0060325; P:face morphogenesis; IEA:Compara.
 GO:0001957; P:intramembranous ossification; IEA:Compara.
 GO:0010812; P:negative regulation of cell-substrate adhesion; IEA:Compara.
 GO:0001649; P:osteoblast differentiation; IEA:Compara.
 GO:0090263; P:positive regulation of canonical Wnt receptor signaling pathway; IEA:Compara.
 GO:0030335; P:positive regulation of cell migration; IEA:Compara.
 GO:0010718; P:positive regulation of epithelial to mesenchymal transition; IEA:Compara.
 GO:0045893; P:positive regulation of transcription, DNA-dependent; IEA:Compara.
 GO:0070208; P:protein heterotrimerization; IEA:Compara.
 GO:0034504; P:protein localization to nucleus; IEA:Compara.
 GO:0015031; P:protein transport; IEA:Compara.
 GO:0007605; P:sensory perception of sound; IEA:Compara.
 GO:0043589; P:skin morphogenesis; IEA:Compara.
 GO:0034505; P:tooth mineralization; IEA:Compara.
 GO:0007601; P:visual perception; IEA:Compara. 
Interpro
 IPR008160; Collagen.
 IPR000885; Fib_collagen_C.
 IPR001007; VWF_C. 
Pfam
 PF01410; COLFI
 PF01391; Collagen
 PF00093; VWC 
SMART
 SM00038; COLFI
 SM00214; VWC 
PROSITE
 PS51461; NC1_FIB
 PS01208; VWFC_1
 PS50184; VWFC_2 
PRINTS