CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-002603
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Collagen alpha-2(I) chain 
Protein Synonyms/Alias
 Alpha-2 type I collagen 
Gene Name
 COL1A2 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
198QPGAPGVKGEPGAPGacetylation[1]
Reference
 [1] Regulation of cellular metabolism by protein lysine acetylation.
 Zhao S, Xu W, Jiang W, Yu W, Lin Y, Zhang T, Yao J, Zhou L, Zeng Y, Li H, Li Y, Shi J, An W, Hancock SM, He F, Qin L, Chin J, Yang P, Chen X, Lei Q, Xiong Y, Guan KL.
 Science. 2010 Feb 19;327(5968):1000-4. [PMID: 20167786
Functional Description
 Type I collagen is a member of group I collagen (fibrillar forming collagen). 
Sequence Annotation
 DOMAIN 1133 1366 Fibrillar collagen NC1.
 METAL 1181 1181 Calcium (By similarity).
 METAL 1183 1183 Calcium (By similarity).
 METAL 1184 1184 Calcium; via carbonyl oxygen (By
 METAL 1186 1186 Calcium; via carbonyl oxygen (By
 METAL 1189 1189 Calcium (By similarity).
 MOD_RES 80 80 Pyrrolidone carboxylic acid (By
 MOD_RES 84 84 Allysine.
 MOD_RES 420 420 Hydroxyproline.
 MOD_RES 441 441 Hydroxyproline.
 MOD_RES 444 444 Hydroxyproline.
 CARBOHYD 1267 1267 N-linked (GlcNAc...) (Potential).
 DISULFID 1163 1195 By similarity.
 DISULFID 1203 1364 By similarity.
 DISULFID 1272 1317 By similarity.  
Keyword
 Calcium; Chromosomal rearrangement; Collagen; Complete proteome; Direct protein sequencing; Disease mutation; Disulfide bond; Dwarfism; Ehlers-Danlos syndrome; Extracellular matrix; Glycoprotein; Hydroxylation; Metal-binding; Osteogenesis imperfecta; Polymorphism; Pyrrolidone carboxylic acid; Reference proteome; Repeat; Secreted; Signal. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1366 AA 
Protein Sequence
MLSFVDTRTL LLLAVTLCLA TCQSLQEETV RKGPAGDRGP RGERGPPGPP GRDGEDGPTG 60
PPGPPGPPGP PGLGGNFAAQ YDGKGVGLGP GPMGLMGPRG PPGAAGAPGP QGFQGPAGEP 120
GEPGQTGPAG ARGPAGPPGK AGEDGHPGKP GRPGERGVVG PQGARGFPGT PGLPGFKGIR 180
GHNGLDGLKG QPGAPGVKGE PGAPGENGTP GQTGARGLPG ERGRVGAPGP AGARGSDGSV 240
GPVGPAGPIG SAGPPGFPGA PGPKGEIGAV GNAGPAGPAG PRGEVGLPGL SGPVGPPGNP 300
GANGLTGAKG AAGLPGVAGA PGLPGPRGIP GPVGAAGATG ARGLVGEPGP AGSKGESGNK 360
GEPGSAGPQG PPGPSGEEGK RGPNGEAGSA GPPGPPGLRG SPGSRGLPGA DGRAGVMGPP 420
GSRGASGPAG VRGPNGDAGR PGEPGLMGPR GLPGSPGNIG PAGKEGPVGL PGIDGRPGPI 480
GPAGARGEPG NIGFPGPKGP TGDPGKNGDK GHAGLAGARG APGPDGNNGA QGPPGPQGVQ 540
GGKGEQGPPG PPGFQGLPGP SGPAGEVGKP GERGLHGEFG LPGPAGPRGE RGPPGESGAA 600
GPTGPIGSRG PSGPPGPDGN KGEPGVVGAV GTAGPSGPSG LPGERGAAGI PGGKGEKGEP 660
GLRGEIGNPG RDGARGAPGA VGAPGPAGAT GDRGEAGAAG PAGPAGPRGS PGERGEVGPA 720
GPNGFAGPAG AAGQPGAKGE RGAKGPKGEN GVVGPTGPVG AAGPAGPNGP PGPAGSRGDG 780
GPPGMTGFPG AAGRTGPPGP SGISGPPGPP GPAGKEGLRG PRGDQGPVGR TGEVGAVGPP 840
GFAGEKGPSG EAGTAGPPGT PGPQGLLGAP GILGLPGSRG ERGLPGVAGA VGEPGPLGIA 900
GPPGARGPPG AVGSPGVNGA PGEAGRDGNP GNDGPPGRDG QPGHKGERGY PGNIGPVGAA 960
GAPGPHGPVG PAGKHGNRGE TGPSGPVGPA GAVGPRGPSG PQGIRGDKGE PGEKGPRGLP 1020
GLKGHNGLQG LPGIAGHHGD QGAPGSVGPA GPRGPAGPSG PAGKDGRTGH PGTVGPAGIR 1080
GPQGHQGPAG PPGPPGPPGP PGVSGGGYDF GYDGDFYRAD QPRSAPSLRP KDYEVDATLK 1140
SLNNQIETLL TPEGSRKNPA RTCRDLRLSH PEWSSGYYWI DPNQGCTMDA IKVYCDFSTG 1200
ETCIRAQPEN IPAKNWYRSS KDKKHVWLGE TINAGSQFEY NVEGVTSKEM ATQLAFMRLL 1260
ANYASQNITY HCKNSIAYMD EETGNLKKAV ILQGSNDVEL VAEGNSRFTY TVLVDGCSKK 1320
TNEWGKTIIE YKTNKPSRLP FLDIAPLDIG GADQEFFVDI GPVCFK 1366 
Gene Ontology
 GO:0005584; C:collagen type I; IDA:UniProtKB.
 GO:0005788; C:endoplasmic reticulum lumen; TAS:Reactome.
 GO:0005615; C:extracellular space; IDA:UniProtKB.
 GO:0070062; C:extracellular vesicular exosome; IDA:UniProtKB.
 GO:0005201; F:extracellular matrix structural constituent; NAS:UniProtKB.
 GO:0042802; F:identical protein binding; IDA:UniProtKB.
 GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
 GO:0048407; F:platelet-derived growth factor binding; IDA:MGI.
 GO:0030674; F:protein binding, bridging; IMP:UniProtKB.
 GO:0001568; P:blood vessel development; IMP:UniProtKB.
 GO:0071230; P:cellular response to amino acid stimulus; IEA:Compara.
 GO:0030574; P:collagen catabolic process; TAS:Reactome.
 GO:0030199; P:collagen fibril organization; IMP:UniProtKB.
 GO:0022617; P:extracellular matrix disassembly; TAS:Reactome.
 GO:0050900; P:leukocyte migration; TAS:Reactome.
 GO:0042476; P:odontogenesis; NAS:UniProtKB.
 GO:0030168; P:platelet activation; TAS:Reactome.
 GO:0070208; P:protein heterotrimerization; IEA:Compara.
 GO:0008217; P:regulation of blood pressure; IMP:UniProtKB.
 GO:0007266; P:Rho protein signal transduction; IDA:UniProtKB.
 GO:0001501; P:skeletal system development; IMP:UniProtKB.
 GO:0043589; P:skin morphogenesis; IMP:UniProtKB.
 GO:0007179; P:transforming growth factor beta receptor signaling pathway; IDA:UniProtKB. 
Interpro
 IPR008160; Collagen.
 IPR000885; Fib_collagen_C. 
Pfam
 PF01410; COLFI
 PF01391; Collagen 
SMART
 SM00038; COLFI 
PROSITE
 PS51461; NC1_FIB 
PRINTS