CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-001965
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Collagen alpha-2(I) chain 
Protein Synonyms/Alias
 Alpha-2 type I collagen 
Gene Name
 Col1a2 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Rattus norvegicus (Rat) 
NCBI Taxa ID
 10116 
Lysine Modification
Position
Peptide
Type
References
549PQGVQGGKGEQGPAGglycation[1]
575GTAGEVGKPGERGLPglycation[1]
1020DKGEPGDKGARGLPGglycation[1]
1070GPSGPIGKDGRSGHPglycation[1]
Reference
 [1] Nonenzymatic glycation of type I collagen. The effects of aging on preferential glycation sites.
 Reiser KM, Amigable MA, Last JA.
 J Biol Chem. 1992 Dec 5;267(34):24207-16. [PMID: 1447170
Functional Description
 Type I collagen is a member of group I collagen (fibrillar forming collagen). 
Sequence Annotation
 DOMAIN 1139 1372 Fibrillar collagen NC1.
 MOTIF 783 785 Cell attachment site (Potential).
 MOTIF 828 830 Cell attachment site (Potential).
 MOTIF 1011 1013 Cell attachment site (Potential).
 METAL 1187 1187 Calcium (By similarity).
 METAL 1189 1189 Calcium (By similarity).
 METAL 1190 1190 Calcium; via carbonyl oxygen (By
 METAL 1192 1192 Calcium; via carbonyl oxygen (By
 METAL 1195 1195 Calcium (By similarity).
 MOD_RES 86 86 Pyrrolidone carboxylic acid (Probable).
 MOD_RES 90 90 Allysine.
 CARBOHYD 1273 1273 N-linked (GlcNAc...) (Potential).
 DISULFID 1169 1201 By similarity.
 DISULFID 1209 1370 By similarity.
 DISULFID 1278 1323 By similarity.  
Keyword
 3D-structure; Calcium; Collagen; Complete proteome; Direct protein sequencing; Disulfide bond; Extracellular matrix; Glycoprotein; Hydroxylation; Metal-binding; Pyrrolidone carboxylic acid; Reference proteome; Repeat; Secreted; Signal. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1372 AA 
Protein Sequence
MLSFVDTRTL LLLAVTSCLA TCQSLQMGSV RKGPTGDRGP RGQRGPAGPR GRDGVDGPVG 60
PPGPPGAPGP PGPPGPPGLT GNFAAQYSDK GVSAGPGPMG LMGPRGPPGA VGAPGPQGFQ 120
GPAGEPGEPG QTGPAGSRGP AGPPGKAGED GHPGKPGRPG ERGVVGPQGA RGFPGTPGLP 180
GFKGIRGHNG LDGLKGQPGA QGVKGEPGAP GENGTPGQAG ARGLPGERGR VGAPGPAGAR 240
GSDGSVGPVG PAGPIGSAGP PGFPGAPGPK GELGPVGNPG PAGPAGPRGE AGLPGLSGPV 300
GPPGNPGANG LTGAKGATGL PGVAGAPGLP GPRGIPGPVG AAGATGPRGL VGEPGPAGSK 360
GETGNKGEPG SAGAQGPPGP SGEEGKRGSP GEPGSAGPAG PPGLRGSPGS RGLPGADGRA 420
GVMGPPGNRG STGPAGVRGP NGDAGRPGEP GLMGPRGLPG SPGNVGPAGK EGPVGLPGID 480
GRPGPIGPAG PRGEAGNIGF PGPKGPSGDP GKPGEKGHPG LAGARGAPGP DGNNGAQGPP 540
GPQGVQGGKG EQGPAGPPGF QGLPGPSGTA GEVGKPGERG LPGEFGLPGP AGPRGERGPP 600
GESGAAGPSG PIGIRGPSGA PGPDGNKGEA GAVGAPGSAG ASGPGGLPGE RGAAGIPGGK 660
GEKGETGLRG EIGNPGRDGA RGAPGAIGAP GPAGASGDRG EAGAAGPSGP AGPRGSPGER 720
GEVGPAGPNG FAGPAGSAGQ PGAKGEKGTK GPKGENGIVG PTGPVGAAGP SGPNGPPGPA 780
GSRGDGGPPG MTGFPGAAGR TGPPGPSGIT GPPGPPGAAG KEGIRGPRGD QGPVGRTGEI 840
GASGPPGFAG EKGPSGEPGT TGPPGTAGPQ GLLGAPGILG LPGSRGERGQ PGIAGALGEP 900
GPLGIAGPPG ARGPPGAVGS PGVNGAPGEA GRDGNPGSDG PPGRDGQPGH KGERGYPGNI 960
GPTGAAGAPG PHGSVGPAGK HGNRGEPGPA GSVGPVGAVG PRGPSGPQGI RGDKGEPGDK 1020
GARGLPGLKG HNGLQGLPGL AGLHGDQGAP GPVGPAGPRG PAGPSGPIGK DGRSGHPGPV 1080
GPAGVRGSQG SQGPAGPPGP PGPPGPPGVS GGGYDFGFEG GFYRADQPRS QPSLRPKDYE 1140
VDATLKSLNN QIETLLTPEG SRKNPARTCR DLRLSHPEWK SDYYWIDPNQ GCTMDAIKVY 1200
CDFSTGETCI QAQPVNTPAK NAYSRAQANK HVWLGETING GSQFEYNAEG VSSKEMATQL 1260
AFMRLLANRA SQNITYHCKN SIAYLDEETG RLNKAVILQG SNDVELVAEG NSRFTYTVLV 1320
DGCSKKTNEW DKTVIEYKTN KPSRLPFLDI APLDIGGTNQ EFRVEVGPVC FK 1372 
Gene Ontology
 GO:0005581; C:collagen; IEA:UniProtKB-KW.
 GO:0005576; C:extracellular region; TAS:Reactome.
 GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
 GO:0046872; F:metal ion binding; IEA:UniProtKB-KW. 
Interpro
 IPR008160; Collagen.
 IPR000885; Fib_collagen_C. 
Pfam
 PF01410; COLFI
 PF01391; Collagen 
SMART
 SM00038; COLFI 
PROSITE
 PS51461; NC1_FIB 
PRINTS