CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-039048
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
  
Protein Name
 Collagen alpha-2(I) chain 
Protein Synonyms/Alias
  
Gene Name
 Col1a2 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Rattus norvegicus (Rat) 
NCBI Taxa ID
 10116 
Lysine Modification
Position
Peptide
Type
References
1070GPSGPIGKDGRSGHPacetylation[1]
Reference
 [1] Proteomic analysis of lysine acetylation sites in rat tissues reveals organ specificity and subcellular patterns.
 Lundby A, Lage K, Weinert BT, Bekker-Jensen DB, Secher A, Skovgaard T, Kelstrup CD, Dmytriyev A, Choudhary C, Lundby C, Olsen JV.
 Cell Rep. 2012 Aug 30;2(2):419-31. [PMID: 22902405
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1372 AA 
Protein Sequence
MLSFVDTRTL LLLAVTSCLA TCQWKTLGFM RYKVSGFMVP QGYSGPAGPR GRDGVDGPVG 60
PPGPPGAPGP PGPPGPPGLT GNFAAQYSDK GVSAGPGPMG LMGPRGPPGA VGAPGPQGFQ 120
GPAGEPGEPG QTGPAGSRGP AGPPGKAGED GHPGKPGRPG ERGVVGPQGA RGFPGTPGLP 180
GFKGIRGHNG LDGLKGQPGA QGVKGEPGAP GENGTPGQAG ARGLPGERGR VGAPGPAGAR 240
GSDGSVGPVG PAGPIGSAGP PGFPGAPGPK GELGPVGNPG PAGPAGPRGE AGLPGLSGPV 300
GPPGNPGANG LTGAKGATGL PGVAGAPGLP GPRGIPGPVG AAGATGPRGL VGEPGPAGSK 360
GETGNKGEPG SAGAQGPPGP SGEEGKRGSP GEPGSAGPAG PPGLRGSPGS RGLPGADGRA 420
GVMGPPGNRG STGPAGVRGP NGDAGRPGEP GLMGPRGLPG SPGNVGPAGK EGPVGLPGID 480
GRPGPIGPAG PRGEAGNIGF PGPKGPSGDP GKPGEKGHPG LAGARGAPGP DGNNGAQGPP 540
GPQGVQGGKG EQGPAGPPGF QGLPGPSGTA GEVGKPGERG LPGEFGLPGP AGPRGERGPP 600
GESGAAGPSG PIGSRGPSGA PGPDGNKGEA GAVGAPGSAG ASGPGGLPGE RGAAGIPGGK 660
GEKGETGLRG EIGNPGRDGA RGAPGAIGAP GPAGASGDRG EAGAAGPSGP AGPRGSPGER 720
GEVGPAGPNG FAGPAGSAGQ PGAKGEKGTK GPKGENGIVG PTGPVGAAGP SGPNGPPGPA 780
GSRGDGGPPG MTGFPGAAGR TGPPGPSGIT GPPGPPGAAG KEGIRGPRGD QGPVGRTGEI 840
GASGPPGFAG EKGPSGEPGT TGPPGTAGPQ GLLGAPGILG LPGSRGERGL PGIAGALGEP 900
GPLGIAGPPG ARGPPGAVGS PGVNGAPGEA GRDGNPGSDG PPGRDGQPGH KGERGYPGNI 960
GPTGAAGAPG PHGSVGPAGK HGNRGEPGPA GSVGPVGAVG PRGPSGPQGI RGDKGEPGDK 1020
GARGLPGLKG HNGLQGLPGL AGLHGDQGAP GPVGPAGPRG PAGPSGPIGK DGRSGHPGPV 1080
GPAGVRGSQG SQGPAGPPGP PGPPGPPGVS GGGYDFGFEG DFYRADQPRS QPSLRPKDYE 1140
VDATLKSLNN QIETLLTPEG SRKNPARTCR DLRLSHPEWK SDYYWIDPNQ GCTMDAIKVY 1200
CDFSTGETCI QAQPVNTPAK NAYSRAQANK HVWLGETING GSQFEYNAEG VSSKEMATQL 1260
AFMRLLANRA SQNITYHCKN SIAYLDEETG RLNKAVILQG SNDVELVAEG NSRFTYTVLV 1320
DGCSKKTNEW DKTIIEYKTN KPSRLPFLDI APLDIGGTNQ EFRVEVGPVC FK 1372 
Gene Ontology
 GO:0005581; C:collagen; IEA:InterPro.
 GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro. 
Interpro
 IPR008160; Collagen.
 IPR000885; Fib_collagen_C. 
Pfam
 PF01410; COLFI
 PF01391; Collagen 
SMART
 SM00038; COLFI 
PROSITE
 PS51461; NC1_FIB 
PRINTS