CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-002602
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Collagen alpha-1(III) chain 
Protein Synonyms/Alias
  
Gene Name
 Col3a1 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Mus musculus (Mouse) 
NCBI Taxa ID
 10090 
Lysine Modification
Position
Peptide
Type
References
1399GSNEGEFKAEGNSKFacetylation[1]
1417VLEDGCTKHTGEWSKacetylation[1]
Reference
 [1] Quantitative acetylome analysis reveals the roles of SIRT1 in regulating diverse substrates and cellular pathways.
 Chen Y, Zhao W, Yang JS, Cheng Z, Luo H, Lu Z, Tan M, Gu W, Zhao Y.
 Mol Cell Proteomics. 2012 Oct;11(10):1048-62. [PMID: 22826441
Functional Description
 Collagen type III occurs in most soft connective tissues along with type I collagen. Involved in regulation of cortical development. Is the major ligand of Gpr56 in the developing brain and binding to Gpr56 inhibits neuronal migration and activates the RhoA pathway by coupling Gpr56 to Gna13 and possibly Gna12. 
Sequence Annotation
 DOMAIN 31 90 VWFC.
 DOMAIN 1230 1464 Fibrillar collagen NC1.
 REGION 155 169 Nonhelical region (N-terminal).
 REGION 170 1195 Triple-helical region.
 METAL 1278 1278 Calcium (By similarity).
 METAL 1280 1280 Calcium (By similarity).
 METAL 1281 1281 Calcium; via carbonyl oxygen (By
 METAL 1283 1283 Calcium; via carbonyl oxygen (By
 METAL 1286 1286 Calcium (By similarity).
 MOD_RES 262 262 5-hydroxylysine; alternate (By
 MOD_RES 283 283 5-hydroxylysine (By similarity).
 MOD_RES 859 859 5-hydroxylysine (By similarity).
 MOD_RES 976 976 5-hydroxylysine (By similarity).
 MOD_RES 1093 1093 5-hydroxylysine (By similarity).
 MOD_RES 1105 1105 5-hydroxylysine (By similarity).
 CARBOHYD 262 262 O-linked (Gal...); alternate (By
 DISULFID 1195 1195 Interchain (By similarity).
 DISULFID 1196 1196 Interchain (By similarity).
 DISULFID 1260 1292 By similarity.
 DISULFID 1266 1266 Interchain (with C-1283) (By similarity).
 DISULFID 1283 1283 Interchain (with C-1266) (By similarity).
 DISULFID 1300 1462 By similarity.
 DISULFID 1370 1415 By similarity.  
Keyword
 Calcium; Collagen; Complete proteome; Disulfide bond; Extracellular matrix; Glycoprotein; Hydroxylation; Metal-binding; Reference proteome; Repeat; Secreted; Signal. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1464 AA 
Protein Sequence
MMSFVQSGTW FLLTLLHPTL ILAQQSNVDE LGCSHLGQSY ESRDVWKPEP CQICVCDSGS 60
VLCDDIICDE EPLDCPNPEI PFGECCAICP QPSTPAPVLP DGHGPQGPKG DPGPPGIPGR 120
NGDPGLPGQP GLPGPPGSPG ICESCPTGGQ NYSPQFDSYD VKSGVGGMGG YPGPAGPPGP 180
PGPPGSSGHP GSPGSPGYQG PPGEPGQAGP AGPPGPPGAL GPAGPAGKDG ESGRPGRPGE 240
RGLPGPPGIK GPAGMPGFPG MKGHRGFDGR NGEKGETGAP GLKGENGLPG DNGAPGPMGP 300
RGAPGERGRP GLPGAAGARG NDGARGSDGQ PGPPGPPGTA GFPGSPGAKG EVGPAGSPGS 360
NGSPGQRGEP GPQGHAGAQG PPGPPGNNGS PGGKGEMGPA GIPGAPGLIG ARGPPGPAGT 420
NGIPGTRGPS GEPGKNGAKG EPGARGERGE AGSPGIPGPK GEDGKDGSPG EPGANGLPGA 480
AGERGPSGFR GPAGPNGIPG EKGPPGERGG PGPAGPRGVA GEPGRDGTPG GPGIRGMPGS 540
PGGPGNDGKP GPPGSQGESG RPGPPGPSGP RGQPGVMGFP GPKGNDGAPG KNGERGGPGG 600
PGLPGPAGKN GETGPQGPPG PTGPAGDKGD SGPPGPQGLQ GIPGTGGPPG ENGKPGEPGP 660
KGEVGAPGAP GGKGDSGAPG ERGPPGTAGI PGARGGAGPP GPEGGKGPAG PPGPPGASGS 720
PGLQGMPGER GGPGSPGPKG EKGEPGGAGA DGVPGKDGPR GPAGPIGPPG PAGQPGDKGE 780
GGSPGLPGIA GPRGGPGERG EHGPPGPAGF PGAPGQNGEP GAKGERGAPG EKGEGGPPGP 840
AGPTGSSGPA GPPGPQGVKG ERGSPGGPGT AGFPGGRGLP GPPGNNGNPG PPGPSGAPGK 900
DGPPGPAGNS GSPGNPGIAG PKGDAGQPGE KGPPGAQGPP GSPGPLGIAG LTGARGLAGP 960
PGMPGPRGSP GPQGIKGESG KPGASGHNGE RGPPGPQGLP GQPGTAGEPG RDGNPGSDGQ 1020
PGRDGSPGGK GDRGENGSPG APGAPGHPGP PGPVGPSGKS GDRGETGPAG PSGAPGPAGA 1080
RGAPGPQGPR GDKGETGERG SNGIKGHRGF PGNPGPPGSP GAAGHQGAIG SPGPAGPRGP 1140
VGPHGPPGKD GTSGHPGPIG PPGPRGNRGE RGSEGSPGHP GQPGPPGPPG APGPCCGGGA 1200
AAIAGVGGEK SGGFSPYYGD DPMDFKINTE EIMSSLKSVN GQIESLISPD GSRKNPARNC 1260
RDLKFCHPEL KSGEYWVDPN QGCKMDAIKV FCNMETGETC INASPMTVPR KHWWTDSGAE 1320
KKHVWFGESM NGGFQFSYGP PDLPEDVVDV QLAFLRLLSS RASQNITYHC KNSIAYMDQA 1380
SGNVKKSLKL MGSNEGEFKA EGNSKFTYTV LEDGCTKHTG EWSKTVFEYQ TRKAMRLPII 1440
DIAPYDIGGP DQEFGVDIGP VCFL 1464 
Gene Ontology
 GO:0005586; C:collagen type III; IDA:MGI.
 GO:0005615; C:extracellular space; IEA:Compara.
 GO:0005201; F:extracellular matrix structural constituent; IEA:Compara.
 GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
 GO:0001568; P:blood vessel development; IMP:MGI.
 GO:0007160; P:cell-matrix adhesion; IEA:Compara.
 GO:0071230; P:cellular response to amino acid stimulus; IDA:MGI.
 GO:0032964; P:collagen biosynthetic process; IEA:Compara.
 GO:0030199; P:collagen fibril organization; IMP:MGI.
 GO:0048565; P:digestive tract development; IMP:MGI.
 GO:0043206; P:extracellular fibril organization; IEA:Compara.
 GO:0007507; P:heart development; IEA:Compara.
 GO:0007229; P:integrin-mediated signaling pathway; IEA:Compara.
 GO:0050777; P:negative regulation of immune response; IEA:Compara.
 GO:0018149; P:peptide cross-linking; IEA:Compara.
 GO:0034097; P:response to cytokine stimulus; IEA:Compara.
 GO:0009314; P:response to radiation; IEA:Compara.
 GO:0001501; P:skeletal system development; IEA:Compara.
 GO:0043588; P:skin development; IEA:Compara.
 GO:0007179; P:transforming growth factor beta receptor signaling pathway; IEA:Compara.
 GO:0042060; P:wound healing; IEA:Compara. 
Interpro
 IPR008160; Collagen.
 IPR000885; Fib_collagen_C.
 IPR001007; VWF_C. 
Pfam
 PF01410; COLFI
 PF01391; Collagen
 PF00093; VWC 
SMART
 SM00038; COLFI
 SM00214; VWC 
PROSITE
 PS51461; NC1_FIB
 PS01208; VWFC_1
 PS50184; VWFC_2 
PRINTS