CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-011248
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Collagen alpha-1(VI) chain 
Protein Synonyms/Alias
  
Gene Name
 Col6a1 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Mus musculus (Mouse) 
NCBI Taxa ID
 10090 
Lysine Modification
Position
Peptide
Type
References
120KASVDAVKYFGKGTYacetylation[1]
124DAVKYFGKGTYTDCAacetylation[1]
165GHPLEGYKEPCGGLEacetylation[1]
Reference
 [1] Quantitative acetylome analysis reveals the roles of SIRT1 in regulating diverse substrates and cellular pathways.
 Chen Y, Zhao W, Yang JS, Cheng Z, Luo H, Lu Z, Tan M, Gu W, Zhao Y.
 Mol Cell Proteomics. 2012 Oct;11(10):1048-62. [PMID: 22826441
Functional Description
 Collagen VI acts as a cell-binding protein. 
Sequence Annotation
 DOMAIN 36 234 VWFA 1.
 DOMAIN 614 802 VWFA 2.
 DOMAIN 826 1018 VWFA 3.
 REGION 20 255 N-terminal globular domain.
 REGION 256 591 Triple-helical region.
 REGION 592 1025 C-terminal globular domain.
 MOTIF 261 263 Cell attachment site.
 MOTIF 441 443 Cell attachment site.
 MOTIF 477 479 Cell attachment site.
 CARBOHYD 211 211 N-linked (GlcNAc...) (Potential).
 CARBOHYD 515 515 N-linked (GlcNAc...) (Potential).
 CARBOHYD 536 536 N-linked (GlcNAc...) (Potential).
 CARBOHYD 801 801 N-linked (GlcNAc...) (Potential).
 CARBOHYD 893 893 N-linked (GlcNAc...) (Potential).  
Keyword
 Cell adhesion; Collagen; Complete proteome; Extracellular matrix; Glycoprotein; Hydroxylation; Reference proteome; Repeat; Secreted; Signal. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1025 AA 
Protein Sequence
MRLAHALLPL LLQACWVATQ DIQGSKAIAF QDCPVDLFFV LDTSESVALR LKPYGALVDK 60
VKSFTKRFID NLRDRYYRCD RNLVWNAGAL HYSDEVEIIR GLTRMPSGRD ELKASVDAVK 120
YFGKGTYTDC AIKKGLEELL IGGSHLKENK YLIVVTDGHP LEGYKEPCGG LEDAVNEAKH 180
LGIKVFSVAI TPDHLEPRLS IIATDHTYRR NFTAADWGHS RDAEEVISQT IDTIVDMIKN 240
NVEQVCCSFE CQAARGPPGP RGDPGYEGER GKPGLPGEKG EAGDPGRPGD LGPVGYQGMK 300
GEKGSRGEKG SRGPKGYKGE KGKRGIDGVD GMKGETGYPG LPGCKGSPGF DGIQGPPGPK 360
GDAGAFGMKG EKGEAGADGE AGRPGNSGSP GDEGDPGEPG PPGEKGEAGD EGNAGPDGAP 420
GERGGPGERG PRGTPGVRGP RGDPGEAGPQ GDQGREGPVG IPGDSGEAGP IGPKGYRGDE 480
GPPGPEGLRG APGPVGPPGD PGLMGERGED GPPGNGTEGF PGFPGYPGNR GPPGLNGTKG 540
YPGLKGDEGE VGDPGEDNND ISPRGVKGAK GYRGPEGPQG PPGHVGPPGP DECEILDIIM 600
KMCSCCECTC GPIDILFVLD SSESIGLQNF EIAKDFIIKV IDRLSKDELV KFEPGQSHAG 660
VVQYSHNQMQ EHVDMRSPNV RNAQDFKEAV KKLQWMAGGT FTGEALQYTR DRLLPPTQNN 720
RIALVITDGR SDTQRDTTPL SVLCGADIQV VSVGIKDVFG FVAGSDQLNV ISCQGLSQGR 780
PGISLVKENY AELLDDGFLK NITAQICIDK KCPDYTCPIT FSSPADITIL LDSSASVGSH 840
NFETTKVFAK RLAERFLSAG RADPSQDVRV AVVQYSGQGQ QQPGRAALQF LQNYTVLASS 900
VDSMDFINDA TDVNDALSYV TRFYREASSG ATKKRVLLFS DGNSQGATAE AIEKAVQEAQ 960
RAGIEIFVVV VGPQVNEPHI RVLVTGKTAE YDVAFGERHL FRVPNYQALL RGVLYQTVSR 1020
KVALG 1025 
Gene Ontology
 GO:0005581; C:collagen; IEA:UniProtKB-KW.
 GO:0031012; C:extracellular matrix; IDA:MGI.
 GO:0005576; C:extracellular region; ISO:MGI.
 GO:0043234; C:protein complex; ISO:MGI.
 GO:0042383; C:sarcolemma; IDA:MGI.
 GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
 GO:0071230; P:cellular response to amino acid stimulus; IDA:MGI.
 GO:0070208; P:protein heterotrimerization; ISO:MGI. 
Interpro
 IPR008160; Collagen.
 IPR002035; VWF_A. 
Pfam
 PF01391; Collagen
 PF00092; VWA 
SMART
 SM00327; VWA 
PROSITE
 PS50234; VWFA 
PRINTS