CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-004005
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Collagen alpha-1(VI) chain 
Protein Synonyms/Alias
  
Gene Name
 COL6A1 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
301PVGYQGMKGEKGSRGubiquitination[1]
Reference
 [1] Systematic and quantitative assessment of the ubiquitin-modified proteome.
 Kim W, Bennett EJ, Huttlin EL, Guo A, Li J, Possemato A, Sowa ME, Rad R, Rush J, Comb MJ, Harper JW, Gygi SP.
 Mol Cell. 2011 Oct 21;44(2):325-40. [PMID: 21906983
Functional Description
 Collagen VI acts as a cell-binding protein. 
Sequence Annotation
 DOMAIN 37 235 VWFA 1.
 DOMAIN 615 805 VWFA 2.
 DOMAIN 829 1021 VWFA 3.
 REGION 20 256 N-terminal globular domain.
 REGION 257 592 Triple-helical region.
 REGION 593 1028 C-terminal globular domain.
 MOTIF 262 264 Cell attachment site.
 MOTIF 442 444 Cell attachment site.
 MOTIF 478 480 Cell attachment site.
 CARBOHYD 212 212 N-linked (GlcNAc...).
 CARBOHYD 516 516 N-linked (GlcNAc...).
 CARBOHYD 537 537 N-linked (GlcNAc...) (Potential).
 CARBOHYD 804 804 N-linked (GlcNAc...).
 CARBOHYD 896 896 N-linked (GlcNAc...).  
Keyword
 Cell adhesion; Collagen; Complete proteome; Direct protein sequencing; Disease mutation; Extracellular matrix; Glycoprotein; Hydroxylation; Polymorphism; Reference proteome; Repeat; Secreted; Signal. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1028 AA 
Protein Sequence
MRAARALLPL LLQACWTAAQ DEPETPRAVA FQDCPVDLFF VLDTSESVAL RLKPYGALVD 60
KVKSFTKRFI DNLRDRYYRC DRNLVWNAGA LHYSDEVEII QGLTRMPGGR DALKSSVDAV 120
KYFGKGTYTD CAIKKGLEQL LVGGSHLKEN KYLIVVTDGH PLEGYKEPCG GLEDAVNEAK 180
HLGVKVFSVA ITPDHLEPRL SIIATDHTYR RNFTAADWGQ SRDAEEAISQ TIDTIVDMIK 240
NNVEQVCCSF ECQPARGPPG LRGDPGFEGE RGKPGLPGEK GEAGDPGRPG DLGPVGYQGM 300
KGEKGSRGEK GSRGPKGYKG EKGKRGIDGV DGVKGEMGYP GLPGCKGSPG FDGIQGPPGP 360
KGDPGAFGLK GEKGEPGADG EAGRPGSSGP SGDEGQPGEP GPPGEKGEAG DEGNPGPDGA 420
PGERGGPGER GPRGTPGTRG PRGDPGEAGP QGDQGREGPV GVPGDPGEAG PIGPKGYRGD 480
EGPPGSEGAR GAPGPAGPPG DPGLMGERGE DGPAGNGTEG FPGFPGYPGN RGAPGINGTK 540
GYPGLKGDEG EAGDPGDDNN DIAPRGVKGA KGYRGPEGPQ GPPGHQGPPG PDECEILDII 600
MKMCSCCECK CGPIDLLFVL DSSESIGLQN FEIAKDFVVK VIDRLSRDEL VKFEPGQSYA 660
GVVQYSHSQM QEHVSLRSPS IRNVQELKEA IKSLQWMAGG TFTGEALQYT RDQLLPPSPN 720
NRIALVITDG RSDTQRDTTP LNVLCSPGIQ VVSVGIKDVF DFIPGSDQLN VISCQGLAPS 780
QGRPGLSLVK ENYAELLEDA FLKNVTAQIC IDKKCPDYTC PITFSSPADI TILLDGSASV 840
GSHNFDTTKR FAKRLAERFL TAGRTDPAHD VRVAVVQYSG TGQQRPERAS LQFLQNYTAL 900
ASAVDAMDFI NDATDVNDAL GYVTRFYREA SSGAAKKRLL LFSDGNSQGA TPAAIEKAVQ 960
EAQRAGIEIF VVVVGRQVNE PHIRVLVTGK TAEYDVAYGE SHLFRVPSYQ ALLRGVFHQT 1020
VSRKVALG 1028 
Gene Ontology
 GO:0005589; C:collagen type VI; NAS:UniProtKB.
 GO:0005788; C:endoplasmic reticulum lumen; TAS:Reactome.
 GO:0043234; C:protein complex; IPI:MGI.
 GO:0042383; C:sarcolemma; IEA:Compara.
 GO:0048407; F:platelet-derived growth factor binding; IDA:MGI.
 GO:0007411; P:axon guidance; TAS:Reactome.
 GO:0007155; P:cell adhesion; NAS:UniProtKB.
 GO:0071230; P:cellular response to amino acid stimulus; IEA:Compara.
 GO:0030574; P:collagen catabolic process; TAS:Reactome.
 GO:0022617; P:extracellular matrix disassembly; TAS:Reactome.
 GO:0070208; P:protein heterotrimerization; IPI:MGI. 
Interpro
 IPR008160; Collagen.
 IPR002035; VWF_A. 
Pfam
 PF01391; Collagen
 PF00092; VWA 
SMART
 SM00327; VWA 
PROSITE
 PS50234; VWFA 
PRINTS