CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-024516
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Collagen alpha-6(VI) chain 
Protein Synonyms/Alias
  
Gene Name
 COL6A6 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
2216LKEDVLQKAKFFQDKubiquitination[1]
Reference
 [1] Refined preparation and use of anti-diglycine remnant (K-ε-GG) antibody enables routine quantification of 10,000s of ubiquitination sites in single proteomics experiments.
 Udeshi ND, Svinkina T, Mertins P, Kuhn E, Mani DR, Qiao JW, Carr SA.
 Mol Cell Proteomics. 2013 Mar;12(3):825-31. [PMID: 23266961
Functional Description
 Collagen VI acts as a cell-binding protein (By similarity). 
Sequence Annotation
 DOMAIN 27 206 VWFA 1.
 DOMAIN 229 411 VWFA 2.
 DOMAIN 436 606 VWFA 3.
 DOMAIN 622 791 VWFA 4.
 DOMAIN 809 982 VWFA 5.
 DOMAIN 1000 1171 VWFA 6.
 DOMAIN 1187 1371 VWFA 7.
 DOMAIN 1757 1937 VWFA 8.
 DOMAIN 1965 2166 VWFA 9.
 REGION 20 1391 Nonhelical region.
 REGION 1392 1725 Triple-helical region.
 REGION 1726 2263 Nonhelical region.
 MOTIF 1508 1510 Cell attachment site (Potential).
 CARBOHYD 198 198 N-linked (GlcNAc...).
 CARBOHYD 275 275 N-linked (GlcNAc...).
 CARBOHYD 288 288 N-linked (GlcNAc...).
 CARBOHYD 298 298 N-linked (GlcNAc...).
 CARBOHYD 347 347 N-linked (GlcNAc...) (Potential).
 CARBOHYD 520 520 N-linked (GlcNAc...) (Potential).
 CARBOHYD 930 930 N-linked (GlcNAc...).
 CARBOHYD 988 988 N-linked (GlcNAc...).
 CARBOHYD 1290 1290 N-linked (GlcNAc...).  
Keyword
 Alternative splicing; Cell adhesion; Collagen; Complete proteome; Extracellular matrix; Glycoprotein; Hydroxylation; Polymorphism; Reference proteome; Repeat; Secreted; Signal. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2263 AA 
Protein Sequence
MMLLILFLVI ICSHISVNQD SGPEYADVVF LVDSSDRLGS KSFPFVKMFI TKMISSLPIE 60
ADKYRVALAQ YSDKLHSEFH LSTFKGRSPM LNHLRKNFGF IGGSLQIGKA LQEAHRTYFS 120
APANGRDKKQ FPPILVVLAS SESEDNVEEA SKALRKDGVK IISVGVQKAS EENLKAMATS 180
QFHFNLRTVR DLSMFSQNMT HIIKDVIKYK EGAVDDIFVE ACQGPSMADV VFLLDMSING 240
SEENFDYLKG FLEESVSALD IKENCMRVGL VAYSNETKVI NSLSMGINKS EVLQHIQNLS 300
PRTGKAYTGA AIKKLRKEVF SARNGSRKNQ GVPQIAVLVT HRDSEDNVTK AAVNLRREGV 360
TIFTLGIEGA SDTQLEKIAS HPAEQYVSKL KTFADLAAHN QTFLKKLRNQ ITHTVSVFSE 420
RTETLKSGCV DTEEADIYLL IDGSGSTQAT DFHEMKTFLS EVVGMFNIAP HKVRVGAVQY 480
ADSWDLEFEI NKYSNKQDLG KAIENIRQMG GNTNTGAALN FTLSLLQKAK KQRGNKVPCH 540
LVVLTNGMSK DSILEPANRL REEHIRVYAI GIKEANQTQL REIAGEEKRV YYVHDFDALK 600
DIRNQVVQEI CTEEACKEMK ADIMFLVDSS GSIGPENFSK MKTFMKNLVS KSQIGPDRVQ 660
IGVVQFSDIN KEEFQLNRFM SQSDISNAID QMAHIGQTTL TGSALSFVSQ YFSPTKGARP 720
NIRKFLILIT DGEAQDIVKE PAVVLRQEGV IIYSVGVFGS NVTQLEEISG RPEMVFYVEN 780
FDILQRIEDD LVFGICSPRE ECKRIEVLDV VFVIDSSGSI DYDEYNIMKD FMIGLVKKAD 840
VGKNQVRFGA LKYADDPEVL FYLDDFGTKL EVISVLQNDQ AMGGSTYTAE ALGFSDHMFT 900
EARGSRLNKG VPQVLIVITD GESHDADKLN ATAKALRDKG ILVLAVGIDG ANPVELLAMA 960
GSSDKYFFVE TFGGLKGIFS DVTASVCNSS KVDCEIDKVD LVFLMDGSTS IQPNDFKKMK 1020
EFLASVVQDF DVSLNRVRIG AAQFSDTYHP EFPLGTFIGE KEISFQIENI KQIFGNTHIG 1080
AALREVEHYF RPDMGSRINT GTPQVLLVLT DGQSQDEVAQ AAEALRHRGI DIYSVGIGDV 1140
DDQQLIQITG TAEKKLTVHN FDELKKVNKR IVRNICTTAG ESNCFVDVVV GFDVSTQEKG 1200
QTLLEGQPWM ETYLQDILRA ISSLNGVSCE VGTETQVSVA FQVTNAMEKY SPKFEIYSEN 1260
ILNSLKDITV KGPSLLNANL LDSLWDTFQN KSAARGKVVL LFSDGLDDDV EKLEQKSDEL 1320
RKEGLNALIT VALDGPADSS DLADLPYIEF GKGFEYRTQL SIGMRELGSR LSKQLVNVAE 1380
RTCCCLFCKC IGGDGTMGDP GPPGKRGPPG FKGSEGYLGE EGIAGERGAP GPVGEQGTKG 1440
CYGTKGPKGN RGLNGQEGEV GENGIDGLNG EQGDNGLPGR KGEKGDEGSQ GSPGKRGTPG 1500
DRGAKGLRGD PGAPGVDSSI EGPTGLKGER GRQGRRGWPG PPGTPGSRRK TAAHGRRGHT 1560
GPQGTAGIPG PDGLEGSLGL KGPQGPRGEA GVKGEKGGVG SKGPQGPPGP GGEAGNQGRL 1620
GSQGNKGEPG DLGEKGAVGF PGPRGLQGND GSPGYGSVGR KGAKGQEGFP GESGPKGEIG 1680
DPGGPGETGL KGARGKMISA GLPGEMGSPG EPGPPGRKGV KGAKGLASFS TCELIQYVRD 1740
RSPGRHGKPE CPVHPTELVF ALDHSRDVTE QEFERMKEMM AFLVRDIKVR ENSCPVGAHI 1800
AILSYNSHAR HLVRFSDAYK KSQLLREIET IPYERSSASR EIGRAMRFIS RNVFKRTLPG 1860
AHTRKIATFF SSGQSADAHS ITTAAMEFGA LEIIPVVITF SNVPSVRRAF AIDDTGTFQV 1920
IVVPSGADYI PALERLQRCT FCYDVCKPDA SCDQARPPPV QSYMDAAFLL DASRNMGSAE 1980
FEDIRAFLGA LLDHFEITPE PETSVTGDRV ALLSHAPPDF LPNTQKSPVR AEFNLTTYRS 2040
KRLMKRHVHE SVKQLNGDAF IGHALQWTLD NVFLSTPNLR RNKVIFVISA GETSHLDGEI 2100
LKKESLRAKC QGYALFVFSL GPIWDDKELE DLASHPLDHH LVQLGRIHKP DHSYGVKFVK 2160
SFINSIRRAI NKYPPINLKI KCNRLNSIDP KQPPRPFRSF VPGPLKATLK EDVLQKAKFF 2220
QDKKYLSRVA RSGRDDAIQN FMRSTSHTFK NGRMIESAPK QHD 2263 
Gene Ontology
 GO:0005581; C:collagen; IEA:UniProtKB-KW.
 GO:0031012; C:extracellular matrix; IDA:MGI.
 GO:0005576; C:extracellular region; TAS:Reactome.
 GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
 GO:0030574; P:collagen catabolic process; TAS:Reactome.
 GO:0022617; P:extracellular matrix disassembly; TAS:Reactome. 
Interpro
 IPR008160; Collagen.
 IPR002035; VWF_A. 
Pfam
 PF01391; Collagen
 PF00092; VWA 
SMART
 SM00327; VWA 
PROSITE
 PS50234; VWFA 
PRINTS