CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-024449
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Collagen alpha-5(VI) chain 
Protein Synonyms/Alias
 Collagen alpha-1(XXIX) chain 
Gene Name
 Col6a5 
Gene Synonyms/Alias
 Col29a1; Gm7455 
Created Date
 July 27, 2013 
Organism
 Mus musculus (Mouse) 
NCBI Taxa ID
 10090 
Lysine Modification
Position
Peptide
Type
References
98NPMLNHLKKNFGFIGacetylation[1]
Reference
 [1] Quantification of mitochondrial acetylation dynamics highlights prominent sites of metabolic regulation.
 Still AJ, Floyd BJ, Hebert AS, Bingman CA, Carson JJ, Gunderson DR, Dolan BK, Grimsrud PA, Dittenhafer-Reed KE, Stapleton DS, Keller MP, Westphall MS, Denu JM, Attie AD, Coon JJ, Pagliarini DJ.
 J Biol Chem. 2013 Jul 17;. [PMID: 23864654
Functional Description
 Collagen VI acts as a cell-binding protein (By similarity). 
Sequence Annotation
 DOMAIN 30 209 VWFA 1.
 DOMAIN 268 445 VWFA 2.
 DOMAIN 474 644 VWFA 3.
 DOMAIN 660 829 VWFA 4.
 DOMAIN 846 1023 VWFA 5.
 DOMAIN 1037 1214 VWFA 6.
 DOMAIN 1226 1413 VWFA 7.
 DOMAIN 1790 1970 VWFA 8.
 DOMAIN 1996 2186 VWFA 9.
 DOMAIN 2321 2516 VWFA 10.
 REGION 19 1426 Nonhelical region.
 REGION 1427 1760 Triple-helical region.
 REGION 1761 2640 Nonhelical region.
 MOTIF 1649 1651 Cell attachment site (Potential).
 MOTIF 2216 2218 Cell attachment site (Potential).
 MOTIF 2259 2261 Cell attachment site (Potential).
 CARBOHYD 201 201 N-linked (GlcNAc...) (Potential).
 CARBOHYD 292 292 N-linked (GlcNAc...) (Potential).
 CARBOHYD 614 614 N-linked (GlcNAc...) (Potential).
 CARBOHYD 2541 2541 N-linked (GlcNAc...) (Potential).  
Keyword
 Cell adhesion; Collagen; Complete proteome; Extracellular matrix; Glycoprotein; Hydroxylation; Reference proteome; Repeat; Secreted; Signal. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2640 AA 
Protein Sequence
MKLRLIAFVL ILWTETLADQ SPGPGPEYAD VVFLVDSSNY LGIKSFPFVR TFLNRMISSL 60
PIEANKYRVA LAQYSDALHN EFQLGTFKNR NPMLNHLKKN FGFIGGSLKI GNALQEAHRT 120
YFSAPTNGRD KKQFPPILVV LASAESEDDV EEAAKALRED GVKIISVGVQ KASEENLKAM 180
ATSQFHFNLR TARDLGMFAP NMTRIIKDVT QYREGTTVDL ITAVAPTTPA APATPAAPTI 240
PAALTTAANH VDKTVPFPTS CQKDSLADLI FLVDESVGTT QNLRDLQNFL ENVTSSVDVK 300
DNCMRLGLMS FSDRAQTISS LRSSANQSEF QQQIQKLSLQ TGASNVGAAI EQMRKEGFSE 360
SSGSRKAQGV PQIAVLVTHR ASDDMVREAA LDLRLEGVTM FAMGIEGANN TQLEDIVSYP 420
SRQSISTHSS YSHLESYSGN FLKKIRNEIW TQVSTRAEQM ELDKTGCVDT KEADIYFLID 480
GSSSIRKKEF EQIQIFMSSV IDMFPIGPNK VRVGVVQYSH KNEVEFPVSR YTDGIDLKKA 540
VFNIKQLKGL TFTGKALDFI LPLIKKGKTE RTDRAPCYLI VLTDGKSNDS VLEPANRLRA 600
EQITIHAIGI GEANKTQLRQ IAGKDERVNF GQNFDSLKSI KNEIVHRICS EKGCEDMKAD 660
IMFLVDSSGS IGPTNFETMK TFMKNLVGKI QIGADRSQVG VVQFSDYNRE EFQLNKYSTH 720
EEIYAAIDRM SPINRNTLTG GALTFVNEYF DLSKGGRPQV RKFLILLTDG KAQDEVGGPA 780
TALRSKSVTI FSVGVYGANR AQLEEISGDG SLVFHVENFD HLKAIESKLI FRVCALHDCK 840
RIELLDIVFV LDHSGSIGPR EQESMMNLTI HLVKKADVGR DRVQIGALTY SNHPEILFYL 900
NTYSSGSAIA EHLRRPRDTG GETYTAKALQ HSNVLFTEEH GSRLTQNVRQ LMIVITDGVS 960
HDRDKLDEAA RELRDKGITI FAVGVGNANQ DELETMAGKK ENTVHVDNFD KLRDIYLPLQ 1020
ETLCNNSQET CNLPEADVIF LCDGSDMVSD SEFVTMTTFL SDLIDNFDIE SQRMKIGMAQ 1080
YGSRYQEIIE LESSLNKTQW KSQVHSVAQS KGLPRLDFAL KHVSDMFDPS VGGRRNAGVP 1140
QTLVVITSSS PRYDVTDAVK VLKDLGICVL ALGIGDVYKE QLLPITGNSE KIITFRDFNK 1200
LKNVDVKKRM VREICQSCGK ANCFVDVVVG FDISTHRQGQ PLFQGHPRLE SYLPGILEDI 1260
TSIRGVSCGA GAEAQVSLAF KVNSDQEFPA KFQIYQKAAF DSLLHVTVRG PTHLDAPFLQ 1320
SLWDMFEERS ASRGQVLLIF SDGLQGESIT LLERQSDRLR EAGLDALLVV SLNTFGHDEF 1380
SSFEFGKGFD YRTQLTIGML DLGKTLSQYL GNIAERACCC TFCKCPGIPG PHGTRGLQAS 1440
KGSSGPKGSR GHRGEDGDPG RRGEIGLQGD RGVVGCPGTR GQEGVKGFSG AQGEHGEDGL 1500
DGLDGEEGFY GFRGGKGQKG DPGNQGYPGI RGAAGEDGEK GFPGDPGDPG KDSNIKGQKG 1560
EKGERGRQGI TGQKGTHGRP SSKGSRGMEG QRGPQGPSGQ AGNPGPQGTQ GPEGLQGSQG 1620
SSGNRGGKGD KGSQGYQGPQ GSPGPAGPRG DIGRPGFGGR KGEPGVPGGP GPVGPPGQRG 1680
KQGDYGIPGY GQTGRKGVKG PTGFPGDPGQ KGDAGNPGIP GGPGPKGFKG LTLSQGLKGR 1740
SGLQGSQGPP GRRGPKGTAG QPIYSPCELI QFLRDHSPCW KDKCPVYPTE LVFALDQSSG 1800
ITERRFNETR DTITSIVSDL NIRENNCPVG ARVAVVSYDS DTSYLIRGSD YHNKKHLLQL 1860
LSQIKYQVPR KARDIGNAMR FVARNVFKRM SAGTNTRRVA VFFSNGQAAS RASILTATME 1920
LSALDISLAV FAYNERVFLD EAFGFDDTGT FQVIPVPPVG DYEPLEKLRR CTLCYDKCFP 1980
NTCAEEPFFP ENSYMDVAFL LDNSKNIASD DFQAVKALVS SVIDSFHITS NPSASESGDR 2040
VALLSYSPSE SSRRKGRVKT EFAFTTYDNQ SIMKNYIYTS LQQLNGDATI GLALQWAMEG 2100
LFLGTPNPRK HKVIIVISAG ENHEEKEFVK TVALRAKCQG YVVFVISLGS TQRDEMEELA 2160
SYPLDHHLIQ LGRMYKPDLN YIVKFLKPFI YSVRRGFNQY PPPTLKDDCR LVELERGDTL 2220
PHGLRLTAKL REVPESTISL ADQELNAGKD SSFVLEDHRG DHLVYVPSQM LEPHKLVSHY 2280
GNDRESVAMA SLTSEHESHG REELGLAHEP GDASLQEYYM DVAFLIDASQ RVGGRNEFKE 2340
VRTLITSVLD YFHIAPAPLT SVLGDRVAVL TYSPPGYLPN TEECPVYLEF DLVTYNTVHQ 2400
MKHHLQESLQ QLNGDVFIGH ALQWTVDNVF VGTPNLRKNK VIFIVTAGET NPLDKEVLRN 2460
ASLRAKCQGY SIFVFSFGPI HNDMELEELA SHPLDHHLVR LGRVHRPDLD YVIKFIKPFV 2520
HSIRRAINKY PGRDLQAKCD NLTFPGPENA GTEDSALLIP EVYRIEAGEN ELSGDSGSQE 2580
QHFFLLGNSH GNHSESTADL MRQLYLLLSS GELMVNDKEE PCSAETPAPV NSKQDGEDAR 2640 
Gene Ontology
 GO:0005581; C:collagen; IEA:UniProtKB-KW.
 GO:0005576; C:extracellular region; ISO:MGI.
 GO:0007155; P:cell adhesion; IEA:UniProtKB-KW. 
Interpro
 IPR008160; Collagen.
 IPR002035; VWF_A. 
Pfam
 PF01391; Collagen
 PF00092; VWA 
SMART
 SM00327; VWA 
PROSITE
 PS50234; VWFA 
PRINTS