CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-038876
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Procollagen, type VI, alpha 2, isoform CRA_a 
Protein Synonyms/Alias
 Protein Col6a2 
Gene Name
 Col6a2 
Gene Synonyms/Alias
 rCG_61044 
Created Date
 July 27, 2013 
Organism
 Rattus norvegicus (Rat) 
NCBI Taxa ID
 10116 
Lysine Modification
Position
Peptide
Type
References
128SDRASFTKSLQGIRSacetylation[1]
724FAYNQLIKESRRQKTacetylation[1]
1008RAAIFREKDFDSLAQacetylation[1]
Reference
 [1] Proteomic analysis of lysine acetylation sites in rat tissues reveals organ specificity and subcellular patterns.
 Lundby A, Lage K, Weinert BT, Bekker-Jensen DB, Secher A, Skovgaard T, Kelstrup CD, Dmytriyev A, Choudhary C, Lundby C, Olsen JV.
 Cell Rep. 2012 Aug 30;2(2):419-31. [PMID: 22902405
Functional Description
  
Sequence Annotation
  
Keyword
 Collagen; Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1027 AA 
Protein Sequence
MTTTTKMLQG SFSVLLLGGL LGVLHAQQQE AISPDISTTD RNNNCPEKAD CPVNVYFVLD 60
TSESVAMQSP TDSLLYHMQQ FVPQFISQLQ NEFYLEQVAL SWRYGGLHFS DQVEVFSPPG 120
SDRASFTKSL QGIRSFRRGT FTDCALANMT QQIRQHVGRG VVNFAVVITD GHVTGNPCGG 180
IKMQAERARE EGIRLFAVAP NRNLNEQGLR DIANTPHELY RNNYATMRPD STEIDQDTIN 240
RIIKVMKHEA YGECYKVSCL EIPGPHGPKG YRGQKGAKGN MGEPGEPGQK GRQGDPGIEG 300
PIGFPGPKGV PGFKGEKGEF GSDGRKGAPG LAGKNGTDGQ KGKLGRIGPP GCKGDPGSRG 360
PDGYPGEAGS PGEQGDQGAK GDSGRPGRRG PPGNPGDKGS KGYRGNSGAP GSPGVKGGKG 420
GPGPRGPKGE PGRRGDPGTK GGPGSDGPKG EKGDPGPEGP RGLAGEIGSK GAKGDRGLPG 480
PRGPQGALGE PGKQGSRGDP GDAGPRGDSG QPGPKGDPGR PGFSYPGPRG TPGEKGEPGP 540
PGPEGGRGDF GLKGAPGRKG EKGEPADPGP PGEPGPRGPR GIPGPEGEPG PPGDPGLTEC 600
DVMTYVRETC GCCDCEKRCG ALDVVFVIDS SESIGYTNFT LEKNFVINVV NRLGAIAKDP 660
KSETGTRVGV VQYSHEGTFE AIRLDDERVN SLSSFKEAVK NLEWIAGGTW TPSALKFAYN 720
QLIKESRRQK TRVFAVVITD GRHDPRDDDL NLRALCDRDV TVTAIGIGDM FHETHESENL 780
YSIACDKPQQ VRNMTLFSDL VAEKFIDDME DVLCPDPQIV CPELPCQTEL YVAQCTQRPV 840
DIVFLLDGSE RLGEQNFYKA RRFVEEVSRR LTLARRDDDP LNARMALLQY GSQNQQQVAF 900
PLTYNVTTIH EALERTTYLN SFSHVGTGIV HAINNVVRGA RGGARRHAEL SFVFLTDGVT 960
GNDSLEESVH SMRKQNVVPT VVAVGGDVDM DVLTKISLGD RAAIFREKDF DSLAQPSFFD 1020
RFIRWIC 1027 
Gene Ontology
 GO:0005581; C:collagen; IEA:UniProtKB-KW.
 GO:0005615; C:extracellular space; IEA:Compara.
 GO:0043234; C:protein complex; IEA:Compara.
 GO:0042383; C:sarcolemma; IEA:Compara.
 GO:0070208; P:protein heterotrimerization; IEA:Compara.
 GO:0009749; P:response to glucose stimulus; IEP:RGD. 
Interpro
 IPR008160; Collagen.
 IPR002035; VWF_A. 
Pfam
 PF01391; Collagen
 PF00092; VWA 
SMART
 SM00327; VWA 
PROSITE
 PS50234; VWFA 
PRINTS