CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-035214
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
  
Protein Name
 Protein Col6a1 
Protein Synonyms/Alias
  
Gene Name
 Col6a1 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Rattus norvegicus (Rat) 
NCBI Taxa ID
 10116 
Lysine Modification
Position
Peptide
Type
References
52ESVALRLKPYGALVDacetylation[1]
60PYGALVDKVKSFTKRacetylation[1]
113PSGRDELKASIDAVKacetylation[1]
120KASIDAVKYFGKGTYacetylation[1]
147LIGGSHLKENKYLIVacetylation[1]
646KVIDRLSKDELVKFEacetylation[1]
687VRNAQDFKEAVKKLQacetylation[1]
933ENSSGATKKRVLLFSacetylation[1]
Reference
 [1] Proteomic analysis of lysine acetylation sites in rat tissues reveals organ specificity and subcellular patterns.
 Lundby A, Lage K, Weinert BT, Bekker-Jensen DB, Secher A, Skovgaard T, Kelstrup CD, Dmytriyev A, Choudhary C, Lundby C, Olsen JV.
 Cell Rep. 2012 Aug 30;2(2):419-31. [PMID: 22902405
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1025 AA 
Protein Sequence
MRLAHTLLPL LLQACWVAAQ DIQGSRAIAF QDCPVDLFFV LDTSESVALR LKPYGALVDK 60
VKSFTKRFID NLRDRYYRCD RNLVWNAGAL HYSDEVEIIR GLMRMPSGRD ELKASIDAVK 120
YFGKGTYTDC AIKKGLEELL IGGSHLKENK YLIVVTDGHP LEGYKEPCGG LEDAVNEAKH 180
LGIKVFSVAI TPDHLEPRLS IIATDHTYRR NFTAADWGHS RDAEETISQT IDTIVDMIKN 240
NVEQVCCTFE CQAARGPPGP RGDPGYEGER GKPGLPGEKG EAGDPGRPGD LGPVGYQGMK 300
GEKGSRGEKG SRGPKGYKGE KGKRGIDGVD GMKGETGYPG LPGCKGSPGF DGIQGPPGPK 360
GDAGAFGLKG EKGEAGAEGE AGRPGNSGPP GDEGEPGEPG PPGEKGEAGD EGNAGPDGAP 420
GERGGPGERG PRGTPGVRGP RGDPGEAGPQ GDQGREGPVG IPGDPGESGP IGPKGYRGDE 480
GPPGPEGLRG APGPVGPPGD PGLMGERGED GPPGNGTEGF PGFPGYPGNR GPPGINGTKG 540
YPGLKGDEGE AGDPGEDNND VSPRGVKGAK GYRGPEGPQG PPGHVGPPGP DECEILDIIM 600
KMCSCCECTC GPIDILFVLD SSESIGLQNF EIAKDFIIKV IDRLSKDELV KFEPGQSHAG 660
VVQYSHNQMQ EHVDMRSPNV RNAQDFKEAV KKLQWMAGGT FTGEALQYTR DRLLPPTQNN 720
RIALVITDGR SDTQRDTTPL SVLCGSDIQV VSVGIKDVFG FVAGSDQLNV ISCQGLSQSR 780
PGISLVKENY AELLDDGFLK NITAQICIDK KCPDYTCPIT FSSPTDITIL LDSSASVGSH 840
NFETTKVFAK RLAERFLSAG REDPTQVVRV AVVQYSGQGQ QQPGRASLQF QQNYTVLASS 900
VDSMDFINDA TDVNDALSYV TRFYRENSSG ATKKRVLLFS DGNSQGATAE AIEKAVQEAQ 960
RGGIEIFVMV VGPQVNEPHI RVLVTGKTAE YDVAFGERHL FRVPNYQALL RGVLYQTVSR 1020
KVALG 1025 
Gene Ontology
 GO:0031012; C:extracellular matrix; IEA:Compara.
 GO:0005576; C:extracellular region; IEA:Compara.
 GO:0043234; C:protein complex; IEA:Compara.
 GO:0042383; C:sarcolemma; IEA:Compara.
 GO:0071230; P:cellular response to amino acid stimulus; IEA:Compara.
 GO:0070208; P:protein heterotrimerization; IEA:Compara. 
Interpro
 IPR008160; Collagen.
 IPR002035; VWF_A. 
Pfam
 PF01391; Collagen
 PF00092; VWA 
SMART
 SM00327; VWA 
PROSITE
 PS50234; VWFA 
PRINTS