CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-008447
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Lumican 
Protein Synonyms/Alias
 Keratan sulfate proteoglycan lumican; KSPG lumican 
Gene Name
 Lum 
Gene Synonyms/Alias
 Lcn; Ldc 
Created Date
 July 27, 2013 
Organism
 Rattus norvegicus (Rat) 
NCBI Taxa ID
 10116 
Lysine Modification
Position
Peptide
Type
References
69PMVPPGIKYLYLRNNacetylation[1]
114IKGKVFSKLKQLKKLacetylation[1]
148DLQLANNKISKLGSFacetylation[1]
181EAVSASLKGLKSLEYacetylation[1]
184SASLKGLKSLEYLDLacetylation[1]
285NYYLEVNKLEKFDVKacetylation[1]
288LEVNKLEKFDVKSFCacetylation[1]
305LGPLSYSKIKHLRLDacetylation[1]
Reference
 [1] Proteomic analysis of lysine acetylation sites in rat tissues reveals organ specificity and subcellular patterns.
 Lundby A, Lage K, Weinert BT, Bekker-Jensen DB, Secher A, Skovgaard T, Kelstrup CD, Dmytriyev A, Choudhary C, Lundby C, Olsen JV.
 Cell Rep. 2012 Aug 30;2(2):419-31. [PMID: 22902405
Functional Description
  
Sequence Annotation
 DOMAIN 28 66 LRRNT.
 REPEAT 67 88 LRR 1.
 REPEAT 91 114 LRR 2.
 REPEAT 117 137 LRR 3.
 REPEAT 138 159 LRR 4.
 REPEAT 160 181 LRR 5.
 REPEAT 185 205 LRR 6.
 REPEAT 206 227 LRR 7.
 REPEAT 230 250 LRR 8.
 REPEAT 255 276 LRR 9.
 REPEAT 277 296 LRR 10.
 REPEAT 305 326 LRR 11.
 MOD_RES 19 19 Pyrrolidone carboxylic acid (By
 MOD_RES 20 20 Sulfotyrosine (By similarity).
 MOD_RES 21 21 Sulfotyrosine (By similarity).
 MOD_RES 23 23 Sulfotyrosine (By similarity).
 MOD_RES 30 30 Sulfotyrosine (By similarity).
 CARBOHYD 88 88 N-linked (GlcNAc...) (keratan sulfate)
 CARBOHYD 127 127 N-linked (GlcNAc...) (keratan sulfate)
 CARBOHYD 160 160 N-linked (GlcNAc...) (keratan sulfate)
 CARBOHYD 252 252 N-linked (GlcNAc...) (keratan sulfate)
 DISULFID 295 328 By similarity.  
Keyword
 Complete proteome; Disulfide bond; Extracellular matrix; Glycoprotein; Leucine-rich repeat; Proteoglycan; Pyrrolidone carboxylic acid; Reference proteome; Repeat; Secreted; Signal; Sulfation. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 338 AA 
Protein Sequence
MNVCTFTLVL ALVGSVSGQY YDYDAPLFMY GELSPNCAPE CNCPHSYPTA MYCDDLKLKS 60
VPMVPPGIKY LYLRNNQIDH IDEKAFENVT DLQWLILDHN LLENSKIKGK VFSKLKQLKK 120
LHINYNNLTE SVGPLPKSLQ DLQLANNKIS KLGSFDGLVN LTFIYLQHNQ LKEEAVSASL 180
KGLKSLEYLD LSFNQMSKLP AGLPTSLLTL YLDNNKITNI PDEYFNRFTG LQYLRLSHNE 240
LADSGVPGNS FNISSLLELD LSYNKLKSIP TVNENLENYY LEVNKLEKFD VKSFCKILGP 300
LSYSKIKHLR LDGNPLTQSS LPPDMYECLR VANEITVN 338 
Gene Ontology
 GO:0005615; C:extracellular space; IEA:Compara.
 GO:0005583; C:fibrillar collagen; IEA:Compara.
 GO:0005578; C:proteinaceous extracellular matrix; IDA:RGD.
 GO:0051216; P:cartilage development; IEP:RGD.
 GO:0030199; P:collagen fibril organization; IEA:InterPro.
 GO:0045944; P:positive regulation of transcription from RNA polymerase II promoter; IEA:Compara.
 GO:0070848; P:response to growth factor stimulus; IEP:RGD.
 GO:0014070; P:response to organic cyclic compound; IEP:RGD.
 GO:0007601; P:visual perception; IEA:InterPro. 
Interpro
 IPR001611; Leu-rich_rpt.
 IPR003591; Leu-rich_rpt_typical-subtyp.
 IPR000372; LRR-contain_N.
 IPR027219; Lumican. 
Pfam
 PF01462; LRRNT 
SMART
 SM00369; LRR_TYP
 SM00013; LRRNT 
PROSITE
 PS51450; LRR 
PRINTS