CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-016722
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Centromere protein U 
Protein Synonyms/Alias
 CENP-U; MLF1-interacting protein 
Gene Name
 Mlf1ip 
Gene Synonyms/Alias
 Cenpu 
Created Date
 July 27, 2013 
Organism
 Mus musculus (Mouse) 
NCBI Taxa ID
 10090 
Lysine Modification
Position
Peptide
Type
References
292LKEAQMLKALKMKNTacetylation[1]
292LKEAQMLKALKMKNTsuccinylation[1]
Reference
 [1] SIRT5-Mediated Lysine Desuccinylation Impacts Diverse Metabolic Pathways.
 Park J, Chen Y, Tishkoff DX, Peng C, Tan M, Dai L, Xie Z, Zhang Y, Zwaans BM, Skinner ME, Lombard DB, Zhao Y.
 Mol Cell. 2013 Jun 27;50(6):919-30. [PMID: 23806337
Functional Description
 Component of the CENPA-NAC (nucleosome-associated) complex, a complex that plays a central role in assembly of kinetochore proteins, mitotic progression and chromosome segregation. The CENPA-NAC complex recruits the CENPA-CAD (nucleosome distal) complex and may be involved in incorporation of newly synthesized CENPA into centromeres. Plays an important role in the correct PLK1 localization to the mitotic kinetochores. A scaffold protein responsible for the initial recruitment and maintenance of the kinetochore PLK1 population until its degradation. Involved in transcriptional repression (By similarity). 
Sequence Annotation
 MOTIF 4 21 Nuclear localization signal (Potential).
 MOTIF 295 312 Nuclear localization signal (Potential).
 MOD_RES 74 74 Phosphothreonine; by PLK1 (By
 MOD_RES 106 106 Phosphoserine (By similarity).
 MOD_RES 131 131 Phosphoserine (By similarity).
 MOD_RES 134 134 Phosphoserine (By similarity).
 MOD_RES 136 136 Phosphoserine (By similarity).
 MOD_RES 186 186 Phosphoserine (By similarity).  
Keyword
 Alternative splicing; Centromere; Chromosome; Coiled coil; Complete proteome; Cytoplasm; Kinetochore; Nucleus; Phosphoprotein; Reference proteome; Repressor; Transcription; Transcription regulation. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 410 AA 
Protein Sequence
MAARRSLRYS GNPGAKHSKN TLRSTYSRKQ KAGPKPRPKD VFDFSNNSDA SSIPGALEEE 60
EETYETFDPP LHSTAIYAED ELSKHCVSSS SLATHRGKAS RNLDPSEDEA SGNESIKVST 120
KKPRRKLEPI SGESDSSADD VRRRVASAEG PRSQQRQAAP AAPSPPERPA EPVTPRRTRL 180
HSAQLSPVDE TPATQSQLKT QKKVRPSPGR RKRPRRGHTD TDGSESMHIW CLEGKRQSDI 240
TELDVILSVF EKTFLEYKQR VESESCNQAI NKFYFKMKGE LIRMLKEAQM LKALKMKNTK 300
IIANMEKKRQ RLIEVQDELI RLEPQLKQLQ TKYDDLKERK SSLKKSKHFL SNLKQLCQDY 360
SNVQEKGPKG TGKYDSSSLP ALLFKARSIL GAENHLRTIN YQLGKLLELD 410 
Gene Ontology
 GO:0005813; C:centrosome; IEA:Compara.
 GO:0000777; C:condensed chromosome kinetochore; IEA:UniProtKB-SubCell.
 GO:0005737; C:cytoplasm; IDA:MGI.
 GO:0005634; C:nucleus; IDA:MGI.
 GO:0006355; P:regulation of transcription, DNA-dependent; IEA:UniProtKB-KW.
 GO:0006351; P:transcription, DNA-dependent; IEA:UniProtKB-KW. 
Interpro
 IPR025214; NAC_CENP-U. 
Pfam
 PF13097; CENP-U 
SMART
  
PROSITE
  
PRINTS