CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-034440
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
  
Protein Name
 Protein Kmt2a 
Protein Synonyms/Alias
  
Gene Name
 Kmt2a 
Gene Synonyms/Alias
 Mll1 
Created Date
 July 27, 2013 
Organism
 Mus musculus (Mouse) 
NCBI Taxa ID
 10090 
Lysine Modification
Position
Peptide
Type
References
240SETKSADKIKKKDSKacetylation[1]
270KIKITHGKDIAELTQacetylation[2, 3]
280AELTQGSKEDSLKKVacetylation[2]
404RTDATIAKQLLQRAKacetylation[2, 3]
670FSSAKYAKEGLIRKPacetylation[2]
1160EPLAPPIKPIKPVTRacetylation[2]
1163APPIKPIKPVTRNKAacetylation[2]
Reference
 [1] Proteomic analysis of lysine acetylation sites in rat tissues reveals organ specificity and subcellular patterns.
 Lundby A, Lage K, Weinert BT, Bekker-Jensen DB, Secher A, Skovgaard T, Kelstrup CD, Dmytriyev A, Choudhary C, Lundby C, Olsen JV.
 Cell Rep. 2012 Aug 30;2(2):419-31. [PMID: 22902405]
 [2] Quantitative acetylome analysis reveals the roles of SIRT1 in regulating diverse substrates and cellular pathways.
 Chen Y, Zhao W, Yang JS, Cheng Z, Luo H, Lu Z, Tan M, Gu W, Zhao Y.
 Mol Cell Proteomics. 2012 Oct;11(10):1048-62. [PMID: 22826441]
 [3] SIRT5-Mediated Lysine Desuccinylation Impacts Diverse Metabolic Pathways.
 Park J, Chen Y, Tishkoff DX, Peng C, Tan M, Dai L, Xie Z, Zhang Y, Zwaans BM, Skinner ME, Lombard DB, Zhao Y.
 Mol Cell. 2013 Jun 27;50(6):919-30. [PMID: 23806337
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1438 AA 
Protein Sequence
MAHSCRWRFP ARPGTTGGGG GGGRRGLGGA PRQRVPALLL PPGPQAGGGG PGAPPSPPAV 60
AAAAAGSSGA GVPGGAAAAS AASSSSASSS SSSSSSASSG PALLRVGPGF DAALQVSAAI 120
GTNLRRFRAV FGESGGGGGS GELTTQIPGS WRTKGLIHDI KAELVRLLAW SWCLNDEQFL 180
GFGSDEEVRV RSPTRSPSVK ASPRKPRGRP RSGSDRNPAI LSDPSVFSPL NKSETKSADK 240
IKKKDSKSIE KKRGRPPTFP GVKIKITHGK DIAELTQGSK EDSLKKVKRT PSAMFQQATK 300
IKKLRAGKLS PLKSKFKTGK LQIGRKGVQI VRRRGRPPST ERIKTPSGLL INSELEKPQK 360
VRKDKEGTPP LTKEDKTVVR QSPRRIKPVR IIPSCKRTDA TIAKQLLQRA KKGAQKKIEK 420
EAAQLQGRKV KTQVKNIRQF IMPVVSAISS RIIKTPRRFI EDEDYDPPMK IARLESTPNS 480
RFSATSCGSS EKSSAASQHS SQMSSDSSRS SSPSIDTTSD SQASEEIQAL PEERSNTPEV 540
HTPLPISQSP ENESNDRRSR RYSMSERSFG SRATKKLPTL QSAPQQQTSS SPPPPLLTPP 600
PPLQPASGIS DHTPWLMPPT IPLASPFLPA SAAPMQGKRK SILREPTFRW TSLKHSRSEP 660
QYFSSAKYAK EGLIRKPIFD NFRPPPLTPE DVGFASGFSA SGTAASARLF SPLHSGTRFD 720
IHKRSPILRA PRFTPSEAHS RIFESVTLPS NRTSSGASSS GVSNRKRKRK VFSPIRSEPR 780
SPSHSMRTRS GRLSTSELSP LTPPSSVSSS LSIPVSPLAA SALNPTFTFP SHSLTQSGES 840
TEKNQRARKQ TSALAEPFSS NSPALFPWFT PGSQTEKGRK KDTAPEELSK DRDADKSVEK 900
DKSRERDRER EKENKRESRK EKRKKGSDIQ SSSALYPVGR VSKEKVAGED VGTSSSAKKA 960
TGRKKSSSLD SGADVAPVTL GDTTAVKAKI LIKKGRGNLE KNNLDLGPAA PSLEKERTPC 1020
LSAPSSSTVK HSTSSIGSML AQADKLPMTD KRVASLLKKA KAQLCKIEKS KSLKQTDQPK 1080
AQGQESDSSE TSVRGPRIKH VCRRAAVALG RKRAVFPDDM PTLSALPWEE REKILSSMGN 1140
DDKSSVAGSE DAEPLAPPIK PIKPVTRNKA PQEPPVKKGR RSRRCGQCPG CQVPEDCGIC 1200
TNCLDKPKFG GRNIKKQCCK MRKCQNLQWM PSKASLQKQT KAVKKKEKKS KTTEKKESKE 1260
STAVKSPLEP AQKAAPPPRE EPAPKKSSSE PPPRKPVEEK SEEGGAPAPA PAPEPKQVSA 1320
PASRKSSKQV SQPAAVVPPQ PPSTAPQKKE APKAVPSEPK KKQPPPPEPG PEQSKQKKVA 1380
PRPSIPVKQK PKDKEKPPPV SKQENAGTLN ILNPLSNGIS SKQKIPADGV HRIRVDFK 1438 
Gene Ontology
 GO:0005634; C:nucleus; IDA:MGI.
 GO:0003682; F:chromatin binding; IDA:MGI.
 GO:0003677; F:DNA binding; IEA:InterPro.
 GO:0008270; F:zinc ion binding; IEA:InterPro.
 GO:0009952; P:anterior/posterior pattern specification; IMP:MGI.
 GO:0006306; P:DNA methylation; IMP:MGI.
 GO:0035162; P:embryonic hemopoiesis; IMP:MGI.
 GO:0008285; P:negative regulation of cell proliferation; IMP:MGI.
 GO:0045944; P:positive regulation of transcription from RNA polymerase II promoter; IGI:MGI.
 GO:0051569; P:regulation of histone H3-K4 methylation; IMP:MGI. 
Interpro
 IPR017956; AT_hook_DNA-bd_motif.
 IPR002857; Znf_CXXC. 
Pfam
 PF02008; zf-CXXC 
SMART
 SM00384; AT_hook 
PROSITE
 PS51058; ZF_CXXC 
PRINTS