CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-043666
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
  
Protein Name
 Histone-lysine N-methyltransferase MLL3 
Protein Synonyms/Alias
  
Gene Name
 KMT2C 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
308KKKEQENKTLVLSDKacetylation[1]
315KTLVLSDKHSPQKKSacetylation[1, 2, 3]
320SDKHSPQKKSTVTNEacetylation[1, 2, 3]
321DKHSPQKKSTVTNEVacetylation[2]
338EVLSPNSKVESKCETacetylation[1, 3, 4]
Reference
 [1] Proteomic investigations of lysine acetylation identify diverse substrates of mitochondrial deacetylase sirt3.
 Sol EM, Wagner SA, Weinert BT, Kumar A, Kim HS, Deng CX, Choudhary C.
 PLoS One. 2012;7(12):e50545. [PMID: 23236377]
 [2] Monoclonal antibody cocktail as an enrichment tool for acetylome analysis.
 Shaw PG, Chaerkady R, Zhang Z, Davidson NE, Pandey A.
 Anal Chem. 2011 May 15;83(10):3623-6. [PMID: 21466224]
 [3] Integrated proteomic analysis of post-translational modifications by serial enrichment.
 Mertins P, Qiao JW, Patel J, Udeshi ND, Clauser KR, Mani DR, Burgess MW, Gillette MA, Jaffe JD, Carr SA.
 Nat Methods. 2013 Jul;10(7):634-7. [PMID: 23749302]
 [4] Proteomic investigations reveal a role for RNA processing factor THRAP3 in the DNA damage response.
 Beli P, Lukashchuk N, Wagner SA, Weinert BT, Olsen JV, Baskcomb L, Mann M, Jackson SP, Choudhary C.
 Mol Cell. 2012 Apr 27;46(2):212-25. [PMID: 22424773
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2441 AA 
Protein Sequence
XERFLVPPQQ IQGSGVSPQL RRSVSVDMPR PLNNSQMNNP VGLPQHFSPQ SLPVQQHNIL 60
GQAYIELRHR APDGRQRLPF SAPPGSVVEA SSNLRHGNFI PRPDFPGPRH TDPMRRPPQG 120
LPNQLPVHPD LEQVPPSQQE QGHSVHSSSM VMRTLNHPLG GEFSEAPLST SVPSETTSDN 180
LQITTQPSDG LEEKLDSDDP SVKELDVKDL EGVEVKDLDD EDLENLNLDT EDGKVVELDT 240
LDNLETNDPN LDDLLRSGEF DIIAYTDPEL DMGDKKSMFN EELDLPIDDK LDNQCVSVEP 300
KKKEQENKTL VLSDKHSPQK KSTVTNEVKT EVLSPNSKVE SKCETEKNDE NKDNVDTPCS 360
QASAHSDLND GEKTSLHPCD PDLFEKRTNR ETAGPSANVI QASTQLPAQD VINSCGITGS 420
TPVLSSLLAN EKSDNSDIRP SGSPPPPTLP ASPSNHVSSL PPFIAPPGRV LDNAMNSNVT 480
VVSRVNHVFS QGVQVNPGLI PGQSTVNHSL GTGKPATQTG PQTSQSGTSS MSGPQQLMIP 540
QTLAQQNRER PLLLEEQPLL LQDLLDQERQ EQQQQRQMQA MIRQRSEPFF PNIDFDAITD 600
PIMKAKMVAL KGINKVMAQN NLGMPPMVMS RFPFMGQVVT GTQNSEGQNL GPQAIPQDGS 660
ITHQISRPNP PNFGPGFVND SQRKQYEEWL QETQQLLQMQ QKYLEEQIGA HRKSKKALSA 720
KQRTAKKAGR EFPEEDAEQL KHVTEQQSMV QKQLEQIRKQ QKEHAELIED YRIKQQQQCA 780
MAPPTMMPSV QPQPPLIPGA TPPTMSQPTF PMVPQQLQHQ QHTTVISGHT SPVRMPSLPG 840
WQPNSAPAHL PLNPPRIQPP IAQLPIKTCT PAPGTVSNAN PQSGPPPRVE FDDNNPFSES 900
FQERERKERL REQQERQRIQ LMQEVDRQRA LQQRMEMEQH GMVGSEISSS RTSVSQIPFY 960
SSDLPCDFMQ PLGPLQQSPQ HQQQMGQVLQ QQNIQQGSIN SPSTQTFMQT NERRQVGPPS 1020
FVPDSPSIPV GSPNFSSVKQ GHGNLSGTSF QQSPVRPSFT PALPAAPPVA NSSLPCGQDS 1080
TITHGHSYPG STQSLIQLYS DIIPEEKGKK KRTRKKKRDD DAESTKAPST PHSDITAPPT 1140
PGISETTSTP AVSTPSELPQ QADQESVEPV GPSTPNMAAG QLCTELENKL PNSDFSQATP 1200
NQQTYANSEV DKLSMETPAK TEEIKLEKAE TESCPGQEEP KLEEQNGSKV EGNAVACPVS 1260
SAQSPPHSAG APAAKGDSGN ELLKHLLKNK KSSSLLNQKP EGSICSEDDC TKDNKLVEKQ 1320
NPAEGLQTLG AQMQGGFGCG NQLPKTDGGS ETKKQRSKRT QRTGEKAAPR SKKRKKDEEE 1380
KQAMYSSTDT FTHLKQQLSL LPLMEPIIGV NFAHFLPYGS GQFNSGNRLL GTFGSATLEG 1440
VSDYYSQLIY KQNNLSNPPT PPASLPPTPP PMACQKMANG FATTEELAGK AGVLVSHEVT 1500
KTLGPKPFQL PFRPQDDLLA RALAQGPKTV DVPASLPTPP HNNQEELRIQ DHCGDRDTPD 1560
SFVPSSSPES VVGVEVSRYP DLSLVKEEPP EPVPSPIIPI LPSTAGKSSE SRRNDIKTEP 1620
GTLYFASPFG PSPNGPRSGL ISVAITLHPT AAENISSVVA AFSDLLHVRI PNSYEVSSAP 1680
DVPSMGLVSS HRINPGLEYR QHLLLRGPPP GSANPPRLVS SYRLKQPNVP FPPTSNGLSG 1740
YKDSSHGIAE SAALRPQWCC HCKVVILGSG VRKSFKDLTL LNKDSRESTK RVEKDIVFCS 1800
NNCFILYSST AQAKNSENKE SIPSLPQSPM RETPSKAFHQ YSNNISTLDV HCLPQLPEKA 1860
SPPASPPIAF PPAFEAAQVE AKPDELKVTV KLKPRLRAVH GGFEDCRPLN KKWRGMKWKK 1920
WSIHIVIPKG TFKPPCEDEI DEFLKKLGTS LKPDPVPKDY RKCCFCHEEG DGLTDGPARL 1980
LNLDLDLWVH LNCALWSTEV YETQAGALIN VELALRRGLQ MKCVFCHKTG ATSGCHRFRC 2040
TNIYHFTCAI KAQCMFFKDK TMLCPMHKPK GIHEQELSYF AVFRRVYVQR DEVRQIASIV 2100
QRGERDHTFR VGSLIFHTIG QLLPQQMQAF HSPKALFPVG YEASRLYWST RYANRRCRYL 2160
CSIEEKDGRP VFVIRIVEQG HEDLVLSDIS PKGVWDKILE PVACVRKKSE MLQLFPAYLK 2220
GEDLFGLTVS AVARIAESLP GVEACENYTF RYGRNPLMEL PLAVNPTGCA RSEPKMSAHV 2280
KRPHTLNSTS TSKSFQSTVT GELNAPYSKQ FVHSKSSQYR KMKTEWKSNV YLARSRIQGL 2340
GLYAARDIEK HTMVIEYIGT IIRNEVANRK EKLYESQNRG VYMFRMDNDH VIDATLTGGP 2400
ARYINHSCAP NCVAEVVTFE RGHKIIISSS RRIQKGEEVR V 2441 
Gene Ontology
 GO:0005634; C:nucleus; IEA:InterPro.
 GO:0008270; F:zinc ion binding; IEA:InterPro. 
Interpro
 IPR003889; FYrich_C.
 IPR003888; FYrich_N.
 IPR001214; SET_dom.
 IPR001965; Znf_PHD. 
Pfam
 PF05965; FYRC
 PF05964; FYRN
 PF00856; SET 
SMART
 SM00542; FYRC
 SM00541; FYRN
 SM00249; PHD
 SM00317; SET 
PROSITE
 PS51543; FYRC
 PS51542; FYRN
 PS50280; SET 
PRINTS