CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-001709
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Histone-lysine N-methyltransferase NSD2 
Protein Synonyms/Alias
 Multiple myeloma SET domain-containing protein; MMSET; Nuclear SET domain-containing protein 2; NSD2; Protein trithorax-5; Wolf-Hirschhorn syndrome candidate 1 protein; WHSC1 
Gene Name
 WHSC1 
Gene Synonyms/Alias
 KIAA1090; MMSET; NSD2; TRX5 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
404MKRRRRAKLCSSAETacetylation[1]
779PSNPRPSKGKMMRCVacetylation[2]
781NPRPSKGKMMRCVRCacetylation[2]
1272LSCLGLGKRPFGKWEacetylation[1]
Reference
 [1] Integrated proteomic analysis of post-translational modifications by serial enrichment.
 Mertins P, Qiao JW, Patel J, Udeshi ND, Clauser KR, Mani DR, Burgess MW, Gillette MA, Jaffe JD, Carr SA.
 Nat Methods. 2013 Jul;10(7):634-7. [PMID: 23749302]
 [2] Lysine acetylation targets protein complexes and co-regulates major cellular functions.
 Choudhary C, Kumar C, Gnad F, Nielsen ML, Rehman M, Walther TC, Olsen JV, Mann M.
 Science. 2009 Aug 14;325(5942):834-40. [PMID: 19608861
Functional Description
 Histone methyltransferase with histone H3 'Lys-27' (H3K27me) methyltransferase activity. Isoform RE-IIBP may act as a transcription regulator that binds DNA and suppresses IL5 transcription through HDAC recruitment. 
Sequence Annotation
 DOMAIN 222 286 PWWP 1.
 DOMAIN 880 942 PWWP 2.
 DOMAIN 1011 1061 AWS.
 DOMAIN 1063 1180 SET.
 DOMAIN 1187 1203 Post-SET.
 DNA_BIND 453 521 HMG box.
 ZN_FING 667 713 PHD-type 1.
 ZN_FING 714 770 PHD-type 2.
 ZN_FING 831 875 PHD-type 3.
 ZN_FING 1239 1286 PHD-type 4; atypical.
 MOD_RES 110 110 Phosphothreonine.
 MOD_RES 114 114 Phosphothreonine.
 MOD_RES 121 121 Phosphoserine.
 MOD_RES 376 376 Phosphoserine.
 MOD_RES 544 544 Phosphothreonine.  
Keyword
 Alternative splicing; Chromatin regulator; Chromosomal rearrangement; Chromosome; Complete proteome; Cytoplasm; DNA-binding; Metal-binding; Methyltransferase; Nucleus; Phosphoprotein; Proto-oncogene; Reference proteome; Repeat; S-adenosyl-L-methionine; Transcription; Transcription regulation; Transferase; Zinc; Zinc-finger. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1365 AA 
Protein Sequence
MEFSIKQSPL SVQSVVKCIK MKQAPEILGS ANGKTPSCEV NRECSVFLSK AQLSSSLQEG 60
VMQKFNGHDA LPFIPADKLK DLTSRVFNGE PGAHDAKLRF ESQEMKGIGT PPNTTPIKNG 120
SPEIKLKITK TYMNGKPLFE SSICGDSAAD VSQSEENGQK PENKARRNRK RSIKYDSLLE 180
QGLVEAALVS KISSPSDKKI PAKKESCPNT GRDKDHLLKY NVGDLVWSKV SGYPWWPCMV 240
SADPLLHSYT KLKGQKKSAR QYHVQFFGDA PERAWIFEKS LVAFEGEGQF EKLCQESAKQ 300
APTKAEKIKL LKPISGKLRA QWEMGIVQAE EAASMSVEER KAKFTFLYVG DQLHLNPQVA 360
KEAGIAAESL GEMAESSGVS EEAAENPKSV REECIPMKRR RRAKLCSSAE TLESHPDIGK 420
STPQKTAEAD PRRGVGSPPG RKKTTVSMPR SRKGDAASQF LVFCQKHRDE VVAEHPDASG 480
EEIEELLRSQ WSLLSEKQRA RYNTKFALVA PVQAEEDSGN VNGKKRNHTK RIQDPTEDAE 540
AEDTPRKRLR TDKHSLRKRD TITDKTARTS SYKAMEAASS LKSQAATKNL SDACKPLKKR 600
NRASTAASSA LGFSKSSSPS ASLTENEVSD SPGDEPSESP YESADETQTE VSVSSKKSER 660
GVTAKKEYVC QLCEKPGSLL LCEGPCCGAF HLACLGLSRR PEGRFTCSEC ASGIHSCFVC 720
KESKTDVKRC VVTQCGKFYH EACVKKYPLT VFESRGFRCP LHSCVSCHAS NPSNPRPSKG 780
KMMRCVRCPV AYHSGDACLA AGCSVIASNS IICTAHFTAR KGKRHHAHVN VSWCFVCSKG 840
GSLLCCESCP AAFHPDCLNI EMPDGSWFCN DCRAGKKLHF QDIIWVKLGN YRWWPAEVCH 900
PKNVPPNIQK MKHEIGEFPV FFFGSKDYYW THQARVFPYM EGDRGSRYQG VRGIGRVFKN 960
ALQEAEARFR EIKLQREARE TQESERKPPP YKHIKVNKPY GKVQIYTADI SEIPKCNCKP 1020
TDENPCGFDS ECLNRMLMFE CHPQVCPAGE FCQNQCFTKR QYPETKIIKT DGKGWGLVAK 1080
RDIRKGEFVN EYVGELIDEE ECMARIKHAH ENDITHFYML TIDKDRIIDA GPKGNYSRFM 1140
NHSCQPNCET LKWTVNGDTR VGLFAVCDIP AGTELTFNYN LDCLGNEKTV CRCGASNCSG 1200
FLGDRPKTST TLSSEEKGKK TKKKTRRRRA KGEGKRQSED ECFRCGDGGQ LVLCDRKFCT 1260
KAYHLSCLGL GKRPFGKWEC PWHHCDVCGK PSTSFCHLCP NSFCKEHQDG TAFSCTPDGR 1320
SYCCEHDLGA ASVRSTKTEK PPPEPGKPKG KRRRRRGWRR VTEGK 1365 
Gene Ontology
 GO:0005694; C:chromosome; IEA:UniProtKB-SubCell.
 GO:0005737; C:cytoplasm; IEA:UniProtKB-SubCell.
 GO:0031965; C:nuclear membrane; IDA:HPA.
 GO:0005730; C:nucleolus; IDA:HPA.
 GO:0003682; F:chromatin binding; IEA:Compara.
 GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
 GO:0018024; F:histone-lysine N-methyltransferase activity; IEA:EC.
 GO:0008270; F:zinc ion binding; IEA:InterPro.
 GO:0009653; P:anatomical structure morphogenesis; TAS:ProtInc.
 GO:0003289; P:atrial septum primum morphogenesis; IEA:Compara.
 GO:0003290; P:atrial septum secundum morphogenesis; IEA:Compara.
 GO:0060348; P:bone development; IEA:Compara.
 GO:0003149; P:membranous septum morphogenesis; IEA:Compara.
 GO:0000122; P:negative regulation of transcription from RNA polymerase II promoter; IEA:Compara.
 GO:0006351; P:transcription, DNA-dependent; IEA:UniProtKB-KW. 
Interpro
 IPR006560; AWS.
 IPR009071; HMG_box_dom.
 IPR003616; Post-SET_dom.
 IPR000313; PWWP.
 IPR001214; SET_dom.
 IPR019786; Zinc_finger_PHD-type_CS.
 IPR011011; Znf_FYVE_PHD.
 IPR001965; Znf_PHD.
 IPR019787; Znf_PHD-finger.
 IPR001841; Znf_RING.
 IPR013083; Znf_RING/FYVE/PHD. 
Pfam
 PF00505; HMG_box
 PF00628; PHD
 PF00855; PWWP
 PF00856; SET 
SMART
 SM00570; AWS
 SM00398; HMG
 SM00249; PHD
 SM00508; PostSET
 SM00293; PWWP
 SM00184; RING
 SM00317; SET 
PROSITE
 PS51215; AWS
 PS50118; HMG_BOX_2
 PS50868; POST_SET
 PS50812; PWWP
 PS50280; SET
 PS01359; ZF_PHD_1
 PS50016; ZF_PHD_2 
PRINTS