CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-020391
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Histone-lysine N-methyltransferase NSD3 
Protein Synonyms/Alias
 Nuclear SET domain-containing protein 3; Protein whistle; WHSC1-like 1 isoform 9 with methyltransferase activity to lysine; Wolf-Hirschhorn syndrome candidate 1-like protein 1; WHSC1-like protein 1 
Gene Name
 WHSC1L1 
Gene Synonyms/Alias
 NSD3; DC28 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
554TPNQRNEKPTQSVSSubiquitination[1]
790PTAIFESKGFRCPQHacetylation[2]
1215RIIDAGPKGNYSRFMubiquitination[1]
Reference
 [1] Refined preparation and use of anti-diglycine remnant (K-ε-GG) antibody enables routine quantification of 10,000s of ubiquitination sites in single proteomics experiments.
 Udeshi ND, Svinkina T, Mertins P, Kuhn E, Mani DR, Qiao JW, Carr SA.
 Mol Cell Proteomics. 2013 Mar;12(3):825-31. [PMID: 23266961]
 [2] Lysine acetylation targets protein complexes and co-regulates major cellular functions.
 Choudhary C, Kumar C, Gnad F, Nielsen ML, Rehman M, Walther TC, Olsen JV, Mann M.
 Science. 2009 Aug 14;325(5942):834-40. [PMID: 19608861
Functional Description
 Histone methyltransferase. Preferentially methylates 'Lys-4' and 'Lys-27' of histone H3. H3 'Lys-4' methylation represents a specific tag for epigenetic transcriptional activation, while 'Lys-27' is a mark for transcriptional repression. 
Sequence Annotation
 DOMAIN 270 333 PWWP 1.
 DOMAIN 960 1022 PWWP 2.
 DOMAIN 1093 1143 AWS.
 DOMAIN 1145 1262 SET.
 DOMAIN 1269 1285 Post-SET.
 ZN_FING 701 748 PHD-type 1.
 ZN_FING 749 805 PHD-type 2.
 ZN_FING 862 955 PHD-type 3.
 ZN_FING 1321 1368 PHD-type 4; atypical.
 MOD_RES 150 150 Phosphoserine.
 MOD_RES 790 790 N6-acetyllysine.  
Keyword
 3D-structure; Acetylation; Alternative splicing; Chromatin regulator; Chromosomal rearrangement; Chromosome; Coiled coil; Complete proteome; Metal-binding; Methyltransferase; Nucleus; Phosphoprotein; Polymorphism; Proto-oncogene; Reference proteome; Repeat; S-adenosyl-L-methionine; Transcription; Transcription regulation; Transferase; Zinc; Zinc-finger. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1437 AA 
Protein Sequence
MDFSFSFMQG IMGNTIQQPP QLIDSANIRQ EDAFDNNSDI AEDGGQTPYE ATLQQGFQYP 60
ATTEDLPPLT NGYPSSISVY ETQTKYQSYN QYPNGSANGF GAVRNFSPTD YYHSEIPNTR 120
PHEILEKPSP PQPPPPPSVP QTVIPKKTGS PEIKLKITKT IQNGRELFES SLCGDLLNEV 180
QASEHTKSKH ESRKEKRKKS NKHDSSRSEE RKSHKIPKLE PEEQNRPNER VDTVSEKPRE 240
EPVLKEEAPV QPILSSVPTT EVSTGVKFQV GDLVWSKVGT YPWWPCMVSS DPQLEVHTKI 300
NTRGAREYHV QFFSNQPERA WVHEKRVREY KGHKQYEELL AEATKQASNH SEKQKIRKPR 360
PQRERAQWDI GIAHAEKALK MTREERIEQY TFIYIDKQPE EALSQAKKSV ASKTEVKKTR 420
RPRSVLNTQP EQTNAGEVAS SLSSTEIRRH SQRRHTSAEE EEPPPVKIAW KTAAARKSLP 480
ASITMHKGSL DLQKCNMSPV VKIEQVFALQ NATGDGKFID QFVYSTKGIG NKTEISVRGQ 540
DRLIISTPNQ RNEKPTQSVS SPEATSGSTG SVEKKQQRRS IRTRSESEKS TEVVPKKKIK 600
KEQVETVPQA TVKTGLQKGA SEISDSCKPL KKRSRASTDV EMTSSAYRDT SDSDSRGLSD 660
LQVGFGKQVD SPSATADADV SDVQSMDSSL SRRGTGMSKK DTVCQICESS GDSLIPCEGE 720
CCKHFHLECL GLASLPDSKF ICMECKTGQH PCFSCKVSGK DVKRCSVGAC GKFYHEACVR 780
KFPTAIFESK GFRCPQHCCS ACSMEKDIHK ASKGRMMRCL RCPVAYHSGD ACIAAGSMLV 840
SSYILICSNH SKRSSNSSAV NVGFCFVCAR GLIVQDHSDP MFSSYAYKSH YLLNESNRAE 900
LMKLPMIPSS SASKKKCEKG GRLLCCESCP ASFHPECLSI EMPEGCWNCN DCKAGKKLHY 960
KQIVWVKLGN YRWWPAEICN PRSVPLNIQG LKHDLGDFPV FFFGSHDYYW VHQGRVFPYV 1020
EGDKSFAEGQ TSINKTFKKA LEEAAKRFQE LKAQRESKEA LEIEKNSRKP PPYKHIKANK 1080
VIGKVQIQVA DLSEIPRCNC KPADENPCGL ESECLNRMLQ YECHPQVCPA GDRCQNQCFT 1140
KRLYPDAEII KTERRGWGLR TKRSIKKGEF VNEYVGELID EEECRLRIKR AHENSVTNFY 1200
MLTVTKDRII DAGPKGNYSR FMNHSCNPNC ETQKWTVNGD VRVGLFALCD IPAGMELTFN 1260
YNLDCLGNGR TECHCGADNC SGFLGVRPKS ACASTNEEKA KNAKLKQKRR KIKTEPKQMH 1320
EDYCFQCGDG GELVMCDKKD CPKAYHLLCL NLTQPPYGKW ECPWHQCDEC SSAAVSFCEF 1380
CPHSFCKDHE KGALVPSALE GRLCCSEHDP MAPVSPEYWS KIKCKWESQD HGEEVKE 1437 
Gene Ontology
 GO:0005694; C:chromosome; IEA:UniProtKB-SubCell.
 GO:0005634; C:nucleus; IC:UniProtKB.
 GO:0018024; F:histone-lysine N-methyltransferase activity; IDA:UniProtKB.
 GO:0008270; F:zinc ion binding; IEA:InterPro.
 GO:0030154; P:cell differentiation; NAS:UniProtKB.
 GO:0016049; P:cell growth; NAS:UniProtKB.
 GO:0006355; P:regulation of transcription, DNA-dependent; IEA:UniProtKB-KW.
 GO:0006351; P:transcription, DNA-dependent; IEA:UniProtKB-KW. 
Interpro
 IPR006560; AWS.
 IPR003616; Post-SET_dom.
 IPR000313; PWWP.
 IPR001214; SET_dom.
 IPR019786; Zinc_finger_PHD-type_CS.
 IPR011011; Znf_FYVE_PHD.
 IPR001965; Znf_PHD.
 IPR019787; Znf_PHD-finger.
 IPR001841; Znf_RING.
 IPR013083; Znf_RING/FYVE/PHD. 
Pfam
 PF00855; PWWP
 PF00856; SET 
SMART
 SM00570; AWS
 SM00249; PHD
 SM00508; PostSET
 SM00293; PWWP
 SM00184; RING
 SM00317; SET 
PROSITE
 PS51215; AWS
 PS50868; POST_SET
 PS50812; PWWP
 PS50280; SET
 PS01359; ZF_PHD_1
 PS50016; ZF_PHD_2 
PRINTS