CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-016828
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Histone-lysine N-methyltransferase SETD1B 
Protein Synonyms/Alias
 SET domain-containing protein 1B 
Gene Name
 Setd1b 
Gene Synonyms/Alias
 Kiaa1076; Set1b 
Created Date
 July 27, 2013 
Organism
 Mus musculus (Mouse) 
NCBI Taxa ID
 10090 
Lysine Modification
Position
Peptide
Type
References
88ELELSVPKFKIDEFYubiquitination[1]
1656EEYVDLAKVRGPWRRubiquitination[1]
Reference
 [1] Proteomic analyses reveal divergent ubiquitylation site patterns in murine tissues.
 Wagner SA, Beli P, Weinert BT, Schölz C, Kelstrup CD, Young C, Nielsen ML, Olsen JV, Brakebusch C, Choudhary C.
 Mol Cell Proteomics. 2012 Dec;11(12):1578-85. [PMID: 22790023
Functional Description
 Histone methyltransferase that specifically methylates 'Lys-4' of histone H3, when part of the SET1 histone methyltransferase (HMT) complex, but not if the neighboring 'Lys- 9' residue is already methylated. H3 'Lys-4' methylation represents a specific tag for epigenetic transcriptional activation. The non-overalpping localization with SETD1B suggests that SETD1A and SETD1B make non-redundant contributions to the epigenetic control of chromatin structure and gene expression (By similarity). 
Sequence Annotation
 DOMAIN 92 180 RRM.
 DOMAIN 1846 1963 SET.
 DOMAIN 1969 1985 Post-SET.
 MOD_RES 1678 1678 Phosphoserine (By similarity).
 MOD_RES 1682 1682 Phosphoserine (By similarity).  
Keyword
 Activator; Alternative splicing; Chromatin regulator; Chromosome; Complete proteome; Methyltransferase; Nucleus; Phosphoprotein; Reference proteome; RNA-binding; S-adenosyl-L-methionine; Transcription; Transcription regulation; Transferase. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1985 AA 
Protein Sequence
MENSHPHHHH QQPPPQPGPS GERRNHHWRS YKLMIDPALK KGHHKLYRYD GQHFSLAMSS 60
NRPVEIVEDP RVVGIWTKNK ELELSVPKFK IDEFYVGPVP PKQVTFAKLN DNVRENFLRD 120
MCKKYGEVEE VEILYNPKTK KHLGIAKVVF ATVRGAKEAV QHLHSTSVMG NIIHVELDTK 180
GETRMRFYEL LVTGRYTPQT LPVGELDAIS PIVSETLQLS DALKRLKDGS LSAGCGSGSS 240
SVTPNSGGTP FSQDTAYSSC RLDTPNSYGQ GTPITPRLGT PFSQDSSYSS RQPTPSYLFS 300
QDPTATFKAR RHESKFTDAY NRRHEHHYVH NSAVAGATAP FRGSSDLSFG TVGSSGTPFK 360
AQSQDATTFA HTPPPAQTAT ASGFKSAFSP YQTPAPPFPP PPEEPTATAA FGSRDSGEFR 420
RAPAPPPLPP AEPPAKEKPG TPPGPPPPDS NSMELGGRPT FGWSPEPCDS PGTPTLESSP 480
AGPEKPHDSL DSRIEMLLKE QRTKLPFLRE QDSDTEIQME GSPISSSSSQ LSPLSHFGTN 540
SQPGFRGPSP PSSRPSSTGL EDISPTPLPD SDEDEDLGLG LGPRPPPEPG PPDPMGLLGQ 600
TAEVDLDLAG DRTPTSERMD EGQQSSGEDM EISDDEMPSA PITSADCPKP MVVTPGAGAV 660
AAPNVLAPNL PLPPPPGFPP LPPPPPPPPP QPGFPMPPPL PPPPPPPPPA HPAVTVPPPP 720
LPAPPGVPPP PILPPLPPFP PGLFPVMQVD MSHVLGGQWG GMPMSFQMQT QMLSRLMTGQ 780
GACPYPPFMA AAAAAASAGL QFVNLPPYRS PFSLSNSGPG RGQHWPPLPK FDPSVPPPGY 840
IPRQEDPHKA TVDGVLLVVL KELKAIMKRD LNRKMVEVVA FRAFDEWWDK KERMAKASLT 900
PVKSGEHKDE DRPKPKDRIA SCLLESWGKG EGLGYEGLGL GIGLRGAIRL PSFKVKRKEP 960
PDTASSGDQK RLRPSTSVDE EDEESERERD RDIADAPCEL TKRDPKSVGV RRRPGRPLEL 1020
DSGGEEDEKE SLSASSSSSA SSSSGSSTTS PSSSASDKEE EDRESTEEEE EEEEEEAEEE 1080
EEEGPRSRIS SPSSSSSSDK DDEDDNEADS DGQIDSDIDD QGAPLSEASE KDNGDSEEEE 1140
TESITTSKAP AESSSSSSES SGSSEFESSS ESESSSSSSE DEEEMTVPGV EEEEEEEEEE 1200
EKETAMAAAT VVAMAEESMP PAGGQDFEQD RAEVPLGPRG PMRESLGTEE EVDIEAEDEV 1260
PEMQAPELEE PPLPMGARKL EGSPEPPEEP GPNTQGDMLL SPELPARETE EAQLPSPPEH 1320
GPESDLDMEP EPPPMLSLPL QPPLPPPRLL RPPSPPPEPE TPEPPKPPVP LEPPPEDHPP 1380
RTPGLCGSLA KSQSTETVPA TPGGEPPLSG SSSGLSLSSP QVPGSPFSYP SPSPGLSSGG 1440
LPRTPGRDFS FTPTFPEPSG PLLLPVCPLP TGRRDERTGP LASPVLLETG LPLPLPLPLP 1500
LPLALPVPVL RAQPRPPPQL PPLLPATLAP CPTPIKRKPG RPRRSPPSML SLDGPLVRPP 1560
PGPALGRDLL LLPGQPPAPI FPSAHDPRAV TLDFRNTGIP APPPPLPPQP PPPPPPPPVE 1620
STKLPFKELD NQWPSEAIPP GPRRDEVTEE YVDLAKVRGP WRRPPKKRHE DLVAPSASPE 1680
PSPPQPLFRP RSEFEEMTIL YDIWNGGIDE EDIRFLCVTY ERLLQQDNGM DWLNDTLWVY 1740
HPSTSLSSAK KKKREDGIRE HVTGCARSEG FYTIDKKDKL RYLNSSRAST DEPPMDTQGM 1800
SIPAQPHAST RAGSERRSEQ RRLLSSFTGS CDSDLLKFNQ LKFRKKKLKF CKSHIHDWGL 1860
FAMEPIAADE MVIEYVGQNI RQVIADMREK RYEDEGIGSS YMFRVDHDTI IDATKCGNFA 1920
RFINHSCNPN CYAKVITVES QKKIVIYSKQ HINVNEEITY DYKFPIEDVK IPCLCGSENC 1980
RGTLN 1985 
Gene Ontology
 GO:0005694; C:chromosome; IEA:UniProtKB-SubCell.
 GO:0016607; C:nuclear speck; IEA:UniProtKB-SubCell.
 GO:0048188; C:Set1C/COMPASS complex; ISS:UniProtKB.
 GO:0042800; F:histone methyltransferase activity (H3-K4 specific); IEA:Compara.
 GO:0000166; F:nucleotide binding; IEA:InterPro.
 GO:0003723; F:RNA binding; IEA:UniProtKB-KW.
 GO:0006355; P:regulation of transcription, DNA-dependent; IEA:UniProtKB-KW.
 GO:0006351; P:transcription, DNA-dependent; IEA:UniProtKB-KW. 
Interpro
 IPR024657; COMPASS_Set1_N-SET.
 IPR015722; Histone-lysine_MeTfrase.
 IPR012677; Nucleotide-bd_a/b_plait.
 IPR003616; Post-SET_dom.
 IPR000504; RRM_dom.
 IPR001214; SET_dom. 
Pfam
 PF11764; N-SET
 PF00076; RRM_1
 PF00856; SET 
SMART
 SM00508; PostSET
 SM00360; RRM
 SM00317; SET 
PROSITE
 PS50868; POST_SET
 PS50102; RRM
 PS50280; SET 
PRINTS