CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-023286
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Histone-lysine N-methyltransferase SETD1B 
Protein Synonyms/Alias
 Lysine N-methyltransferase 2G; SET domain-containing protein 1B; hSET1B 
Gene Name
 SETD1B 
Gene Synonyms/Alias
 KIAA1076; KMT2G; SET1B 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
1790KKKLKFCKSHIHDWGacetylation[1]
Reference
 [1] Regulation of cellular metabolism by protein lysine acetylation.
 Zhao S, Xu W, Jiang W, Yu W, Lin Y, Zhang T, Yao J, Zhou L, Zeng Y, Li H, Li Y, Shi J, An W, Hancock SM, He F, Qin L, Chin J, Yang P, Chen X, Lei Q, Xiong Y, Guan KL.
 Science. 2010 Feb 19;327(5968):1000-4. [PMID: 20167786
Functional Description
 Histone methyltransferase that specifically methylates 'Lys-4' of histone H3, when part of the SET1 histone methyltransferase (HMT) complex, but not if the neighboring 'Lys- 9' residue is already methylated. H3 'Lys-4' methylation represents a specific tag for epigenetic transcriptional activation. The non-overalpping localization with SETD1A suggests that SETD1A and SETD1B make non-redundant contributions to the epigenetic control of chromatin structure and gene expression. Specifically tri-methylates 'Lys-4' of histone H3 in vitro. 
Sequence Annotation
 DOMAIN 93 181 RRM.
 DOMAIN 1784 1901 SET.
 DOMAIN 1907 1923 Post-SET.
 MOD_RES 1616 1616 Phosphoserine.
 MOD_RES 1620 1620 Phosphoserine.  
Keyword
 3D-structure; Activator; Chromatin regulator; Chromosome; Complete proteome; Methyltransferase; Nucleus; Phosphoprotein; Reference proteome; RNA-binding; S-adenosyl-L-methionine; Transcription; Transcription regulation; Transferase. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1923 AA 
Protein Sequence
MENSHPPHHH HQQPPPQPGP SGERRNHHWR SYKLMIDPAL KKGHHKLYRY DGQHFSLAMS 60
SNRPVEIVED PRVVGIWTKN KELELSVPKF KIDEFYVGPV PPKQVTFAKL NDNIRENFLR 120
DMCKKYGEVE EVEILYNPKT KKHLGIAKVV FATVRGAKDA VQHLHSTSVM GNIIHVELDT 180
KGETRMRFYE LLVTGRYTPQ TLPVGELDAV SPIVNETLQL SDALKRLKDG GLSAGCGSGS 240
SSVTPNSGGT PFSQDTAYSS CRLDTPNSYG QGTPLTPRLG TPFSQDSSYS SRQPTPSYLF 300
SQDPAVTFKA RRHESKFTDA YNRRHEHHYV HNSPAVTAVA GATAAFRGSS DLPFGAVGGT 360
GGSSGPPFKA QPQDSATFAH TPPPAQATPA PGFKSAFSPY QTPVAHFPPP PEEPTATAAF 420
GARDSGEFRR APAPPPLPPA EPLAKEKPGT PPGPPPPDTN SMELGGRPTF GWSPEPCDSP 480
GTPTLESSPA GPEKPHDSLD SRIEMLLKEQ RTKLLFLREP DSDTELQMEG SPISSSSSQL 540
SPLAPFGTNS QPGFRGPTPP SSRPSSTGLE DISPTPLPDS DEDEELDLGL GPRPPPEPGP 600
PDPAGLLSQT AEVALDLVGD RTPTSEKMDE GQQSSGEDME ISDDEMPSAP ITSADCPKPM 660
VVTPGAAAVA APSVLAPTLP LPPPPGFPPL PPPPPPPPPQ PGFPMPPPLP PPPPPPPPAH 720
PAVTVPPPPL PAPPGVPPPP ILPPLPPFPP GLFPVMQVDM SHVLGGQWGG MPMSFQMQTQ 780
VLSRLMTGQG ACPYPPFMAA AAAAASAGLQ FVNLPPYRGP FSLSNSGPGR GQHWPPLPKF 840
DPSVPPPGYM PRQEDPHKAT VDGVLLVVLK ELKAIMKRDL NRKMVEVVAF RAFDEWWDKK 900
ERMAKASLTP VKSGEHKDED RPKPKDRIAS CLLESWGKGE GLGYEGLGLG IGLRGAIRLP 960
SFKVKRKEPP DTTSSGDQKR LRPSTSVDEE DEESERERDR DMADTPCELA KRDPKGVGVR 1020
RRPARPLELD SGGEEDEKES LSEEQESTEE EEEAEEEEEE EDDDDDDSDD RDESENDDED 1080
TALSEASEKD EGDSDEEETV SIVTSKAEAT SSSESSESSE FESSSESSPS SSEDEEEVVA 1140
REEEEEEEEE EMVAEESMAS AGPEDFEQDG EEAALAPGAP AVDSLGMEEE VDIETEAVAP 1200
EERPSMLDEP PLPVGVEEPA DSREPPEEPG LSQEGAMLLS PEPPAKEVEA RPPLSPERAP 1260
EHDLEVEPEP PMMLPLPLQP PLPPPRPPRP PSPPPEPETT DASHPSVPPE PLAEDHPPHT 1320
PGLCGSLAKS QSTETVPATP GGEPPLSGGS SGLSLSSPQV PGSPFSYPAP SPSLSSGGLP 1380
RTPGRDFSFT PTFSEPSGPL LLPVCPLPTG RRDERSGPLA SPVLLETGLP LPLPLPLPLP 1440
LALPAVLRAQ ARAPTPLPPL LPAPLASCPP PMKRKPGRPR RSPPSMLSLD GPLVRPPAGA 1500
ALGRELLLLP GQPQTPVFPS THDPRTVTLD FRNAGIPAPP PPLPPQPPPP PPPPPVEPTK 1560
LPFKELDNQW PSEAIPPGPR GRDEVTEEYM ELAKSRGPWR RPPKKRHEDL VPPAGSPELS 1620
PPQPLFRPRS EFEEMTILYD IWNGGIDEED IRFLCVTYER LLQQDNGMDW LNDTLWVYHP 1680
STSLSSAKKK KRDDGIREHV TGCARSEGFY TIDKKDKLRY LNSSRASTDE PPADTQGMSI 1740
PAQPHASTRA GSERRSEQRR LLSSFTGSCD SDLLKFNQLK FRKKKLKFCK SHIHDWGLFA 1800
MEPIAADEMV IEYVGQNIRQ VIADMREKRY EDEGIGSSYM FRVDHDTIID ATKCGNFARF 1860
INHSCNPNCY AKVITVESQK KIVIYSKQHI NVNEEITYDY KFPIEDVKIP CLCGSENCRG 1920
TLN 1923 
Gene Ontology
 GO:0005694; C:chromosome; IEA:UniProtKB-SubCell.
 GO:0016607; C:nuclear speck; IEA:UniProtKB-SubCell.
 GO:0048188; C:Set1C/COMPASS complex; IDA:UniProtKB.
 GO:0018024; F:histone-lysine N-methyltransferase activity; IEA:EC.
 GO:0000166; F:nucleotide binding; IEA:InterPro.
 GO:0003723; F:RNA binding; IEA:UniProtKB-KW.
 GO:0051568; P:histone H3-K4 methylation; IDA:UniProtKB.
 GO:0006355; P:regulation of transcription, DNA-dependent; IEA:UniProtKB-KW.
 GO:0006351; P:transcription, DNA-dependent; IEA:UniProtKB-KW. 
Interpro
 IPR024657; COMPASS_Set1_N-SET.
 IPR015722; Histone-lysine_MeTfrase.
 IPR012677; Nucleotide-bd_a/b_plait.
 IPR003616; Post-SET_dom.
 IPR000504; RRM_dom.
 IPR001214; SET_dom. 
Pfam
 PF11764; N-SET
 PF00076; RRM_1
 PF00856; SET 
SMART
 SM00508; PostSET
 SM00360; RRM
 SM00317; SET 
PROSITE
 PS50868; POST_SET
 PS50102; RRM
 PS50280; SET 
PRINTS