CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-000356
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Histone-lysine N-methyltransferase SETD1A 
Protein Synonyms/Alias
 Lysine N-methyltransferase 2F; SET domain-containing protein 1A; hSET1A; Set1/Ash2 histone methyltransferase complex subunit SET1 
Gene Name
 SETD1A 
Gene Synonyms/Alias
 KIAA0339; KMT2F; SET1; SET1A 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
297STTSTSFKPRRSENSacetylation[1]
Reference
 [1] Proteomic investigations of lysine acetylation identify diverse substrates of mitochondrial deacetylase sirt3.
 Sol EM, Wagner SA, Weinert BT, Kumar A, Kim HS, Deng CX, Choudhary C.
 PLoS One. 2012;7(12):e50545. [PMID: 23236377
Functional Description
 Histone methyltransferase that specifically methylates 'Lys-4' of histone H3, when part of the SET1 histone methyltransferase (HMT) complex, but not if the neighboring 'Lys- 9' residue is already methylated. H3 'Lys-4' methylation represents a specific tag for epigenetic transcriptional activation. The non-overalpping localization with SETD1B suggests that SETD1A and SETD1B make non-redundant contributions to the epigenetic control of chromatin structure and gene expression. 
Sequence Annotation
 DOMAIN 84 172 RRM.
 DOMAIN 1568 1685 SET.
 DOMAIN 1691 1707 Post-SET.
 REGION 1415 1450 Interaction with CFP1.
 REGION 1450 1537 Interaction with ASH2L, RBBP5 and WDR5.
 MOTIF 1299 1303 HCFC1-binding motif (HBM).
 MOD_RES 508 508 Phosphoserine.  
Keyword
 3D-structure; Activator; Chromatin regulator; Chromosome; Complete proteome; Methyltransferase; Nucleus; Phosphoprotein; Polymorphism; Reference proteome; RNA-binding; S-adenosyl-L-methionine; Transcription; Transcription regulation; Transferase. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1707 AA 
Protein Sequence
MDQEGGGDGQ KAPSFQWRNY KLIVDPALDP ALRRPSQKVY RYDGVHFSVN DSKYIPVEDL 60
QDPRCHVRSK NRDFSLPVPK FKLDEFYIGQ IPLKEVTFAR LNDNVRETFL KDMCRKYGEV 120
EEVEILLHPR TRKHLGLARV LFTSTRGAKE TVKNLHLTSV MGNIIHAQLD IKGQQRMKYY 180
ELIVNGSYTP QTVPTGGKAL SEKFQGSGAA TETAESRRRS SSDTAAYPAG TTAVGTPGNG 240
TPCSQDTSFS SSRQDTPSSF GQFTPQSSQG TPYTSRGSTP YSQDSAYSSS TTSTSFKPRR 300
SENSYQDAFS RRHFSASSAS TTASTAIAAT TAATASSSAS SSSLSSSSSS SSSSSSSQFR 360
SSDANYPAYY ESWNRYQRHT SYPPRRATRE EPPGAPFAEN TAERFPPSYT SYLPPEPSRP 420
TDQDYRPPAS EAPPPEPPEP GGGGGGGGPS PEREEVRTSP RPASPARSGS PAPETTNESV 480
PFAQHSSLDS RIEMLLKEQR SKFSFLASDT EEEEENSSMV LGARDTGSEV PSGSGHGPCT 540
PPPAPANFED VAPTGSGEPG ATRESPKANG QNQASPCSSG DDMEISDDDR GGSPPPAPTP 600
PQQPPPPPPP PPPPPPYLAS LPLGYPPHQP AYLLPPRPDG PPPPEYPPPP PPPPHIYDFV 660
NSLELMDRLG AQWGGMPMSF QMQTQMLTRL HQLRQGKGLI AASAGPPGGA FGEAFLPFPP 720
PQEAAYGLPY ALYAQGQEGR GAYSREAYHL PMPMAAEPLP SSSVSGEEAR LPPREEAELA 780
EGKTLPTAGT VGRVLAMLVQ EMKSIMQRDL NRKMVENVAF GAFDQWWESK EEKAKPFQNA 840
AKQQAKEEDK EKTKLKEPGL LSLVDWAKSG GTTGIEAFAF GSGLRGALRL PSFKVKRKEP 900
SEISEASEEK RPRPSTPAEE DEDDPEQEKE AGEPGRPGTK PPKRDEERGK TQGKHRKSFA 960
LDSEGEEASQ ESSSEKDEED DEEDEEDEDR EEAVDTTKKE TEVSDGEDEE SDSSSKCSLY 1020
ADSDGENDST SDSESSSSSS SSSSSSSSSS SSSSSSSSES SSEDEEEEER PAALPSASPP 1080
PREVPVPTPA PVEVPVPERV AGSPVTPLPE QEASPARPAG PTEESPPSAP LRPPEPPAGP 1140
PAPAPRPDER PSSPIPLLPP PKKRRKTVSF SAIEVVPAPE PPPATPPQAK FPGPASRKAP 1200
RGVERTIRNL PLDHASLVKS WPEEVSRGGR SRAGGRGRLT EEEEAEPGTE VDLAVLADLA 1260
LTPARRGLPA LPAVEDSEAT ETSDEAERPR PLLSHILLEH NYALAVKPTP PAPALRPPEP 1320
VPAPAALFSS PADEVLEAPE VVVAEAEEPK PQQLQQQREE GEEEGEEEGE EEEEESSDSS 1380
SSSDGEGALR RRSLRSHARR RRPPPPPPPP PPRAYEPRSE FEQMTILYDI WNSGLDSEDM 1440
SYLRLTYERL LQQTSGADWL NDTHWVHHTI TNLTTPKRKR RPQDGPREHQ TGSARSEGYY 1500
PISKKEKDKY LDVCPVSARQ LEGVDTQGTN RVLSERRSEQ RRLLSAIGTS AIMDSDLLKL 1560
NQLKFRKKKL RFGRSRIHEW GLFAMEPIAA DEMVIEYVGQ NIRQMVADMR EKRYVQEGIG 1620
SSYLFRVDHD TIIDATKCGN LARFINHCCT PNCYAKVITI ESQKKIVIYS KQPIGVDEEI 1680
TYDYKFPLED NKIPCLCGTE SCRGSLN 1707 
Gene Ontology
 GO:0005694; C:chromosome; IEA:UniProtKB-SubCell.
 GO:0016607; C:nuclear speck; IEA:UniProtKB-SubCell.
 GO:0048188; C:Set1C/COMPASS complex; IDA:UniProtKB.
 GO:0042800; F:histone methyltransferase activity (H3-K4 specific); IDA:UniProtKB.
 GO:0000166; F:nucleotide binding; IEA:InterPro.
 GO:0003723; F:RNA binding; IEA:UniProtKB-KW.
 GO:0006355; P:regulation of transcription, DNA-dependent; IEA:UniProtKB-KW.
 GO:0006351; P:transcription, DNA-dependent; IEA:UniProtKB-KW. 
Interpro
 IPR024657; COMPASS_Set1_N-SET.
 IPR015722; Histone-lysine_MeTfrase.
 IPR012677; Nucleotide-bd_a/b_plait.
 IPR003616; Post-SET_dom.
 IPR000504; RRM_dom.
 IPR001214; SET_dom. 
Pfam
 PF11764; N-SET
 PF00076; RRM_1
 PF00856; SET 
SMART
 SM00508; PostSET
 SM00360; RRM
 SM00317; SET 
PROSITE
 PS50868; POST_SET
 PS50102; RRM
 PS50280; SET 
PRINTS