CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-015323
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 ELM2 and SANT domain-containing protein 1 
Protein Synonyms/Alias
  
Gene Name
 ELMSAN1 
Gene Synonyms/Alias
 C14orf117; C14orf43 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
169PEALKREKAGGPQLDubiquitination[1]
843AERKLFNKGIAIYKKubiquitination[2, 3, 4]
Reference
 [1] Refined preparation and use of anti-diglycine remnant (K-ε-GG) antibody enables routine quantification of 10,000s of ubiquitination sites in single proteomics experiments.
 Udeshi ND, Svinkina T, Mertins P, Kuhn E, Mani DR, Qiao JW, Carr SA.
 Mol Cell Proteomics. 2013 Mar;12(3):825-31. [PMID: 23266961]
 [2] A proteome-wide, quantitative survey of in vivo ubiquitylation sites reveals widespread regulatory roles.
 Wagner SA, Beli P, Weinert BT, Nielsen ML, Cox J, Mann M, Choudhary C.
 Mol Cell Proteomics. 2011 Oct;10(10):M111.013284. [PMID: 21890473]
 [3] Systems-wide analysis of ubiquitylation dynamics reveals a key role for PAF15 ubiquitylation in DNA-damage bypass.
 Povlsen LK, Beli P, Wagner SA, Poulsen SL, Sylvestersen KB, Poulsen JW, Nielsen ML, Bekker-Jensen S, Mailand N, Choudhary C.
 Nat Cell Biol. 2012 Oct;14(10):1089-98. [PMID: 23000965]
 [4] hCKSAAP_UbSite: improved prediction of human ubiquitination sites by exploiting amino acid pattern and properties.
 Chen Z, Zhou Y, Song J, Zhang Z.
 Biochim Biophys Acta. 2013 Aug;1834(8):1461-7. [PMID: 23603789
Functional Description
  
Sequence Annotation
 DOMAIN 721 813 ELM2.
 DOMAIN 828 879 SANT.
 MOD_RES 1 1 N-acetylmethionine.
 MOD_RES 461 461 Phosphoserine.
 MOD_RES 655 655 Phosphothreonine.
 MOD_RES 661 661 Phosphoserine.
 MOD_RES 704 704 Phosphothreonine.
 MOD_RES 709 709 Phosphoserine.
 MOD_RES 715 715 Phosphothreonine.  
Keyword
 Acetylation; Complete proteome; DNA-binding; Nucleus; Phosphoprotein; Polymorphism; Reference proteome; Transcription; Transcription regulation. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1045 AA 
Protein Sequence
MNLQAQPKAQ NKRKRCLFGG QEPAPKEQPP PLQPPQQSIR VKEEQYLGHE GPGGAVSTSQ 60
PVELPPPSSL ALLNSVVYGP ERTSAAMLSQ QVASVKWPNS VMAPGRGPER GGGGGVSDSS 120
WQQQPGQPPP HSTWNCHSLS LYSATKGSPH PGVGVPTYYN HPEALKREKA GGPQLDRYVR 180
PMMPQKVQLE VGRPQAPLNS FHAAKKPPNQ SLPLQPFQLA FGHQVNRQVF RQGPPPPNPV 240
AAFPPQKQQQ QQQPQQQQQQ QQAALPQMPL FENFYSMPQQ PSQQPQDFGL QPAGPLGQSH 300
LAHHSMAPYP FPPNPDMNPE LRKALLQDSA PQPALPQVQI PFPRRSRRLS KEGILPPSAL 360
DGAGTQPGQE ATGNLFLHHW PLQQPPPGSL GQPHPEALGF PLELRESQLL PDGERLAPNG 420
REREAPAMGS EEGMRAVSTG DCGQVLRGGV IQSTRRRRRA SQEANLLTLA QKAVELASLQ 480
NAKDGSGSEE KRKSVLASTT KCGVEFSEPS LATKRAREDS GMVPLIIPVS VPVRTVDPTE 540
AAQAGGLDED GKGPEQNPAE HKPSVIVTRR RSTRIPGTDA QAQAEDMNVK LEGEPSVRKP 600
KQRPRPEPLI IPTKAGTFIA PPVYSNITPY QSHLRSPVRL ADHPSERSFE LPPYTPPPIL 660
SPVREGSGLY FNAIISTSTI PAPPPITPKS AHRTLLRTNS AEVTPPVLSV MGEATPVSIE 720
PRINVGSRFQ AEIPLMRDRA LAAADPHKAD LVWQPWEDLE SSREKQRQVE DLLTAACSSI 780
FPGAGTNQEL ALHCLHESRG DILETLNKLL LKKPLRPHNH PLATYHYTGS DQWKMAERKL 840
FNKGIAIYKK DFFLVQKLIQ TKTVAQCVEF YYTYKKQVKI GRNGTLTFGD VDTSDEKSAQ 900
EEVEVDIKTS QKFPRVPLPR RESPSEERLE PKREVKEPRK EGEEEVPEIQ EKEEQEEGRE 960
RSRRAAAVKA TQTLQANESA SDILILRSHE SNAPGSAGGQ ASEKPREGTG KSRRALPFSE 1020
KKKKTETFSK TQNQENTFPC KKCGR 1045 
Gene Ontology
 GO:0005634; C:nucleus; IDA:HPA.
 GO:0003682; F:chromatin binding; IEA:InterPro.
 GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
 GO:0006355; P:regulation of transcription, DNA-dependent; IEA:UniProtKB-KW.
 GO:0006351; P:transcription, DNA-dependent; IEA:UniProtKB-KW. 
Interpro
 IPR000949; ELM2_dom.
 IPR009057; Homeodomain-like.
 IPR001005; SANT/Myb.
 IPR017884; SANT_dom. 
Pfam
 PF01448; ELM2 
SMART
 SM00717; SANT 
PROSITE
 PS51156; ELM2
 PS51293; SANT 
PRINTS