CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-013464
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Single-strand selective monofunctional uracil DNA glycosylase 
Protein Synonyms/Alias
  
Gene Name
 SMUG1 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
185TPAELPAKQREQLLGubiquitination[1]
Reference
 [1] Refined preparation and use of anti-diglycine remnant (K-ε-GG) antibody enables routine quantification of 10,000s of ubiquitination sites in single proteomics experiments.
 Udeshi ND, Svinkina T, Mertins P, Kuhn E, Mani DR, Qiao JW, Carr SA.
 Mol Cell Proteomics. 2013 Mar;12(3):825-31. [PMID: 23266961
Functional Description
 Responsible for recognizing base lesions in the genome and initiating base excision DNA repair. Acts as a monofunctional DNA glycosylase specific for uracil (U) residues in DNA and has a preference for single-stranded DNA substrates. The activity is greater against mismatches (U/G) than against matches (U/A). Excised uracil (U), 5-formyluracil (fU) and uracil derivatives bearing an oxidized group at C5 [5-hydroxyuracil (hoU) and 5- hydroxymethyluracil (hmU)] in ssDNA and dsDNA but not analogous cytosine derivatives (5-hydroxycytosine and 5-formylcytosine) and other oxidized damage. The activity is damage specificity and salt concentration-dependent. The general order of the preference for ssDNA and dsDNA is the following: ssDNA > dsDNA (G pair) = dsDNA (A pair) at the low salt concentration. At the high concentration dsDNA (G pair) > dsDNA (A pair) > ssDNA. 
Sequence Annotation
 REGION 173 187 DNA binding (By similarity).
 BINDING 84 84 Substrate; via amide nitrogen (By
 BINDING 98 98 Substrate; via amide nitrogen.
 BINDING 163 163 Substrate.
 BINDING 239 239 Substrate.  
Keyword
 Alternative splicing; Complete proteome; Direct protein sequencing; DNA damage; DNA repair; DNA-binding; Glycosidase; Hydrolase; Nucleus; Polymorphism; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 270 AA 
Protein Sequence
MPQAFLLGSI HEPAGALMEP QPCPGSLAES FLEEELRLNA ELSQLQFSEP VGIIYNPVEY 60
AWEPHRNYVT RYCQGPKEVL FLGMNPGPFG MAQTGVPFGE VSMVRDWLGI VGPVLTPPQE 120
HPKRPVLGLE CPQSEVSGAR FWGFFRNLCG QPEVFFHHCF VHNLCPLLFL APSGRNLTPA 180
ELPAKQREQL LGICDAALCR QVQLLGVRLV VGVGRLAEQR ARRALAGLMP EVQVEGLLHP 240
SPRNPQANKG WEAVAKERLN ELGLLPLLLK 270 
Gene Ontology
 GO:0005730; C:nucleolus; IDA:HGNC.
 GO:0005654; C:nucleoplasm; TAS:Reactome.
 GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
 GO:0017065; F:single-strand selective uracil DNA N-glycosylase activity; IDA:HGNC.
 GO:0045008; P:depyrimidination; TAS:Reactome. 
Interpro
 IPR005122; Uracil-DNA_glycosylase-like. 
Pfam
 PF03167; UDG 
SMART
  
PROSITE
  
PRINTS