CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-019280
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Target of EGR1 protein 1 
Protein Synonyms/Alias
  
Gene Name
 TOE1 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
74LSGLGDRKSLLNQCIubiquitination[1, 2]
164PYHKGNDKGDESQSQubiquitination[3]
481NKVYLSGKAVPLTVAubiquitination[4]
Reference
 [1] Refined preparation and use of anti-diglycine remnant (K-ε-GG) antibody enables routine quantification of 10,000s of ubiquitination sites in single proteomics experiments.
 Udeshi ND, Svinkina T, Mertins P, Kuhn E, Mani DR, Qiao JW, Carr SA.
 Mol Cell Proteomics. 2013 Mar;12(3):825-31. [PMID: 23266961]
 [2] Integrated proteomic analysis of post-translational modifications by serial enrichment.
 Mertins P, Qiao JW, Patel J, Udeshi ND, Clauser KR, Mani DR, Burgess MW, Gillette MA, Jaffe JD, Carr SA.
 Nat Methods. 2013 Jul;10(7):634-7. [PMID: 23749302]
 [3] Systematic and quantitative assessment of the ubiquitin-modified proteome.
 Kim W, Bennett EJ, Huttlin EL, Guo A, Li J, Possemato A, Sowa ME, Rad R, Rush J, Comb MJ, Harper JW, Gygi SP.
 Mol Cell. 2011 Oct 21;44(2):325-40. [PMID: 21906983]
 [4] Methods for quantification of in vivo changes in protein ubiquitination following proteasome and deubiquitinase inhibition.
 Udeshi ND, Mani DR, Eisenhaure T, Mertins P, Jaffe JD, Clauser KR, Hacohen N, Carr SA.
 Mol Cell Proteomics. 2012 May;11(5):148-59. [PMID: 22505724
Functional Description
 Inhibits cell growth rate and cell cycle. Induces CDKN1A expression as well as TGF-beta expression. Mediates the inhibitory growth effect of EGR1. 
Sequence Annotation
 ZN_FING 294 322 C3H1-type.
 MOTIF 335 347 Nuclear localization signal.
 MOD_RES 2 2 N-acetylalanine.
 MOD_RES 5 5 Phosphoserine.  
Keyword
 3D-structure; Acetylation; Complete proteome; Direct protein sequencing; Metal-binding; Nucleus; Phosphoprotein; Polymorphism; Reference proteome; Zinc; Zinc-finger. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 510 AA 
Protein Sequence
MAADSDDGAV SAPAASDGGV SKSTTSGEEL VVQVPVVDVQ SNNFKEMWPS LLLAIKTANF 60
VAVDTELSGL GDRKSLLNQC IEERYKAVCH AARTRSILSL GLACFKRQPD KGEHSYLAQV 120
FNLTLLCMEE YVIEPKSVQF LIQHGFNFNQ QYAQGIPYHK GNDKGDESQS QSVRTLFLEL 180
IRARRPLVLH NGLIDLVFLY QNFYAHLPES LGTFTADLCE MFPAGIYDTK YAAEFHARFV 240
ASYLEYAFRK CERENGKQRA AGSPHLTLEF CNYPSSMRDH IDYRCCLPPA THRPHPTSIC 300
DNFSAYGWCP LGPQCPQSHD IDLIIDTDEA AAEDKRRRRR RREKRKRALL NLPGTQTSGE 360
AKDGPPKKQV CGDSIKPEET EQEVAADETR NLPHSKQGNK NDLEMGIKAA RPEIADRATS 420
EVPGSQASPN PVPGDGLHRA GFDAFMTGYV MAYVEVSQGP QPCSSGPWLP ECHNKVYLSG 480
KAVPLTVAKS QFSRSSKAHN QKMKLTWGSS 510 
Gene Ontology
 GO:0016607; C:nuclear speck; IEA:UniProtKB-SubCell.
 GO:0005730; C:nucleolus; IEA:UniProtKB-SubCell.
 GO:0003676; F:nucleic acid binding; IEA:InterPro.
 GO:0008270; F:zinc ion binding; IEA:InterPro. 
Interpro
 IPR006941; RNase_CAF1.
 IPR012337; RNaseH-like_dom.
 IPR000571; Znf_CCCH. 
Pfam
 PF04857; CAF1
 PF00642; zf-CCCH 
SMART
  
PROSITE
 PS50103; ZF_C3H1 
PRINTS