CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-041310
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Transcription elongation regulator 1 
Protein Synonyms/Alias
 Transcription elongation regulator 1, isoform CRA_c 
Gene Name
 TCERG1 
Gene Synonyms/Alias
 hCG_38116 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
522EKAAQKAKPVATAPIubiquitination[1]
579IQEPPHKKGMEELKKubiquitination[1]
585KKGMEELKKLRHPTPubiquitination[1]
586KGMEELKKLRHPTPTubiquitination[1]
624INEDEPVKAKKRKRDubiquitination[2]
640NKDIDSEKEAAMEAEubiquitination[2]
683SAFSTWEKELHKIVFubiquitination[1, 3, 4]
687TWEKELHKIVFDPRYubiquitination[1]
700RYLLLNPKERKQVFDubiquitination[1, 4]
711QVFDQYVKTRAEEERubiquitination[1]
740KKMMEEAKFNPRATFubiquitination[1]
944LEREEKEKLFNEHIEubiquitination[3]
992KEDPRCIKFSSSDRKubiquitination[1, 4]
Reference
 [1] Refined preparation and use of anti-diglycine remnant (K-ε-GG) antibody enables routine quantification of 10,000s of ubiquitination sites in single proteomics experiments.
 Udeshi ND, Svinkina T, Mertins P, Kuhn E, Mani DR, Qiao JW, Carr SA.
 Mol Cell Proteomics. 2013 Mar;12(3):825-31. [PMID: 23266961]
 [2] Systematic and quantitative assessment of the ubiquitin-modified proteome.
 Kim W, Bennett EJ, Huttlin EL, Guo A, Li J, Possemato A, Sowa ME, Rad R, Rush J, Comb MJ, Harper JW, Gygi SP.
 Mol Cell. 2011 Oct 21;44(2):325-40. [PMID: 21906983]
 [3] A proteome-wide, quantitative survey of in vivo ubiquitylation sites reveals widespread regulatory roles.
 Wagner SA, Beli P, Weinert BT, Nielsen ML, Cox J, Mann M, Choudhary C.
 Mol Cell Proteomics. 2011 Oct;10(10):M111.013284. [PMID: 21890473]
 [4] Integrated proteomic analysis of post-translational modifications by serial enrichment.
 Mertins P, Qiao JW, Patel J, Udeshi ND, Clauser KR, Mani DR, Burgess MW, Gillette MA, Jaffe JD, Carr SA.
 Nat Methods. 2013 Jul;10(7):634-7. [PMID: 23749302
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1032 AA 
Protein Sequence
MAERGGDGGE SERFNPGELR MAQQQALRFR GPAPPPNAVM RGPPPLMRPP PPFGMMRGPP 60
PPPRPPFGRP PFDPNMPPMP PPGGIPPPMG PPHLQRPPFM PPPMSSMPPP PGMMFPPGMP 120
PVTAPGTPAL PPTEEIWVEN KTPDGKVYYY NARTRESAWT KPDGVKVIQQ SELTPMLAAQ 180
AQVQAQAQAQ AQAQAQAQAQ AQAQAQAQAQ AQAQAQAQAQ AQAQAQAQAQ AQAQAQAQAQ 240
AQVQAQVQAQ VQAQAVGAST PTTSSPAPAV STSTSSSTPS STTSTTTTAT SVAQTVSTPT 300
TQDQTPSSAV SVATPTVSVS TPAPTATPVQ TVPQPHPQTL PPAVPHSVPQ PTTAIPAFPP 360
VMVPPFRVPL PGMPIPLPGV AMMQIVSCPY VKTVATTKTG VLPGMAPPIV PMIHPQVAIA 420
ASPATLAGAT AVSEWTEYKT ADGKTYYYNN RTLESTWEKP QELKEKEKLE EKIKEPIKEP 480
SEEPLPMETE EEDPKEEPIK EIKEEPKEEE MTEEEKAAQK AKPVATAPIP GTPWCVVWTG 540
DERVFFYNPT TRLSMWDRPD DLIGRADVDK IIQEPPHKKG MEELKKLRHP TPTMLSIQKW 600
QFSMSAIKEE QELMEEINED EPVKAKKRKR DDNKDIDSEK EAAMEAEIKA ARERAIVPLE 660
ARMKQFKDML LERGVSAFST WEKELHKIVF DPRYLLLNPK ERKQVFDQYV KTRAEEERRE 720
KKNKIMQAKE DFKKMMEEAK FNPRATFSEF AAKHAKDSRF KAIEKMKDRE ALFNEFVAAA 780
RKKEKEDSKT RGEKIKSDFF ELLSNHHLDS QSRWSKVKDK VESDPRYKAV DSSSMREDLF 840
KQYIEKIAKN LDSEKEKELE RQARIEASLR EREREVQKAR SEQTKEIDRE REQHKREEAI 900
QNFKALLSDM VRSSDVSWSD TRRTLRKDHR WESGSLLERE EKEKLFNEHI EALTKKKREH 960
FRQLLDETSA ITLTSTWKEV KKIIKEDPRC IKFSSSDRKK QREFEEYIRD KYITAKADFR 1020
TLLKETKFIT YS 1032 
Gene Ontology
  
Interpro
 IPR002713; FF_domain.
 IPR001202; WW_dom. 
Pfam
 PF01846; FF
 PF00397; WW 
SMART
 SM00441; FF
 SM00456; WW 
PROSITE
 PS01159; WW_DOMAIN_1
 PS50020; WW_DOMAIN_2 
PRINTS