CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-026826
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 THO complex subunit 3 
Protein Synonyms/Alias
 THOC3 protein 
Gene Name
 THOC3 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
86ASVFLLEKDRLVKENubiquitination[1]
91LEKDRLVKENNYRGHubiquitination[2, 3]
133IWDVRTTKCIATVNTubiquitination[3, 4]
171VVTFIDAKTHRSKAEubiquitination[5]
Reference
 [1] Landscape of the PARKIN-dependent ubiquitylome in response to mitochondrial depolarization.
 Sarraf SA, Raman M, Guarani-Pereira V, Sowa ME, Huttlin EL, Gygi SP, Harper JW.
 Nature. 2013 Apr 18;496(7445):372-6. [PMID: 23503661]
 [2] Systematic and quantitative assessment of the ubiquitin-modified proteome.
 Kim W, Bennett EJ, Huttlin EL, Guo A, Li J, Possemato A, Sowa ME, Rad R, Rush J, Comb MJ, Harper JW, Gygi SP.
 Mol Cell. 2011 Oct 21;44(2):325-40. [PMID: 21906983]
 [3] Refined preparation and use of anti-diglycine remnant (K-ε-GG) antibody enables routine quantification of 10,000s of ubiquitination sites in single proteomics experiments.
 Udeshi ND, Svinkina T, Mertins P, Kuhn E, Mani DR, Qiao JW, Carr SA.
 Mol Cell Proteomics. 2013 Mar;12(3):825-31. [PMID: 23266961]
 [4] Integrated proteomic analysis of post-translational modifications by serial enrichment.
 Mertins P, Qiao JW, Patel J, Udeshi ND, Clauser KR, Mani DR, Burgess MW, Gillette MA, Jaffe JD, Carr SA.
 Nat Methods. 2013 Jul;10(7):634-7. [PMID: 23749302]
 [5] A proteome-wide, quantitative survey of in vivo ubiquitylation sites reveals widespread regulatory roles.
 Wagner SA, Beli P, Weinert BT, Nielsen ML, Cox J, Mann M, Choudhary C.
 Mol Cell Proteomics. 2011 Oct;10(10):M111.013284. [PMID: 21890473
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome; Repeat; WD repeat. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 327 AA 
Protein Sequence
MAVPAAAMGP SALGQSGPGS MAPWCSVSSG PSRYVLGMQE LFRGHSKTRE FLAHSAKVHS 60
VAWSCDGRRL ASGSFDKTAS VFLLEKDRLV KENNYRGHGD SVDQLCWHPS NPDLFVTASG 120
DKTIRIWDVR TTKCIATVNT KGENINICWS PDGQTIAVGN KDDVVTFIDA KTHRSKAEEQ 180
FKFEVNEISW NNDNNMFFLT NGNGCINILS YPELKPVQSI NAHPSNCICI KFDPMGKYFA 240
TGSADALVSL WDVDELVCVR CFSRLDWPVR TLSFSHDGKM LASASEDHFI DIAEVETGNF 300
MRIYRLSPLA VRTSLVISSL HVTTSPA 327 
Gene Ontology
  
Interpro
 IPR020472; G-protein_beta_WD-40_rep.
 IPR011659; PD40.
 IPR015943; WD40/YVTN_repeat-like_dom.
 IPR001680; WD40_repeat.
 IPR017986; WD40_repeat_dom. 
Pfam
 PF07676; PD40
 PF00400; WD40 
SMART
 SM00320; WD40 
PROSITE
 PS50082; WD_REPEATS_2
 PS50294; WD_REPEATS_REGION 
PRINTS
 PR00320; GPROTEINBRPT.