CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-031099
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 TOX high mobility group box family member 4 
Protein Synonyms/Alias
 cDNA FLJ54372, highly similar to Epidermal Langerhans cell protein LCP1 
Gene Name
 TOX4 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
184TVVVEAGKKQKAPKKacetylation[1, 2]
185VVVEAGKKQKAPKKRubiquitination[2]
219RDTQAAIKGQNPNATubiquitination[3]
Reference
 [1] Proteomic investigations of lysine acetylation identify diverse substrates of mitochondrial deacetylase sirt3.
 Sol EM, Wagner SA, Weinert BT, Kumar A, Kim HS, Deng CX, Choudhary C.
 PLoS One. 2012;7(12):e50545. [PMID: 23236377]
 [2] Integrated proteomic analysis of post-translational modifications by serial enrichment.
 Mertins P, Qiao JW, Patel J, Udeshi ND, Clauser KR, Mani DR, Burgess MW, Gillette MA, Jaffe JD, Carr SA.
 Nat Methods. 2013 Jul;10(7):634-7. [PMID: 23749302]
 [3] Refined preparation and use of anti-diglycine remnant (K-ε-GG) antibody enables routine quantification of 10,000s of ubiquitination sites in single proteomics experiments.
 Udeshi ND, Svinkina T, Mertins P, Kuhn E, Mani DR, Qiao JW, Carr SA.
 Mol Cell Proteomics. 2013 Mar;12(3):825-31. [PMID: 23266961
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 598 AA 
Protein Sequence
METFHTPSLG DEEFEIPPIS LDSDPSLAVS DVVGHFDDLA DPSSSQDGSF SAQYGVQTLD 60
MPVGMTHGLM EQGGGLLSGG LTMDLDHSIG TQYSANPPVT IDVPMTDMTS GLMGHSQLTT 120
IDQSELSSQL GLSLGGGTIL PPAQSPEDRL STTPSPTSSL HEDGVEDFRR QLPSQKTVVV 180
EAGKKQKAPK KRKKKDPNEP QKPVSAYALF FRDTQAAIKG QNPNATFGEV SKIVASMWDS 240
LGEEQKQVYK RKTEAAKKEY LKALAAYKDN QECQATVETV ELDPAPPSQT PSPPPMATVD 300
PASPAPASIE PPALSPSIVV NSTLSSYVAN QASSGAGGQP NITKLIITKQ MLPSSITMSQ 360
GGMVTVIPAT VVTSRGLQLG QTSTATIQPS QQAQIVTRSV LQAAAAAAAA ASMQLPPPRL 420
QPPPLQQMPQ PPTQQQVTIL QQPPPLQAMQ QPPPQKVRIN LQQQPPPLQI KSVPLPTLKM 480
QTTLVPPTVE SSPERPMNNS PEAHTVEAPS PETICEMITD VVPEVESPSQ MDVELVSGSP 540
VALSPQPRCV RSGCENPPIV SKDWDNEYCS NECVVKHCRD VFLAWVASRN SNTVVFVK 598 
Gene Ontology
 GO:0005634; C:nucleus; IDA:HPA. 
Interpro
 IPR009071; HMG_box_dom. 
Pfam
 PF00505; HMG_box 
SMART
 SM00398; HMG 
PROSITE
 PS50118; HMG_BOX_2 
PRINTS