CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-037094
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
  
Protein Name
 Treacle protein 
Protein Synonyms/Alias
  
Gene Name
 TCOF1 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
146MPHPATGKTVANLLSacetylation[1]
155VANLLSGKSPRKSAEacetylation[2]
296SQVKASEKILQVRAAacetylation[1, 2, 3]
600GPVAVQVKAEKPMDNacetylation[2, 3]
746TAPVLPGKTGPTVTQacetylation[1]
811GTISAPGKVVTAAAQacetylation[1, 2]
904RAALAPAKESPRKGAacetylation[2]
1414SLGSQGAKDEPEEELacetylation[2]
Reference
 [1] Integrated proteomic analysis of post-translational modifications by serial enrichment.
 Mertins P, Qiao JW, Patel J, Udeshi ND, Clauser KR, Mani DR, Burgess MW, Gillette MA, Jaffe JD, Carr SA.
 Nat Methods. 2013 Jul;10(7):634-7. [PMID: 23749302]
 [2] Proteomic investigations of lysine acetylation identify diverse substrates of mitochondrial deacetylase sirt3.
 Sol EM, Wagner SA, Weinert BT, Kumar A, Kim HS, Deng CX, Choudhary C.
 PLoS One. 2012;7(12):e50545. [PMID: 23236377]
 [3] Proteomic investigations reveal a role for RNA processing factor THRAP3 in the DNA damage response.
 Beli P, Lukashchuk N, Wagner SA, Weinert BT, Olsen JV, Baskcomb L, Mann M, Jackson SP, Choudhary C.
 Mol Cell. 2012 Apr 27;46(2):212-25. [PMID: 22424773
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1488 AA 
Protein Sequence
MAEARKRREL LPLIYHHLLR AGYVRAAREV KEQSGQKCFL AQPVTLLDIY THWQQTSELG 60
RKRKAEEDAA LQAKKTRVSD PISTSESSEE EEEAEAETAK ATPRLASTNS SVLGADLPSS 120
MKEKAKAETE KAGKTGNSMP HPATGKTVAN LLSGKSPRKS AEPSANTTLV SETEEEGSVP 180
AFGAAAKPGM VSAGQADSSS EDTSSSSDET DVEGKPSVKP AQVKASSVST KESPARKAAP 240
APGKVGDVTP QVKGGALPPA KRAKKPEEES ESSEEGSESE EEAPAGTRSQ VKASEKILQV 300
RAASAPAKGT PGKGATPAPP GKAGAVASQT KAGKPEEDSE SSSEESSDSE EETPAAKALL 360
QAKASGKTSQ VGAASAPAKE SPRKGAAPAP PGKTGPAVAK AQAGKREEDS QSSSEESDSE 420
EEAPAQAKPS GKAPQVRAAS APAKESPRKG AAPAPPRKTG PAAAQVQVGK QEEDSRSSSE 480
ESDSDREALA AMNAAQVKPL GKSPQVKPAS TMGMGPLGKG AGPVPPGKVG PATPSAQVGK 540
WEEDSESSSE ESSDSSDGEV PTAVAPAQEK SLGNILQAKP TSSPAKGPPQ KAGPVAVQVK 600
AEKPMDNSES SEESSDSADS EEAPAAMTAA QAKPALKIPQ TKACPKKTNT TASAKVAPVR 660
VGTQAPRKAG TATSPAGSSP AVAGGTQRPA EDSSSSEESD SEEEKTGLAV TVGQAKSVGK 720
GLQVKAASVP VKGSLGQGTA PVLPGKTGPT VTQVKAEKQE DSESSEEESD SEEAAASPAQ 780
VKTSVKKTQA KANPAAARAP SAKGTISAPG KVVTAAAQAK QRSPSKVKPP VRNPQNSTVL 840
ARGPASVPSV GKAVATAAQA QTGPEEDSGS SEEESDSEEE AETLAQVKPS GKTHQIRAAL 900
APAKESPRKG AAPTPPGKTG PSAAQAGKQD DSGSSSEESD SDGEAPAAVT SAQKDSNSKP 960
ARSKTLAPAP PERNTEGSSE SSEEELPLTQ VIKPPLIFVD PNRSPAGPAA TPAQAQAAST 1020
PRKARASEST ARSSSSESED EDVIPATQCL TPGIRTNVVT MPTAHPRIAP KASMAGASSS 1080
KESSRISDGK KQEGPATQVD SAVGTLPATS PQSTSVQAKG TNKLRKPKLP EVQQATKAPE 1140
SSDDSEDSSD SSSGSEEDGE GPQGAKSAHT LVGPTPSRTE TLVEETAAES SEDDVVAPSQ 1200
SLLSGYMTPG LTPANSQASK ATPKLDSSPS VSSTLAAKDD PDGKQEAKPQ QAAGMLSPKT 1260
GGKEAASGTT PQKSRKPKKG AGNPQASTLA LQSNITQCLL GQPWPLNEAQ VQASVVKVLT 1320
ELLEQERKKV VDTTKESSRK GWESRKRKLS GDQPAARTPR SKKKKKLGAG EGGEASVSPE 1380
KTSTTSKGKA KRDKASGDVK EKKGKGSLGS QGAKDEPEEE LQKGMGTVEG GDQSNPKSKK 1440
EKKKSDKRKK DKEKKEKKKK AKKASTKDSE SPSQKKKKKK KKTAEQTV 1488 
Gene Ontology
 GO:0005737; C:cytoplasm; IDA:HPA.
 GO:0005730; C:nucleolus; IDA:HPA. 
Interpro
 IPR006594; LisH_dimerisation.
 IPR003993; TCS_treacle.
 IPR017859; Treacle-like_TCS. 
Pfam
 PF03546; Treacle 
SMART
 SM00667; LisH 
PROSITE
 PS50896; LISH 
PRINTS
 PR01503; TREACLE.