CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-029074
UniProt Accession
Genbank Protein ID
 U52112 
Genbank Nucleotide ID
  
Protein Name
 HCF N-terminal chain 5 
Protein Synonyms/Alias
  
Gene Name
 HCFC1 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
129RLKAKTPKNGPPPCPubiquitination[1]
148SFSLVGNKCYLFGGLubiquitination[1, 2, 3, 4]
163ANDSEDPKNNIPRYLubiquitination[1, 3]
211TAVVYTEKDNKKSKLubiquitination[1]
214VYTEKDNKKSKLVIYubiquitination[1]
215YTEKDNKKSKLVIYGubiquitination[1]
266SATTIGNKMYVFGGWubiquitination[1]
288VKVATHEKEWKCTNTacetylation[4, 5, 6]
288VKVATHEKEWKCTNTubiquitination[1, 4]
345SGRDGYRKAWNNQVCubiquitination[1]
354WNNQVCCKDLWYLETubiquitination[1]
363LWYLETEKPPPPARVubiquitination[1, 4, 7]
488QGVPAVLKVTGPQATubiquitination[1]
511RPASQAGKAPVTVTSmethylation[8]
511RPASQAGKAPVTVTSubiquitination[1]
665TKTITLVKSPISVPGubiquitination[1, 2, 3, 4]
682ALISNLGKVMSVVQTubiquitination[1]
690VMSVVQTKPVQTSAVubiquitination[1]
713VTQIIQTKGPLPAGTubiquitination[1]
813SGTGAPAKIITAVPKacetylation[4, 6]
813SGTGAPAKIITAVPKubiquitination[1, 2, 3, 4]
820KIITAVPKIATGHGQubiquitination[1]
836GVTQVVLKGAPGQPGubiquitination[1, 2, 3, 4]
866PVTVSAVKPAVTTLVubiquitination[4]
1163LEAAQGSKSQCQTRQubiquitination[1, 3, 4]
1217QLAPLSSKVRLSSPSubiquitination[1, 3, 4]
1226RLSSPSIKDLPAGRHubiquitination[1, 4]
1897VPDYNQLKKQELQPGubiquitination[1]
1898PDYNQLKKQELQPGTubiquitination[1]
1908LQPGTAYKFRVAGINubiquitination[1, 2, 9]
1946PCAIKISKSPDGAHLubiquitination[1, 2]
2021AHIDYTTKPAIIFRIubiquitination[1]
2034RIAARNEKGYGPATQacetylation[4]
2050RWLQETSKDSSGTKPacetylation[4]
2050RWLQETSKDSSGTKPubiquitination[1]
Reference
 [1] Refined preparation and use of anti-diglycine remnant (K-ε-GG) antibody enables routine quantification of 10,000s of ubiquitination sites in single proteomics experiments.
 Udeshi ND, Svinkina T, Mertins P, Kuhn E, Mani DR, Qiao JW, Carr SA.
 Mol Cell Proteomics. 2013 Mar;12(3):825-31. [PMID: 23266961]
 [2] A proteome-wide, quantitative survey of in vivo ubiquitylation sites reveals widespread regulatory roles.
 Wagner SA, Beli P, Weinert BT, Nielsen ML, Cox J, Mann M, Choudhary C.
 Mol Cell Proteomics. 2011 Oct;10(10):M111.013284. [PMID: 21890473]
 [3] Methods for quantification of in vivo changes in protein ubiquitination following proteasome and deubiquitinase inhibition.
 Udeshi ND, Mani DR, Eisenhaure T, Mertins P, Jaffe JD, Clauser KR, Hacohen N, Carr SA.
 Mol Cell Proteomics. 2012 May;11(5):148-59. [PMID: 22505724]
 [4] Integrated proteomic analysis of post-translational modifications by serial enrichment.
 Mertins P, Qiao JW, Patel J, Udeshi ND, Clauser KR, Mani DR, Burgess MW, Gillette MA, Jaffe JD, Carr SA.
 Nat Methods. 2013 Jul;10(7):634-7. [PMID: 23749302]
 [5] Proteomic investigations reveal a role for RNA processing factor THRAP3 in the DNA damage response.
 Beli P, Lukashchuk N, Wagner SA, Weinert BT, Olsen JV, Baskcomb L, Mann M, Jackson SP, Choudhary C.
 Mol Cell. 2012 Apr 27;46(2):212-25. [PMID: 22424773]
 [6] Proteomic investigations of lysine acetylation identify diverse substrates of mitochondrial deacetylase sirt3.
 Sol EM, Wagner SA, Weinert BT, Kumar A, Kim HS, Deng CX, Choudhary C.
 PLoS One. 2012;7(12):e50545. [PMID: 23236377]
 [7] Systematic and quantitative assessment of the ubiquitin-modified proteome.
 Kim W, Bennett EJ, Huttlin EL, Guo A, Li J, Possemato A, Sowa ME, Rad R, Rush J, Comb MJ, Harper JW, Gygi SP.
 Mol Cell. 2011 Oct 21;44(2):325-40. [PMID: 21906983]
 [8] Mass spectrometry-based identification and characterisation of lysine and arginine methylation in the human proteome.
 Bremang M, Cuomo A, Agresta AM, Stugiewicz M, Spadotto V, Bonaldi T.
 Mol Biosyst. 2013 Jul 30;9(9):2231-47. [PMID: 23748837]
 [9] Systems-wide analysis of ubiquitylation dynamics reveals a key role for PAF15 ubiquitylation in DNA-damage bypass.
 Povlsen LK, Beli P, Wagner SA, Poulsen SL, Sylvestersen KB, Poulsen JW, Nielsen ML, Bekker-Jensen S, Mailand N, Choudhary C.
 Nat Cell Biol. 2012 Oct;14(10):1089-98. [PMID: 23000965
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2080 AA 
Protein Sequence
MASAVSPANL PAVLLQPRWK RVVGWSGPVP RPRHGHRAVA IKELIVVFGG GNEGIVDELH 60
VYNTATNQWF IPAVRGDIPP GCAAYGFVCD GTRLLVFGGM VEYGKYSNDL YELQASRWEW 120
KRLKAKTPKN GPPPCPRLGH SFSLVGNKCY LFGGLANDSE DPKNNIPRYL NDLYILELRP 180
GSGVVAWDIP ITYGVLPPPR ESHTAVVYTE KDNKKSKLVI YGGMSGCRLG DLWTLDIDTL 240
TWNKPSLSGV APLPRSLHSA TTIGNKMYVF GGWVPLVMDD VKVATHEKEW KCTNTLACLN 300
LDTMAWETIL MDTLEDNIPR ARAGHCAVAI NTRLYIWSGR DGYRKAWNNQ VCCKDLWYLE 360
TEKPPPPARV QLVRANTNSL EVSWGAVATA DSYLLQLQKY DIPATAATAT SPTPNPVPSV 420
PANPPKSPAP AAAAPAVQPL TQVGITLLPQ AAPAPPTTTT IQVLPTVPGS SISVPTAART 480
QGVPAVLKVT GPQATTGTPL VTMRPASQAG KAPVTVTSLP AGVRMVVPTQ SAQGTVIGSS 540
PQMSGMAALA AAAAATQKIP PSSAPTVLSV PAGTTIVKTM AVTPGTTTLP ATVKVASSPV 600
MVSNPATRML KTAAAQVGTS VSSATNTSTR PIITVHKSGT VTVAQQAQVV TTVVGGVTKT 660
ITLVKSPISV PGGSALISNL GKVMSVVQTK PVQTSAVTGQ ASTGPVTQII QTKGPLPAGT 720
ILKLVTSADG KPTTIITTTQ ASGAGTKPTI LGISSVSPST TKPGTTTIIK TIPMSAIITQ 780
AGATGVTSSP GIKSPITIIT TKVMTSGTGA PAKIITAVPK IATGHGQQGV TQVVLKGAPG 840
QPGTILRTVP MGGVRLVTPV TVSAVKPAVT TLVVKGTTGV TTLGTVTGTV STSLAGAGGH 900
STSASLATPI TTLGTIATLS SQVINPTAIT VSAAQTTLTA AGGLTTPTIT MQPVSQPTQV 960
TLITAPSGVE AQPVHDLPVS ILASPTTEQP TATVTIADSG QGDVQPGTVT LVCSNPPCET 1020
HETGTTNTAT TTVVANLGGH PQPTQVQFVC DRQEAAASLV TSTVGQQNGS VVRVCSNPPC 1080
ETHETGTTNT ATTATSNMAG QHGCSNPPCE THETGTTNTA TTAMSSVGAN HQRDARRACA 1140
AGTPAVIRIS VATGALEAAQ GSKSQCQTRQ TSATSTTMTV MATGAPCSAG PLLGPSMARE 1200
PGGRSPAFVQ LAPLSSKVRL SSPSIKDLPA GRHSHAVSTA AMTRSSVGAG EPRMAPVCES 1260
LQGGSPSTTV TVTALEALLC PSATVTQVCS NPPCETHETG TTNTATTSNA GSAQRVCSNP 1320
PCETHETGTT HTATTATSNG GTGQPEGGQQ PPAGRPCETH QTTSTGTTMS VSVGALLPDA 1380
TSSHRTVESG LEVAAAPSVT PQAGTALLAP FPTQRVCSNP PCETHETGTT HTATTVTSNM 1440
SSNQDPPPAA SDQGEVESTQ GDSVNITSSS AITTTVSSTL TRAVTTVTQS TPVPGPSVPK 1500
ISSMTETAPR ALTTEVPIPA KITVTIANTE TSDMPFSAVD ILQPPEELQV SPGPRQQLPP 1560
RQLLQSASTA LMGESAEVLS ASQTPELPAA VDLSSTGEPS SGQESAGSAV VATVVVQPPP 1620
PTQSEVDQLS LPQELMAEAQ AGTTTLMVTG LTPEELAVTA AAEAAAQAAA TEEAQALAIQ 1680
AVLQAAQQAV MAGTGEPMDT SEAAATVTQA ELGHLSAEGQ EGQATTIPIV LTQQELAALV 1740
QQQQLQEAQA QQQHHHLPTE ALAPADSLND PAIESNCLNE LAGTVPSTVA LLPSTATESL 1800
APSNTFVAPQ PVVVASPAKL QAAATLTEVA NGIESLGVKP DLPPPPSKAP MKKENQWFDV 1860
GVIKGTNVMV THYFLPPDDA VPSDDDLGTV PDYNQLKKQE LQPGTAYKFR VAGINACGRG 1920
PFSEISAFKT CLPGFPGAPC AIKISKSPDG AHLTWEPPSV TSGKIIEYSV YLAIQSSQAG 1980
GELKSSTPAQ LAFMRVYCGP SPSCLVQSSS LSNAHIDYTT KPAIIFRIAA RNEKGYGPAT 2040
QVRWLQETSK DSSGTKPANK RPMSSPEMKS APKKSKADGQ 2080 
Gene Ontology
 GO:0005739; C:mitochondrion; IDA:HPA.
 GO:0005634; C:nucleus; IDA:HPA. 
Interpro
 IPR003961; Fibronectin_type3.
 IPR015916; Gal_Oxidase_b-propeller.
 IPR013783; Ig-like_fold.
 IPR015915; Kelch-typ_b-propeller.
 IPR006652; Kelch_1. 
Pfam
 PF01344; Kelch_1 
SMART
 SM00060; FN3 
PROSITE
  
PRINTS