CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-029971
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
  
Protein Name
 Host cell factor 1 
Protein Synonyms/Alias
  
Gene Name
 Hcfc1 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Mus musculus (Mouse) 
NCBI Taxa ID
 10090 
Lysine Modification
Position
Peptide
Type
References
288VKVATHEKEWKCTNTacetylation[1, 2, 3]
345SGRDGYRKAWNNQVCacetylation[1]
354WNNQVCCKDLWYLETacetylation[1]
363LWYLETEKPPPPARVubiquitination[4]
665TKTITLVKSPISVPGubiquitination[4]
690VMSVVQTKPVQTSAVacetylation[1]
802PITIITTKVMTSGTGacetylation[1]
813SGTGAPAKIITAVPKacetylation[1, 2, 3]
820KIITAVPKIATGHGQacetylation[1]
836GVTQVVLKGAPGQPGacetylation[1]
836GVTQVVLKGAPGQPGubiquitination[4]
1863PSKAPVKKENQWFDVubiquitination[4]
1918LQPGTAYKFRVAGINacetylation[1]
1918LQPGTAYKFRVAGINubiquitination[4]
1956PCAIKISKSPDGAHLacetylation[1]
2044RIAARNEKGYGPATQacetylation[1, 3]
Reference
 [1] Quantitative acetylome analysis reveals the roles of SIRT1 in regulating diverse substrates and cellular pathways.
 Chen Y, Zhao W, Yang JS, Cheng Z, Luo H, Lu Z, Tan M, Gu W, Zhao Y.
 Mol Cell Proteomics. 2012 Oct;11(10):1048-62. [PMID: 22826441]
 [2] Proteomic investigations of lysine acetylation identify diverse substrates of mitochondrial deacetylase sirt3.
 Sol EM, Wagner SA, Weinert BT, Kumar A, Kim HS, Deng CX, Choudhary C.
 PLoS One. 2012;7(12):e50545. [PMID: 23236377]
 [3] SIRT5-Mediated Lysine Desuccinylation Impacts Diverse Metabolic Pathways.
 Park J, Chen Y, Tishkoff DX, Peng C, Tan M, Dai L, Xie Z, Zhang Y, Zwaans BM, Skinner ME, Lombard DB, Zhao Y.
 Mol Cell. 2013 Jun 27;50(6):919-30. [PMID: 23806337]
 [4] Proteomic analyses reveal divergent ubiquitylation site patterns in murine tissues.
 Wagner SA, Beli P, Weinert BT, Schölz C, Kelstrup CD, Young C, Nielsen ML, Olsen JV, Brakebusch C, Choudhary C.
 Mol Cell Proteomics. 2012 Dec;11(12):1578-85. [PMID: 22790023
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2090 AA 
Protein Sequence
MASAVSPANL PAVLLQPRWK RVVGWSGPVP RPRHGHRAVA IKELIVVFGG GNEGIVDELH 60
VYNTATNQWF IPAVRGDIPP GCAAYGFVCD GTRLLVFGGM VEYGKYSNDL YELQASRWEW 120
KRLKAKTPKN GPPPCPRLGH SFSLVGNKCY LFGGLANDSE DPKNNIPRYL NDLYILELRP 180
GSGVVAWDIP ITYGVLPPPR ESHTAVVYTE KDNKKSKLVI YGGMSGCRLG DLWTLDIETL 240
TWNKPSLSGV APLPRSLHSA TTIGNKMYVF GGWVPLVMDD VKVATHEKEW KCTNTLACLN 300
LDTMAWETIL MDTLEDNIPR ARAGHCAVAI NTRLYIWSGR DGYRKAWNNQ VCCKDLWYLE 360
TEKPPPPARV QLVRANTNSL EVSWGAVATA DSYLLQLQKY DIPATAATAT SPTPNPVPSV 420
PANPPKSPAP AAAAPAVQPL TQVGITLVPQ AATAPPSTTT IQVLPTVPGS SISVPTAART 480
QGVPAVLKVT GPQATTGTPL VTMRPASQAG KAPVTVTSLP ASVRMVVPTQ SAQGTVIGSN 540
PQMSGMAALA AAAAATQKIP PSSAPTVLSV PAGTTIVKTV AVTPGTTTLP ATVKVASSPV 600
MVSNPATRML KTAAAQVGTS VSSAANTSTR PIITVHKSGT VTVAQQAQVV TTVVGGVTKT 660
ITLVKSPISV PGGSALISNL GKVMSVVQTK PVQTSAVTGQ ASTGPVTQII QTKGPLPAGT 720
ILKLVTSADG KPTTIITTTQ ASGAGTKPTI LGISSVSPST TKPGTTTIIK TIPMSAIITQ 780
AGATGVTSSP GIKSPITIIT TKVMTSGTGA PAKIITAVPK IATGHGQQGV TQVVLKGAPG 840
QPGTILRTVP MGGVRLVTPV TVSAVKPAVT TLVVKGTTGV TTLGTVTGTV STSLAGAGAH 900
STSASLATPI TTLGTIATLS SQVINPTAIT VSAAQTTLTA AGGLTTPTIT MQPVSQPTQV 960
TLITAPSGVE AQPVHDLPVS ILASPTTEQP TATVTIADSG QGDVQPGTVT LVCSNPPCET 1020
HETGTTNTAT TTVVANLGGH PQPTQVQFVC DRQETAASLV TSAVGQQNGN VVRVCSNPPC 1080
ETHETGTTNT ATTATSNMAG QHGCSNPPCE THETGTTSTA TTAMSSMGTG QQRDTRRTTN 1140
TPTVVRITVA PGALERVQGT VKPQCQTQQT NMTTTTMTVQ ATGAPCSAGP LLRPSVALES 1200
GSHSPAFVQL ALPSVRVGLS GPSSKDMPTG RQPETYHTYT TNTPTTTRSI MVAGELGAAR 1260
VVPTSTYESL QASSPSSTMT MTALEALLCP SATVTQVCSN PPCETHETGT TNTATTSNAG 1320
SAQRVCSNPP CETHETGTTH TATTATSNGG AGQPEGGQQP ASGHPCETHQ TTSTGTTMSV 1380
SVGTLIPDAT SSHGTLESGL EVVAVPTVTS QAGSTLLASF PTQRVCSNPP CETHETGTTH 1440
TATTVTSNMS SNQDPPPAAS DQGEVASTQG DSTNITSASA ITTSVSSTLP RAVTTVTQST 1500
PVPGPSVPNI SSLTETTPGA LNSEVPIPAT ITVTIANTET SDMPFSAVDI LQPPEELQVS 1560
PGPRQQLPPR QLLQSASTPL MGESTEVLSA SQTPELQAAV DLSSTGDPSS GQEPTTSAVV 1620
ATVVVQPPPP TQSEVDQLSL PQELMAEAQA GTTTLMVTGL TPEELAVTAA AEAAAQAAAT 1680
EEAQALAIQA VLQAAQQAVM AGTGEPMDTS EAAAAVTQAE LGHLSAEGQE GQATTIPIVL 1740
TQQELAALVQ QQQQLQEAQA QAQQQHHLPT EALAPADSLN DPSIESNCLN ELASAVPSTV 1800
ALLPSTATES LAPSNTFVAP QPVVASPAKM QAAATLTEVA NGIESLGVKP DLPPPPSKAP 1860
VKKENQWFDV GVIKGTSVMV THYFLPPDDA VQSDDDSGTV PDYNQLKKQE LQPGTAYKFR 1920
VAGINACGRG PFSEISAFKT CLPGFPGAPC AIKISKSPDG AHLTWEPPSV TSGKIIEYSV 1980
YLAIQSSQAS GEPKSSTPAQ LAFMRVYCGP SPSCLVQSSS LSNAHIDYTT KPAIIFRIAA 2040
RNEKGYGPAT QVRWLQETSK DSSGTKPASK RPMSSPEMKS APKKSKADGQ 2090 
Gene Ontology
  
Interpro
 IPR003961; Fibronectin_type3.
 IPR013783; Ig-like_fold.
 IPR015915; Kelch-typ_b-propeller.
 IPR006652; Kelch_1.
 IPR011498; Kelch_2. 
Pfam
 PF01344; Kelch_1
 PF07646; Kelch_2 
SMART
 SM00060; FN3 
PROSITE
  
PRINTS