CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-044074
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Host cell factor, isoform E 
Protein Synonyms/Alias
  
Gene Name
 Hcf 
Gene Synonyms/Alias
 CG1710; Dmel_CG1710 
Created Date
 July 27, 2013 
Organism
 Drosophila melanogaster (Fruit fly) 
NCBI Taxa ID
 7227 
Lysine Modification
Position
Peptide
Type
References
690TTTSIGGKQYFIQKPacetylation[1]
782PHTMAGGKLIMKNSNacetylation[1]
865ISNQSGVKMLRNISSacetylation[1]
941TTALSARKSFVFNAGacetylation[1]
959RTVTLATKSINAKSIacetylation[1]
964ATKSINAKSIPQSQPacetylation[1]
Reference
 [1] Proteome-wide mapping of the Drosophila acetylome demonstrates a high degree of conservation of lysine acetylation.
 Weinert BT, Wagner SA, Horn H, Henriksen P, Liu WR, Olsen JV, Jensen LJ, Choudhary C.
 Sci Signal. 2011 Jul 26;4(183):ra48. [PMID: 21791702
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1484 AA 
Protein Sequence
MEGSDFVDPA FSSGERISAS DLNSEHIIQA ENHSFANRIS MDMDVPDGHQ LDSNLTGFRW 60
KRVLNPTGPQ PRPRHGHRAI NIKELMVVFG GGNEGIVDEL HVYNTVTNQW YVPVLKGDVP 120
NGCAAYGFVV EGTRMFVFGG MIEYGKYSNE LYELQATKWE WRKMYPESPD SGLSPCPRLG 180
HSFTMVGEKI FLFGGLANES DDPKNNIPKY LNDLYILDTR GVHSHNGKWI VPKTYGDSPP 240
PRESHTGISF ATKSNGNLNL LIYGGMSGCR LGDLWLLETD SMTWSKPKTS GEAPLPRSLH 300
SSTMIGNKMY VFGGWVPLVI NDSKSTTERE WKCTNTLAVL DLETMTWENV TLDTVEENVP 360
RARAGHCAVG IQSRLYVWSG RDGYRKAWNN QVRVCCKDLW YLEVSKPLYA VKVALVRAST 420
HALELSWTAT TFAAAYVLQI QKIEQPLNTS SKLLSNNIVQ QGTPTSAETS GINISANRSG 480
SALGLGVEAT STVLKLEKES LQLSGCQPET NVQPSVNDLL QSMSQPSSPA SRADKDPLSS 540
GGGTTFNLST SVASVHPQIS VISSTAAVTG NDTASPSGAI NSILQKFRPV VTAVRTSTTT 600
AVSIATSTSD PLSVRVPSTM SANVVLSSSS STLRIVPSVT ASHSLRIASS QASGNNCRSS 660
SAINILKTAL PNVAVQSQPT SSTTTSIGGK QYFIQKPLTL APNVQLQFVK TSGGMTVQTL 720
PKVNFTASKG TPPHGISIAN PHLASGITQI QGSTVPGSQI QKPIVSGNVL KLVSPHTMAG 780
GKLIMKNSNI LQMGKGTPLG NQQIIIVTTG GNVRSVPTST VMTSAGGSAS GTNIVSIVNS 840
TSTTPSPLQA LSGQKTLISN QSGVKMLRNI SSVQASSSMA FGQKQSGTPI HQKTALYIGG 900
KAVTVMSTNT SMAASGNKVM VLPGTSSNNS PATTTALSAR KSFVFNAGGS PRTVTLATKS 960
INAKSIPQSQ PVTETNNHSV ATIKDTDPMD DIIEQLDGAG DLLKLSESEG QHGSEENENN 1020
GENATSSSAS ALFTGGDTAG PSRAQNPIVM EHPVDIIEDV SGVSSTTDVN ETAIVSGDTI 1080
ESLKMSEKEN DDVKSMGEKS ILSDDCHQPT TSETEAATIL TTIKSAEALV LETAEIRKDH 1140
TGCTIGSLKE NQDENKKFKQ RQESSPSQNI HQFQNVDGSQ LEALASAALL QAATSDTTAL 1200
ALKELIERPE SETNTRSSNI AEIQQNNVQS TLAVVVPNTS QNENQKWHTV GVFKDLSHTV 1260
TSYIDSNCIS DSFFDGIDVD NLPDFSKFPR TNLEPGTAYR FRLSAINSCG RGEWGEISSF 1320
KTCLPGFPGA PSAIKISKDV KEGAHLTWEP PPAQKTKEII EYSVYLAVKP TAKDKALSTP 1380
QLAFVRVYVG AANQCTVPNA SLSNAHVDCS NKPAIIFRIA ARNQKGYGPA TQVRWLQDPA 1440
AAKQHTPTVT PNLKRGPEKS TIGSSNIANT FCSPHKRGRN GLHD 1484 
Gene Ontology
  
Interpro
 IPR003961; Fibronectin_type3.
 IPR015916; Gal_Oxidase_b-propeller.
 IPR013783; Ig-like_fold.
 IPR015915; Kelch-typ_b-propeller.
 IPR006652; Kelch_1. 
Pfam
 PF01344; Kelch_1 
SMART
 SM00060; FN3 
PROSITE
  
PRINTS