CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-022503
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 CpG-binding protein 
Protein Synonyms/Alias
 CXXC-type zinc finger protein 1; PHD finger and CXXC domain-containing protein 1 
Gene Name
 CXXC1 
Gene Synonyms/Alias
 CFP1; CGBP; PCCX1; PHF18 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
63RITEKMAKAIREWYCacetylation[1]
539DVYNPQSKTYCKRLQubiquitination[2]
557PEHSRDPKVPADEVCubiquitination[2, 3]
595NRHYCWEKLRRAEVDubiquitination[2]
611ERVRVWYKLDELFEQubiquitination[2]
Reference
 [1] Lysine acetylation targets protein complexes and co-regulates major cellular functions.
 Choudhary C, Kumar C, Gnad F, Nielsen ML, Rehman M, Walther TC, Olsen JV, Mann M.
 Science. 2009 Aug 14;325(5942):834-40. [PMID: 19608861]
 [2] Refined preparation and use of anti-diglycine remnant (K-ε-GG) antibody enables routine quantification of 10,000s of ubiquitination sites in single proteomics experiments.
 Udeshi ND, Svinkina T, Mertins P, Kuhn E, Mani DR, Qiao JW, Carr SA.
 Mol Cell Proteomics. 2013 Mar;12(3):825-31. [PMID: 23266961]
 [3] Integrated proteomic analysis of post-translational modifications by serial enrichment.
 Mertins P, Qiao JW, Patel J, Udeshi ND, Clauser KR, Mani DR, Burgess MW, Gillette MA, Jaffe JD, Carr SA.
 Nat Methods. 2013 Jul;10(7):634-7. [PMID: 23749302
Functional Description
 Transcriptional activator that exhibits a unique DNA binding specificity for CpG unmethylated motifs with a preference for CpGG. 
Sequence Annotation
 ZN_FING 28 76 PHD-type.
 ZN_FING 160 209 CXXC-type.
 MOD_RES 1 1 N-acetylmethionine.
 MOD_RES 6 6 Phosphoserine.
 MOD_RES 19 19 Phosphoserine.
 MOD_RES 227 227 Phosphothreonine.  
Keyword
 3D-structure; Acetylation; Activator; Alternative splicing; Coiled coil; Complete proteome; DNA-binding; Metal-binding; Nucleus; Phosphoprotein; Reference proteome; Transcription; Transcription regulation; Zinc; Zinc-finger. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 656 AA 
Protein Sequence
MEGDGSDPEP PDAGEDSKSE NGENAPIYCI CRKPDINCFM IGCDNCNEWF HGDCIRITEK 60
MAKAIREWYC RECREKDPKL EIRYRHKKSR ERDGNERDSS EPRDEGGGRK RPVPDPDLQR 120
RAGSGTGVGA MLARGSASPH KSSPQPLVAT PSQHHQQQQQ QIKRSARMCG ECEACRRTED 180
CGHCDFCRDM KKFGGPNKIR QKCRLRQCQL RARESYKYFP SSLSPVTPSE SLPRPRRPLP 240
TQQQPQPSQK LGRIREDEGA VASSTVKEPP EATATPEPLS DEDLPLDPDL YQDFCAGAFD 300
DHGLPWMSDT EESPFLDPAL RKRAVKVKHV KRREKKSEKK KEERYKRHRQ KQKHKDKWKH 360
PERADAKDPA SLPQCLGPGC VRPAQPSSKY CSDDCGMKLA ANRIYEILPQ RIQQWQQSPC 420
IAEEHGKKLL ERIRREQQSA RTRLQEMERR FHELEAIILR AKQQAVREDE ESNEGDSDDT 480
DLQIFCVSCG HPINPRVALR HMERCYAKYE SQTSFGSMYP TRIEGATRLF CDVYNPQSKT 540
YCKRLQVLCP EHSRDPKVPA DEVCGCPLVR DVFELTGDFC RLPKRQCNRH YCWEKLRRAE 600
VDLERVRVWY KLDELFEQER NVRTAMTNRA GLLALMLHQT IQHDPLTTDL RSSADR 656 
Gene Ontology
 GO:0016363; C:nuclear matrix; IEA:Compara.
 GO:0016607; C:nuclear speck; IDA:LIFEdb.
 GO:0048188; C:Set1C/COMPASS complex; IDA:UniProtKB.
 GO:0045322; F:unmethylated CpG binding; IDA:UniProtKB.
 GO:0008270; F:zinc ion binding; IEA:InterPro.
 GO:0006987; P:activation of signaling protein activity involved in unfolded protein response; TAS:Reactome.
 GO:0051568; P:histone H3-K4 methylation; IDA:UniProtKB.
 GO:0045893; P:positive regulation of transcription, DNA-dependent; IDA:UniProtKB.
 GO:0006351; P:transcription, DNA-dependent; IEA:UniProtKB-KW. 
Interpro
 IPR022056; CpG-bd_C.
 IPR019786; Zinc_finger_PHD-type_CS.
 IPR002857; Znf_CXXC.
 IPR011011; Znf_FYVE_PHD.
 IPR001965; Znf_PHD.
 IPR019787; Znf_PHD-finger.
 IPR013083; Znf_RING/FYVE/PHD. 
Pfam
 PF00628; PHD
 PF12269; zf-CpG_bind_C
 PF02008; zf-CXXC 
SMART
 SM00249; PHD 
PROSITE
 PS51058; ZF_CXXC
 PS01359; ZF_PHD_1
 PS50016; ZF_PHD_2 
PRINTS