CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-011270
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Cleavage stimulation factor subunit 1 
Protein Synonyms/Alias
 CF-1 50 kDa subunit; Cleavage stimulation factor 50 kDa subunit; CSTF 50 kDa subunit; CstF-50 
Gene Name
 CSTF1 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
143DTERMLAKSAMPIEVubiquitination[1]
198GSRDYTLKLFDYSKPubiquitination[2]
204LKLFDYSKPSAKRAFubiquitination[1, 2, 3]
208DYSKPSAKRAFKYIQubiquitination[2]
326KYILSSGKDSVAKLWubiquitination[2]
Reference
 [1] Proteomic analyses reveal divergent ubiquitylation site patterns in murine tissues.
 Wagner SA, Beli P, Weinert BT, Schölz C, Kelstrup CD, Young C, Nielsen ML, Olsen JV, Brakebusch C, Choudhary C.
 Mol Cell Proteomics. 2012 Dec;11(12):1578-85. [PMID: 22790023]
 [2] Refined preparation and use of anti-diglycine remnant (K-ε-GG) antibody enables routine quantification of 10,000s of ubiquitination sites in single proteomics experiments.
 Udeshi ND, Svinkina T, Mertins P, Kuhn E, Mani DR, Qiao JW, Carr SA.
 Mol Cell Proteomics. 2013 Mar;12(3):825-31. [PMID: 23266961]
 [3] Systematic and quantitative assessment of the ubiquitin-modified proteome.
 Kim W, Bennett EJ, Huttlin EL, Guo A, Li J, Possemato A, Sowa ME, Rad R, Rush J, Comb MJ, Harper JW, Gygi SP.
 Mol Cell. 2011 Oct 21;44(2):325-40. [PMID: 21906983
Functional Description
 One of the multiple factors required for polyadenylation and 3'-end cleavage of mammalian pre-mRNAs. May be responsible for the interaction of CSTF with other factors to form a stable complex on the pre-mRNA. 
Sequence Annotation
 REPEAT 106 145 WD 1.
 REPEAT 171 210 WD 2.
 REPEAT 215 254 WD 3.
 REPEAT 260 301 WD 4.
 REPEAT 303 343 WD 5.
 REPEAT 395 430 WD 6.
 REGION 14 35 Hydrophobic.  
Keyword
 Complete proteome; Direct protein sequencing; mRNA processing; Nucleus; Reference proteome; Repeat; WD repeat. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 431 AA 
Protein Sequence
MYRTKVGLKD RQQLYKLIIS QLLYDGYISI ANGLINEIKP QSVCAPSEQL LHLIKLGMEN 60
DDTAVQYAIG RSDTVAPGTG IDLEFDADVQ TMSPEASEYE TCYVTSHKGP CRVATYSRDG 120
QLIATGSADA SIKILDTERM LAKSAMPIEV MMNETAQQNM ENHPVIRTLY DHVDEVTCLA 180
FHPTEQILAS GSRDYTLKLF DYSKPSAKRA FKYIQEAEML RSISFHPSGD FILVGTQHPT 240
LRLYDINTFQ CFVSCNPQDQ HTDAICSVNY NSSANMYVTG SKDGCIKLWD GVSNRCITTF 300
EKAHDGAEVC SAIFSKNSKY ILSSGKDSVA KLWEISTGRT LVRYTGAGLS GRQVHRTQAV 360
FNHTEDYVLL PDERTISLCC WDSRTAERRN LLSLGHNNIV RCIVHSPTNP GFMTCSDDFR 420
ARFWYRRSTT D 431 
Gene Ontology
 GO:0005654; C:nucleoplasm; TAS:Reactome.
 GO:0003723; F:RNA binding; TAS:ProtInc.
 GO:0006379; P:mRNA cleavage; TAS:ProtInc.
 GO:0006378; P:mRNA polyadenylation; TAS:ProtInc.
 GO:0000398; P:mRNA splicing, via spliceosome; TAS:Reactome.
 GO:0006369; P:termination of RNA polymerase II transcription; TAS:Reactome. 
Interpro
 IPR015943; WD40/YVTN_repeat-like_dom.
 IPR001680; WD40_repeat.
 IPR019775; WD40_repeat_CS.
 IPR017986; WD40_repeat_dom. 
Pfam
 PF00400; WD40 
SMART
 SM00320; WD40 
PROSITE
 PS00678; WD_REPEATS_1
 PS50082; WD_REPEATS_2
 PS50294; WD_REPEATS_REGION 
PRINTS