CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-006286
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Cleavage stimulation factor subunit 2 
Protein Synonyms/Alias
 CF-1 64 kDa subunit; Cleavage stimulation factor 64 kDa subunit; CSTF 64 kDa subunit; CstF-64 
Gene Name
 CSTF2 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
189VDPEIALKILHRQTNubiquitination[1, 2, 3]
572ILKEQIQKSTGAP**ubiquitination[2]
Reference
 [1] A proteome-wide, quantitative survey of in vivo ubiquitylation sites reveals widespread regulatory roles.
 Wagner SA, Beli P, Weinert BT, Nielsen ML, Cox J, Mann M, Choudhary C.
 Mol Cell Proteomics. 2011 Oct;10(10):M111.013284. [PMID: 21890473]
 [2] Systematic and quantitative assessment of the ubiquitin-modified proteome.
 Kim W, Bennett EJ, Huttlin EL, Guo A, Li J, Possemato A, Sowa ME, Rad R, Rush J, Comb MJ, Harper JW, Gygi SP.
 Mol Cell. 2011 Oct 21;44(2):325-40. [PMID: 21906983]
 [3] Proteome-wide identification of ubiquitylation sites by conjugation of engineered lysine-less ubiquitin.
 Oshikawa K, Matsumoto M, Oyamada K, Nakayama KI.
 J Proteome Res. 2012 Feb 3;11(2):796-807. [PMID: 22053931
Functional Description
 One of the multiple factors required for polyadenylation and 3'-end cleavage of mammalian pre-mRNAs. This subunit is directly involved in the binding to pre-mRNAs (By similarity). 
Sequence Annotation
 DOMAIN 16 94 RRM.
 REPEAT 410 414 1; approximate.
 REPEAT 415 419 2.
 REPEAT 420 424 3.
 REPEAT 425 429 4; approximate.
 REPEAT 430 434 5; approximate.
 REPEAT 435 439 6.
 REPEAT 440 444 7.
 REPEAT 445 449 8.
 REPEAT 450 454 9.
 REPEAT 455 459 10; approximate.
 REPEAT 460 464 11.
 REPEAT 465 469 12; approximate.
 REGION 108 248 Interactions with CSTF3 and SYMPK.
 REGION 410 469 12 X 5 AA tandem repeats of M-E-A-R-[AG].
 REGION 514 577 Interaction with RPO2TC1.
 MOD_RES 518 518 Phosphoserine.
 MOD_RES 524 524 Phosphoserine.  
Keyword
 3D-structure; Alternative splicing; Complete proteome; mRNA processing; Nucleus; Phosphoprotein; Reference proteome; Repeat; RNA-binding. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 577 AA 
Protein Sequence
MAGLTVRDPA VDRSLRSVFV GNIPYEATEE QLKDIFSEVG PVVSFRLVYD RETGKPKGYG 60
FCEYQDQETA LSAMRNLNGR EFSGRALRVD NAASEKNKEE LKSLGTGAPV IESPYGETIS 120
PEDAPESISK AVASLPPEQM FELMKQMKLC VQNSPQEARN MLLQNPQLAY ALLQAQVVMR 180
IVDPEIALKI LHRQTNIPTL IAGNPQPVHG AGPGSGSNVS MNQQNPQAPQ AQSLGGMHVN 240
GAPPLMQASM QGGVPAPGQM PAAVTGPGPG SLAPGGGMQA QVGMPGSGPV SMERGQVPMQ 300
DPRAAMQRGS LPANVPTPRG LLGDAPNDPR GGTLLSVTGE VEPRGYLGPP HQGPPMHHVP 360
GHESRGPPPH ELRGGPLPEP RPLMAEPRGP MLDQRGPPLD GRGGRDPRGI DARGMEARAM 420
EARGLDARGL EARAMEARAM EARAMEARAM EARAMEVRGM EARGMDTRGP VPGPRGPIPS 480
GMQGPSPINM GAVVPQGSRQ VPVMQGTGMQ GASIQGGSQP GGFSPGQNQV TPQDHEKAAL 540
IMQVLQLTAD QIAMLPPEQR QSILILKEQI QKSTGAP 577 
Gene Ontology
 GO:0071920; C:cleavage body; IDA:UniProtKB.
 GO:0005847; C:mRNA cleavage and polyadenylation specificity factor complex; IDA:UniProtKB.
 GO:0000166; F:nucleotide binding; IEA:InterPro.
 GO:0003723; F:RNA binding; TAS:ProtInc.
 GO:0006379; P:mRNA cleavage; TAS:ProtInc.
 GO:0006378; P:mRNA polyadenylation; TAS:ProtInc.
 GO:0000398; P:mRNA splicing, via spliceosome; TAS:Reactome.
 GO:0006369; P:termination of RNA polymerase II transcription; TAS:Reactome. 
Interpro
 IPR025742; CSTF2_hinge.
 IPR026896; CSTF_C.
 IPR012677; Nucleotide-bd_a/b_plait.
 IPR000504; RRM_dom. 
Pfam
 PF14327; CSTF2_hinge
 PF14304; CSTF_C
 PF00076; RRM_1 
SMART
 SM00360; RRM 
PROSITE
 PS50102; RRM 
PRINTS