CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-014593
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Splicing factor 1 
Protein Synonyms/Alias
 CW17; Mammalian branch point-binding protein; BBP; mBBP; Transcription factor ZFM1; mZFM; Zinc finger gene in MEN1 locus; Zinc finger protein 162 
Gene Name
 Sf1 
Gene Synonyms/Alias
 Zfm1; Zfp162 
Created Date
 July 27, 2013 
Organism
 Mus musculus (Mouse) 
NCBI Taxa ID
 10090 
Lysine Modification
Position
Peptide
Type
References
227EQIRNILKQGIETPEubiquitination[1]
306DPQSAQDKARMDKEYubiquitination[1]
Reference
 [1] Proteomic analyses reveal divergent ubiquitylation site patterns in murine tissues.
 Wagner SA, Beli P, Weinert BT, Schölz C, Kelstrup CD, Young C, Nielsen ML, Olsen JV, Brakebusch C, Choudhary C.
 Mol Cell Proteomics. 2012 Dec;11(12):1578-85. [PMID: 22790023
Functional Description
 Necessary for the ATP-dependent first step of spliceosome assembly. Binds to the intron branch point sequence (BPS) 5'-UACUAAC-3' of the pre-mRNA. May act as transcription repressor (By similarity). 
Sequence Annotation
 DOMAIN 141 222 KH.
 ZN_FING 277 296 CCHC-type.
 MOTIF 15 19 Nuclear localization signal (Potential).
 MOD_RES 2 2 N-acetylalanine (By similarity).
 MOD_RES 20 20 Phosphoserine; by PKG (By similarity).
 MOD_RES 80 80 Phosphoserine.
 MOD_RES 82 82 Phosphoserine.  
Keyword
 Acetylation; Alternative splicing; Complete proteome; Metal-binding; mRNA processing; mRNA splicing; Nucleus; Phosphoprotein; Reference proteome; Repressor; RNA-binding; Spliceosome; Transcription; Transcription regulation; Zinc; Zinc-finger. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 653 AA 
Protein Sequence
MATGANATPL DFPSKKRKRS RWNQDTMEQK TVIPGMPTVI PPGLTREQER AYIVQLQIED 60
LTRKLRTGDL GIPPNPEDRS PSPEPIYNSE GKRLNTREFR TRKKLEEERH TLITEMVALN 120
PDFKPPADYK PPATRVSDKV MIPQDEYPEI NFVGLLIGPR GNTLKNIEKE CNAKIMIRGK 180
GSVKEGKVGR KDGQMLPGED EPLHALVTAN TMENVKKAVE QIRNILKQGI ETPEDQNDLR 240
KMQLRELARL NGTLREDDNR ILRPWQSSET RSITNTTVCT KCGGAGHIAS DCKFQRPGDP 300
QSAQDKARMD KEYLSLMAEL GEAPVPASVG STSGPATTPL ASAPRPAAPA SNPPPPSLMS 360
TTQSRPPWMN SGPSENRPYH GMHGGGPGGP GGGPHSFPHP LPSLTGGHGG HPMQHNPNGP 420
PPPWMQPPPP PMNQGPHPPG HHGPPPMDQY LGSTPVGSGV YRLHQGKGMM PPPPMGMMPP 480
PPPPPSGQPP PPPSGPLPPW QQQQQQPPPP PPPSSSMASS TPLPWQQNTT TTTTSAGTGS 540
IPPWQQQQAA AAASPGTPQM QGNPTMVPLP PGVQPPLPPG APPPPPCSIE CLLCLLSSPN 600
SLCLSPNRAA RIPPRGSDGP SHESEDFPRP LVTLPGRQPQ QRPWWTGWFG KAA 653 
Gene Ontology
 GO:0005737; C:cytoplasm; ISS:MGI.
 GO:0005634; C:nucleus; ISS:MGI.
 GO:0005681; C:spliceosomal complex; IEA:UniProtKB-KW.
 GO:0003723; F:RNA binding; IEA:UniProtKB-KW.
 GO:0008270; F:zinc ion binding; IEA:InterPro.
 GO:0033327; P:Leydig cell differentiation; IGI:MGI.
 GO:0030238; P:male sex determination; IGI:MGI.
 GO:0006397; P:mRNA processing; IEA:UniProtKB-KW.
 GO:0050810; P:regulation of steroid biosynthetic process; IGI:MGI.
 GO:0006355; P:regulation of transcription, DNA-dependent; IEA:UniProtKB-KW.
 GO:0008380; P:RNA splicing; IEA:UniProtKB-KW.
 GO:0006351; P:transcription, DNA-dependent; IEA:UniProtKB-KW. 
Interpro
 IPR004087; KH_dom.
 IPR004088; KH_dom_type_1.
 IPR001878; Znf_CCHC. 
Pfam
 PF00013; KH_1
 PF00098; zf-CCHC 
SMART
 SM00322; KH
 SM00343; ZnF_C2HC 
PROSITE
 PS50084; KH_TYPE_1
 PS50158; ZF_CCHC 
PRINTS