CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-012599
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Splicing factor 1 
Protein Synonyms/Alias
 Mammalian branch point-binding protein; BBP; mBBP; Transcription factor ZFM1; Zinc finger gene in MEN1 locus; Zinc finger protein 162 
Gene Name
 SF1 
Gene Synonyms/Alias
 ZFM1; ZNF162 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
227EQIRNILKQGIETPEubiquitination[1]
454MGKSVPGKYACGLWGubiquitination[1]
Reference
 [1] hCKSAAP_UbSite: improved prediction of human ubiquitination sites by exploiting amino acid pattern and properties.
 Chen Z, Zhou Y, Song J, Zhang Z.
 Biochim Biophys Acta. 2013 Aug;1834(8):1461-7. [PMID: 23603789
Functional Description
 Necessary for the ATP-dependent first step of spliceosome assembly. Binds to the intron branch point sequence (BPS) 5'-UACUAAC-3' of the pre-mRNA. May act as transcription repressor. 
Sequence Annotation
 DOMAIN 141 222 KH.
 ZN_FING 277 296 CCHC-type.
 MOTIF 15 19 Nuclear localization signal (Potential).
 MOD_RES 2 2 N-acetylalanine.
 MOD_RES 20 20 Phosphoserine; by PKG.
 MOD_RES 80 80 Phosphoserine.
 MOD_RES 82 82 Phosphoserine.  
Keyword
 3D-structure; Acetylation; Alternative splicing; Complete proteome; Direct protein sequencing; Metal-binding; mRNA processing; mRNA splicing; Nucleus; Phosphoprotein; Polymorphism; Reference proteome; Repressor; RNA-binding; Spliceosome; Transcription; Transcription regulation; Zinc; Zinc-finger. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 639 AA 
Protein Sequence
MATGANATPL DFPSKKRKRS RWNQDTMEQK TVIPGMPTVI PPGLTREQER AYIVQLQIED 60
LTRKLRTGDL GIPPNPEDRS PSPEPIYNSE GKRLNTREFR TRKKLEEERH NLITEMVALN 120
PDFKPPADYK PPATRVSDKV MIPQDEYPEI NFVGLLIGPR GNTLKNIEKE CNAKIMIRGK 180
GSVKEGKVGR KDGQMLPGED EPLHALVTAN TMENVKKAVE QIRNILKQGI ETPEDQNDLR 240
KMQLRELARL NGTLREDDNR ILRPWQSSET RSITNTTVCT KCGGAGHIAS DCKFQRPGDP 300
QSAQDKARMD KEYLSLMAEL GEAPVPASVG STSGPATTPL ASAPRPAAPA NNPPPPSLMS 360
TTQSRPPWMN SGPSESRPYH GMHGGGPGGP GGGPHSFPHP LPSLTGGHGG HPMQHNPNGP 420
PPPWMQPPPP PMNQGPHPPG HHGPPPMGKS VPGKYACGLW GLSPASRKRY DAATTYGHDA 480
AAAAASQWAA PTPSLWSSSP MATTAAAASA TPSAQQQYGF QYPLAMAAKI PPRGGDGPSH 540
ESEDFPRPLV TLPGRQPQQR PWWTGWFGKA A 571 
Gene Ontology
 GO:0005840; C:ribosome; NAS:UniProtKB.
 GO:0005681; C:spliceosomal complex; IDA:HGNC.
 GO:0003723; F:RNA binding; TAS:UniProtKB.
 GO:0003714; F:transcription corepressor activity; TAS:ProtInc.
 GO:0008270; F:zinc ion binding; IEA:InterPro.
 GO:0022402; P:cell cycle process; IEA:Compara.
 GO:0033327; P:Leydig cell differentiation; IEA:Compara.
 GO:0030238; P:male sex determination; IEA:Compara.
 GO:0000389; P:mRNA 3'-splice site recognition; TAS:HGNC.
 GO:0050810; P:regulation of steroid biosynthetic process; IEA:Compara.
 GO:0006355; P:regulation of transcription, DNA-dependent; IEA:UniProtKB-KW.
 GO:0006351; P:transcription, DNA-dependent; IEA:UniProtKB-KW. 
Interpro
 IPR004087; KH_dom.
 IPR004088; KH_dom_type_1.
 IPR001878; Znf_CCHC. 
Pfam
 PF00013; KH_1
 PF00098; zf-CCHC 
SMART
 SM00322; KH
 SM00343; ZnF_C2HC 
PROSITE
 PS50084; KH_TYPE_1
 PS50158; ZF_CCHC 
PRINTS