CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-016864
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 SURP and G-patch domain-containing protein 1 
Protein Synonyms/Alias
 Splicing factor 4 
Gene Name
 Sugp1 
Gene Synonyms/Alias
 Sf4 
Created Date
 July 27, 2013 
Organism
 Mus musculus (Mouse) 
NCBI Taxa ID
 10090 
Lysine Modification
Position
Peptide
Type
References
120PTPSSLKKPLVLSKRacetylation[1]
520LTQMGRGKHFIGDFLacetylation[2]
Reference
 [1] Quantitative acetylome analysis reveals the roles of SIRT1 in regulating diverse substrates and cellular pathways.
 Chen Y, Zhao W, Yang JS, Cheng Z, Luo H, Lu Z, Tan M, Gu W, Zhao Y.
 Mol Cell Proteomics. 2012 Oct;11(10):1048-62. [PMID: 22826441]
 [2] SIRT5-Mediated Lysine Desuccinylation Impacts Diverse Metabolic Pathways.
 Park J, Chen Y, Tishkoff DX, Peng C, Tan M, Dai L, Xie Z, Zhang Y, Zwaans BM, Skinner ME, Lombard DB, Zhao Y.
 Mol Cell. 2013 Jun 27;50(6):919-30. [PMID: 23806337
Functional Description
 Plays a role in pre-mRNA splicing (By similarity). 
Sequence Annotation
 REPEAT 187 229 SURP motif 1.
 REPEAT 262 305 SURP motif 2.
 DOMAIN 560 607 G-patch.
 MOTIF 378 384 Nuclear localization signal (Potential).
 MOD_RES 407 407 Phosphoserine (By similarity).
 MOD_RES 409 409 Phosphoserine (By similarity).
 MOD_RES 412 412 Phosphoserine (By similarity).
 MOD_RES 483 483 Phosphoserine.  
Keyword
 3D-structure; Complete proteome; mRNA processing; mRNA splicing; Nucleus; Phosphoprotein; Reference proteome; Repeat; Spliceosome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 643 AA 
Protein Sequence
MSLKMDNRDV AGKANRWFGM AQPKSGKMNM NILHQEELIA QKKREIEARM EQKARQSHVP 60
SPQPPHPGEI ADAHNSCISN KFANDGSFLQ QFLKLQKAQT STDSAPRAPP SMPTPSSLKK 120
PLVLSKRTGL GLSSPTGPVK NYSHAKQLPV AHRPSVFQSP DDDEEEDYEQ WLEIKVSPPE 180
GAETRRVIEK LARFVAEGGP ELEKVAMEDY KDNPAFTFLH DKNSREFLYY RRKVAEIRKE 240
AQKPQAATQK VSPPEDEEAK NLAEKLARFI ADGGPEVETI ALQNNRENQA FSFLYDPNSQ 300
GYRYYRQKLD EFRKAKAGST GSFPAPAPNP SLRRKSAPEA LSGAVPPITA CPTPVAPAPA 360
VNPTPSIPGK PTATAAVKRK RKSRWGPEED KVELPPAELA QRDIDASPSP LSVQDLKGLG 420
YEKGKPVGLV GVTELSDAQK KQLKEQQEMQ QMYDMIMQHK RAMQDMQLLW EKALQQHQHG 480
YDSDEEVDSE LGTWEHQLRR MEMDKTREWA EQLTQMGRGK HFIGDFLPPD ELEKFMETFK 540
ALKEGREPDY SEYKEFKLTV ENIGYQMLMK MGWKEGEGLG TEGQGIKNPV NKGATTIDGA 600
GFGIDRPAEL SKEDDEYEAF RKRMMLAYRF RPNPLNNPRR PYY 643 
Gene Ontology
 GO:0005681; C:spliceosomal complex; IEA:UniProtKB-KW.
 GO:0003723; F:RNA binding; IEA:InterPro.
 GO:0006397; P:mRNA processing; IEA:UniProtKB-KW.
 GO:0008380; P:RNA splicing; IEA:UniProtKB-KW. 
Interpro
 IPR000467; G_patch_dom.
 IPR000061; Surp. 
Pfam
 PF01585; G-patch
 PF01805; Surp 
SMART
 SM00443; G_patch
 SM00648; SWAP 
PROSITE
 PS50174; G_PATCH
 PS50128; SURP 
PRINTS