CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-044054
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
  
Protein Name
 Protein SON 
Protein Synonyms/Alias
  
Gene Name
 Son 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Mus musculus (Mouse) 
NCBI Taxa ID
 10090 
Lysine Modification
Position
Peptide
Type
References
16FRSFVVSKFREIQQEacetylation[1, 2]
16FRSFVVSKFREIQQEubiquitination[3]
273EYQKSVLKSLETMPPacetylation[1]
2388AMKDLSGKHPVSALMacetylation[1]
Reference
 [1] Quantitative acetylome analysis reveals the roles of SIRT1 in regulating diverse substrates and cellular pathways.
 Chen Y, Zhao W, Yang JS, Cheng Z, Luo H, Lu Z, Tan M, Gu W, Zhao Y.
 Mol Cell Proteomics. 2012 Oct;11(10):1048-62. [PMID: 22826441]
 [2] SIRT5-Mediated Lysine Desuccinylation Impacts Diverse Metabolic Pathways.
 Park J, Chen Y, Tishkoff DX, Peng C, Tan M, Dai L, Xie Z, Zhang Y, Zwaans BM, Skinner ME, Lombard DB, Zhao Y.
 Mol Cell. 2013 Jun 27;50(6):919-30. [PMID: 23806337]
 [3] Proteomic analyses reveal divergent ubiquitylation site patterns in murine tissues.
 Wagner SA, Beli P, Weinert BT, Schölz C, Kelstrup CD, Young C, Nielsen ML, Olsen JV, Brakebusch C, Choudhary C.
 Mol Cell Proteomics. 2012 Dec;11(12):1578-85. [PMID: 22790023
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2477 AA 
Protein Sequence
MAADIEQVFR SFVVSKFREI QQELSSGRSE GQLNGETNPP IEGNQAGDTA ASARSLPNEE 60
IVQKIEEVLS GVLDTELRYK PDLKEASRKS RCVSVQTDPT DEVPTKKSKK HKKHKNKKKK 120
KKKEKEKKYK RQPEESESKL KSHHDGNLES DSFLKFDSEP SAAALEHPVR AFGLSEASET 180
ALVLEPPVVS MEVQESHVLE TLKPATKAAE LSVVSTSVIS EQSEQPMPGM LEPSMTKILD 240
SFTAAPVPMS TAALKSPEPV VTMSVEYQKS VLKSLETMPP ETSKTTLVEL PIAKVVEPSE 300
TLTIVSETPT EVHPEPSPST MDFPESSTTD VQRLPEQPVE APSEIADSSM TRPQESLELP 360
KTTAVELQES TVASALELPG PPATSILELQ GPPVTPVPEL PGPSATPVPE LSGPLSTPVP 420
ELPGPPATVV PELPGPSVTP VPQLSQELPG PPAPSMGLEP PQEVPEPPVM AQELSGVPAV 480
SAAIELTGQP AVTVAMELTE QPVTTTEFEQ PVAMTTVEHP GHPEVTTATG LLGQPEAAMV 540
LELPGQPVAT TALELSGQPS VTGVPELSGL PSATRALELS GQSVATGALE LPGQLMATGA 600
LEFSGQSGAA GALELLGQPL ATGVLELPGQ PGAPELPGQP VATVALEISV QSVVTTSELS 660
TMTVSQSLEV PSTTALESYN TVAQELPTTL VGETSVTVGV DPLMAQESHM LASNTMETHM 720
LASNTMDSQM LASNTMDSQM LASNTMDSQM LASSTMDSQM LASSTMDSQM LATSTMDSQM 780
LATSSMDSQM LATSSMDSQM LATSSMDSQM LATSSMDSQM LATSSMDSQM LATSSMDSQM 840
LATSSMDSQM LATSSMDSQM LASGAMDSQM LASGTMDAQM LASGTMDAQM LASSTQDSAM 900
MGSKSPDPYR LAQDPYRLAQ DPYRLGHDPY RLGHDAYRLG QDPYRLGHDP YRLTPDPYRV 960
SPRPYRIAPR SYRIAPRPYR LAPRPLMLAS RRSMMMSYAA ERSMMSSYER SMMSYERSMM 1020
SPMAERSMMS AYERSMMSAY ERSMMSPMAE RSMMSAYERS MMSAYERSMM SPMADRSMMS 1080
MGADRSMMSS YSAADRSMMS SYSAADRSMM SSYTDRSMMS MAADSYTDSY TDSYTEAYMV 1140
PPLPPEEPPT MPPLPPEEPP MTPPLPPEEP PEGPALSTEQ SALTADNTWS TEVTLSTGES 1200
LSQPEPPVSQ SEISEPMAVP ANYSMSESET SMLASEAVMT VPEPAREPES SVTSAPVESA 1260
VVAEHEMVPE RPMTYMVSET TMSVEPAVLT SEASVISETS ETYDSMRPSG HAISEVTMSL 1320
LEPAVTISQP AENSLELPSM TVPAPSTMTT TESPVVAVTE IPPVAVPEPP IMAVPELPTM 1380
AVVKTPAVAV PEPLVAAPEP PTMATPELCS LSVSEPPVAV SELPALADPE HAITAVSGVS 1440
SLEPSVPILE PAVSVLQPVM IVSEPSVPVQ EPTVAVSEPA VIVSEHTQIT SPEMAVESSP 1500
VIVDSSVMSS QIMKGMNLLG GDENLGPEVG MQETLLHPGE EPRDGGHLKS DLYENEYDRN 1560
ADLTVNSHLI VKDAEHNTVC ATTVGPVGEA SEEKILPISE TKEITELATC AAVSEADIGR 1620
SLSSQLALEL DTVGTSKGFE FVTASALISE SKYDVEVSVT TQDTEHDMVI STSPSGGSEA 1680
DIEGPLPAKD IHLDLPSTNF VCKDVEDSLP IKESAQAVAV ALSPKESSED TEVPLPNKEI 1740
VPESGYSASI DEINEADLVR PLLPKDMERL TSLRAGIEGP LLASEVERDK SAASPVVISI 1800
PERASESSSE EKDDYEIFVK VKDTHEKSKK NKNRDKGEKE KKRDSSLRSR SKRSKSSEHK 1860
SRKRTSESRS RARKRSSKSK SHRSQTRSRS RSRRRRRSSR SRSKSRGRRS VSKEKRKRSP 1920
KHRSKSRERK RKRSSSRDNR KAARARSRTP SRRSRSHTPS RRRRSRSVGR RRSFSISPSR 1980
RSRTPSRRSR TPSRRSRTPS RRSRTPSRRS RTPSRRRRSR SAVRRRSFSI SPVRLRRSRT 2040
PLRRRFSRSP IRRKRSRSSE RGRSPKRLTD LDKAQLLEIA KANAAAMCAK AGVPLPPNLK 2100
PAPPPTIEEK VAKKSGGATI EELTEKCKQI AQSKEDDDVI VNKPHVSDEE EEEPPFYHHP 2160
FKLSEPKPIF FNLNIAAAKP TPPKSQVTLT KEFPVSSGSQ HRKKEADSVY GEWVPVEKNG 2220
EESKDDDNVF SSSLPSEPVD ISTAMSERAL AQKRLSENAF DLEAMSMLNR AQERIDAWAQ 2280
LNSIPGQFTG STGVQVLTQE QLANTGAQAW IKKDQFLRAA PVTGGMGAVL MRKMGWREGE 2340
GLGKNKEGNK EPILVDFKTD RKGLVAVGER AQKRSGNFSA AMKDLSGKHP VSALMEICNK 2400
RRWQPPEFLL VHDSGPDHRK HFLFRVLING SAYQPSFASP NKKHAKATAA TVVLQAMGLV 2460
PKDLMANATC FRSASRR 2477 
Gene Ontology
 GO:0003725; F:double-stranded RNA binding; IEA:InterPro. 
Interpro
 IPR001159; Ds-RNA-bd.
 IPR014720; dsRNA-bd-like_dom.
 IPR000467; G_patch_dom. 
Pfam
 PF00035; dsrm
 PF01585; G-patch 
SMART
 SM00358; DSRM
 SM00443; G_patch 
PROSITE
 PS50137; DS_RBD
 PS50174; G_PATCH 
PRINTS