CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-016861
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Transcription factor SOX-30 
Protein Synonyms/Alias
  
Gene Name
 Sox30 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Mus musculus (Mouse) 
NCBI Taxa ID
 10090 
Lysine Modification
Position
Peptide
Type
References
201AKLDGTGKALDGRRSacetylation[1]
326PLTPVPIKMQSLLEPacetylation[1]
Reference
 [1] Proteomic analysis of lysine acetylation sites in rat tissues reveals organ specificity and subcellular patterns.
 Lundby A, Lage K, Weinert BT, Bekker-Jensen DB, Secher A, Skovgaard T, Kelstrup CD, Dmytriyev A, Choudhary C, Lundby C, Olsen JV.
 Cell Rep. 2012 Aug 30;2(2):419-31. [PMID: 22902405
Functional Description
 Transcriptional activator. Binds to the DNA sequence 5'- ACAAT-3' and shows a preference for guanine residues surrounding this core motif (By similarity). 
Sequence Annotation
 DNA_BIND 366 434 HMG box.  
Keyword
 Activator; Complete proteome; DNA-binding; Nucleus; Reference proteome; Transcription; Transcription regulation. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 782 AA 
Protein Sequence
MERARPEPPP PPPPPPRQPP RPTPPRPLRP APPAQPVEAA TFRAAAAERS QSPSAQATAA 60
MAAVASSCGE AAAAGAQAAG TRRLLQVKPE QVLLLPPGGP GVPPAPDEGA AAAAAAAAAA 120
ASSAQARLLQ LRPELLLLPP QSAADGGPCR PELHPMQPRT LLVKAEKQEL GAGLDLSVGS 180
RRTTEAGPRA SRAAKLDGTG KALDGRRSDE KKAKLEAEEA PRDALKGGEG KSLLAIGEGV 240
IKTEEPDRPR DDCRLGTEAT SNGLVHSSKE AILAQPPSAF GPHQQDLRFP LTLHTVPPGA 300
RIQFQGPPPS ELIRLSKVPL TPVPIKMQSL LEPSVKIETK DVPLTVLPSD AGIPDTPFSK 360
DRNGHVKRPM NAFMVWARIH RPALAKANPA ANNAEISVQL GLEWNKLSEE QKKPYYDEAQ 420
KIKEKHREEF PGWVYQPRPG KRKRFPLSVS NVFSGTTQNI ISTNPTTIYP YRSPTYSVVI 480
PGLQNTITHP VGEAPPAIQL PTPAVQRPSP ITLFQPSVSS TGPVAVPPPS LTPRPSLPPQ 540
RFSGPSQTDI HRLPSGSSRS VKRSTPGSLE STTRIPAGAS TAHARFATSP IQPPKEYASV 600
STCPRSTPIP PATPIPHSHV YQPPPLGHPA TLFGTPPRFS FHHPYFLPGP HYFPSSTCPY 660
SRPPFGYGNF PSSMPECLGY YEDRYQKHEA IFSALNRDYP FRDYPDEHTH SEDSRSCESM 720
DGPPYYSSHG HGGEEYLNAM PTLDIGALEN VFTAPASAPS GVQQVNVTDS DEEEEEKVLR 780
NL 782 
Gene Ontology
 GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
 GO:0043565; F:sequence-specific DNA binding; IEA:Compara.
 GO:0000981; F:sequence-specific DNA binding RNA polymerase II transcription factor activity; IEA:Compara.
 GO:0031960; P:response to corticosteroid stimulus; IEA:Compara.
 GO:0007283; P:spermatogenesis; IEP:UniProtKB. 
Interpro
 IPR009071; HMG_box_dom. 
Pfam
 PF00505; HMG_box 
SMART
 SM00398; HMG 
PROSITE
 PS50118; HMG_BOX_2 
PRINTS