CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-035397
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
  
Protein Name
 Protein LOC100911180 
Protein Synonyms/Alias
  
Gene Name
 Setdb1 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Rattus norvegicus (Rat) 
NCBI Taxa ID
 10116 
Lysine Modification
Position
Peptide
Type
References
681PYVLVDRKFQPFKPFacetylation[1]
Reference
 [1] Proteomic analysis of lysine acetylation sites in rat tissues reveals organ specificity and subcellular patterns.
 Lundby A, Lage K, Weinert BT, Bekker-Jensen DB, Secher A, Skovgaard T, Kelstrup CD, Dmytriyev A, Choudhary C, Lundby C, Olsen JV.
 Cell Rep. 2012 Aug 30;2(2):419-31. [PMID: 22902405
Functional Description
  
Sequence Annotation
  
Keyword
 Chromosome; Complete proteome; Methyltransferase; Nucleus; Reference proteome; Transferase. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1302 AA 
Protein Sequence
MSSLPGCMSL AAAPAAADSA EIAELQQAVV EELGISMEEL RQFIDEELEK MDCIQQRKKQ 60
LAELETWVLQ KESEVAYVDR LFDDASREVT NCESLVKDFY SKLGLQYHDS SSDDEASQPT 120
EIIEIPDEDD DVLSIDSGDA GSRTPKDQKL REAMAALRKS AQDVQKFMDA VNKKSSSQDL 180
HKGTLGQVSG ELSKDGDLIV SMRILGKKRT KTWHKGTLIA IQTVGLGKKY KVKFDNKGKS 240
LLSGNHIAYD YHPPADKLFV GSRVVAKYKD GNQVWLYAGI VAETPNVKNK LRFLIFFDDG 300
YASYVTQSEL YPICRPLKKT WEDIEDSSCR DFIEEYITAY PNRPMVLLKS GQLIKTEWEG 360
TWWKSRVEEV DGSLVRILFL DDKRCEWIYR GSTRLEPMFS MKTSSASAME KKQGGQLRTR 420
PNMGAVRSKG PVVQYTQDLT STGIQFKPME PLQPIVPPAP LPMLPLSPQA GDSESLESQL 480
AQSRKQVAKK STSFRPGSVS SGHSSPTSPT LSENVPPGKI GMNQTYRSPS ASVTSTPAPA 540
APSGPPAPPG PLAPPGPPAP PAFHGMLERA PAEPSYRAPM EKLFYLPHVC SYTCLSRIRP 600
MRNEQYRGKN PLLVPLLYDF RRMTARRRVN RKMGFHVIYK TPCGLCLRTM QEIERYLFET 660
GCDFLYLEMF CLDPYVLVDR KFQPFKPFYY ILDITYGKED VPLSCVNEID TTPPPQVAYS 720
KERIPGKGVF INTGPEFLVG CDCKDGCRDK SKCACHQLTV QATACTPGGQ INPSSGYQHK 780
RLEECLPTGV YECNKRCKCD PNMCTNRLVQ HGLQVRLQLF KTQNKGWGIR CLDDIAKGSF 840
VCIYAGKILT DDFADKEGLE MGDEYFANLD HIESVENFKE GYESDVPSSS DSSGVDMKDQ 900
EDGNSGSEDP EESNDDSSDD NFCKDEDFST SSVWRSYATR RQTRGQKESE LSEVTSKDSR 960
APDRGPPHVP ITPSGSVGGC NPPSSEETPK NKVASWLSCN SVSEGGFADS DSRSSFKTSE 1020
GGDGRAGGGR GEAERASTSG LSFKDEGDSK QSKKEDPEDR NKMSVVTEGS QNHGHNPPMK 1080
SEGLRRPASK ISMLQSQRVV TSTQSNPDDI LTLSSSTESE GESGTSRKPT TGQTSATAVD 1140
SDDIQTISSG SDGDDFEDKK NLSGPTKRQV AVKSTRGFAL KSTHGIAIKS TNMASVDKGE 1200
SAPVRKNTRQ FYDGEESCYI IDAKLEGNLG RYLNHSCSPN LFVQNVFVDT HDLRFPWVAF 1260
FASKRIRAGT ELTWDYNYEV GSVEGKELLC CCGAIECRGR LL 1302 
Gene Ontology
 GO:0005694; C:chromosome; IEA:UniProtKB-KW.
 GO:0005794; C:Golgi apparatus; IEA:Compara.
 GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
 GO:0005886; C:plasma membrane; IEA:Compara.
 GO:0003677; F:DNA binding; IEA:InterPro.
 GO:0018024; F:histone-lysine N-methyltransferase activity; IEA:InterPro.
 GO:0008270; F:zinc ion binding; IEA:InterPro.
 GO:0001833; P:inner cell mass cell proliferation; IEA:Compara. 
Interpro
 IPR016177; DNA-bd_integrase-typ.
 IPR025796; Hist-Lys_N-MeTrfase_SETDB1.
 IPR001739; Methyl_CpG_DNA-bd.
 IPR003616; Post-SET_dom.
 IPR007728; Pre-SET_dom.
 IPR003606; Pre-SET_Zn-bd_sub.
 IPR001214; SET_dom.
 IPR002999; Tudor. 
Pfam
 PF01429; MBD
 PF05033; Pre-SET
 PF00856; SET 
SMART
 SM00391; MBD
 SM00508; PostSET
 SM00468; PreSET
 SM00317; SET
 SM00333; TUDOR 
PROSITE
 PS50982; MBD
 PS50868; POST_SET
 PS50867; PRE_SET
 PS51573; SAM_MT43_SUVAR39_1
 PS50280; SET 
PRINTS