CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-039090
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
  
Protein Name
 Protein Wdr33 
Protein Synonyms/Alias
  
Gene Name
 Wdr33 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Rattus norvegicus (Rat) 
NCBI Taxa ID
 10116 
Lysine Modification
Position
Peptide
Type
References
46QQLTFDGKRMRKAVNacetylation[1]
Reference
 [1] Proteomic analysis of lysine acetylation sites in rat tissues reveals organ specificity and subcellular patterns.
 Lundby A, Lage K, Weinert BT, Bekker-Jensen DB, Secher A, Skovgaard T, Kelstrup CD, Dmytriyev A, Choudhary C, Lundby C, Olsen JV.
 Cell Rep. 2012 Aug 30;2(2):419-31. [PMID: 22902405
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome; Repeat; WD repeat. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1331 AA 
Protein Sequence
MATEIGSPPR FFHMPRFQHQ APRQLFYKRP DFAQQQAMQQ LTFDGKRMRK AVNRKTIDYN 60
PSVIKYLENR IWQRDQRDMR AIQPDAGYYN DLVPPIGMLN NPMNAVTTKF VRTSTNKVKC 120
PVFVVRWTPE GRRLVTGASS GEFTLWNGLT FNFETILQAH DSPVRAMTWS HNDMWMLTAD 180
HGGYVKYWQS NMNNVKMFQA HKEAIREASF SPTDNKFATC SDDGTVRIWD FLRCHEERIL 240
RGHGADVKCV DWHPTKGLVV SGSKDSQQPI KFWDPKTGQS LATLHAHKNT VMEVKLNLNG 300
NWLLTASRDH LCKLFDIRNL KEELQVFRGH KKEATAVAWH PVHEGLFASG GSDGSLLFWH 360
VGVEKEVGGM EMAHEGMIWS LAWHPLGHIL CSGSNDHTSK FWTRNRPGDK MRDRYNLNLL 420
PGMSEDGVEY DDLEPNSLAV IPGMGIPEQL KLAMEQEQMG KDESNDIEMT IPGLDWGMEE 480
VMQKDQKKVP QKKVPYAKPI PAQFQQAWMQ NKVPIPAPNE VLNDRKEDIK LEEKKKTQAE 540
IEQEMATLQY TNPQLLEQLK IERLAQKQAD QIQPPPSSGT PLLGPQPFSG QGPMSQIPQG 600
FQQPHPSQQM PLVAQMGPPG PQGQFRAPGP QGQMGPQGPP LHQGGGGPQG FMGPQGPQGP 660
PQGLPRPQDM HGPQGMQRHP GPHGPLGPQG PPGPQGSSGP QGHMGPQGPP GPQGHIGPQG 720
PPAPQGHMGP QGPPGTQGMQ GPPGPRGMQG PPHPHGIQGG PTSQGIQGPL MGLNPRGMQG 780
PPGPRENQGP APQGIMIGHP PQEMRGPHPP SGLLGHGPQE MRGPQEMRGM QGPPPQGSML 840
GPPQELRGPS GSQGQQGPPQ GSLGPPPQGG MQGPPGPQGQ QNPARGPHPS QGPIPFQQQK 900
APLLGDGPRA PFNQEGQSTG PPPLIPGLGQ QGAQGRIPPL NPGQGPGPNK GDSRGPPNHH 960
LGPMSERRHE QSGGPEHGPD RGPFRGGQDC RGPPDRRGSH PDFPDDFSRP DDFHPDKRFG 1020
HRLREFEGRG GPLPQEEKWR RGGPGPPFPP DHREFNEGDG RGAARGPPGA WEGRRPGDER 1080
FPRDPDDPRF RGRREESFRR GAPPRHEGRA PPRGRDNFPG PDDFGPEEVF DASDEAARGR 1140
DLRGRGRGTP RGGSRKCLLP TPDEFPRFEG GRKPDSWDGN REPGPGHEHF RDAPRPDHPP 1200
HDGHSPASRE RSSSLQGMDM ASLPPRKRPW HDGSGTSEHR EMEAQGGPSE DRGSKGRGGP 1260
GPSQRVPKSG RSSSLDGDHH DGYHRDEPFG GPPGSSSSSR GGRSGSNWGR GSNMNSGPPR 1320
RGTSRGSGRG R 1331 
Gene Ontology
 GO:0005634; C:nucleus; IEA:Compara. 
Interpro
 IPR008160; Collagen.
 IPR015943; WD40/YVTN_repeat-like_dom.
 IPR001680; WD40_repeat.
 IPR017986; WD40_repeat_dom. 
Pfam
 PF01391; Collagen
 PF00400; WD40 
SMART
 SM00320; WD40 
PROSITE
 PS50082; WD_REPEATS_2
 PS50294; WD_REPEATS_REGION 
PRINTS