CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-016848
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 WD repeat-containing protein 47 
Protein Synonyms/Alias
 Neuronal enriched MAP interacting protein; Nemitin 
Gene Name
 Wdr47 
Gene Synonyms/Alias
 Kiaa0893 
Created Date
 July 27, 2013 
Organism
 Mus musculus (Mouse) 
NCBI Taxa ID
 10090 
Lysine Modification
Position
Peptide
Type
References
25ILDFLNSKKLHISMLubiquitination[1]
188KLSEAGFKASNNRLFubiquitination[1]
290LLTPLISKLSPYPSSubiquitination[1]
878DLQGDLTKQLPLMVVubiquitination[1]
Reference
 [1] Proteomic analyses reveal divergent ubiquitylation site patterns in murine tissues.
 Wagner SA, Beli P, Weinert BT, Schölz C, Kelstrup CD, Young C, Nielsen ML, Olsen JV, Brakebusch C, Choudhary C.
 Mol Cell Proteomics. 2012 Dec;11(12):1578-85. [PMID: 22790023
Functional Description
  
Sequence Annotation
 DOMAIN 10 42 LisH.
 DOMAIN 45 102 CTLH.
 REPEAT 605 644 WD 1.
 REPEAT 660 699 WD 2.
 REPEAT 707 749 WD 3.
 REPEAT 754 792 WD 4.
 REPEAT 799 838 WD 5.
 REPEAT 841 880 WD 6.
 REPEAT 887 919 WD 7.
 MOD_RES 285 285 Phosphothreonine (By similarity).
 MOD_RES 292 292 Phosphoserine (By similarity).
 MOD_RES 297 297 Phosphoserine (By similarity).
 MOD_RES 312 312 Phosphoserine.  
Keyword
 Complete proteome; Cytoplasm; Cytoskeleton; Developmental protein; Microtubule; Phosphoprotein; Reference proteome; Repeat; WD repeat. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 920 AA 
Protein Sequence
MTAEETVNVK EVEIIKLILD FLNSKKLHIS MLALEKESGV INGLFSDDML FLRQLILDGQ 60
WDEVLQFIQP LECMEKFDKK RFRYIILKQK FLEALCVNNA MSAEDEPQHL EFTMQEAVQC 120
LHALEEYCPS KDDYSKLCLL LTLPRLTNHA EFKDWNPSTA RVHCFEEVCV MVAEFIPADR 180
KLSEAGFKAS NNRLFQLVMK GLLYECCVEF CQSKATGEEI TESEVLLGID LLCGNGCDDL 240
DLSLLSWLQN LPSSVFSCAF EQKMLNIHVD KLLKPTKAAY ADLLTPLISK LSPYPSSPMR 300
RPQSADAYMT RSLNPALDGL TCGLTSHDKR ISDLGNKTSP MSHSFANFHY PGVQNLSRSL 360
MLENTECHSI YEESPERSDT PVEAQQPVSS EAMCQGSGLE KEPANGAQNP VPAKQEKNEL 420
RDSTEQFQEY YRQRLRYQQH LEQKEQQRQM YQQMLLEGGV NQEDGPDQQQ NLTEQFLNRS 480
IQKLGELNIG MDSLGNEVPV LNQQCSGSKN NGSNNSSVTS FSTPPQDSSQ RLIHDTANIH 540
TSTPRNPGST NHIPFHEDSP CGSQNSSEHS VIKPSPGDSS GNLSRSKGEE DDKSKKQFVC 600
INTLEDTQAV RAVAFHPSGS LYAVGSNSKT LRVCAYPEKM DASAHDNPKQ PVVRFKRNKH 660
HKGSIYCVAW SPCGQLLATG SNDKYVKVLP FNAETCNATG PDLEFSMHDG TIRDLAFMEG 720
PESGGAILIS AGAGDCNIYT TDCQRGQGLH ALSGHTGHIL ALYTWSGWMI ASGSQDKTVR 780
FWDLRVPSCV RVVGTTFHGT GSAVASVAVD PSGRLLATGQ EDSSCMLYDI RGGRMVQSYH 840
PHSSDVRSVR FSPGAHYLLT GSYDMKIKVT DLQGDLTKQL PLMVVGEHKD KVIQCRWHTQ 900
DLSFLSSSAD RTVTLWTYSG 920 
Gene Ontology
 GO:0005737; C:cytoplasm; IEA:UniProtKB-KW.
 GO:0005874; C:microtubule; IEA:UniProtKB-KW.
 GO:0007275; P:multicellular organismal development; IEA:UniProtKB-KW. 
Interpro
 IPR006595; CTLH_C.
 IPR006594; LisH_dimerisation.
 IPR015943; WD40/YVTN_repeat-like_dom.
 IPR001680; WD40_repeat.
 IPR019775; WD40_repeat_CS.
 IPR017986; WD40_repeat_dom. 
Pfam
 PF00400; WD40 
SMART
 SM00668; CTLH
 SM00667; LisH
 SM00320; WD40 
PROSITE
 PS50897; CTLH
 PS50896; LISH
 PS00678; WD_REPEATS_1
 PS50082; WD_REPEATS_2
 PS50294; WD_REPEATS_REGION 
PRINTS