CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-041434
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 DDB1- and CUL4-associated factor 5 
Protein Synonyms/Alias
 WD repeat domain 22, isoform CRA_c 
Gene Name
 DCAF5 
Gene Synonyms/Alias
 WDR22; hCG_22244 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
363IKIWSPYKQPGCTGDubiquitination[1, 2]
664RKIYKAYKWLRYSYIubiquitination[1, 3, 4]
677YISYSNNKDGETSLVubiquitination[1, 2, 3, 4, 5]
698GRAGTSHKDNPAPSSubiquitination[3]
Reference
 [1] Refined preparation and use of anti-diglycine remnant (K-ε-GG) antibody enables routine quantification of 10,000s of ubiquitination sites in single proteomics experiments.
 Udeshi ND, Svinkina T, Mertins P, Kuhn E, Mani DR, Qiao JW, Carr SA.
 Mol Cell Proteomics. 2013 Mar;12(3):825-31. [PMID: 23266961]
 [2] Integrated proteomic analysis of post-translational modifications by serial enrichment.
 Mertins P, Qiao JW, Patel J, Udeshi ND, Clauser KR, Mani DR, Burgess MW, Gillette MA, Jaffe JD, Carr SA.
 Nat Methods. 2013 Jul;10(7):634-7. [PMID: 23749302]
 [3] Systematic and quantitative assessment of the ubiquitin-modified proteome.
 Kim W, Bennett EJ, Huttlin EL, Guo A, Li J, Possemato A, Sowa ME, Rad R, Rush J, Comb MJ, Harper JW, Gygi SP.
 Mol Cell. 2011 Oct 21;44(2):325-40. [PMID: 21906983]
 [4] Landscape of the PARKIN-dependent ubiquitylome in response to mitochondrial depolarization.
 Sarraf SA, Raman M, Guarani-Pereira V, Sowa ME, Huttlin EL, Gygi SP, Harper JW.
 Nature. 2013 Apr 18;496(7445):372-6. [PMID: 23503661]
 [5] Global identification of modular cullin-RING ligase substrates.
 Emanuele MJ, Elia AE, Xu Q, Thoma CR, Izhar L, Leng Y, Guo A, Chen YN, Rush J, Hsu PW, Yen HC, Elledge SJ.
 Cell. 2011 Oct 14;147(2):459-74. [PMID: 21963094
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome; Repeat; WD repeat. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 941 AA 
Protein Sequence
MKRRAGLGGS MRSVVGFLSQ RGLHGDPLLT QDFQRRRLRG CRNLYKKDLL GHFGCVNAIE 60
FSNNGGQWLV SGGDDRRVLL WHMEQAIHSR VKPIQLKGEH HSNIFCLAFN SGNTKVFSGG 120
NDEQVILHDV ESETLDVFAH EDAVYGLSVS PVNDNIFASS SDDGRVLIWD IRESPHGEPF 180
CLANYPSAFH SVMFNPVEPR LLATANSKEG VGLWDIRKPQ SSLLRYGGNL SLQSAMSVRF 240
NSNGTQLLAL RRRLPPVLYD IHSRLPVFQF DNQGYFNSCT MKSCCFAGDR DQYILSGSDD 300
FNLYMWRIPA DPEAGGIGRV VNGAFMVLKG HRSIVNQVRF NPHTYMICSS GVEKIIKIWS 360
PYKQPGCTGD LDGRIEDDSR CLYTHEEYIS LVLNSGSGLS HDYANQSVQE DPRMMAFFDS 420
LVRREIEGWS SDSDSDLSES TILQLHAGVS ERSGYTDSES SASLPRSPPP TVDESADNAF 480
HLGPLRVTTT NTVASTPPTP TCEDAASRQQ RLSALRRYQD KRLLALSNES DSEENVCEVE 540
LDTDLFPRPR SPSPEDESSS SSSSSSSEDE EELNERRAST WQRNAMRRRQ KTTREDKPSA 600
PIKPTNTYIG EDNYDYPQIK VDDLSSSPTS SPERSTSTLE IQPSRASPTS DIESVERKIY 660
KAYKWLRYSY ISYSNNKDGE TSLVTGEADE GRAGTSHKDN PAPSSSKEAC LNIAMAQRNQ 720
DLPPEGCSKD TFKEETPRTP SNGPGHEHSS HAWAEVPEGT SQDTGNSGSV EHPFETKKLN 780
GKALSSRAEE PPSPPVPKAS GSTLNSGSGN CPRTQSDDSE ERSLETICAN HNNGRLHPRP 840
PHPHNNGQNL GELEVVAYSS PGHSDTDRDN SSLTGTLLHK DCCGSEMACE TPNAGTREDP 900
TDTPATDSSR AVHGHSGLKR QRIELEDTDS ENSSSEKKLK T 941 
Gene Ontology
  
Interpro
 IPR015943; WD40/YVTN_repeat-like_dom.
 IPR001680; WD40_repeat.
 IPR017986; WD40_repeat_dom. 
Pfam
 PF00400; WD40 
SMART
 SM00320; WD40 
PROSITE
 PS50082; WD_REPEATS_2
 PS50294; WD_REPEATS_REGION 
PRINTS