CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-023835
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 WD repeat-containing protein 7 
Protein Synonyms/Alias
 Rabconnectin-3 beta; TGF-beta resistance-associated protein TRAG 
Gene Name
 WDR7 
Gene Synonyms/Alias
 KIAA0541; TRAG 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
302SFRSDVGKAVENLIPubiquitination[1]
663LASEASDKGNLPKYSubiquitination[1]
877PASEGVGKGTYGVSRubiquitination[1]
1112SSVPQMKKISTSYEEubiquitination[1]
1484ILMAHDGKEHRFMV*ubiquitination[1]
Reference
 [1] Refined preparation and use of anti-diglycine remnant (K-ε-GG) antibody enables routine quantification of 10,000s of ubiquitination sites in single proteomics experiments.
 Udeshi ND, Svinkina T, Mertins P, Kuhn E, Mani DR, Qiao JW, Carr SA.
 Mol Cell Proteomics. 2013 Mar;12(3):825-31. [PMID: 23266961
Functional Description
  
Sequence Annotation
 REPEAT 17 56 WD 1.
 REPEAT 62 104 WD 2.
 REPEAT 156 199 WD 3.
 REPEAT 324 366 WD 4.
 REPEAT 404 443 WD 5.
 REPEAT 462 507 WD 6.
 REPEAT 558 597 WD 7.
 REPEAT 1351 1390 WD 8.
 REPEAT 1392 1432 WD 9.
 MOD_RES 935 935 Phosphoserine (By similarity).
 MOD_RES 1152 1152 Phosphoserine (By similarity).
 MOD_RES 1154 1154 Phosphoserine (By similarity).
 MOD_RES 1456 1456 Phosphoserine (By similarity).  
Keyword
 Alternative splicing; Complete proteome; Phosphoprotein; Reference proteome; Repeat; WD repeat. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1490 AA 
Protein Sequence
MAGNSLVLPI VLWGRKAPTH CISAVLLTDD GATIVTGCHD GQICLWDLSV ELQINPRALL 60
FGHTASITCL SKACASSDKQ YIVSASESGE MCLWDVSDGR CIEFTKLACT HTGIQFYQFS 120
VGNQREGRLL CHGHYPEILV VDATSLEVLY SLVSKISPDW ISSMSIIRSH RTQEDTVVAL 180
SVTGILKVWI VTSEISDMQD TEPIFEEESK PIYCQNCQSI SFCAFTQRSL LVVCSKYWRV 240
FDAGDYSLLC SGPSENGQTW TGGDFVSSDK VIIWTENGQS YIYKLPASCL PASDSFRSDV 300
GKAVENLIPP VQHILLDRKD KELLICPPVT RFFYGCREYF HKLLIQGDSS GRLNIWNISD 360
TADKQGSEEG LAMTTSISLQ EAFDKLNPCP AGIIDQLSVI PNSNEPLKVT ASVYIPAHGR 420
LVCGREDGSI VIVPATQTAI VQLLQGEHML RRGWPPHRTL RGHRNKVTCL LYPHQVSARY 480
DQRYLISGGV DFSVIIWDIF SGEMKHIFCV HGGEITQLLV PPENCSARVQ HCICSVASDH 540
SVGLLSLREK KCIMLASRHL FPIQVIKWRP SDDYLVVGCS DGSVYVWQMD TGALDRCVMG 600
ITAVEILNAC DEAVPAAVDS LSHPAVNLKQ AMTRRSLAAL KNMAHHKLQT LATNLLASEA 660
SDKGNLPKYS HNSLMVQAIK TNLTDPDIHV LFFDVEALII QLLTEEASRP NTALISPENL 720
QKASGSSDKG GSFLTGKRAA VLFQQVKETI KENIKEHLLD DEEEDEEIMR QRREESDPEY 780
RSSKSKPLTL LEYNLTMDTA KLFMSCLHAW GLNEVLDEVC LDRLGMLKPH CTVSFGLLSR 840
GGHMSLMLPG YNQPACKLSH GKTEVGRKLP ASEGVGKGTY GVSRAVTTQH LLSIISLANT 900
LMSMTNATFI GDHMKKGPTR PPRPSTPDLS KARGSPPTSS NIVQGQIKQV AAPVVSARSD 960
ADHSGSDPPS APALHTCFLV NEGWSQLAAM HCVMLPDLLG LDKFRPPLLE MLARRWQDRC 1020
LEVREAAQAL LLAELRRIEQ AGRKEAIDAW APYLPQYIDH VISPGVTSEA AQTITTAPDA 1080
SGPEAKVQEE EHDLVDDDIT TGCLSSVPQM KKISTSYEER RKQATAIVLL GVIGAEFGAE 1140
IEPPKLLTRP RSSSQIPEGF GLTSGGSNYS LARHTCKALT FLLLQPPSPK LPPHSTIRRT 1200
AIDLIGRGFT VWEPYMDVSA VLMGLLELCA DAEKQLANIT MGLPLSPAAD SARSARHALS 1260
LIATARPPAF ITTIAKEVHR HTALAANTQS QQNMHTTTLA RAKGEILRVI EILIEKMPTD 1320
VVDLLVEVMD IIMYCLEGSL VKKKGLQECF PAICRFYMVS YYERNHRIAV GARHGSVALY 1380
DIRTGKCQTI HGHKGPITAV AFAPDGRYLA TYSNTDSHIS FWQMNTSLLG SIGMLNSAPQ 1440
LRCIKTYQVP PVQPASPGSH NALKLARLIW TSNRNVILMA HDGKEHRFMV 1490 
Gene Ontology
  
Interpro
 IPR011047; Quinonprotein_ADH-like_supfam.
 IPR015943; WD40/YVTN_repeat-like_dom.
 IPR001680; WD40_repeat.
 IPR019775; WD40_repeat_CS.
 IPR017986; WD40_repeat_dom. 
Pfam
 PF00400; WD40 
SMART
 SM00320; WD40 
PROSITE
 PS00678; WD_REPEATS_1
 PS50082; WD_REPEATS_2
 PS50294; WD_REPEATS_REGION 
PRINTS