CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-018628
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 WD repeat-containing protein 7 
Protein Synonyms/Alias
 TGF-beta resistance-associated protein TRAG 
Gene Name
 Wdr7 
Gene Synonyms/Alias
 Kiaa0541; Trag 
Created Date
 July 27, 2013 
Organism
 Mus musculus (Mouse) 
NCBI Taxa ID
 10090 
Lysine Modification
Position
Peptide
Type
References
319QHSLLDQKDKELVICubiquitination[1]
337TRFFYGCKEYLHKLLubiquitination[1]
364NIADIAEKQEADEGLubiquitination[1]
663LASEASDKGNLPKYSubiquitination[1]
729KASGSSDKGGSFLTGubiquitination[1]
Reference
 [1] Proteomic analyses reveal divergent ubiquitylation site patterns in murine tissues.
 Wagner SA, Beli P, Weinert BT, Schölz C, Kelstrup CD, Young C, Nielsen ML, Olsen JV, Brakebusch C, Choudhary C.
 Mol Cell Proteomics. 2012 Dec;11(12):1578-85. [PMID: 22790023
Functional Description
  
Sequence Annotation
 REPEAT 17 56 WD 1.
 REPEAT 62 104 WD 2.
 REPEAT 156 199 WD 3.
 REPEAT 324 366 WD 4.
 REPEAT 404 443 WD 5.
 REPEAT 462 507 WD 6.
 REPEAT 558 597 WD 7.
 REPEAT 1350 1389 WD 8.
 REPEAT 1391 1431 WD 9.
 MOD_RES 935 935 Phosphoserine.
 MOD_RES 1151 1151 Phosphoserine.
 MOD_RES 1153 1153 Phosphoserine.
 MOD_RES 1455 1455 Phosphoserine.  
Keyword
 Alternative splicing; Complete proteome; Phosphoprotein; Reference proteome; Repeat; WD repeat. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1489 AA 
Protein Sequence
MAGNSLVLPI VLWGRKAPTH CISSILLTDD GGTIVTGCHD GQICLWDVSV ELEVNPRALL 60
FGHTASITCL SKACASGDKR YTVSASANGE MCLWDVNDGR CIEFTKLACT HTGIQFYQFS 120
VGNQQEGRLL CHGHYPEILV VDATSLEVLY SLVSKISPDW ISSMSIIRSQ RTQEDTVVAL 180
SVTGILKVWI VTSEMSGMQD TEPIFEEESK PIYCQNCQSI SFCAFTQRSL LVVCSKYWRV 240
FDAGDYSLLC SGPSENGQTW TGGDFVSADK VIIWTENGQS YIYKLPASCL PASDSFRSDV 300
GKAVENLIPP VQHSLLDQKD KELVICPPVT RFFYGCKEYL HKLLIQGDSS GRLNIWNIAD 360
IAEKQEADEG LKMTTCISLQ EAFDKLKPCP AGIIDQLSVI PNSNEPLKVT ASVYIPAHGR 420
LVCGREDGSI IIVPATQTAI VQLLQGEHML RRGWPPHRTL RGHRNKVTCL LYPHQVSARY 480
DQRYLISGGV DFSVIIWDIF SGEMKHIFCV HGGEITQLLV PPENCSARVQ HCICSVASDH 540
SVGLLSLREK KCIMLASRHL FPIQVIKWRP SDDYLVVGCT DGSVYVWQMD TGALDRCAMG 600
ITAVEILNAC DEAVPAAVDS LSHPAVNLKQ AMTRRSLAAL KNMAHHKLQT LATNLLASEA 660
SDKGNLPKYS HNSLMVQAIK TNLTDPDIHV LFFDVEALII QLLTEEASRP NTALISPENL 720
QKASGSSDKG GSFLTGKRAA VLFQQVKETI KENIKEHLLD EEEDEEEARR QSREDSDPEY 780
RASKSKPLTL LEYNLTMDTA KLFMSCLHAW GLNEVLDEVC LDRLGMLKPH CTVSFGLLSR 840
GGHMSLMLPG YNQAAGKLLH AKAEVGRKLP AAEGVGKGTY TVSRAVTTQH LLSIISLANT 900
LMSMTNATFI GDHMKKGPTR PPRPGTPDLS KARDSPPPSS NIVQGQIKQA AAPVVSARSD 960
ADHSGSDSAS PALPTCFLVN EGWSQLAAMH CVMLPDLLGL ERFRPPLLEM LARRWQDRCL 1020
EVREAAQALL LAELRRIEQA GRKETIDTWA PYLPQYMDHV ISPGVTAEAM QTMAAAPDAS 1080
GPEAKVQEEE HDLVDDDITA GCLSSVPQMK KISTSYEERR KQATAIVLLG VIGAEFGAEI 1140
EPPKLLTRPR SSSQIPEGFG LTSGGSNYSL ARHTCKALTY LLLQPPSPKL PPHSTIRRTA 1200
TDLIGRGFTV WEPYMDVSAV LMGLLELCAD AEKQLANITM GLPLSPAADS ARSARHALSL 1260
IATARPPAFI TTIAKEVHRH TALAANTQSQ QSIHTTTLAR AKGEILRVIE ILIEKMPTDV 1320
VDLLVEVMDI IMYCLEGSLV KKKGLQECFP AICRFYMVSY YERSHRIAVG ARHGSVALYD 1380
IRTGKCQTIH GHKGPITAVS FAPDGRYLAT YSNTDSHISF WQMNTSLLGS IGMLNSAPQL 1440
RCIKTYQVPP VQPASPGSHN ALKLARLIWT SNRNVILMAH DGKEHRFMV 1489 
Gene Ontology
  
Interpro
 IPR011047; Quinonprotein_ADH-like_supfam.
 IPR015943; WD40/YVTN_repeat-like_dom.
 IPR001680; WD40_repeat.
 IPR019775; WD40_repeat_CS.
 IPR017986; WD40_repeat_dom. 
Pfam
 PF00400; WD40 
SMART
 SM00320; WD40 
PROSITE
 PS00678; WD_REPEATS_1
 PS50082; WD_REPEATS_2
 PS50294; WD_REPEATS_REGION 
PRINTS