CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-015550
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 WD repeat- and FYVE domain-containing protein 4 
Protein Synonyms/Alias
  
Gene Name
 WDFY4 
Gene Synonyms/Alias
 C10orf64; KIAA1607 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
1317VDSRLIAKEMNISSRubiquitination[1, 2]
Reference
 [1] A proteome-wide, quantitative survey of in vivo ubiquitylation sites reveals widespread regulatory roles.
 Wagner SA, Beli P, Weinert BT, Nielsen ML, Cox J, Mann M, Choudhary C.
 Mol Cell Proteomics. 2011 Oct;10(10):M111.013284. [PMID: 21890473]
 [2] hCKSAAP_UbSite: improved prediction of human ubiquitination sites by exploiting amino acid pattern and properties.
 Chen Z, Zhou Y, Song J, Zhang Z.
 Biochim Biophys Acta. 2013 Aug;1834(8):1461-7. [PMID: 23603789
Functional Description
  
Sequence Annotation
 DOMAIN 2527 2821 BEACH.
 REPEAT 2930 2970 WD 1.
 REPEAT 2980 3019 WD 2.
 REPEAT 3022 3061 WD 3.
 REPEAT 3071 3109 WD 4.
 REPEAT 3151 3184 WD 5.  
Keyword
 Alternative splicing; Complete proteome; Membrane; Polymorphism; Reference proteome; Repeat; Transmembrane; Transmembrane helix; WD repeat. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 3184 AA 
Protein Sequence
MEAEDLSKAE DRNEDPGSKN EGQLAAVQPD VPHGGQSSSP TALWDMLERK FLEYQQLTHK 60
SPIERQKSLL SLLPLFLKAW EHSVGIICFP SLQRLAEDVS DQLAQQLQKA LVGKPAEQAR 120
LAAGQLLWWK GDVDQDGYLL LKSVYVLTGT DSETLGRVAE SGLPALLLQC LYLFFVFPLD 180
KDELLESDLQ VQKMFVQMLL NICSDSQGLE GLLSGSELQS LLIATTCLRE HSCCFWKEPT 240
FCVLRAISKA QNLSIIQYLQ ATDCVRLSLQ NLSRLTDTLP APEVSEAVSL ILGFVKDSYP 300
VSSALFLEFE NSEGYPLLLK VLLRYDGLTQ SEVDPHLEEL LGLVVWLTTC GRSELKVFDS 360
ITYPQLEGFK FHHEASGVTV KNLQAFQVLQ NVFHKASDSV LCIQVLSVIR TMWAWNARNF 420
FLLEWTLQPI SQFVEIMPLK PAPVQEHFFQ LLEALVFELH YVPHEILRKV QHLIKESPGP 480
SCTLMALQSI LSIAGGDPLF TDIFRDSGLL GLLLAQLRKQ AKIMRKSGNK VSTPGVQDPE 540
RELTCVMLRI VVTLLKGSVR NAVVLKDHGM VPFIKIFLDD ECYREASLSI LEQLSAINAE 600
EYMSIIVGAL CSSTQGELQL KLDLLKSLLR ILVTPKGRAA FRVSSGFNGL LSLLSDLEGS 660
LQEPPLQAWG AVSPRQTLEL VLYTLCAVSA ALHWDPVNGY FFRRNGLFEK LAEDLCLLGC 720
FGALEEEGNL LRSWVDTKAR PFADLLGTAF SSSGSLPPRI QSCLQILGFL DSMASGTLHL 780
RGDLKESLRT KQGPVVDVQK GETGSDPQRN FKQWPDLEER MDEGDAAIMH PGVVCIMVRL 840
LPRLYHEDHP QLSEEIQCSL ASHIQSLVKS EKNRQVMCEA GLLGTLMASC HRALVTSGSP 900
LHSRLIRIFE KLASQAIEPD VLRQFLGLGI PSSLSATTKI LDSSHTHRGN PGCSGSQTAQ 960
GLAEGPWPAA PDAGLHPGVT QAPQPLGESQ DSTTALQTAL SLISMTSPRN LQPQRAALAP 1020
SFVEFDMSVE GYGCLFIPTL STVMGTSTEY SVSGGIGTGA TRPFPPPGGL TFSCWFLISR 1080
HGAATEGHPL RFLTLVRHLA RTEQPFVCFS VSLCPDDLSL VVSTEEKEFQ PLDVMEPEDD 1140
SEPSAGCQLQ VRCGQLLACG QWHHLAVVVT KEMKRHCTVS TCLDGQVIGS AKMLYIQALP 1200
GPFLSMDPSA FVDVYGYIAT PRVWKQKSSL IWRLGPTYLF EEAISMETLE VINKLGPRYC 1260
GNFQAVHVQG EDLDSEATPF VAEERVSFGL HIASSSITSV ADIRNAYNEV DSRLIAKEMN 1320
ISSRDNAMPV FLLRNCAGHL SGSLRTIGAV AVGQLGVRVF HSSPAASSLD FIGGPAILLG 1380
LISLATDDHT MYAAVKVLHS VLTSNAMCDF LMQHICGYQI MAFLLRKKAS LLNHRIFQLI 1440
LSVAGTVELG FRSSAITNTG VFQHILCNFE LWMNTADNLE LSLFSHLLEI LQSPREGPRN 1500
AEAAHQAQLI PKLIFLFNEP SLIPSKISTI IGILACQLRG HFSTQDLLRI GLFVVYTLKP 1560
SSVNERQICM DGALDPSLPA GSQTSGKTIW LRNQLLEMLL SVISSPQLHL SSESKEEMFL 1620
KLGPDWFLLL LQGHLHASTT VLALKLLLYF LASPSLRTRF RDGLCAGSWV ERSTEGVDIV 1680
MDNLKSQSPL PEQSPCLLPG FRVLNDFLAH HVHIPEVYLI VSTFFLQTPL TELMDGPKDS 1740
LDAMLQWLLQ RHHQEEVLQA GLCTEGALLL LEMLKATMSQ PLAGSEDGAW AQTFPASVLQ 1800
FLSLVHRTYP QDPAWRAPEF LQTLAIAAFP LGAQKGVGAE STRNTSSPEA AAEGDSTVEG 1860
LQAPTKAHPA RRKLREFTQL LLRELLLGAS SPKQWLPLEV LLEASPDHAT SQQKRDFQSE 1920
VLLSAMELFH MTSGGDAAMF RDGKEPQPSA EAAAAPSLAN ISCFTQKLVE KLYSGMFSAD 1980
PRHILLFILE HIMVVIETAS SQRDTVLSTL YSSLNKVILY CLSKPQQSLS ECLGLLSILG 2040
FLQEHWDVVF ATYNSNISFL LCLMHCLLLL NERSYPEGFG LEPKPRMSTY HQVFLSPNED 2100
VKEKREDLPS LSDVQHNIQK TVQTLWQQLV AQRQQTLEDA FKIDLSVKPG EREVKIEEVT 2160
PLWEETMLKA WQHYLASEKK SLASRSNVAH HSKVTLWSGS LSSAMKLMPG RQAKDPECKT 2220
EDFVSCIENY RRRGQELYAS LYKDHVQRRK CGNIKAANAW ARIQEQLFGE LGLWSQGEET 2280
KPCSPWELDW REGPARMRKR IKRLSPLEAL SSGRHKESQD KNDHISQTNA ENQDELTLRE 2340
AEGEPDEVGV DCTQLTFFPA LHESLHSEDF LELCRERQVI LQELLDKEKV TQKFSLVIVQ 2400
GHLVSEGVLL FGHQHFYICE NFTLSPTGDV YCTRHCLSNI SDPFIFNLCS KDRSTDHYSC 2460
QCHSYADMRE LRQARFLLQD IALEIFFHNG YSKFLVFYNN DRSKAFKSFC SFQPSLKGKA 2520
TSEDTLSLRR YPGSDRIMLQ KWQKRDISNF EYLMYLNTAA GRTCNDYMQY PVFPWVLADY 2580
TSETLNLANP KIFRDLSKPM GAQTKERKLK FIQRFKEVEK TEGDMTVQCH YYTHYSSAII 2640
VASYLVRMPP FTQAFCALQG GSFDVADRMF HSVKSTWESA SRENMSDVRE LTPEFFYLPE 2700
FLTNCNGVEF GCMQDGTVLG DVQLPPWADG DPRKFISLHR KALESDFVSA NLHHWIDLIF 2760
GYKQQGPAAV DAVNIFHPYF YGDRMDLSSI TDPLIKSTIL GFVSNFGQVP KQLFTKPHPA 2820
RTAAGKPLPG KDVSTPVSLP GHPQPFFYSL QSLRPSQVTV KDMYLFSLGS ESPKGAIGHI 2880
VSTEKTILAV ERNKVLLPPL WNRTFSWGFD DFSCCLGSYG SDKVLMTFEN LAAWGRCLCA 2940
VCPSPTTIVT SGTSTVVCVW ELSMTKGRPR GLRLRQALYG HTQAVTCLAA SVTFSLLVSG 3000
SQDCTCILWD LDHLTHVTRL PAHREGISAI TISDVSGTIV SCAGAHLSLW NVNGQPLASI 3060
TTAWGPEGAI TCCCLMEGPA WDTSQIIITG SQDGMVRVWK TEDVKMSVPG RPAGEEPPAQ 3120
PPSPRGHKWE KNLALSRELD VSIALTGKPS KTSPAVTALA VSRNHTKLLV GDERGRIFCW 3180
SADG 3184 
Gene Ontology
 GO:0016021; C:integral to membrane; IEA:UniProtKB-KW. 
Interpro
 IPR011989; ARM-like.
 IPR016024; ARM-type_fold.
 IPR000409; BEACH_dom.
 IPR023362; PH-BEACH_dom.
 IPR015943; WD40/YVTN_repeat-like_dom.
 IPR001680; WD40_repeat.
 IPR019775; WD40_repeat_CS.
 IPR017986; WD40_repeat_dom. 
Pfam
 PF02138; Beach
 PF00400; WD40 
SMART
 SM01026; Beach
 SM00320; WD40 
PROSITE
 PS50197; BEACH
 PS00678; WD_REPEATS_1
 PS50082; WD_REPEATS_2
 PS50294; WD_REPEATS_REGION 
PRINTS