CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-035615
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
  
Protein Name
 Protein Wdfy4 
Protein Synonyms/Alias
  
Gene Name
 RGD1564142 
Gene Synonyms/Alias
 Wdfy4 
Created Date
 July 27, 2013 
Organism
 Rattus norvegicus (Rat) 
NCBI Taxa ID
 10116 
Lysine Modification
Position
Peptide
Type
References
3008AVTRNQAKLLVGDEKacetylation[1]
Reference
 [1] Proteomic analysis of lysine acetylation sites in rat tissues reveals organ specificity and subcellular patterns.
 Lundby A, Lage K, Weinert BT, Bekker-Jensen DB, Secher A, Skovgaard T, Kelstrup CD, Dmytriyev A, Choudhary C, Lundby C, Olsen JV.
 Cell Rep. 2012 Aug 30;2(2):419-31. [PMID: 22902405
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome; Repeat; WD repeat. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 3025 AA 
Protein Sequence
MEAEDCSKAE DRPEDPGFQN EGQSSAVKPT FSLEEQSPGP TVLWDMLEQK FLEYQQLIHK 60
NPEERRRNLL SLLPLFLKAW EHSVGIICFR SLQRLAEDVS DQLAQEIQQA LSGKPAEQAR 120
AAAGQLLQWK SDVDQDGYLL LKSVYVLTGT DSETLGRVVD SGLPALLLQC LYLFFAFPVE 180
KGEHLEIDVQ GQRMFVQMLL NICSESQGLE GLLSGSELQS LLIATTCLRE HSCYFWKQPT 240
FCVLRAISKA QSPSVIQYLR TADCVRLSVQ NLSKLADTLP APEVSEAVSL ILNFVRDSYP 300
ISSALLLEFE NGEGYPLLLK VLLRYNGVTQ GVVEPHLEEL IELVMWLTTC GRSELKVFDS 360
ITYPQLEGFK FHQEASGVTV KNLQAFQVLQ NLFHRASDSV LCIQVLLAIK TMWAWNPRNF 420
FLLEWTLQPI SQFVEIIPLK PTPVQEHFFQ LLETLVFKLL YVPHEVLAKV QGLIKDSQEL 480
SCTLVALRSI LRITASDRLF TDIFRDSGLL GLLLAQLRKQ AKIMRKSGNK ECSPGVQDLE 540
RELTCVMLKT VVALLQGSVR NAVVLKDHGM VPFIKIFLDD ECYRGSALSI LEQLSVINAE 600
EYMSIIVGAL CSSTQGELQL KLDLLKSLLR ILETPKGHAA FRVSSGFNGL LSLLSDLEGS 660
LQLPAVTTCS AVSPSQTLEL VLHTLCVVSA ALHLDPVNKH FFRSNGLFEK LAEDLCLLGC 720
FGAPEEEGAQ RYSSSDMKAR PFVDLLSGAF SSSCQFPPRV QSCLQILSFL ESMAGGTLHL 780
HRDLMEPSKA GQVPSLDARK GEPGSRQGKF KQWPDPEERM DEGDVTIMHP GIICIMVRLL 840
PRLYLGDHPQ LSEEIQCSMA HHLLSLVKSE KNRQVMCEAG MLRTLMTFCH RTLSTGGSAL 900
HSVLIRIFEK LGSQAIEPDV LSPCPRGSQD LRSSPTSEDS ATALQTTLSL ISMTSPRNLQ 960
SQRAALTPSF VEFDMSAEGY GCLFIPTLST VMGTSTEHSI SGGTGSGAPR PFPPPGGLTF 1020
SCWFLISRQA NVMEGHPLRF LTLVRHLART EQPFVCFSVS LCMDDLSLVV STEEKEFQPL 1080
DIMEPEDEAE PSAGRQLQVR CSQLLACGQW YHLAVVVSKE MKRNCTVTMY LDGQAIGSAK 1140
MLYIQALPGS FFSMDPSSFV DVYGYIGTPR VWKQKSSLTW RLGPAYLFEE DISPDTLELI 1200
IKLGPRYCGN FQAVHLQGDV PDGEATPLIS EERVSFGLYV PSSSITSIIN IRNTYNEVDS 1260
RLIAKEMNIS SRDNATPVFL LRNCAGHLFG SLRTLGAVAV GQLGVRVFHS SPAASSLDYI 1320
GGPAILLGLI SLATDDHTMY AAMKVLHSVL TSNAMCDYLM QHIDGYQILA FLLRKKTPFL 1380
NHRIFQLILS VAGTAELGFR PSAVTNMCIF RHVLCNFELW TNTADNLELT LFSHLLEILQ 1440
SPREGPRNAE RAHQAQLVPK LIFLFNEPSL ALSKVSTIIA ILGCQLKGHF NIQDLLRVGL 1500
FVIYTLKPSS VNERQICLDG VQDPSAPAGS QTSGKAIWLR NQLLEMLFGV ITAPQLHLSS 1560
ELKEQVFLSL GPDWFLLLLQ GHLHPSTTTL ALKLLLYFLS SPSLCGRFRD GLSAGCWVES 1620
CLEGVDIMMD NLKNHPAVPD QSPCLLPGFR VLNDFLGYHV LIPEVYLIVS AFFLQMPLTE 1680
LTDGPRESLD LMIQWLLQKH YQQEVLQAGL CIEGALLLLG MLKAIMSQPR AGSGDGTWEQ 1740
TLPSSILQFL GLVHRSYPQD PAWRSPDFLQ TMAIITFSVE TQKEPQSNAE AGTVPSLASV 1800
SYFTQKLVEK LHSGMFSANP KHILLFITEH IISVIENPSS QKDTVMSALY SSLNKVILHC 1860
LSKPQQSLSE CLGLLSILDF LQEQWDIIFA TYNSNVNFLL CLMHCLLLLN ARSYPEGFGL 1920
EPKPRINPYH QVFLSPSEEV KDKKEEGLPS LGDVQHSIQK SVRALWQQLV AQRRQTLEDA 1980
FKIDLSVKAG ESEVKIEEVT PLWEETMLRA WQHYLASEKK SLASRSSVTH HSKATSWSGS 2040
LSSAMRLMPG RQAKDPECRA EDFVSCIENY RRKGQELYAS IYKDYVQRRK RDSIKAATAW 2100
ARMQEQLFGE LGLWGRMRES TSCSQWELDR REGPARMRKR IRRLCAWETL NLGSCKESQD 2160
QRGDVSQTNA ENQDELTIEE AESRPDEVGV DCTQLTFFPA LHESLHSEDF LELCRERQII 2220
LQELLDGEKV SQKVPMVIVQ GHLVSEGILL FGQQHFYVCE NFTLSPTGDV YCTHHCLSNI 2280
SDPFIFNMCS KDRSSDHYSC QRHAYSDLRE LRQARFLLQD IALEIFFQNG YSKLLVFYNS 2340
DRSKAFKSFC TFQPSLKGKG TTEDPFNLRR HPGFDRTMLQ KWQKREMSNF EYLMYLNTLA 2400
GRTYNDYMQY PVFPWVLADY TSEMLNLTNP KTFRDLSKPM GAQSKERKLK FTQRFKDIEK 2460
IEGDMTVQCH YYTHYSSAII VASYLVRMPP FTQAFCSLQG GSFDVADRMF HSVKNTWESA 2520
SKENMSDVRE LTPEFFYLPE FLTNCNAVEF GRMQDGTTLG DVQLPPWADG DPRKFISLHR 2580
QALESDFVSS NLHHWIDLIF GYKQQGPAAV EAVNTFHPYF YGDRIDLSSI TDPLIKSTIL 2640
GFVSNFGQVP KQIFTKPHPS RNTTGKSPVP GKETSTPTGL PGHSQSFLHS LPALRPSQVT 2700
VKDMYLFSLG SESPKGAIGH IVPTEKSILA VEKNKLLMPP LWNRVFSWGF DDFTCCLGSY 2760
GSDKILMTFE NLAAWGPCLC AVCPSPTMII TSGASAVVCV WELSLVKGRP KGLKLRQALY 2820
GHTQAVTCLT ASVTFSLLVS GSQDCTCMLW DLDHLSRVAC LPVHREGISA VAISDVSGTI 2880
VSCAGAHLSL WDVNGQPLAS ITTAWGPEGA ITCCCIVERP AWDASHVIIT GSKDGMVRIW 2940
KTEDMKMSVP RQAVMEEPST EPLSPRGHKW AKNLALSREL DVSVALSGKP SKASPAVTAL 3000
AVTRNQAKLL VGDEKGRIFC WSADG 3025 
Gene Ontology
  
Interpro
 IPR016024; ARM-type_fold.
 IPR000409; BEACH_dom.
 IPR023362; PH-BEACH_dom.
 IPR015943; WD40/YVTN_repeat-like_dom.
 IPR001680; WD40_repeat.
 IPR019775; WD40_repeat_CS.
 IPR017986; WD40_repeat_dom. 
Pfam
 PF02138; Beach
 PF00400; WD40 
SMART
 SM01026; Beach
 SM00320; WD40 
PROSITE
 PS50197; BEACH
 PS00678; WD_REPEATS_1
 PS50082; WD_REPEATS_2
 PS50294; WD_REPEATS_REGION 
PRINTS