CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-023812
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 DmX-like protein 1 
Protein Synonyms/Alias
 X-like 1 protein 
Gene Name
 DMXL1 
Gene Synonyms/Alias
 XL1 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
2108EDLPHQTKVKQLRENubiquitination[1]
Reference
 [1] Systematic and quantitative assessment of the ubiquitin-modified proteome.
 Kim W, Bennett EJ, Huttlin EL, Guo A, Li J, Possemato A, Sowa ME, Rad R, Rush J, Comb MJ, Harper JW, Gygi SP.
 Mol Cell. 2011 Oct 21;44(2):325-40. [PMID: 21906983
Functional Description
  
Sequence Annotation
 REPEAT 108 145 WD 1.
 REPEAT 166 206 WD 2.
 REPEAT 229 277 WD 3.
 REPEAT 476 516 WD 4.
 REPEAT 580 621 WD 5.
 REPEAT 628 665 WD 6.
 REPEAT 848 895 WD 7.
 REPEAT 968 1010 WD 8.
 REPEAT 1134 1175 WD 9.
 REPEAT 1211 1251 WD 10.
 REPEAT 2742 2783 WD 11.
 REPEAT 2785 2824 WD 12.
 REPEAT 2836 2878 WD 12.
 REPEAT 2884 2923 WD 14.
 REPEAT 2926 2965 WD 15.
 REPEAT 2978 3016 WD 16.
 MOD_RES 324 324 Phosphoserine.
 MOD_RES 436 436 Phosphoserine.
 MOD_RES 917 917 Phosphoserine (By similarity).
 MOD_RES 924 924 Phosphoserine.
 MOD_RES 1970 1970 Phosphoserine.  
Keyword
 Complete proteome; Phosphoprotein; Polymorphism; Reference proteome; Repeat; WD repeat. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 3027 AA 
Protein Sequence
MNLHQVLTGA VNPGDHCFSV GSIGDQRFTA YASGCDIVIL GSDFERLQII PGAKHGNIQV 60
GCVDCSMQQG KIAASYGNVI SIFEPVNLPK QKKNLELYSQ WQKSGQFFLE SIAHNITWDP 120
TGSRLLTGSS YLQLWSNTNL EKPTEDENLN KTDLNFGDWK CIWHCKTASQ VHLMKFSPDG 180
EFFATAGKDD CLLKVWYNVE NWRTAVTSPD GSSEKQSQGE IDFSFVYLAH PRAVNGFSWR 240
KTSKYMPRAS VCNVLLTCCK DNVCRLWVET FLPNDCLLYG GDCSHWTESI NLTNNFKRNA 300
SSKERVQNAL EVNLRHFRRG RRRSLALVAH TGYLPHQQDP HHVHRNTPLH ANALCHFHIA 360
ASINPATDIP LLPSITSLSL NENEEKTGPF VVHWLNNKEL HFTLSMEVFL QQLRKSFEQP 420
SSEASVEDSN QADVKSDEET DDGVDDLKIN PEKKELGCDK MVPNSSFTSL SSAAIDHQIE 480
VLLSEWSKNA DMLFSIHPMD GSLLVWHVDW LDEYQPGMFR QVQVSFVSRI PVAFPTGDAN 540
SLCKSIMMYA CTKNVDLAIQ QGKQKPSGLT RSTSMLISSG HNKSSNSLKL SIFTPNVMMI 600
SKHADGSLNQ WLVSFAEESA FSTVLSISHK SRYCGHRFHL NDLACHSVLP LLLTTSHHNA 660
LRTPDVDNPE QPFDALNIEE CSLTQQNKST VDVAFQDPSA VYSELILWRV DPVGPLSFSG 720
GVSELARINS LHVSAFSNVA WLPTLIPSYC LGAYCNSPSA CFVASDGQYL RLYEAVIDAK 780
KLLSELSNPE ISKYVGEVFN IVSQQSTARP GCIIALDPIT KLHGRKTQLL HVFEEDFILN 840
NLEKKSLGKD SILSNAGSSP NGFSEKFYLI VIECTQDNRS LLHMWNLHLK SIPVSLDEKV 900
DTKLSEAVWQ PEEHYSSSPE KILSPFSQKY QACRANLQST SRLTLFSEMV YSQELHLPEG 960
VEIISIKPSA GHLSSSSIYP ACSAPYLLAT SCSDEKVRFW RCRVTDGESA TSKNGKIDLA 1020
YIWEEWPLLI EDGLQSNSSI TVPGRPVEVS CAHTNRLAVA YKQPASNSRS SQDFVMHVSI 1080
FECESTGGSC WVLEQTIHLD ELSTVLDSGI SVDSNLVAYN KQDMYLSSKE NITSNTKHLV 1140
HLDWMSREDG SHILTVGIGS KLFMYGPLAG KVQDQTGKET LAFPLWESTK VVPLSKFVLL 1200
RSVDLVSSVD GSPPFPVSLS WVRDGILVVG MDCEMHVYCQ WQPSSKQEPV ITDSYSGSTP 1260
SITSLIKQSN SSSGLHPPKK TLTRSMTSLA QKICGKKTAF DPSVDMEDSG LFEAAHVLSP 1320
TLPQYHPLQL LELMDLGKVR RAKAILSHLV KCIAGEVVAL NEAESNHERR LRSLTISASG 1380
STTRDPQAFN KAENTDYTEI DSVPPLPLYA LLAADDDSCY SSLEKSSNES TLSKSNQLSK 1440
ESYDELFQTQ LLMTDTHMLE TDEENTKPRV IDLSQYSPTY FGPEHAQVLS GHLLHSSLPG 1500
LSRMEQMSLM ALADTIATTS TDIGESRDRS QGGETLDECG LKFLLAVRLH TFLTTSLPAY 1560
RAQLLHQGLS TSHFAWAFHS VAEEELLNML PAMQKDDPTW SELRAMGVGW WVRNTRILRK 1620
CIEKVAKAAF YRKNDPLDAA IFYLAMKKKA VIWGLYRAEK NTRMTQFFGH NFEDERWRKA 1680
ALKNAFSLLG KQRFEHSAAF FLLAGCLRDA IEVCLEKLND IQLALVIARL YESEFDTSAA 1740
YKSILRKKVL GIDSPVSELC SLNINMHHDP FLRSMAYWIL EDYSGALETL IKQPIRENDD 1800
QVLSASNPTV FNFYNYLRTH PLLLRRHFGS SDTFSTHMSL TGKSGLAGTI NLSERRLFFT 1860
TASAHLKAGC PMLALEVLSK MPKVIKKTRP FYRASSFLDT SKDCSPSSPL KLDAREDKSS 1920
AVDWSQSLIN GFGSSSEGSS EKQSNSTLSF DWSQPSVVFQ DDSLELKWDS DNDEENEDVP 1980
ISMKELKPLQ RKTDKKLDDI SSNYTESFST LDENDLLNPS EDIIAVQLKF RACLKILTVE 2040
LRTLSTGYEI DGGKLRYQLY HWLEKEVIAL QRTCDFCSDA EELQSAFGRN EDEFGLNEDA 2100
EDLPHQTKVK QLRENFQEKR QWLLKYQSLL RMFLSYCILH GSHGGGLASV RMELILLLQE 2160
SQQETSEPLF SSPLSEQTSV PLLFACTANA KTVVANPLLH LSNLTHDILH AIINFDSPPH 2220
PDIQSNKVYV MHTLAASLSA CIYQCLCGSH NYSSFQTNQF TGMVYQTVLL PHRPSLKTGS 2280
LDEALTPNTS PAQWPGITCL IRLLNSSGEE AQSGLTVLLC EILTAVYLSL FIHGLATHSS 2340
NELFRIVAHP LNEKMWSAVF GGGAHVPSKE QTHSKTLPVS SLVEEGEKQN KRFRPSKMSC 2400
RESAPLTPSS APVSQESLAV KEKFIPPELS IWDYFIAKPF LPSSQSRAEY DSEESLGSDD 2460
DDNDDDDDVL ASDFHLQEHS NSNSYSWSLM RLAMVQLVLN NLKTFYPFAG HDLAELPVSS 2520
PLCHAVLKTL QCWEQVLLRR LEIHGGPPQN YIASHTAEES LSAGPAILRH KALLEPTNTP 2580
FKSKHHLALS VKRLWQYLVK QEEIQETFIK NIFTKKRCLN EIEADLGYPG GKARIIHKES 2640
DIITAFAVNK ANRNCIAIAS SHDVQELDVS GILATQVYTW VDDDIEVETK GSEDFLVIHA 2700
RDDLTAVQGT TPYTHSNPGT PINMPWLGST QTGRGASVMI KKAINNVRRM TSHPTLPYYL 2760
TGAQDGSVRM FEWGHSQQIT CFRSGGNSRV TRMRFNYQGN KFGIVDADGY LSLYQTNWKC 2820
CPVTGSMPKP YLTWQCHNKT ANDFVFVSSS SLIATAGLST DNRNVCLWDT LVAPANSLVH 2880
AFTCHDSGAT VLAYAPKHQL LISGGRKGFT YVFDLCQRQQ RQLFQSHDSP VKAVAVDPTE 2940
EYFVTGSAEG NIKIWSLSTF GLLHTFVSEH ARQSIFRNIG TGVMQIETGP ANHIFSCGAD 3000
GTMKMRILPD QFSPLNEVLK NDVKFML 3027 
Gene Ontology
  
Interpro
 IPR022033; Rav1p_C.
 IPR015943; WD40/YVTN_repeat-like_dom.
 IPR001680; WD40_repeat.
 IPR017986; WD40_repeat_dom. 
Pfam
 PF12234; Rav1p_C
 PF00400; WD40 
SMART
 SM00320; WD40 
PROSITE
 PS00678; WD_REPEATS_1
 PS50082; WD_REPEATS_2
 PS50294; WD_REPEATS_REGION 
PRINTS