CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-025612
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Mutagen-sensitive 201, isoform A 
Protein Synonyms/Alias
  
Gene Name
 mus201 
Gene Synonyms/Alias
 CG10890; Dmel_CG10890 
Created Date
 July 27, 2013 
Organism
 Drosophila melanogaster (Fruit fly) 
NCBI Taxa ID
 7227 
Lysine Modification
Position
Peptide
Type
References
1243QMRLNKAKAAEILKNacetylation[1]
Reference
 [1] Proteome-wide mapping of the Drosophila acetylome demonstrates a high degree of conservation of lysine acetylation.
 Weinert BT, Wagner SA, Horn H, Henriksen P, Liu WR, Olsen JV, Jensen LJ, Choudhary C.
 Sci Signal. 2011 Jul 26;4(183):ra48. [PMID: 21791702
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Nucleus; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1257 AA 
Protein Sequence
MGVTGLWKLI EPCGKPVPVE TLEGKILAVG KHRFSPPNEI LADWSTFRPA DISIWLHQVV 60
KGFQDNKGSA LSNAHLLGLF HRLCKLLYYR VRPVFIFDGC VPQLKRDTIA RRQQQRNKLS 120
NEADRIQALL LQSLAKEKVV QQALGKNAEL LLKSPVKRPP PAKKNDEDDL FKLPELPAAS 180
VQDNQDESEQ DTSASASDSS FDESTARHSY NSSLQAIDVK SQHFRNLPAD VRHEILTDIK 240
ETRKQSSWGR LHELPARSDD FCSFQMKRLL KRRAVQESLE QAEQEMGGHT LTYAELCDFF 300
NEEGILTPTA IEQCTRQISS DEHTRFLLVR DLKKKAMEST KQEVKMEMIE EVPAEEEDEK 360
PSTSTKKEAV KSVDLGTEFD EDLAKALSMS MEETKVYDEK DYEYDSDQEL RLNRAQTKQL 420
RHAAKGPARA YMIEYGGMND EEVGNIMEAT QFNDTQSLEK LLEITTVPTD MADNSIEEAK 480
LISQAIEESK QLSQAIEESK KNLNEDKVEI VDTDTDSDLE EVMEVQELDK GKKNLEICVD 540
ITGQADSNDL FADIFEDGEA NKIEKTISVE EDDDFIEVKD SEELKLDTED ENKPITNKSI 600
KESNEVKPFI DEIIEVKDSQ EAVPAEPNLK PDLESILNDL KKQTAAVKDI QLNVNEEEKP 660
KPKVEISSIL DELKVKMADV KNITLDNVKL SNSVPIILSS DDESTLKSSK IVPKQELIEL 720
CDSDDNKNNR LSPNKTPSKN KSIKDFFETS YVVKRTPDKS QASNETSPGT PKTPKPFFRK 780
RTPKSGRKRA SDANEDSDEE VSPTKRSSKA SKSLFEPKEP EEEKTVDPEE IIKDAAEALK 840
SQKTSEELQE LATNLAQERK ELEIERNRQD RMGMSISQRM SIDCQELLRL FGIPYIVAPM 900
EAEAQCAFLN ATDLTHGTIT DDSDIWLFGG RTVYKNFFAQ NKHVMEFRAE QIEQTFNCNR 960
GKLIQLACLV GSDYTTGIHG IGAVTALEIL ASFSGQDANG PGICNQSVLQ TLIKFRDWWQ 1020
AHKCSNLPPG SSARLALRKK LKNIELHEGF PSGAVVEAYL APTIDDNRDA FSWGTPDVES 1080
IREFTRKSFG WTTSKTDDIL MPVMKKINEK KIQGSIRNYF TAKSALRVQQ PHVSKRVQLA 1140
IDKMSGKIDE TPEKPKKVTR TRRAKAAPPT DDDLAIADVA TKAARPKRGK RKAAPESVVL 1200
DGELPSTSQS IPKPEKCPRI PSSVEVIPQR EKDLEQMRLN KAKAAEILKN SAKANRK 1257 
Gene Ontology
 GO:0005634; C:nucleus; IEA:UniProtKB-KW.
 GO:0004519; F:endonuclease activity; IEA:InterPro.
 GO:0003697; F:single-stranded DNA binding; IEA:InterPro.
 GO:0090305; P:nucleic acid phosphodiester bond hydrolysis; IEA:GOC.
 GO:0006289; P:nucleotide-excision repair; IEA:InterPro. 
Interpro
 IPR020045; 5-3_exonuclease_C.
 IPR008918; HhH2.
 IPR003903; Ubiquitin-int_motif.
 IPR006086; XPG-I_dom.
 IPR006084; XPG/Rad2.
 IPR001044; XPG/Rad2_eukaryotes.
 IPR019974; XPG_CS.
 IPR006085; XPG_DNA_repair_N. 
Pfam
 PF00867; XPG_I
 PF00752; XPG_N 
SMART
 SM00279; HhH2
 SM00726; UIM
 SM00484; XPGI
 SM00485; XPGN 
PROSITE
 PS50330; UIM
 PS00841; XPG_1
 PS00842; XPG_2 
PRINTS
 PR00853; XPGRADSUPER.