CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-031908
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 CG42232 
Protein Synonyms/Alias
  
Gene Name
 CG14896, CG42232, Dmel_CG42232 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Drosophila melanogaster (Fruit fly) 
NCBI Taxa ID
 7227 
Lysine Modification
Position
Peptide
Type
References
239SPRETGVKIEKSFSCacetylation[1]
939LARMEQGKPLDNVLKacetylation[1]
2647ETEDASEKLEKAEDHacetylation[1]
Reference
 [1] Proteome-wide mapping of the Drosophila acetylome demonstrates a high degree of conservation of lysine acetylation.
 Weinert BT, Wagner SA, Horn H, Henriksen P, Liu WR, Olsen JV, Jensen LJ, Choudhary C.
 Sci Signal. 2011 Jul 26;4(183):ra48. [PMID: 21791702
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 3441 AA 
Protein Sequence
MSLDGGRLRY EDENGKTVVL AAKGRTFQVG SYYNCDLILE GPEEERLICE INCDAFGRVI 60
IYNKSSGDPI HLNDVAIHAA GKRPLLHGGK ITIRDKVYTW EFPKTSEVQD APCTPERLLP 120
NEQASNSCPS LKQSHRLQAE KRLTVHNFHY SINSDDEGNT SIESRDQSES HLEDVSLNES 180
IQRSTEETAY ESPKVNLLEA TQNKENTATP PGSHQKLLRL CAISDVVITS YSPRETGVKI 240
EKSFSCVRKP GYTTTSLAVS TPKSVYSTPK GNVLSELNED SCSRDLMDFS TPSTSKKTKR 300
DSSMFLIDLT TPSKLRQTPK QTLGFKLTPK QTPISVDSTD ESSDASPLVI DITNSDTPPS 360
ASPSQRYKTP KRPAGITTPN RTPQSLMKRA LLTSIKKQIA SNQPDKTTPI ATPKRTSLLE 420
ARRHCLTTPR RLPFHPHRRT PVQREGHTRG NAPKTSPRKR ISLMESPREN KVSQLRKSFA 480
AAKRSPGVDK SNKLVAKARR SLNSPKSGSP KPGSSKPSSP CQKKIDVSQI KSSTPEEPDD 540
SKDELSRTFT ILHDTTQGND QSGAALAIEA VTALITGEVD DDVSFQLSSL IEKRLPSVTS 600
EEPAPIQDKV ESIGSESDVV GELKYAQEDM NPKPDAVILD QNERDENELK TGKNEDECKE 660
KDCELPIDKQ LEDAVIEDNI CEEVHANIPQ QTKNSNSEKI IEESICEELP LESTTGNSNS 720
TGGISQKENQ PTPSAETPVA RRSLRRHSVE QRAVSTTPRR STRRSSMEAS SKALPEKDNK 780
RSRRASCSAM DSQVAVTPRR KRQFTQEMST PTRQSLRIKN TPNRELQVDE SVGDMGVILE 840
EVGSEDDSKS FIADDENYGN ELPNDDIDKV DFHGLRDLLK TPKSCSTPRF KGLRELMRTP 900
KIPASPIIGN MAELLETSVG RTPYHDRRST ALARMEQGKP LDNVLKTPSA RNIMVPNEPA 960
STVLKSRKDF LGATTEYDLN VTNNTLHLDK IFEDMSETTG ANMEDTEEIN VTTISTATGV 1020
DPLDSINQNK TVASEALMNV GHTATASASL EDPLTSTTYK ATMHANPNLS DFRSVSPNPN 1080
EMSGIQLLDQ SSDSMFSEAL VVSGVESCEV TVDETRPLGQ TNPPIDNIED RSDTDSNVGL 1140
TEPLVFSDDE EEPKKSEGTT KQSDSAQEPS IAYKLEESVN LTEHKTVDEN SSKYDVETGA 1200
ISEISLIEVE DTTIEASTCE QNLESCKLQE EKSVDIASVN SDENIESAVV ELTILESEIF 1260
PLDTTADSSI NSTNVLDSSV EKTIPDRNLK GGKLQEEKSL DIASVNSDEN IESAVELTIL 1320
ESEIFPLDTT VDSSINSTNV LDSSVEKTIP EHNLNQEDAK ATNDAQDTSV TDEKSSTDLY 1380
RSVGEVLPSN KSDLHTDEPP IVISSSKELA KEAKSGDGNP IDNDNSKPYI ERSEELSEET 1440
KPDKVSSAAE SDRPVTETSP TPKAPGAESK QPLTESHSDE LVKETNSNEV SVDGELIQPA 1500
IETLPFVEQK QEPDVSTDAE SNQHCIESSP LNELSENIQR EEVHTLDNTK QEVKPYPVME 1560
VLKETKTDES IQSVTNTYPT EELQEEAKAD ELPTVSEMVQ AFKETKTDDT IKPIIETSPE 1620
REPLEEAHPD ELPSSSEKDQ VIVETPSAET RPTIETSCDE PLSTAGTDIV PDLSQIDQLI 1680
FEKSPAEEPH EKAKPDETTQ EVIETSHAEK DPIEASPKED SSVSKMDNAV VETNPQTDKL 1740
IPPDEITNCG PVDICIDEFS LRNSSSAFEI EHSEEVATTS LDLDLSEISK EKGNMEALPN 1800
DNACVKHIAC DLANVSSLEE PTNLSPKTRL RGTSMSQDEA AQVDEQNEND HVEEHVAMEV 1860
SSNEIAAHKE SDEKFEDIDE DALSGVKQSS LEENALIHQS LTDQNKEVSI MEESEAVVLE 1920
EPKDSQPNQD FKAEVIRLDA SIIVEKDTLD ESTSVGEDKD QSTEISMLKE QSNADQIKDV 1980
SSMDKKEFTE CIPKKDSDEK DNDDKVIQLD ASSLDESYTE ASVLVEKHSD MEVNTKETEN 2040
RNQNHSDDEV ISLDSSSIAE ECPLNKDSII EDPNDDSIIE GIVIEDNDVK ECVSEESSNQ 2100
NEVTEPPNHI SIISEEPSEK EICPKISTLN AANSSPRQVE SQKMYEQINQ VETAVLKNSS 2160
SPNGELPNAS SAREIELPIP KVEDVSDAMA DFQCAEDGGE VEKVDIMSEE IVKEQQPQPG 2220
DPANSSSTGQ GPESTVVEAI IVLPHEDAVA KIEPAPFNSS TNSLVVPSAN EPHEVVSIEE 2280
SADSTKTSQG EDSTDNLQGK GNFPMEPDTN QEQEDGTLTK PVSGDTSLTK TESTSAVDSD 2340
QNLLSVKDEA GSSQDKPSEK SIHSTSTDSK SKERTDITTS KRTIFPESTE MSGLISKPSN 2400
SEEKLTHEEE SKPVKRVNKK GSASADGATE AERPKRVSRK PSAEVLEIED HGSDTKQEDD 2460
HSIKSRGRAR KPSSDVDEDK KEAITEKRRG RVRTPSVELS ETTANVEQKE IAGHKSDEAL 2520
KPILEEEPEP EVEKRIESNA EQVIIIDKPK KRLRKASSKE SDTLLDNKEK INTDLEVANK 2580
AESTHSNNSE QRDVREPFVS TTKSEVEADL EGEMEVQETN QKHTTKERPK RRGRKASANE 2640
TEDASEKLEK AEDHLKAINT DGKLAVVLPT TSNEIDNTNT EIVADLEIKP KRRARKASAK 2700
ETNTTSDKKE KTVDILAPVK EVEPSLSIKK NSDHSESKGP QEPAPVATLN ISKEEQDPDN 2760
VPNVSSENMA PKEPKKQERP KRRVRKASED IVEVNAAIDA HNPETALASV QEDHLNQADQ 2820
EMPKKRPRKS AELMLANVEE HAKEDHLERP KRRGRKPSLD TDHVSQDVPE KPKRRGKTPA 2880
DRETQPLGDE KQEPSLPVVE PAKKTRRNAR KASAETVEAV STSGEEHLAQ IEEATEPVTE 2940
SEPAQSELLP EEGEENKTRR RGRKPTVDTD EGPIKKTPAE EPVPLPSHSR RRGRKATEDE 3000
VLPTADLAVP KTKLRGRRAS MEPEHKDELT AESSSEVVEL PTAKTTRRGR KPSVDMEATI 3060
PEKKPPSRRG RKASASVDEE QPAAKKTAIR RGRKNEAHED EERGHIDLQD LPTEIASPLV 3120
DTSGSPSKAS DAEELTPRRR EGRNLPRKNY EEAPDDEKPH SGLRRARKPA ATKSLASKAP 3180
EPDPVTPSPV TNQPPVKSED TPDNTVSLSE PTTSQRREGR NLPRKNYTED LDDEMPTPAR 3240
SRRVRNLTAK ALELIVDSSP RPVTPKRPKG KAANSEEPPA KKPTPEFPST TEAAGPEHEP 3300
IPATKGRGTR RKADDTDLEK PDVKTAKKTV RGAGGKTKVE TETEKQPPIK KPRGGARTKT 3360
PSEEAPQEEE QVKKSAARSR AKGTKAVEPE EPAEDPQVEA SFSKSTASVR GGRARKVHFE 3420
AAPETSASEV APQRATRSRR K 3441 
Gene Ontology
  
Interpro
 IPR000253; FHA_dom.
 IPR008984; SMAD_FHA_domain. 
Pfam
  
SMART
  
PROSITE
  
PRINTS