CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-036513
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Enhancer of bithorax, isoform E 
Protein Synonyms/Alias
 Enhancer of bithorax, isoform J 
Gene Name
 E(bx) 
Gene Synonyms/Alias
 CG32346; Dmel_CG32346 
Created Date
 July 27, 2013 
Organism
 Drosophila melanogaster (Fruit fly) 
NCBI Taxa ID
 7227 
Lysine Modification
Position
Peptide
Type
References
1167KKSYIGTKDVLDQTLacetylation[1]
1542GSRRVIVKNPDGTTRacetylation[1]
Reference
 [1] Proteome-wide mapping of the Drosophila acetylome demonstrates a high degree of conservation of lysine acetylation.
 Weinert BT, Wagner SA, Horn H, Henriksen P, Liu WR, Olsen JV, Jensen LJ, Choudhary C.
 Sci Signal. 2011 Jul 26;4(183):ra48. [PMID: 21791702
Functional Description
  
Sequence Annotation
  
Keyword
 Bromodomain; Complete proteome; Metal-binding; Reference proteome; Zinc; Zinc-finger. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2669 AA 
Protein Sequence
MSGRGSRKRG RPPKTPNERA SGRFNYQLLK KPKYLSEGKS QPSTPSASRG ISPQSDEGSR 60
SSHNNHTNRS RGSAAKRGRG RKSAVQPNTS SYSGRKGYES EYHYGSDFGD SEEDKSDNED 120
DMLLTPSDDE SLEVANESES EFSVCSFNQN GVGRPPRPPS PEPVWLQEGR QYAALDLPDS 180
SEDLFIANTH VLRALSIYEV LRRFRHMVRL SPFRFEDLCA ALACEEQSAL LTEVHIMLLK 240
AILREEDAQG THFGPLDQKD TVNISLYLID SITWPEVLRS YVESDKTFDR NVFHILSHTE 300
YPYTGIDNRL EVLQFLSDQF LTSNSIRDVM LQEGPIHYDD HCRVCHRLGD LLCCETCPAV 360
YHLECVDPPM NDVPTEDWQC GLCRSHKVSG VVDCVLPQEK QGVLIRHDSL GVDRHGRKYW 420
FIARRIFIED QENFTCWYYS TTSKLKLLLS RLDAEELETR LHSQITERRD EIERQMKLTE 480
TLTNEHKHTK RSVIEIEQEA KNELLEKEVL DEDEKDGDAK SESQSIEGTK KQEECKMVTR 540
QKSNQLTNGT LHFKLGMEQG FKNYVNQYST NPIALNKPQR NEERDKRRHL SHKFSLTTAS 600
DFKWIGITMG TTDNMITTLR QTLINFESNI AASFLNINWV VNKKIWNAAV MNARRPSEFA 660
VVLLLFQASL KSVVFANVWH EQLGHTTLQR ITSAEREERK KLEKREKRER DDEEERNRLA 720
FNYIKYTLGL KHQVWKQKGE EYRVHGQWGW LWLSSSRRCG VRARRAQPLT HNRVYVHYTM 780
GEENDVNEII LVDPRTQRFM QQCESSNVDG QVCHYLPDQY KNVKVIEDVT EKIKGHIDVS 840
KALNAPGRTY YSKVARKSRL DDLLDRRLKL AEVEEQMASK IPSDMKPLLV SSQNNTANSK 900
QTFLEKRLLR LTEVQAKGGP ANVNLELVNS LAKQIQTVRL QFSQLNRFAK VFRCYTKECN 960
TNSNAVSQIT QNTCYSPLCL QKARAKKELL LLLRKAHTAG NGSKETVAAI LGAVKKPSIL 1020
EQKLTEGKRE STQVAVDDSE EGKPAESEAP LDLLQDWEHA RAHAVPFSDS LLTECILVDQ 1080
ECVTNTKIKQ EVNASSGCNT TPDSNTQDSD KIDYIESMDV CSNVEIESTE DSIVTGLNSG 1140
NAEDVDMTPG WRRKRNQKSK KSYIGTKDVL DQTLDKDIPL NKQNRRFPIT ARPVKRECVK 1200
KYERETFENG NERVYSTSSP RGRVYLLNDA AKLYEQAVKT EDKSTITKKP SYSRYPLISN 1260
FLTHKKKRSL LVLPRFELLK LARLGGKSST NGFHHAAKNN TIWQYQCSRP LFRTCWSYRT 1320
SNATSLSSLA LQLRILWSCL RWDDMIAKPP STDGKHQVTT DTEIVTLELL KLRHSGRYGE 1380
KTSYLRRKVV IPLEMPKTVR EVTSIRSGLR KRKRAESPQP TEPQITEEWV DEDKLELWEI 1440
KFMGEKQEKA RLSAVTRSVA SRQLEASGSN GSNTSTNGAL GVAGRVQLAP KLSEDVKEKM 1500
EQQLKLQRAV HQQRKLVATG EITRSVTPVK GQVIGSRRVI VKNPDGTTRI IQQAVTQVSR 1560
TGGANTAAAA ASPTVGGSTS TQSNPSTSTP HKVQIIRGPD GKVSVRGLNP GQQLVQMPDG 1620
KLHVLTTTTS SNSAGQGNKM KVPIKPASTS SSPAISSAQT TTNPVTPVIK QIAVKHVTKN 1680
SATQSIASSS RVALPLAQIK NKLLLAQQQQ QSTSSSPATS SSPVQKIVSK VVNTSTSGQT 1740
LQQVFVQSGS KLVVGQNAQG QKVIISTSAA QQQGTSPVQQ QQLVQSQPIQ QSPQQISMTQ 1800
VGNQPTQKVI QQIVNTSNVQ QQIVVGGQRI ILSPGQTIVT QRNVPQSQAL QMVQQQIQTQ 1860
QQQQQHHVVQ PQQQFVVQSN QIVQSSPSAQ TKLVKQLVVQ QQSQQTIEEK TQITTTDSNE 1920
TGTQQVLVPN STLAQQLAQG KLQVATVNGQ QVIVKPLGNN QAQIVAHIKH QGDGNAHIVT 1980
SNSATAVPQA NPQTSPVKQQ ALPPQSPQQV VVQQQQIHQQ SPTNFESGVT PITQQPVLTQ 2040
AVQAPAQQQA LSVEESLLQN QPPGTVIKCV TAQVLQTEHG PRIVLQGLVG NDFTAQQLQL 2100
VQTQVKQQLM KAQESNGKLG VLGPTKIYLA VQPENAVQSQ PPPLTPVHQS AAHQQTNNIE 2160
IDADTLATTY EANSTIKDIA INNGDDQENS KCAETENSNI TTNESFAGTS SLLEGSEHDE 2220
PTNLAGLDIS ETDLENKQNE SFVVTRGYIQ KSISNALKQG NLSPELEEKL VCMQKQQENA 2280
NSTNEWETCS RGSVNEEALT PSRQTDDTEW KIRTSLRRPN AMTTSSQFNR ILKKNRSKND 2340
EVAELGEQKQ SQLERHKELL KKNILRKRSL LERNLQSEIH EDVKTKVQRH VRPLSNASPD 2400
EQSENERSGE PNLDFKRTEV QNPRHGAGRP KKLTRKKEKL YCICRTPYDD TKFYVGCDLC 2460
SNWFHGDCVS ITEEASKKLS EFICIDCKRA RETQQLYCSC RQPYDESQFY ICCDKCQDWF 2520
HGRCVGILQS EAEFIDEYVC PECQRKNDAN AANMKKLTSN DVEELKNLIK QMQLHKSAWP 2580
FMEPVDPKEA PDYYKVIKEP MDLKRMEIKL ESNTYTKLSE FIGDMTKIFD NCRYYNPKES 2640
SFYKCAEALE SYFVQKIKNF RENVFDQRT 2669 
Gene Ontology
 GO:0003677; F:DNA binding; IEA:InterPro.
 GO:0008270; F:zinc ion binding; IEA:InterPro. 
Interpro
 IPR017956; AT_hook_DNA-bd_motif.
 IPR001487; Bromodomain.
 IPR018359; Bromodomain_CS.
 IPR004022; DDT_dom.
 IPR018500; DDT_dom_subgr.
 IPR018501; DDT_dom_superfamily.
 IPR019786; Zinc_finger_PHD-type_CS.
 IPR011011; Znf_FYVE_PHD.
 IPR001965; Znf_PHD.
 IPR019787; Znf_PHD-finger.
 IPR013083; Znf_RING/FYVE/PHD. 
Pfam
 PF00439; Bromodomain
 PF02791; DDT
 PF00628; PHD 
SMART
 SM00384; AT_hook
 SM00297; BROMO
 SM00571; DDT
 SM00249; PHD 
PROSITE
 PS00633; BROMODOMAIN_1
 PS50014; BROMODOMAIN_2
 PS50827; DDT
 PS01359; ZF_PHD_1
 PS50016; ZF_PHD_2 
PRINTS
 PR00503; BROMODOMAIN.