CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-032743
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Bromodomain-containing protein, putative 
Protein Synonyms/Alias
  
Gene Name
 TGVEG_055290 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Toxoplasma gondii 
NCBI Taxa ID
 5811 
Lysine Modification
Position
Peptide
Type
References
1881RLPAAGGKDADLRAQacetylation[1, 2]
Reference
 [1] Lysine acetylation is widespread on proteins of diverse function and localization in the protozoan parasite Toxoplasma gondii.
 Jeffers V, Sullivan WJ Jr.
 Eukaryot Cell. 2012 Jun;11(6):735-42. [PMID: 22544907]
 [2] Protein intrinsic disorder in the acetylome of intracellular and extracellular Toxoplasma gondii.
 Xue B, Jeffers V, Sullivan WJ, Uversky VN.
 Mol Biosyst. 2013 Apr 5;9(4):645-57. [PMID: 23403842
Functional Description
  
Sequence Annotation
  
Keyword
 Bromodomain; Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2648 AA 
Protein Sequence
MNAERQAAPS GAPAHSADPS GRLSVAAASA FPTYPPFSNL TSPGHPVHLV GSTPAANASF 60
PHRLPFSTSV ASASYAASLS FAAPFLAPGP AQAPGISGQV LGISGQAPVV PGQAPVVPGQ 120
APGAPHSGSR PPAGFGPGGA PLRPGVGPPY LKDGAPRPGL GAEALAEKEG ASGAGRRDLG 180
RLAFDSQLEA QRMQPASAAP ALLRQGYVSS VSTVPLSPEQ IEAAARSAAG DLEAASPLSG 240
VQSRLRPAVA QTFSPSSALP PASRGPAGPA APGFLDVASS SSHPTAAWSS SSTPALAPSS 300
STRAPAAAAP PLDLSPVSPL CPAGPSSTPQ ELLDLLYPAV SAMPWHSASA KSFAASHAAA 360
SLSSRRLPTQ KIGATAREQS CNASPGSAAS PHFQQLRCGL VVTQVDQRDL WRRPREGTLE 420
AQAPDASPNC MREPGAGSES EEGMVSEDRE SRQDEGLWAE REGGDPAVCS GQDAEASVGR 480
GHKALGRGEG TLFESRVKCQ DFVEETLEQV YLLREFGGIF SVARERPESL TLFGCLGKAC 540
QFAPSPSVSA GVSPEDFASV LFAPAPASPL ALFSGASAGS LSPALAAEWR RLFASLRLSA 600
CTFWLLTVQT EMEIAEANRR FRKTVEVWQL QREVLDVQDR LYKHQEEAPR GASSSVARHA 660
WKVKRRALFH SYAQIAQKYR EALSEQQRIF DVTADAFSQV TASQLFDANE ILPNFFARTG 720
NSQLTLRNLG FPRCCCTDAA AAVAAAGALR GFRRLPGCTH TCGCRVCVFG LAGRVGPTPE 780
ARETTGEGDA SEGSSFPAYA YGGDSGAADS LPTSRVSNCK GVETGTGVFD SAFVEAGPGR 840
ASLASTAPLA VSADTPGTQE AETARESGAE KTGADPTLGP ESLHRSPLPF RDSAGVSRPS 900
SSVDLPGGSS SAFCPEAPKA GELRHAEPVS RPPVESDENA YSDGNARVLP LSAKPLEAGK 960
DAAALAGPRS GSPPEDSVVA APLTGGQRAH ALSGGTHGGS LRADLDDEFE FDDADLEDAH 1020
HALHPGARAP AEAAPAATAG VSEGAAGPGG AAEPLGRHAG PRASGSRDLP AGGAGASPGA 1080
PGGAQGPNAG TQSVSGVSAF PEDEGPTEGK VLARPVVLRE PLWGEVEGTL ECDMTRMHRL 1140
LHWKKRDTET LAVAAAAAAR SAAEKAGSLL ISDSGVEEGL AALSSAATAA ALAVEEEETA 1200
LATERTELLK QAEGNLERAE EDEVSEHGVT LAVSGGASGG PPETPGDSSS AMAAIGAFYD 1260
AEGALAGVSR TAALRLRRKA KQEVLKLDKQ KRIELVKQGK SALELHAMAA AEQETRSSKA 1320
RSPLQEFVVT LGEFSLKEKL GSLLQRLNTS FDAFQADRRL HCRGKEQGLR GGIPLIHTKI 1380
AFQLQGTHPV AGRRGRLAML RFHRPDFRAG LVSSRTFFDG SAASSCTRED RRRKAAVQSV 1440
EQSPWVILPP EPSAVMNRLR EELCRGAPEG EASFPRLEEE SSAGSRETGP QREQAAERGP 1500
GTTGALRLDG EGSGETIHLA NLFYTPQDLS LRDDAPIAVL EYMEQQPLLM HNPGMAIRIV 1560
RHFVPHNAPN SELSRADQLV HAERQVKGRL GPFGELQLQQ DDAKLTLFGT RLPLARGQGQ 1620
AVAESPLLKA PVYVHPTAPN APVKRDSRFH DYRDTDFILV RTRAKDRCKV YLRPLLYPPP 1680
EKNSAREGFS FYRSGGSLES LSVLGSPANC GVYTVGQCEP RMEVHAPNSK KHVEDRKLHA 1740
KAWALRFAAE KNVTDMKRVK DLVKQRFCPP VVEKEVTQML KLLSPIAPHR LPRLDDAALR 1800
SIIRPELICC LEAAHAALFR LKAIGIMTLT SADGLAGVAQ FIEKEERHAQ EKVLAAKKRL 1860
KEIEMKFRQQ AEARLPAAGG KDADLRAQSL EAANAACQRA HAALSSVCTA YAESRIGKRY 1920
GHLVRFIEEL LLLTPWNLTK ECKDVLTNKG SAQFMLSGFG DPSGGRGEGI SLIKRLNRER 1980
AGLSFLSGAS RNFGFSRDGA RSLAGAFGPG GIFLGAHLGA AAGGGVAGTG EDLRKLSMQE 2040
LRRRLVQYGL SESVIRTLPR WDQVALVRQY RDGFGNADFE NEEEGKKAKG GLAKTGRLRG 2100
EEYEERLTDI LQRQKKALEA DAPAITDTED EGEEDAAREN AADVDLLEEQ KAKKGKKVTK 2160
DEERHKAHAK KVGTASPGNL ANGDPNHGQD KGVEDALLAA LDGEEEEKEA DEDELERREL 2220
QALRQRQEAR AAKPLTAEEQ QRADMKVQAV PCLRWIRRFR QQAGEPFATE RVVLIYGEKN 2280
IRNFIQWRTK RLEDTRARML ASTRSKEALA GKRVCRACGQ PGHIASNVNC PLYSGPKRPA 2340
AALTPPPPVK KRRKENLQLE EDLLMGLCPE GENESEVNAF GLSQTSGTPG RQRAGGAASS 2400
GGRGRSARGA VWEEEDEFGD AASSLVSYSR RRAGTRSAGS SADGHTQFFG GKAGEAGGRK 2460
KASRRRGAGG GGGGRGRESL EEEEERTQDE EDEEEERWTP ERNRRSAIDE LNIALARIVN 2520
VVLQQQIFKP FWQRVDERYA PNYYRVIANP MWLQKIASKC KQREYVSGEQ FLADVRLVVT 2580
NCFLFNPPNS PSAWLRERAT NLEETVRQKF EEQRATVAEC EAVIMSHGVG ITGMAADGVA 2640
AHMGYPMM 2648 
Gene Ontology
 GO:0003676; F:nucleic acid binding; IEA:InterPro.
 GO:0008270; F:zinc ion binding; IEA:InterPro. 
Interpro
 IPR001487; Bromodomain.
 IPR018359; Bromodomain_CS.
 IPR022591; TFIID_sub1_DUF3591.
 IPR001878; Znf_CCHC. 
Pfam
 PF00439; Bromodomain
 PF12157; DUF3591 
SMART
 SM00297; BROMO
 SM00343; ZnF_C2HC 
PROSITE
 PS00633; BROMODOMAIN_1
 PS50014; BROMODOMAIN_2 
PRINTS
 PR00503; BROMODOMAIN.