CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-031793
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 ARID/BRIGHT DNA-binding domain-containing protein 
Protein Synonyms/Alias
 ARID/BRIGHT DNA-binding domain-containing protein, putative 
Gene Name
 TGME49_046170, TGVEG_005630 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Toxoplasma gondii 
NCBI Taxa ID
 5811 
Lysine Modification
Position
Peptide
Type
References
1895AIFLGQGKLDSDASSacetylation[1]
Reference
 [1] Lysine acetylation is widespread on proteins of diverse function and localization in the protozoan parasite Toxoplasma gondii.
 Jeffers V, Sullivan WJ Jr.
 Eukaryot Cell. 2012 Jun;11(6):735-42. [PMID: 22544907
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; DNA-binding; Hydrolase; Nucleus; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2624 AA 
Protein Sequence
MDREQFLESL EKYHLERGHP YVPGRLLGQK VDLYEMFMTA MKQGGFPRIN KNNKWGYLAK 60
HLRLVPKDKP PSAQDLEQVK RYYVKWIRHF EGERVPQHIK KDLIPPGMEL PCKRSSSSSA 120
QAGVSSGARP AGGALGAGPA GLSPNASLLF DPQNPLAAFS STPPNFFPTL GFGPNGVLAA 180
SPYFGLASGG PAAASGTDSP LVGLQLASAH DEQLLGLGGA GPRTRNERRL AQQFGAASGS 240
AGAAGLGPGG GVLEMNRKRS KLFLKYSPEA VAERRWKRRR REALMARGIL PPAVASRVAL 300
LHDPLRPADL LPPLPFSSSG ASAATVSEDE ASLHIVSLVA ATLSNILFEA GNVNFLAGVS 360
RQCLLDASVA AGLQVAREGD RDRVSEPASG SASSFWEEVF AAARGASLLR TGGPIVHAAS 420
GGRFSGVSGG SKSLSTAVAS PTAGCLGAAV GETTVADGLL VAEGGGNRHA EEERRLAVLR 480
EDERRTRNEV NAEFADEEEE CSETEAVELE EEILLGLSEK SDIAFTRQCR MFRRPPSSSG 540
SSEASSLTGG LLLGAFSSPG SAGAGVSSSG ALGSNGFLSL SSGGGLGASG DGAGPSGGRT 600
REQERLAARV RERTLKGLAS ALLSALDCCA IAAYEGERRI STVCTESAMR MLDAFLVRER 660
KQRQTQLSKA VDAASPVVSG ELGAGKRAGD GEKEVEAAVA EKGEEVEVSA EKKVGEESAM 720
KEVSWSPRHR SQGSDAPCLL LSNAKKAVGA LADALAAVGD DEAREEGKEL REQQVCRLLV 780
AAAHGDVAQL LQLLIQEQAS GVAREESNAA SFLAECKRQE TRELEAPAFG LLAGYGAARK 840
AVAIAASGRE AGLMRAEAEE RRRAEGPMEK DESGDQADGG RARESAREEK RRDPSASESE 900
ISAYSCSLAS PDPSAGVTVG VSSAGPLATD VEMMETEAPQ RQRSFERTLN TRLEGESLLP 960
TLSTSPAKEV NRPSGATTSV VASVNGGRLV ASSASLKGRA VEEVVSSAEV SETQNLLFGP 1020
LVSLVAENSS PLFSASILRP SGKTEKDGDK KGLCFSQSAC GCQTPPVSRL PGEPRPEGRM 1080
REEEMARREE AEGEETPSEP DGVLALAATP LLLGHLEADQ SEFAFDFAAF LSPGVSREEA 1140
ASRLEALARA DGRAASLVSQ NPLVKQFVSD LGLFKSQATV RPTKKKKASP LEKASALSLD 1200
RFLSSEVCRA LQAVQRVSEA TPASRLKQLA FYRHTARILE SLLSFSASSL GQAPASWEQP 1260
HGLAAFLLAL QLLMLGDPQT LGKALEPVYS RVVAAAALLL PGIGTRQLSP PHTGAEGLRG 1320
PEAEKKLDGD AERREGETKP DVEGERHRVE KHESERREGE KEVGECGEEE ASNEKIAREK 1380
RAQVEGEETG AIGSENKSAS KGKRDDAKAK DGSAEISRRE EDTADGELAN AERIRLAGWA 1440
KLKEHSTQLS PHLETGIACV KAFTALVALL PPPPSPSLFA SSSSLLYRDA PQSSWLALSF 1500
VSAVARFLKS LQEISKTRIP RVLQCAPGLH FLQAVAEFAS QMLLFRQTAI YAHLEIFLAV 1560
ASAVLELEIS VPSIQLPRIL IKACLAVIYL TADGPAEALC EVLECVGKPC GRETQETSTR 1620
QRQGRKRPSP AAVEAAAKLF IAPLLRFLQF RMHRTVRLQC QSQLADPLAP STALFEAEAN 1680
RRLVSPLCPS PFATVIPHFF VSACPKGSVS LEKSRGGWAY SHYIGAIDEI PDAAWQATKA 1740
PPSKPGEPSG EPKATAQASP AALSALTSRP DDADSAPQGP FFTAPRVPLG EESATPGLLP 1800
LSEGEEREGW KRRAGTEDDD CETEAYRRSA QVIAECAFPF VVPDAARARD EEKETPKTRA 1860
LRTEAVGRKT EDTPTGKKGS GEGIGAVAIF LGQGKLDSDA SSGKMPRPRD VGGGPPSRTR 1920
QGPGTCEAEP AERAGATVLG KDPLPALSSL LPGEKEREEE PSPTSSSHSA LEKKRDDGSD 1980
VEGREDDREE TSGPSPLFGQ DETHGAPQPS NRDDEDRGAS ANAFYANASG YYPSGPPSQG 2040
DPAGGASYPF YYASSEPHGP SAPGMWQTAP IGGSPQGPGM YPGSSFGPQG FAAQAQVANR 2100
VNPQGQWMGM HPSSQHYASS YFPENYYSSS SSWRQQQWLQ WQHEQQRAVA RQVCGAGSAG 2160
SGGGDSPFFP SAYPHMVQGQ PPVQNWGGNA YQGGWQPGQG ESRDIPFFDV NSAGAPALGA 2220
PGSQQGAGGV AAGARVGNEG EEPQNAWPTP HMQGSQEAGA VGRSEDPPEA TRGPPAHRTS 2280
HPGVAPFEAR PRDGECPTYP REMGGPHGPD ARPSHPSLAQ MSPGYTPASG QAPVASFLPS 2340
APFGASQDRA QMYGGAGAPG QEAPRPEAYG SGDPGTLGGA SQGDGAHAAL RDDASCAHGL 2400
LRESGEKEET RNEREEREGK GGCRSGSPTS PGADGAGASK APPRPAAVHA ASPTGTKKKM 2460
KKEDLKAGVT DSLSAAAVAA AVAFGDEEAK VDEDVSVGVS ALLLLASNSL TLPLLRPHLP 2520
TITELAWYKT TASSCLFSVV QELLSCFRDA PPVHLRGFHD GAAYFPSCVK EATGTADAAN 2580
EAARKARISE EFAKLGNSQH LLCVRKVGQA GKLLSVPEII EMRG 2624 
Gene Ontology
 GO:0005634; C:nucleus; IEA:UniProtKB-KW.
 GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
 GO:0016787; F:hydrolase activity; IEA:UniProtKB-KW. 
Interpro
 IPR001606; ARID/BRIGHT_DNA-bd. 
Pfam
 PF01388; ARID 
SMART
  
PROSITE
 PS51011; ARID 
PRINTS