CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-032689
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Putative uncharacterized protein 
Protein Synonyms/Alias
  
Gene Name
 TGVEG_084040 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Toxoplasma gondii 
NCBI Taxa ID
 5811 
Lysine Modification
Position
Peptide
Type
References
128RNSAAFGKTVSRLDTacetylation[1]
1269GGETASEKKEMAEPVacetylation[2]
Reference
 [1] Lysine acetylation is widespread on proteins of diverse function and localization in the protozoan parasite Toxoplasma gondii.
 Jeffers V, Sullivan WJ Jr.
 Eukaryot Cell. 2012 Jun;11(6):735-42. [PMID: 22544907]
 [2] Protein intrinsic disorder in the acetylome of intracellular and extracellular Toxoplasma gondii.
 Xue B, Jeffers V, Sullivan WJ, Uversky VN.
 Mol Biosyst. 2013 Apr 5;9(4):645-57. [PMID: 23403842
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2564 AA 
Protein Sequence
MDGSGESSGH LFKPGHEASA NFPRLHKSVH ALDDKMRGLD AQLYVRPHQT LPLQPRLRET 60
DLCRNGEDGR PGKFDSPHLG SSAGPYGHSF LANPQLTPFV PQHLSSSPPQ PVLSPPGEEG 120
RNSAAFGKTV SRLDTGGGER QDSSEDQVGG TGRQSDQATK ANSGSTPAGC AQTAGLLTDV 180
QSSGTNVEHG REHFSTPQNP ADGSARTCGF RETRVSPSNS SLPRTACRSR LDAFLPQKSV 240
SPDHEHVRGT GGARAFVGGD SPFPEKPDAL PATVTAEIAT EAPPASRDPP VEEFPGAHEL 300
ESLPPPHSGR PPIGEKDGEA ASPGVSRLPS QERVHTLLYP NEKDASSLSR CCPSSMQPPP 360
AGPRQEEARS FSVSAASAPG APPGIVYQAS ACASPATVAS FATPLTTPVG ASAQSEPAAL 420
HAHSRSRTGA HPEALPPGVP GVTSQLGRGA RGDRETLAGG ARPGQDGVCE RRGDVARGRL 480
GGVSVAGDEA AEGTSHKAAL EGAYVQDGCS PQPSNPHAPS GISAPTNGSS EFASSAIPAS 540
TCHDAFVRSP VSGSDCMSVA NPGGPPGALG GLFPSPRGPS GPRPTPHPAQ MAFAFVGQQP 600
VFPGFDASQP AGSTFQYPPI RGAVSGVSPQ PPMHPSSFAQ PVWSPTSVPS SSVSSSVSSS 660
GVSSSAPPPL AVGFQNPCPW RPTAPRDRSE GGAGSPGVSC GSAPPAPTHP TGKGGAAGRA 720
GKQLGQATRF LSSVSGVVYD KGGEKWIARW SENGKPFKKT FAVGKHGFDA ARKMAEDCRL 780
QALYAKRWNS ASGLPASFSK SNSLGRSTPG DRGKTEPTNS AKCKRDTSGE SGCTDTGLRS 840
LNMGGAGDLS SLGHPGTPPR DQEGAPASFL LEGTGVVRSS QVQTPFRLYD SVPSPLRSGD 900
ALGAQRGLVP QLLNNALVGV PFAPPPGASH SGCSAALPPG PGAPVQVSSP HTGFVAPADV 960
EAPPRDGLEG LGGAAEVSPQ IAVQDGGKKG EGLLGSASLS VRRRRKREPD EKFSPGESNA 1020
AVKKTPRPGS FHPHSCPGSE GFRSHDGPGD STEARCAGLP AFQHATAPSS VCWPSTASLP 1080
SLDKAGQRAE HAGPSAFSSF SSVQQSPGSV ETWRPEGDGG PASPARDAGR RGAESEERET 1140
SELAGPFAGV SASAGSASRK GQQKQLTRQI QRQQQLYRQQ EALLQNQEEL FSRLLRRRSR 1200
QERSDVRRRM QRDVSSLRRL PTMLLSPLRD ALVASAARLP LATRGTKRES QKERRDCGAG 1260
IGGETASEKK EMAEPVRVHR RDRGGARDEE KPSTEGVRQA DPKGRKAEGF PTWVIPPNEE 1320
LKAAQVLRAL RVQRRAAARE GKLLESLLVH RGEGEGTFSE ETEGNTEIED AGTESDATVT 1380
QETAEKVVEN VQKMEELESE VEKENERRRE AEDETPKQSS EEAPGVQQSP HKLSTNNEND 1440
ASPQKLTKSV RFAESVAGSS SAVETACAAD EEPLATETLE GRRVGGIPVP ATSSPAPVFP 1500
CTAAQLGDLC MDTLYALGTV RPQWRRQDHR RAFGWHLSQI KPDLILPSLH ASRVLRRLSP 1560
RPSNAVEFPR EELAAASSAA GLVYGEGLSS HHTLRSYVDA FRPLFSSPSS PPLEFLHLSS 1620
GDLLMSLWQL EEGGRAAVID NVLLALDALY ERHTGRRLRG TAPPPFAVSS PSSAPSSLFA 1680
LAHLQGGATS TTPLPATALP SPPFPRVSSA PDSPVFAPDA SHGPSQRRQV SPHVTFETPP 1740
THPRDRDSET SVERNASPET SPQAATLAAP APCDGDREEN FVLAYNPEAK ALRQVNFLAV 1800
GVRVFLHLEV VEEMLHLQAK MQRTPGRDDR ATASSGPSVD DGSGLMTSLP STCSGVSGKK 1860
DPMHWSALFV TVPAPSVSTA ASKPLFVVAE MVDRRLQVPC GEQLLFRPLP LSPAAPSALL 1920
AFAPARVCQL LRAGAMCLTR FTEKEGGKRP RGSAQRCSAA SSFFYSPPPL DLSHLASFAP 1980
AASTLTPPSS PASSPSASAS QTGPGRAKSR GTSPVGPESP EAASTTADGL AVPGSASAVS 2040
TPGVPAGASG ASLGAPAPSP MASPGGSPGR PPKPVCCPAA PGIETAWRCK CSHRRHELQL 2100
EIKQKLRQDK KRCLALIREY PDLSLLVGAP PATPREKETG AKRQAPEGRR TATPSGSGTL 2160
TAKGGDLQGS TPSGAGLLSL ARTSQLEMLA YLVEVDPWKY AKNRQDAPKP EEIPGLLAKY 2220
KAAVRTAEYG RMLQKWRAGQ SREDEGRGGA DGRKDGDGLL SPTASPPSRR KQGKDSSPNS 2280
ASSQASGPAP SPSLSPGAGA AAVLETEKPE PQSPQESPCP LEPAAGQEPR ATSSALPAGS 2340
PPWALPLVPP GGSPRASVSP SVLEELLRIQ TAMSQLAIGT AICVRVKALL GLPAGAEQHI 2400
RGVVTRNALK FPWEKPAAPQ VQAAGPSAGA SRTSPSRRLS GGVVPGDEAG ERREKGGARR 2460
GVSEGDIEKK EDEGTALCAG SRETEADGAG YLTLSLNNRK EEFILSFREV QCLVAQDDLR 2520
LVRTRARQWV SSFGPQPSAD RKGEREEEKE TGGRTRKFVV DEDF 2564 
Gene Ontology
 GO:0003700; F:sequence-specific DNA binding transcription factor activity; IEA:InterPro. 
Interpro
 IPR001471; AP2/ERF_dom. 
Pfam
 PF00847; AP2 
SMART
  
PROSITE
  
PRINTS