CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-032782
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Putative uncharacterized protein 
Protein Synonyms/Alias
  
Gene Name
 TGVEG_079860 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Toxoplasma gondii 
NCBI Taxa ID
 5811 
Lysine Modification
Position
Peptide
Type
References
57LPPSASGKLLPDQSSacetylation[1]
2175AGGSGADKGNFWSGRacetylation[1, 2]
2777SESDGAGKFMSATSAacetylation[2]
Reference
 [1] Protein intrinsic disorder in the acetylome of intracellular and extracellular Toxoplasma gondii.
 Xue B, Jeffers V, Sullivan WJ, Uversky VN.
 Mol Biosyst. 2013 Apr 5;9(4):645-57. [PMID: 23403842]
 [2] Lysine acetylation is widespread on proteins of diverse function and localization in the protozoan parasite Toxoplasma gondii.
 Jeffers V, Sullivan WJ Jr.
 Eukaryot Cell. 2012 Jun;11(6):735-42. [PMID: 22544907
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2806 AA 
Protein Sequence
MPPGGPPPSA MGLGPHAAAS ESGAASRPSG SSGAHPASRL SSEYNRTTPL PPSASGKLLP 60
DQSSNPSTVA ASDFSSQDGA TPGSPVPRDA SLSGEAGPLP SSSESERGDT ANPVSGGVSE 120
SRCSADSAAP VSDKSRTDRG LSPVRDTATS GADPAAGREA AEAAGATGKA AGHADARKGR 180
KDPASEDAGG TREPSAASAP PRGPAPQPPP VPKVAAALRK LGVSELPQIS FDSAGGDGEG 240
RGRSERRQRR TQAHTKGLFK RMQDGLGREV ETPSQTTSGG GKRGGQGAGA SQKQDSHKTP 300
PADSGQATSR QWRARPGTSP AEGAAGMPGT STHGGSGAAG GLPRASGLSP TAVFPAELGS 360
QNSSAPMGVF AEQGANASSP SPLPFGRIEG PAGPGNPSGV PCGVGSSLKK EAGGQRHPKK 420
GVSGAAQKRR NATGGTEGSD RGGEWRQPLP GQSLDAGRGG DGAHRPPLDS ALASMVRDVR 480
IEQFGPDGVE KNAENVMAIS TWRVTWIDGS WKVHTAAFEP NPSRDPYPLA ATEACAHFAL 540
RLHRMFRETY TQQGAATPPR FPPLGHLLGG GAVTGAPPGG PWDGVKPTKV WNMEEMQRLH 600
QHVATVVFAT VGPASASGSQ FLGPGGPAPP RDPRHPGAYF YLHRASASPG AGTGLDGSVD 660
PRAGSMSPSL ASVSSTAPCT SSPAVPVGEG DGKLAFSPFQ PSSLPGGPYS AGAARFPFLG 720
GGPNPGVESQ GFGFNMFPGT PTRVHGTLEH PAHAYTPQGM GVSSSQAVPG FAGGIFPGGS 780
PVFEGGFQSP FYGAFSISGV AGDQRSLYAP VARAGVPEQM QGSQLSGRAR AVDSGPSFFV 840
PESNTLFHPG CSPTSAASRG TIASSLPSRV GGALDEAAGG SCRLHSAGSK SAKGFESLFP 900
TCSRAKRSAV SVSATRRGGA PLWGVNSHVA AALAAAAARR TAEEQSLLEE ARREAEREAL 960
LSKLLGEDDT FFESGLSMRS HRLFRATPRP AERGLIFKEK ASLCEQIDQE ASGGNEDEGR 1020
EVRSDNSPHA TRGRKGDGQE ENRPRSQAAS TSEDNAGIAP TVHIKREAGQ QGTARGRGDR 1080
GETPQTDRAT EADLASRKDD MSPNDGEESK KPKTNRKLPS GREAEGERTD KRRRRPSHLL 1140
ESGRASGAGE QQRDVVDAAG AEALSFDSEK RTSTVSVLLP SGKDEATETI PTVEEHPVPH 1200
AKDISRSFVF EPSNTGLIVV ELSEKDEKGE VSEDEDALDV SLSGAAGASE DSKSPANQEN 1260
LLPWEFPDKT RLLLQSVANQ SADFSETCSF GADQRTRDAA LIRDLEPFLR LGEKNPYFDA 1320
RGTQKNILSG CMMSSSSLFN VWKIVNRLRL QDERRPEERD RLARVAQEQS QGARRRLGAV 1380
LNELAALERE DPDDVESAEG DEGEGEQSGK EMKDAPETVD AKASHSSGDT LASPVAFLPA 1440
AAEGEAENGH EEPANGDPSA AKSGGSLLAQ GEREKTREDL PDAFKGNDKL RSDMPQSTPI 1500
DGLPALGAAC SPLELETKTP AILASGGEAH RCCPAGEGGE DRMPEDAKDR TRTGELESTS 1560
DRANLSTGSA QLKGASAAEK RLEGGAEAKQ TLRDSNDGAE GSENDTNQLL PTQGGPQERT 1620
QNAPGSPLEG EQALSTVGRC GGGDKEREKE KDALIDTGER SPGIGERSSK TTAPELEKQV 1680
GDTVKGDRGR DETALAGTAE SLSLGEEGTD QGEKAFGDMG EQLVSPPQGA GLSREEGNAK 1740
VFGDRSALQL TPVSCPPLTL SGEDGDKEKN APALLSAFVK TEEGCATDIQ GKEEGAKESP 1800
TPTPRGSPST SARNMAPRGS AGERQDAKRL GKSYATRRAQ KRKRLELLSS SCEEIEQRLR 1860
RLSDTFPWLP VLNEAFFSAE SKGEVLTCVD TLLERSDEDA ALKGGHGDPS FASRPCGGAG 1920
LLEAVGGGLK GPFRDAVFAS EGEELEKMDE SEDEEAALVP EDLWLARASA LAEAKAEKRA 1980
LADARSAAAN ACAAAAMAFS SYSPRPYSME PFPQASRRPF VVPPPAMMPA GAEEAGAAVR 2040
VNSVQMHASP VPFSPFVGGR VSGMPDEAEG GFAGTRRFSA STQSSRARGG AVLDAPHMDP 2100
AMAEHLQNRS SSASSRESLS TGGVATQGDV WAEGYLDGAL SADTGSSAAN AGWGSHLQGS 2160
AFRSGKRAGG SGADKGNFWS GRGPAETYPG ELGSGAAWGS GAEMAGLDSS ARGVRGSDSH 2220
GSGRRQKASG ATGHAPSGDR TRRRSSVASS VTSSVSASRG GSGLPASGLR SDCAGSVHMT 2280
EGSDTGSQGP SRSASIIMGP PSPIRPSQYT PASAYPNSPA VSHASCHTSF QPQHAPREGG 2340
AERGASWQSL QHAAAGGTGN LEEGDALAGR SAPMGDPSSA GGGGRFRPTV GAMSAQMRGY 2400
PAHPFAAHGP GLGSTLPGLP SNYGKADGYP AGLTFPGTPA SGTATPFQGG YPISDASFGG 2460
NASGACSMGY SSMPAPGYGP EMPQVPGRFA NFSPFFRGEK GLSAGDRTPQ LAAGTPSSPS 2520
TSGMQTGQSA PPLSPWFDHN NAVQHEAGDF GAREHMSSPQ VSASCLGASQ LAGSRQAYAT 2580
IGGENINVPD SETEPLHGRG AATDTSTGAK GGSGARAGAA SGGGEGKASR KRKPKSKKEG 2640
EGHSAVGDEV APQHVLQETN ACSSATGAAP LPVCPKQMVP LRGGEGEGHE QSPLPEVPES 2700
GGQTTCSPYS ISPSSVSGGT GSGTEFGAGY AATVVLHGTP PTQRKTLADK KRRSKSVSDN 2760
RGTCGSSLIS ESDGAGKFMS ATSAAAAATT PSRKKVPPAA ASGGRK 2806 
Gene Ontology
  
Interpro
  
Pfam
  
SMART
  
PROSITE
  
PRINTS