CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-032767
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Putative uncharacterized protein 
Protein Synonyms/Alias
  
Gene Name
 TGVEG_030930 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Toxoplasma gondii 
NCBI Taxa ID
 5811 
Lysine Modification
Position
Peptide
Type
References
1927PSAEGEGKFAPDAGAacetylation[1]
Reference
 [1] Protein intrinsic disorder in the acetylome of intracellular and extracellular Toxoplasma gondii.
 Xue B, Jeffers V, Sullivan WJ, Uversky VN.
 Mol Biosyst. 2013 Apr 5;9(4):645-57. [PMID: 23403842
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 3817 AA 
Protein Sequence
MDFERGESPG DSRGSVAFLH HTEKLERLPG TGETTIRGMS VTPPYICMER PPRANCEALR 60
LKSPPENRAF SSRSESPSPT PFARECASLG LVWGDEGTRA GLLRTRLFTP PDHTPHLLAE 120
TRASCLEDLF PGTVPQLLSH LPSPPPVPGV ARASRSSSPL AGGSSLACAS PWPRSATPFF 180
AANMPALLPG RGPMRVTKWL DGQVSDPQSD SCLRGGASRV ERAAALLCGR SEEEQERERS 240
VDERRLRKAI GVTDEDESER ERETEGGVHE KLSRCAAATA ADRANNLLGL GVERGPEVAG 300
GRLGGYWTTE SEVYPQRIGE LEGEGLGSPD PVAASALVTA VQDSRENLNC LTGVLTTLRL 360
SSRDSEGDFD LPLFVQSRKW RAKYNRRSLD LKRTVARSKA LGYPAGVQIP ETYRDLKNCM 420
QRPPSIDAAD SRAWRSAEAP RVAKKVFSEG RRATDRDEQV AFVEDEVTEQ LLFNANAAVE 480
GTTLYNNLLC KYGLETRCFS TSSAPGNTAF ESRLARSDAD PTASSQSASA LSHAAVSPSL 540
ASALPVSSLL LEDAADAVGD RSELETGSQA EAAIPTSEAS CMRREKHVGE ESRADKGAFL 600
RRASDSTHAE EDGLSGGKDA SSREGGSEER EEAAHEAADS LWSLVLNRNI AALPGLMTVG 660
RYECDLLPKR SAFSRKQLAG LVAGSRPLPV LPSSSDTPGS ASTELLAERV ACALTLDEGE 720
AWNPSDASDL DDFLESSCAP NALRRGRQAV APVRGARRRR GADLGLSPPP SSPAVRCRSL 780
VRWSQQRPFF SNVSACAGAA DSRREEWKDA GKVAKPGSES AWASRELHAS TGLVNAALDS 840
SEQKSGERES SLSPQERILT QVKKELEDER VREKQTIRDK DSEKGQAGES NHHMPGTANG 900
QRTPNEGEAP METEEASTLE PSNGMHRDGQ DAGARMHSSS TRVLEGAVED EPKVTLPDKD 960
EPHASALCGE REKQRQSFFS SVSSREDAQD EDSRWCVAGG MYNGWKGTYD VWIYRRVSAA 1020
LREGKGEEEK RREGEKRKTG KGKQSVHTAS LGAGGAQGPS PGETQAAGLA SGSTPLGSAG 1080
TLSAGRNGEE TCESTGSPAG AFASSSSLAA KGQNGHASVE DLKTQKEESL GCVLSASALP 1140
LNPHSGETRE DSAGRDEEKG EERERDENEP PLYEWRVKRF SALILGHEKA SRLACKYCVY 1200
LERFGRIRGR LSICSTCCRD ACSGCMPSKK RAAGADFSPH CRNGRDAGVG GAGRAPKRRV 1260
QAKKGAAGAA GVCGDRARKG KGEDEPERDG LDRREEGGTP SSKQTAERRG AAKKEGREED 1320
DRVDGKGTSL SLENNSFESS CPAMRSSLRA SFEVKGPLSP SLADDRPNEG AAGRGAPPGS 1380
EGPSRDLALR SHSFSSASSS RKSAKNAAES LRRIAGPLFR SSGDLTASQL GAEVEESDVL 1440
QDVFELYSEA GEAWETCTTP VSFSPSLSVA SRDTLVVLGG SQTTAVARLD SGKMSEAVRR 1500
SSNALSAAAS SFPKGKGFGG ASKKTDSVTL SFLARVCRNL RMFLLLCQHN TVAGGLPGDS 1560
KCVCRAQSGP GGAGLAGADG RAPGDLGDSK GTAVARGPVG AAGRAHGSEP WASPNYTGGP 1620
FFPPAGSAPS GWPPVAQANS RPEVLSAIQG AQGQGPHTAH SLRLAASLSP AQTTSESFLA 1680
PESFAAGVRP LLEGSLSVLI PEEPQVGLGP SAGQQLASSS LSPGVSVKAE PSSYFQSAQG 1740
TGRDVSAGAR TAMPSSFREQ GRPGAAPGHA PSGVGRCPPQ GRDASPGCPG FRTPPAGFDG 1800
PSSSGAGYSL SPYGYPGTEI SPHLAPFFPE PYRRFRESRG GPAWIHSPGS VDVPSSGLQS 1860
PFTGFHATSG SSPPRLGPSE GASFAEASPR ALAGDLGPAG FLGASAGAPA AEGRGPLFDP 1920
SAEGEGKFAP DAGALGTVEG PADCRTQGET GRTADEDEKK KAKKAKKHGR ITDIEERLAR 1980
EEPYDVVEEG DDPEPTRQLG LEATEKEQDV PRSGDSKSPD QDSPGQPADI MHGYFKARVR 2040
NRRVKDGLLL RMTAVLVGKG FYDLETVEPG APRRRGGWGE SGEEEEESET KYLFSNPASQ 2100
KPCDFILYFD TRENRDASVA ILNQALPAPP PRLPPKNGES QARRTLRQLY DHFLEPKCQC 2160
LEDKTLKVKH GVINLLGFPR LYVKLHCSMS WDERLSLFSS FLHWLCREDD SQPPPWSSPE 2220
LHPELLAYLV DLGRKGFASG GAATTAVVNA PDLPLDDSAL SKKNAALIRA YMQQDTGASG 2280
PSGSVGATSS DPEAPRKDDE AEEGEKDDSN AALVEGPAPE TSGDSTGAAQ RCGKGREERE 2340
AGDKRGPRNE GCGKGDGFGS PVAVAGTTAA PGETESVSCP SSTSGGGASS ALSSGPSDSA 2400
AAPDGCESSP VALESASLLS FSPSAARAEV LTVPGVGLVN FSLPDGVKFD KSKLAFRCYW 2460
REGHAGVVTV GAGAAVSPSS GAGTFVPSRP TVCTAQNKSR TFSCRKYGLY QSRVLALQAR 2520
LLSELLWPQP PSPARLRVSA MAAVVYGLIA APMPFTDPWQ AVCGVSVAED ALRQRREVWK 2580
NLLDPRQRRP APAPISQLSL PPVSGPPHAS GATQELPNRP GTPWPGQETV CGARGPAPGL 2640
ASAWATYGNP GDRDAAEPQS TYVGRGPAGA EGPGGGIAVH REWARNSGSE AAQPCQFGRA 2700
VERPVPGPQS SLGPGGDNRG DHMAYDQSPA GPASNAPGPT PPFVGPFSPG LVLRHGPPAF 2760
SQDPSLHRPP FAAGTGPAGQ RLASDSPYPL KNEASPQLAM AHAPGFEKSD GFQGEQPLAK 2820
QRKIEGASDR PVPDEGQVLG TISHGKSPAA RPVDGGFAPD GRSPFFSQDA SGVGGGRPSG 2880
VGGQLAAGGK GHFATAPFGS GTLPTTRGPS QPGGDGLSHR SGTEPAAAYS SPAGAAYPSA 2940
SNASPIYGAA PKREGDSPFG PAPPSGYCRP GSPAVDPKLP GSVPSSGNLD SVNYGSFFPG 3000
QQAPQGDGRI APWGSGHVGA PRGEARGSER VGHAGASRGL TGHELEEGQG GPGEEGAGRE 3060
RQRKRRKSAM SMSSQGENTP LFAPTSLPPV PFASGDSLAD GSGSDFGQQL GPPFSHGSHA 3120
PPFPEANAVG SQHFTADNLE TPGLPADLGG GDGRRQSGST HEEVSGPRAG GEKGEFSLEG 3180
APQAAAQQLS AETLTFLLGT NVVWEENEKR WRVQVRPPSP RGCDGEGADG KLGGEKKKRK 3240
RDGFSAGGER RRSSTGNEPD DQHKAGTLEW VSMAQLHQAQ KLQNQLVGKM ERGKGEGGDE 3300
ERLGGDGRGN IFFDANGSDE NAKKAALLKA RRWLRRRIVQ GQILVTGLSR DGLFSSRPDE 3360
PERSSSVSTG AFTGSSPNDK PTDLNAAVPP LSPFFSPIPF GATTAPHRPS PGFYPPAPAH 3420
PTEDGCRPPM PAPVPMHAPQ GPVDSRTYRG ARPVYPGSDV TPQTCHGVRP ESMQEEGRAA 3480
LLAEQGSAFF VSGDGKGDNR GATVGQIRQG TVRVMQSQTA SQSLDQGFDL PHPPAPGPAY 3540
RGVPVGHGPS GPYYLNGGCV AQRPYATFSN LAGPVQGSFP PLEFSNGGLP TTAVGRRGSD 3600
SGPQGPGRNA SQMQPGFASR PHGPERLGRE SAPQSGAPPG FSPHAPGRGE RDRPSFSGAT 3660
TMPLANLTAF SQPAAGPMFV GTEGRGQQGD IHPNLCGVAP VGGPRGPAHA PMPAYGPGGA 3720
AGPPRDDRRA EGGAPGVSHS DIFLANDRRL HPEMCLHSAP SWGPAGTFAS PDNRQNAEPW 3780
PAAHASSNNF FDYTGVNMPA AGPPIQLDWS KVRGAGG 3817 
Gene Ontology
  
Interpro
  
Pfam
  
SMART
  
PROSITE
  
PRINTS