CPLM 1.0 - Compendium of Protein Lysine Modification
Tag
Content
CPLM ID
CPLM-032767
UniProt Accession
B9QN68_TOXGO
;
B9QN68
Genbank Protein ID
EQ970698
Genbank Nucleotide ID
EEE28214.1
Protein Name
Putative uncharacterized protein
Protein Synonyms/Alias
Gene Name
TGVEG_030930
Gene Synonyms/Alias
Created Date
July 27, 2013
Organism
Toxoplasma gondii
NCBI Taxa ID
5811
Lysine Modification
Position
Peptide
Type
References
1927
PSAEGEG
K
FAPDAGA
acetylation
[1]
Reference
[1] Protein intrinsic disorder in the acetylome of intracellular and extracellular Toxoplasma gondii.
Xue B, Jeffers V, Sullivan WJ, Uversky VN.
Mol Biosyst. 2013 Apr 5;9(4):645-57. [
PMID: 23403842
]
Functional Description
Sequence Annotation
Keyword
Complete proteome; Reference proteome.
Sequence Source
UniProt (SWISSPROT/TrEMBL); GenBank; EMBL
Protein Length
3817 AA
Protein Sequence
MDFERGESPG DSRGSVAFLH HTEKLERLPG TGETTIRGMS VTPPYICMER PPRANCEALR 60
LKSPPENRAF SSRSESPSPT PFARECASLG LVWGDEGTRA GLLRTRLFTP PDHTPHLLAE 120
TRASCLEDLF PGTVPQLLSH LPSPPPVPGV ARASRSSSPL AGGSSLACAS PWPRSATPFF 180
AANMPALLPG RGPMRVTKWL DGQVSDPQSD SCLRGGASRV ERAAALLCGR SEEEQERERS 240
VDERRLRKAI GVTDEDESER ERETEGGVHE KLSRCAAATA ADRANNLLGL GVERGPEVAG 300
GRLGGYWTTE SEVYPQRIGE LEGEGLGSPD PVAASALVTA VQDSRENLNC LTGVLTTLRL 360
SSRDSEGDFD LPLFVQSRKW RAKYNRRSLD LKRTVARSKA LGYPAGVQIP ETYRDLKNCM 420
QRPPSIDAAD SRAWRSAEAP RVAKKVFSEG RRATDRDEQV AFVEDEVTEQ LLFNANAAVE 480
GTTLYNNLLC KYGLETRCFS TSSAPGNTAF ESRLARSDAD PTASSQSASA LSHAAVSPSL 540
ASALPVSSLL LEDAADAVGD RSELETGSQA EAAIPTSEAS CMRREKHVGE ESRADKGAFL 600
RRASDSTHAE EDGLSGGKDA SSREGGSEER EEAAHEAADS LWSLVLNRNI AALPGLMTVG 660
RYECDLLPKR SAFSRKQLAG LVAGSRPLPV LPSSSDTPGS ASTELLAERV ACALTLDEGE 720
AWNPSDASDL DDFLESSCAP NALRRGRQAV APVRGARRRR GADLGLSPPP SSPAVRCRSL 780
VRWSQQRPFF SNVSACAGAA DSRREEWKDA GKVAKPGSES AWASRELHAS TGLVNAALDS 840
SEQKSGERES SLSPQERILT QVKKELEDER VREKQTIRDK DSEKGQAGES NHHMPGTANG 900
QRTPNEGEAP METEEASTLE PSNGMHRDGQ DAGARMHSSS TRVLEGAVED EPKVTLPDKD 960
EPHASALCGE REKQRQSFFS SVSSREDAQD EDSRWCVAGG MYNGWKGTYD VWIYRRVSAA 1020
LREGKGEEEK RREGEKRKTG KGKQSVHTAS LGAGGAQGPS PGETQAAGLA SGSTPLGSAG 1080
TLSAGRNGEE TCESTGSPAG AFASSSSLAA KGQNGHASVE DLKTQKEESL GCVLSASALP 1140
LNPHSGETRE DSAGRDEEKG EERERDENEP PLYEWRVKRF SALILGHEKA SRLACKYCVY 1200
LERFGRIRGR LSICSTCCRD ACSGCMPSKK RAAGADFSPH CRNGRDAGVG GAGRAPKRRV 1260
QAKKGAAGAA GVCGDRARKG KGEDEPERDG LDRREEGGTP SSKQTAERRG AAKKEGREED 1320
DRVDGKGTSL SLENNSFESS CPAMRSSLRA SFEVKGPLSP SLADDRPNEG AAGRGAPPGS 1380
EGPSRDLALR SHSFSSASSS RKSAKNAAES LRRIAGPLFR SSGDLTASQL GAEVEESDVL 1440
QDVFELYSEA GEAWETCTTP VSFSPSLSVA SRDTLVVLGG SQTTAVARLD SGKMSEAVRR 1500
SSNALSAAAS SFPKGKGFGG ASKKTDSVTL SFLARVCRNL RMFLLLCQHN TVAGGLPGDS 1560
KCVCRAQSGP GGAGLAGADG RAPGDLGDSK GTAVARGPVG AAGRAHGSEP WASPNYTGGP 1620
FFPPAGSAPS GWPPVAQANS RPEVLSAIQG AQGQGPHTAH SLRLAASLSP AQTTSESFLA 1680
PESFAAGVRP LLEGSLSVLI PEEPQVGLGP SAGQQLASSS LSPGVSVKAE PSSYFQSAQG 1740
TGRDVSAGAR TAMPSSFREQ GRPGAAPGHA PSGVGRCPPQ GRDASPGCPG FRTPPAGFDG 1800
PSSSGAGYSL SPYGYPGTEI SPHLAPFFPE PYRRFRESRG GPAWIHSPGS VDVPSSGLQS 1860
PFTGFHATSG SSPPRLGPSE GASFAEASPR ALAGDLGPAG FLGASAGAPA AEGRGPLFDP 1920
SAEGEGKFAP DAGALGTVEG PADCRTQGET GRTADEDEKK KAKKAKKHGR ITDIEERLAR 1980
EEPYDVVEEG DDPEPTRQLG LEATEKEQDV PRSGDSKSPD QDSPGQPADI MHGYFKARVR 2040
NRRVKDGLLL RMTAVLVGKG FYDLETVEPG APRRRGGWGE SGEEEEESET KYLFSNPASQ 2100
KPCDFILYFD TRENRDASVA ILNQALPAPP PRLPPKNGES QARRTLRQLY DHFLEPKCQC 2160
LEDKTLKVKH GVINLLGFPR LYVKLHCSMS WDERLSLFSS FLHWLCREDD SQPPPWSSPE 2220
LHPELLAYLV DLGRKGFASG GAATTAVVNA PDLPLDDSAL SKKNAALIRA YMQQDTGASG 2280
PSGSVGATSS DPEAPRKDDE AEEGEKDDSN AALVEGPAPE TSGDSTGAAQ RCGKGREERE 2340
AGDKRGPRNE GCGKGDGFGS PVAVAGTTAA PGETESVSCP SSTSGGGASS ALSSGPSDSA 2400
AAPDGCESSP VALESASLLS FSPSAARAEV LTVPGVGLVN FSLPDGVKFD KSKLAFRCYW 2460
REGHAGVVTV GAGAAVSPSS GAGTFVPSRP TVCTAQNKSR TFSCRKYGLY QSRVLALQAR 2520
LLSELLWPQP PSPARLRVSA MAAVVYGLIA APMPFTDPWQ AVCGVSVAED ALRQRREVWK 2580
NLLDPRQRRP APAPISQLSL PPVSGPPHAS GATQELPNRP GTPWPGQETV CGARGPAPGL 2640
ASAWATYGNP GDRDAAEPQS TYVGRGPAGA EGPGGGIAVH REWARNSGSE AAQPCQFGRA 2700
VERPVPGPQS SLGPGGDNRG DHMAYDQSPA GPASNAPGPT PPFVGPFSPG LVLRHGPPAF 2760
SQDPSLHRPP FAAGTGPAGQ RLASDSPYPL KNEASPQLAM AHAPGFEKSD GFQGEQPLAK 2820
QRKIEGASDR PVPDEGQVLG TISHGKSPAA RPVDGGFAPD GRSPFFSQDA SGVGGGRPSG 2880
VGGQLAAGGK GHFATAPFGS GTLPTTRGPS QPGGDGLSHR SGTEPAAAYS SPAGAAYPSA 2940
SNASPIYGAA PKREGDSPFG PAPPSGYCRP GSPAVDPKLP GSVPSSGNLD SVNYGSFFPG 3000
QQAPQGDGRI APWGSGHVGA PRGEARGSER VGHAGASRGL TGHELEEGQG GPGEEGAGRE 3060
RQRKRRKSAM SMSSQGENTP LFAPTSLPPV PFASGDSLAD GSGSDFGQQL GPPFSHGSHA 3120
PPFPEANAVG SQHFTADNLE TPGLPADLGG GDGRRQSGST HEEVSGPRAG GEKGEFSLEG 3180
APQAAAQQLS AETLTFLLGT NVVWEENEKR WRVQVRPPSP RGCDGEGADG KLGGEKKKRK 3240
RDGFSAGGER RRSSTGNEPD DQHKAGTLEW VSMAQLHQAQ KLQNQLVGKM ERGKGEGGDE 3300
ERLGGDGRGN IFFDANGSDE NAKKAALLKA RRWLRRRIVQ GQILVTGLSR DGLFSSRPDE 3360
PERSSSVSTG AFTGSSPNDK PTDLNAAVPP LSPFFSPIPF GATTAPHRPS PGFYPPAPAH 3420
PTEDGCRPPM PAPVPMHAPQ GPVDSRTYRG ARPVYPGSDV TPQTCHGVRP ESMQEEGRAA 3480
LLAEQGSAFF VSGDGKGDNR GATVGQIRQG TVRVMQSQTA SQSLDQGFDL PHPPAPGPAY 3540
RGVPVGHGPS GPYYLNGGCV AQRPYATFSN LAGPVQGSFP PLEFSNGGLP TTAVGRRGSD 3600
SGPQGPGRNA SQMQPGFASR PHGPERLGRE SAPQSGAPPG FSPHAPGRGE RDRPSFSGAT 3660
TMPLANLTAF SQPAAGPMFV GTEGRGQQGD IHPNLCGVAP VGGPRGPAHA PMPAYGPGGA 3720
AGPPRDDRRA EGGAPGVSHS DIFLANDRRL HPEMCLHSAP SWGPAGTFAS PDNRQNAEPW 3780
PAAHASSNNF FDYTGVNMPA AGPPIQLDWS KVRGAGG 3817
Gene Ontology
Interpro
Pfam
SMART
PROSITE
PRINTS