CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-031821
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Putative uncharacterized protein 
Protein Synonyms/Alias
  
Gene Name
 TGME49_033160, TGVEG_022380 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Toxoplasma gondii 
NCBI Taxa ID
 5811 
Lysine Modification
Position
Peptide
Type
References
1060PGSALSEKRLFFDSNacetylation[1]
1610HGNVEGGKKRLLADAacetylation[1, 2]
Reference
 [1] Protein intrinsic disorder in the acetylome of intracellular and extracellular Toxoplasma gondii.
 Xue B, Jeffers V, Sullivan WJ, Uversky VN.
 Mol Biosyst. 2013 Apr 5;9(4):645-57. [PMID: 23403842]
 [2] Lysine acetylation is widespread on proteins of diverse function and localization in the protozoan parasite Toxoplasma gondii.
 Jeffers V, Sullivan WJ Jr.
 Eukaryot Cell. 2012 Jun;11(6):735-42. [PMID: 22544907
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1874 AA 
Protein Sequence
MAEGAETPGK GAVSAQEEER RGRSTRGRYP ARTLSVSVRE KETVDLGARD ESDRGPQQLT 60
PSHQKGDTGD RRDRRDNGDE PAETAEALEC RGSGAGEERD AGSPQVSDSE EGRASFSVEP 120
PSHGSSERAD EKEGDSAAHE DDETRKARRE RKRERRRQET LAAEASIEQD GPKRYAFREK 180
RQKRERFEAD IFAKKSYTAV PTGYQLSAEI PPDQVQGLQL FVYRKKRRAR AGSARPGGDA 240
GANGETRGLE PRDRLESVSV TGTVSDAGDA EDEAVAAGVR EGVSRAGSAE EASRDRRHQK 300
PTRNGPGRPR IHPPSASLLG HRGMGEAARR AGLGMGALRS SQNQKEERRA PVQDLEGRRE 360
EESAVGLDAN RGGLPKKRTE ALKAASEKKK LPSDGSRGFK RTVVAGGKKR DGRRKPSDGE 420
DEDEESEVAV ISEDNARKIR EQFFQDAAAV EEYLPFGLYY GDSLASDTAL LERQEDTRKE 480
RYLEAKKAFQ REMFQRLFLA SVMREEGVKA ERARRENEGA GVIATSEASS ARAPARAAER 540
TEDAERGEET GRGEEAEITE EAEVNEEAEK GKASQEAEED GRGVKVEGGE CASSAESRPR 600
ETSERQGCRE REKALERSSS AAGDVGDSSP SLTRRRERRQ SSVASPASSS AAQEGLADER 660
VFSVSAACPR AVKTEEPAGG AALTDGEARD SSALRGKAAS REASDGAAAK DDEASRMEDE 720
ATVVEVPGGR SGETPDGTAS REERDANKEN KAAQHLWEYA LLPFPAQAPT AEGFFSEILA 780
RETVKEDTET RRQATKSLQA SAASLLQAAA RLAESRPFMN SLAASFSSSG PRWREEEKED 840
AKSEEQEKEK KEKREEERTK ERDRGEKEES GEQKTRACSR EGLRVGSLLS PLEGREASRG 900
PAPVGLFGAV QEDAKKLKPA FSAVTSSLSR LLRVLPPSVL HDSLPPRMRL SSGALAALAK 960
EERAGRGEEP QTKREEAQVA RGGSESSKDA EATEGDTQRE GEKSRWESRS PGHLSSTPAA 1020
STFAVGTAEK ETRTREETNA EEPDVAPPHR GRPGSALSEK RLFFDSNTFA HVPLRQLRCW 1080
AAPPGLRPAR PVSWRRVRAL LRAAGARRLA RLQAESQRQG VFYAPASGDW EALSVIHPVV 1140
SWEGNEGLEG LRHPGLLFAE EGEETDSCDC SDDEGVATAG SGCTKKNKDS GSLAAELQAD 1200
APRASPPAGA SSGRGVSWRG LVLAEATGSA VGSAPEPGGS SSGSSGAAPA AGASHAKAAV 1260
PALPASLPAA VCQPFLAARA TLQARQEEKR MGLPPPLPVR PHCLSADLAA VAKALWSPTG 1320
GCHVVGSADR HWQRVDRVLN AFHLSQVAQT KGTAKPGETE EKGEDRRETK PETLPAFDAS 1380
LLPANFKNLL EPVHFFPAEQ DLEAQARLLE QARALARANS PSQGAATAKG PKAAGSRGPG 1440
DPLGSPTGGG DSSGKASSKK RPGLDPFGPA LSEEERKGEQ RSAAGPGARD GAGDVEGARK 1500
DSQGGSQGGE KSGLTCSAAS LKKLQTDAAN NPEVEADIRQ LLLWRWRKES ASPVLFGPGA 1560
TALEPQGPGD EETPRPDTAT EKPESEEERE NGDARAEACA KTHGNVEGGK KRLLADAERK 1620
EDRDGLREEE SGKRRRNEEE RGKEGKRLST SARASAGRPG ADCGAHDGDE TPTSLSAATT 1680
AAPGSTVGSA SAEEGEKAFG RDKPAGTGQG DSGARPRRSA ARTRLAEVLT AVKGGSGPGS 1740
GESGPGGTPS GPGASSAPFS VSSASQPFGA PASPAEALGS SGSASPGALG ATGEGAVAVA 1800
VAAPGTPPLK CLLRLLRHYP LSVLAQRLGV HRSTVARWAK MWTRQQIEEE GEEDGDELEG 1860
PVASSASSVR GLDA 1874 
Gene Ontology
  
Interpro
  
Pfam
  
SMART
  
PROSITE
  
PRINTS