CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-031743
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Putative uncharacterized protein 
Protein Synonyms/Alias
  
Gene Name
 TGME49_113270, TGVEG_097300 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Toxoplasma gondii 
NCBI Taxa ID
 5811 
Lysine Modification
Position
Peptide
Type
References
1058QHGRGSGKGCQDSGDacetylation[1]
Reference
 [1] Protein intrinsic disorder in the acetylome of intracellular and extracellular Toxoplasma gondii.
 Xue B, Jeffers V, Sullivan WJ, Uversky VN.
 Mol Biosyst. 2013 Apr 5;9(4):645-57. [PMID: 23403842
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1546 AA 
Protein Sequence
MLPFPSPMRG APQANGGTGP VPETDGPHAV GASGGRERPG RGSRPGTSPT RDNWRAEKAP 60
AAPGAGGFCD DPSRPASSGG RGGRDEDRAA GLVSGARSQG GQQYFHGAYS HASGAASQPG 120
ASPLLQGGTE RGGPGGPQRP PLIANAPLLG FAPPAGGIDA GRELRSKAFP ASAGAAQGGV 180
GRSASATRGP ETGPQGDSQA PVACGDSGPQ GAETTFVKDL HKNATLWQLS SASFPSGVPR 240
KLGANNASGG EGAEGAGLLR KPVGGAAGAG EKRAAASGAS GAPDAPAAQP VPASGGSGDL 300
SFKALLSTRP APTVLVATHH STKSDEGAAT CSTGGGAAGP EGLHVGLAPE PVVHGAAGDK 360
TRGPGAAGVL GDTPHQATDG DGAVTRGPGG KVFDVDERGK DRARKGERGA AFGARGAAGG 420
DSAAGAGEPA GPEGGIGISG AHPQVGNRGQ EKSHAKADEG ERGAGRNAGA YRKAGGNADG 480
SQRVWGVVEK PGKAPVSRGR DEEKNQAPCA ETTGRPGASG RRRSQQPFEG EREDISGAGT 540
GRGPHSKPQR GGHGRREGEE TAHAGEGGSR VRGAAASTGN KDSASARRGV GVGRGGAEAG 600
RARGGASEVG RAPSPGGAGD NAAYPKGRDQ GLLGTRGGKG GRGSLRGGAA SGRGSAYVGE 660
RATGGMSESN AQPDVHAPED GASRHRGQGG PISRDRAAGT GAAVWLPKGA TKKGETVAAP 720
GAPEAGAGPR RGGETGQLAG SRGPAESRQG DRGSPSASAG PENQGGAAPH THARQAASGL 780
RDSRSNSGSM PPAVADALLT RVNGALSVGA GLVCVTQEDN QEETAGDDEG FQLVKSKRRE 840
QQEKRKKAEH QQRGSDRGAA AKATPFSSPA APGSARSNSR REKGQSGTAA GGDALGPASG 900
TPGEARVPGS TASKSEGPVS SPRDQGTGAE RTEPDDSGAS GRGPRHEGGG RRPSASPRGG 960
GRGGAQQRGE GRKEFPGKGL RTEVALTASS TMEVSTVSSE VTSSVADHRT RQRGQEEKKT 1020
EDGEAEKKEL TSASGLLPLP GGGSQGGGRG QHGRGSGKGC QDSGDAHAYP GEGRASEKSD 1080
RGEGSGFSRG AGAGSSREDG AHGTGEKGRE RRGSSGRGAD RRVGGHMEEK RRPESPAEVE 1140
GGSQRQAHSA PTNVGDPWYA SQDLDAANAM SHRLSSDPSQ SAVSDPQGTD LMIPSGPTGH 1200
QGNAGAQGAD RERAGASPSA GAAPAPPTYS LWSSNFNQPA TASSRVAPET IAAIWARDPG 1260
SGPTAVGGAP RGARLPAGPG GAPADMAGLA SHHPMTPVGA AAATLFQPGC YGAPGGGQLP 1320
PGAPRMPAQR NVTSPLTGGP QSLSKNEMLA PPLLPGRTAP GLVTLAPGQG SASPGAASSS 1380
PRLDLYDPRN PLSGPSFSPV PGVSAPGAHP GVVAGAHSSS YVVPEAGHQL LPGFAHPQGA 1440
AFSPSLSHDG APSPAPGAPG SGRAAGQALW VPGGPAPQGS AFVPSAPGHH QQGRGHSIGR 1500
GNDMLAPQGF GSFIAAQGGR NGAQRFGAMG AGQPGAGAPG GGEFYM 1546 
Gene Ontology
  
Interpro
  
Pfam
  
SMART
  
PROSITE
  
PRINTS