CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-032590
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Putative uncharacterized protein 
Protein Synonyms/Alias
  
Gene Name
 TGGT1_075030, TGME49_094710, TGVEG_013140 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Toxoplasma gondii 
NCBI Taxa ID
 5811 
Lysine Modification
Position
Peptide
Type
References
626EALPSGDKKPTGPAAacetylation[1]
1126AEDSLGAKDAVGPKPacetylation[1]
Reference
 [1] Protein intrinsic disorder in the acetylome of intracellular and extracellular Toxoplasma gondii.
 Xue B, Jeffers V, Sullivan WJ, Uversky VN.
 Mol Biosyst. 2013 Apr 5;9(4):645-57. [PMID: 23403842
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1151 AA 
Protein Sequence
MLGGASGLGA APTRALYSTA NVLYIKNLSP LVTEDHIRQI FTHCGEILNI GFKAYLNNPA 60
QRYCVLEFKD SAGITAASQL NNTPLLNVPM TVTVVEPAGG GTFIAAPEAL AGASGALGSS 120
SLEALAKQAA LNVQHPVGQP ALAPGQVGEG VRQLQQLQQQ QLQQFEANGA LGAGAQLATA 180
GAVAALGGQA GGAAAFGAMP LNSQMSVVEI AMQNHLSNLQ AVQSAALQAK ILAEKKKTEL 240
GLGVSIGPGA GGLLPAAASL VTGCPHKETE LQQLGRTVFV ENFPEEYEKE ELHLLLRDFG 300
KISNLRFGQH PEKGSKFAVV EFETAEEAQF LRGMDAKPIG NLTLTIKESQ SVVNFRDPDG 360
VLFDVPPPPI LAVLKQQQTP EAQQEALQSK LKEVKYAKLE IEEKFKAATR PKRESRSRGV 420
SSRRRRRNRS SSGSVEAPRG RSLSPGRRGS VERKRRKEER PERGRDSSRS LSAGARGGSG 480
AGRSLRGGGR GERRGASRSP RRSLERRGDS RGRLGSEGAG RGRERAVASG ARAPRPVEAR 540
RSISRSPDMT ETRRSGETAQ KDEVQRDGSR LSPREAEREG DGCSDEARKH VSRSASREAG 600
RARKPEGRPK NLGVTAGPEA LPSGDKKPTG PAAENGLARC AEASESEPES PGLGRRARDT 660
KERDGRGGDL RSPRRRGTLS NKRDAFSCSS RSPSADRRRA LRETERKRPL DASASRSPSG 720
PRQRPPGAWP HGPQTQRGAD PRSSSRSPEG RSGARRRLSP FSKKKEDRDL SRERGGRAGP 780
SRLSVDGRRG GPERRGRRSG SRRHSSLSDS DRSPSFSPGR PQRVPGPRGP SGESGLAYGR 840
PRPRLSGRPD AERGPSPFSR RRRRSVSRSG DRNSVEDGRL SRGGSRGRRP GPSRRPSDRL 900
EPGEEAMRDR DDDYSRGGFP GPRIRGDRRG GAPYGPGGRR PGSREPSFGR ELHPGEKGDG 960
PARSGARVPV RGRGTGAQLA ETASTAWGGP GRMEKGREDG PFGMRGEGPR GRGRQPGRRR 1020
LGRRDGDSGD EGGRDARMDR YKSQSRSLSE DSDSASPVSG GGVGPGVGGS RRGSRSSGLA 1080
GREALEGEGA PAASREEVER NGGARAPQGA GEFRSGSAAE DSLGAKDAVG PKPMNALCGG 1140
RPGTQGSAGT A 1151 
Gene Ontology
 GO:0003676; F:nucleic acid binding; IEA:InterPro.
 GO:0000166; F:nucleotide binding; IEA:InterPro. 
Interpro
 IPR012677; Nucleotide-bd_a/b_plait.
 IPR000504; RRM_dom. 
Pfam
 PF00076; RRM_1 
SMART
 SM00360; RRM 
PROSITE
 PS50102; RRM 
PRINTS