CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-032613
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Putative uncharacterized protein 
Protein Synonyms/Alias
  
Gene Name
 TGGT1_054250, TGME49_095770, TGVEG_050190 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Toxoplasma gondii 
NCBI Taxa ID
 5811 
Lysine Modification
Position
Peptide
Type
References
793GNAEDWSKLMPSDKPacetylation[1]
Reference
 [1] Protein intrinsic disorder in the acetylome of intracellular and extracellular Toxoplasma gondii.
 Xue B, Jeffers V, Sullivan WJ, Uversky VN.
 Mol Biosyst. 2013 Apr 5;9(4):645-57. [PMID: 23403842
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1967 AA 
Protein Sequence
MLFRRCVLLA ADWVADALGP LAREVATNFH VVFQLQGLLQ FLLPPFPARS RRVSFPPKSL 60
PGRPPFRPSS PPPEVPTHVC SLAASSIGRA SLAVSFSPGY TAQLEEVPNG LRSSLAAIRE 120
AACVGDAEAV AGATLSSLPF AASEAGAGLE PGLDAGLLPP TAVALLGWFF PEEEDKEKQL 180
ASGQMLQRAT LLRLKFFIKR LRSALQAGRL REASRLARRA FHSCPSYGGE LMHATALSLI 240
AVQEGDPVHA LRLLSFAFSR GSLLLSLSVS SAEASSGEKG ETKEEDASRG AAGLRVTDGS 300
FLSCLLSSIP LSREATQGLL GGRETRTQGA FFLSSDKDPV SPSALCLYVI GFLSLRIARL 360
LISFPDLLSL LTCDPVDTLF SLYPPTASPV PLPGLRLPFF RCLRAPQAGE ESGQEAENPG 420
LRGSDSSVSS ATPSGAGVSV SASVFSRAGE GRAKTEAEGS DEVSGVSAQS ALGGAGARLR 480
PAESPLSRRS LLSPPQRQTT RLPCAYSATG EEQAAVTTGA SDRRGFGEPP TEGGATRGGL 540
LWATSLSRFT EELGSTRTGE TVRPAKKKRV NGGNNFFGEP KERTSLRQSS CIPDSQKPFL 600
LHDEGDQGHR RVFCSAVRKL SARRPLDADS SFSLDSLFDH GTLVSPSSSP CTPSLNSSLP 660
VFFCHARSLY ESRPSEERGA VVGVGYNSLQ TSSLEAFLSH ARKRMKRRRD EERGLMQRAA 720
AEREDGRSRQ SRGPRRSLDR LGEDSETDEG GSDQDEAWSS GKRQRQRARA VLLGAAWTAA 780
KENARGNAED WSKLMPSDKP SVFRAFGPET KESLAASSPS VSRSTGILGI LASNLFLTLA 840
LRCCELLNLP RFVQASPSLL QDVADQQSLQ ASSRFPSFSS FCRSPSRTST HAFSCRPSCP 900
QGSSSLLLLC RQLHLKCLVA CLVSAWRQSE REAEAARRRE NRRDARRPSH REEDEEEEAY 960
ALSAPKQFLG LGSETDLLHE VLSLMSAFVQ GSGRNRETPE SPADCGPSER QEQGFFMQRS 1020
TFWAAVRSAN LSDQEGWDIL ALLCATLSAD GEGDGEARDR ESPVCVQQYR KAEAGMRRDF 1080
CALRRLLFQR EAALKSRTSR GGSWNRQPGE AETVTDCGAL TSDACREDER TARDKSKNGR 1140
DVDALAQEAA ETVALEHQQG GREQGEEGDA SVLSTGKRYP FRRCDVVQAL HAQAAYRLSV 1200
MAGAASEIPL DSFLQNLPVS FFLSQTGAPR AMVASGGTLG LEEGRRSSAE GQGEAEEQTK 1260
HKWLLEEFIL AADREAEAVF SAGLPASLNE YWGTGPDSSG FTAYEDEREN IFTFLQDFPT 1320
LQQLLSLAGA KHGEDCEEEE RDGARDELFI AADLRLLLIL VHLRNLSPIR HLLLRLDDSV 1380
ARLQAKRGMG RRGPAIASRG EICFVKGMTL AFLAGAEKRR MRERLDSENG APGPSKKSKH 1440
IRQVWMPART DKGEKHPNAE VRRTSLRPWS ASPAVGSRRK LTLPFPLSGQ PCAGNSVSFS 1500
PRGRGQCCGA SEAKKRSLSA QTKGDGAHGS PASALSVVSS CFAASLSNTP KRLRRCLSPY 1560
HRQASLERNE ERLEDEAETE GGAGTRQRAT EEATSSQTRK TQNGTGCCQG DQAAALLACE 1620
DEAGAGKAEG ETGEAPDGTE TDQEGFFEMH ARNQRDDRLR SVNATPAHPV GLSSTRDSDM 1680
FSPLSAASSA SRASSLSSLA RPRHLGSLAA GASQEAFWPL RGAQVKRAVG SGAAEERGDF 1740
SRWSSSSASW RDSEDSSCVA AWQEEGDERA DATKRGSGDL LGLGLSLLPR DPLCEARRLG 1800
ERGTQAEQSV IRRETEMSLL LLHARLHLER ALSIWRTQGF SVGVFGGYQQ RAWVKDAYCV 1860
LLAACGQERR LLEKLAEHGG AGSSFSSLGV RWVAKAMAGG TQTCEGNQPT GGCESHEGLE 1920
EKYKIKEVLA HQLERVKEHI FFYRTKLAEA SDPHFQFSRV FRDQRAN 1967 
Gene Ontology
  
Interpro
  
Pfam
  
SMART
  
PROSITE
  
PRINTS