CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-031766
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Putative uncharacterized protein 
Protein Synonyms/Alias
  
Gene Name
 TGME49_058970, TGVEG_075480 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Toxoplasma gondii 
NCBI Taxa ID
 5811 
Lysine Modification
Position
Peptide
Type
References
1048SATPVAGKLKRRSRFacetylation[1]
Reference
 [1] Protein intrinsic disorder in the acetylome of intracellular and extracellular Toxoplasma gondii.
 Xue B, Jeffers V, Sullivan WJ, Uversky VN.
 Mol Biosyst. 2013 Apr 5;9(4):645-57. [PMID: 23403842
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2283 AA 
Protein Sequence
MGETSQDSSQ PPSYPQASSP PFLRVAWYRP APAAAGFCGW PSHISRSGDG AVGSRPGPNG 60
VAPGGLCGSR HRKLKLGYYA QGAAVLLFEP EEGEKEKSRR DGEREGRQRA ASESQAQMVV 120
PAFERLVAIV PDLEVSAREA TPSTSSWCHA PHDQRCQQSS STFSGSSRRI QPLKESFAAF 180
PLCAFGSPAF GSSFAPGESL WVGELPCSFG DSAVHALSGA AAFTLPSPFD IAKRIHWRYL 240
PYRDVLSSSS PATRRASSTS RGARGASGGA YGGYSGGTGL SPQPGASFAS FSARPSGTSF 300
LNGNLGALPA PPTNPLFSPV EKTIAVAPPP FSSFAVHGRQ AYGAHAFLGF GRRQEGPELA 360
VPLSALPALA RDQALPSRRV LILTSRCVLS LTFCSLEEIY KKLATRLIRQ VESRALFGPQ 420
VRPGPSGSEA RRDPQGGSAS PQGQGGGSFL PFFKRKSTHE TGDETEKAML AAVVAATDSR 480
GREGEVRDTA REGWRRNSAT AALPVEQRIQ RVYAQLLARQ TRQEAERSPQ TIRGAYASGV 540
WTPHDPRSGV PTPYGGSPGL GSEGARGWTR LVDGSSGGFF SGASFGGSSR DGKLAADFPP 600
ASRAPGFLEE LFLAYLRDVY TPEQVAAVCW QLLIDFPTLL APSRSLSAPS SASNLHQALR 660
ALLVHASEAR GDVSLAGLLT AHAGLLFHAA PALAAPSVTL PFSSRGDRAF LDLFLRSTVT 720
LSTPLPPAAS PLVLQSLRRG DGAQEEADGR RMRFLDAHAA EQFLTPLRRG YVGGAQTRGE 780
EGGPRSRLSS ATETGLRFAD LRGDGCMREP GASLGAPGTQ SHAAQAQSAS RSAAALLSSA 840
GAPFFEVEEQ GGPAAPGLSP TEVAALLFLS WRVLVGEKET PGGAGASRSA WRSSDQEADV 900
ARLKELENEL QQLRQRTACD GADRRRERSE GKGDLGRLEN SRSPRCWLLG ATNDPGKPAS 960
LQEQQLELQI KQLQLQIDQR NYHAQLSTNA APAAADFFSV SPTAKGLLLF VSRLLRPLLF 1020
APLFEAGHDA SGILRSPAST SATPVAGKLK RRSRFSSFLF GDPSESDAVA PPAASLLAVA 1080
SLAGVQASQL TPGAVPVPIC MRGRFSLSAV STLQKKLALL FMVVDAVYTA IFERHVMRRQ 1140
EQIAASQVSP TPSFVSCAPF APSPVTSALP PHWPLAAFSA SAAADARDAD SVAAGFLGGA 1200
KTQDTNLADS VAAASRAAIV WGSLPSHEQE ELQQLFAVSR MLLFCNEALA AYRLLLREVE 1260
SCAQLGRRRI IDPLQFDRVL CLNLSLLCSE RSSREDFRQF VFSCVSVQSP ALGQLCTGAL 1320
FTPQELRAHS ALMQLKSFID EGREALEQNM FQQKVFLSRF LAQHGTEAEA TCGASSLPSL 1380
PAPSPAVLSL QTQILQFVGS QLVPFLSFFP MQTLAELLVS VQMYRLLVSV VHAKAEELST 1440
RHRLAGGACG RSLTRLPSPA EQEKEIEETC YSLVTSWLQR FLDLHLDARQ RQRASQAVPV 1500
SSSPQGDGDA EALFWRMQAA ALVSAVLSCD DENLQSSVFS WIAQAGQKTF PESLLTLNSP 1560
YLLPWLQRHF RHFALDVGAF YAAAGLSSRA AAEYLAQGLR PWREEDCPAV ADSPLPPRPL 1620
DGARRPPNPP NATDRERTNA ENQGAPPTRD GDSEDGVARE QTAQLLAFDA ERDAFCVEKT 1680
SLIWRRTAAY VSEQARAPSL QRRLFLLAQA KDALQQQAQE LHAEKEVFPG LQRASGDTPV 1740
ERRQFASPSL GNARAAWLSA QAVSDCMQNP FSSLLVGCVS IEQISESIQM VGCQQGLLRD 1800
CLRVFCFLAL EFAAAAARVK SELACSRRGA SLCGRSAETG EGCSSGEKTA AAEREKGERE 1860
QLLREAVEDF LRDFFAHREE IDRSSAAGCH ERTPTSPQHQ QNAEQWEIFD SLLDLMKALQ 1920
TTVYSRSELL TLADVFWRAN RFVQDESFSS IFPSITALQL LARASPPDGR RSRDGDTERT 1980
ASGGARRDAG EYRADAEERV SELDAGEGKI AAACGCYLAF AAAASRALEN EKILHSALDD 2040
LLSYPDVGLR RVQRRLMPEY LLLLHKFVQE RELSLSPLLT RDRNDFSGEQ SRFVEELLLH 2100
PSRRLGGDDA CTECLLLPPA LWPAWMILHR RVPPETLVDA YLHLVRTRSP GMTLHISLTP 2160
QVQATIFAWL FDAWLMNEEN STSGQTFIKT FSFYLSEGKL PFFETPNDCL QAQLSREEQE 2220
GAADACRKLT RMVLTIGAAL AAVAADVPQL KQLEMYRSAC LALDTFQSKL SLYASALQQP 2280
IDS 2283 
Gene Ontology
  
Interpro
  
Pfam
  
SMART
  
PROSITE
  
PRINTS