CPLM 1.0 - Compendium of Protein Lysine Modification
Tag
Content
CPLM ID
CPLM-031766
UniProt Accession
B6KBC7_TOXGO
;
B6KBC7
;
B9QBF0
Genbank Protein ID
DS984728
;
EQ970683
Genbank Nucleotide ID
EEA97970.1
;
EEE32341.1
Protein Name
Putative uncharacterized protein
Protein Synonyms/Alias
Gene Name
TGME49_058970, TGVEG_075480
Gene Synonyms/Alias
Created Date
July 27, 2013
Organism
Toxoplasma gondii
NCBI Taxa ID
5811
Lysine Modification
Position
Peptide
Type
References
1048
SATPVAG
K
LKRRSRF
acetylation
[1]
Reference
[1] Protein intrinsic disorder in the acetylome of intracellular and extracellular Toxoplasma gondii.
Xue B, Jeffers V, Sullivan WJ, Uversky VN.
Mol Biosyst. 2013 Apr 5;9(4):645-57. [
PMID: 23403842
]
Functional Description
Sequence Annotation
Keyword
Complete proteome; Reference proteome.
Sequence Source
UniProt (SWISSPROT/TrEMBL); GenBank; EMBL
Protein Length
2283 AA
Protein Sequence
MGETSQDSSQ PPSYPQASSP PFLRVAWYRP APAAAGFCGW PSHISRSGDG AVGSRPGPNG 60
VAPGGLCGSR HRKLKLGYYA QGAAVLLFEP EEGEKEKSRR DGEREGRQRA ASESQAQMVV 120
PAFERLVAIV PDLEVSAREA TPSTSSWCHA PHDQRCQQSS STFSGSSRRI QPLKESFAAF 180
PLCAFGSPAF GSSFAPGESL WVGELPCSFG DSAVHALSGA AAFTLPSPFD IAKRIHWRYL 240
PYRDVLSSSS PATRRASSTS RGARGASGGA YGGYSGGTGL SPQPGASFAS FSARPSGTSF 300
LNGNLGALPA PPTNPLFSPV EKTIAVAPPP FSSFAVHGRQ AYGAHAFLGF GRRQEGPELA 360
VPLSALPALA RDQALPSRRV LILTSRCVLS LTFCSLEEIY KKLATRLIRQ VESRALFGPQ 420
VRPGPSGSEA RRDPQGGSAS PQGQGGGSFL PFFKRKSTHE TGDETEKAML AAVVAATDSR 480
GREGEVRDTA REGWRRNSAT AALPVEQRIQ RVYAQLLARQ TRQEAERSPQ TIRGAYASGV 540
WTPHDPRSGV PTPYGGSPGL GSEGARGWTR LVDGSSGGFF SGASFGGSSR DGKLAADFPP 600
ASRAPGFLEE LFLAYLRDVY TPEQVAAVCW QLLIDFPTLL APSRSLSAPS SASNLHQALR 660
ALLVHASEAR GDVSLAGLLT AHAGLLFHAA PALAAPSVTL PFSSRGDRAF LDLFLRSTVT 720
LSTPLPPAAS PLVLQSLRRG DGAQEEADGR RMRFLDAHAA EQFLTPLRRG YVGGAQTRGE 780
EGGPRSRLSS ATETGLRFAD LRGDGCMREP GASLGAPGTQ SHAAQAQSAS RSAAALLSSA 840
GAPFFEVEEQ GGPAAPGLSP TEVAALLFLS WRVLVGEKET PGGAGASRSA WRSSDQEADV 900
ARLKELENEL QQLRQRTACD GADRRRERSE GKGDLGRLEN SRSPRCWLLG ATNDPGKPAS 960
LQEQQLELQI KQLQLQIDQR NYHAQLSTNA APAAADFFSV SPTAKGLLLF VSRLLRPLLF 1020
APLFEAGHDA SGILRSPAST SATPVAGKLK RRSRFSSFLF GDPSESDAVA PPAASLLAVA 1080
SLAGVQASQL TPGAVPVPIC MRGRFSLSAV STLQKKLALL FMVVDAVYTA IFERHVMRRQ 1140
EQIAASQVSP TPSFVSCAPF APSPVTSALP PHWPLAAFSA SAAADARDAD SVAAGFLGGA 1200
KTQDTNLADS VAAASRAAIV WGSLPSHEQE ELQQLFAVSR MLLFCNEALA AYRLLLREVE 1260
SCAQLGRRRI IDPLQFDRVL CLNLSLLCSE RSSREDFRQF VFSCVSVQSP ALGQLCTGAL 1320
FTPQELRAHS ALMQLKSFID EGREALEQNM FQQKVFLSRF LAQHGTEAEA TCGASSLPSL 1380
PAPSPAVLSL QTQILQFVGS QLVPFLSFFP MQTLAELLVS VQMYRLLVSV VHAKAEELST 1440
RHRLAGGACG RSLTRLPSPA EQEKEIEETC YSLVTSWLQR FLDLHLDARQ RQRASQAVPV 1500
SSSPQGDGDA EALFWRMQAA ALVSAVLSCD DENLQSSVFS WIAQAGQKTF PESLLTLNSP 1560
YLLPWLQRHF RHFALDVGAF YAAAGLSSRA AAEYLAQGLR PWREEDCPAV ADSPLPPRPL 1620
DGARRPPNPP NATDRERTNA ENQGAPPTRD GDSEDGVARE QTAQLLAFDA ERDAFCVEKT 1680
SLIWRRTAAY VSEQARAPSL QRRLFLLAQA KDALQQQAQE LHAEKEVFPG LQRASGDTPV 1740
ERRQFASPSL GNARAAWLSA QAVSDCMQNP FSSLLVGCVS IEQISESIQM VGCQQGLLRD 1800
CLRVFCFLAL EFAAAAARVK SELACSRRGA SLCGRSAETG EGCSSGEKTA AAEREKGERE 1860
QLLREAVEDF LRDFFAHREE IDRSSAAGCH ERTPTSPQHQ QNAEQWEIFD SLLDLMKALQ 1920
TTVYSRSELL TLADVFWRAN RFVQDESFSS IFPSITALQL LARASPPDGR RSRDGDTERT 1980
ASGGARRDAG EYRADAEERV SELDAGEGKI AAACGCYLAF AAAASRALEN EKILHSALDD 2040
LLSYPDVGLR RVQRRLMPEY LLLLHKFVQE RELSLSPLLT RDRNDFSGEQ SRFVEELLLH 2100
PSRRLGGDDA CTECLLLPPA LWPAWMILHR RVPPETLVDA YLHLVRTRSP GMTLHISLTP 2160
QVQATIFAWL FDAWLMNEEN STSGQTFIKT FSFYLSEGKL PFFETPNDCL QAQLSREEQE 2220
GAADACRKLT RMVLTIGAAL AAVAADVPQL KQLEMYRSAC LALDTFQSKL SLYASALQQP 2280
IDS 2283
Gene Ontology
Interpro
Pfam
SMART
PROSITE
PRINTS