CPLM 1.0 - Compendium of Protein Lysine Modification
Tag
Content
CPLM ID
CPLM-032700
UniProt Accession
B9QEZ4_TOXGO
;
B9QEZ4
Genbank Protein ID
EQ970685
Genbank Nucleotide ID
EEE31197.1
Protein Name
Putative uncharacterized protein
Protein Synonyms/Alias
Gene Name
TGVEG_081940
Gene Synonyms/Alias
Created Date
July 27, 2013
Organism
Toxoplasma gondii
NCBI Taxa ID
5811
Lysine Modification
Position
Peptide
Type
References
616
QQAGSGG
K
EAAPRSA
acetylation
[1]
Reference
[1] Lysine acetylation is widespread on proteins of diverse function and localization in the protozoan parasite Toxoplasma gondii.
Jeffers V, Sullivan WJ Jr.
Eukaryot Cell. 2012 Jun;11(6):735-42. [
PMID: 22544907
]
Functional Description
Sequence Annotation
Keyword
Complete proteome; Reference proteome.
Sequence Source
UniProt (SWISSPROT/TrEMBL); GenBank; EMBL
Protein Length
1226 AA
Protein Sequence
MTEQQRDRTL LDNRLDAETA ASSASARPLQ HTSSSSCSPS SLSCSASSVS PGSVSSDAPG 60
ASALKDEGAD ASGVRPAKRV RVSSRCVDSV ESPSLEEEER EKASGEASGD STTDTETRIR 120
KEEEAKSVAE RQKRLYSSFL PPGLLNDFVS AALALDASGE SPAEPGCTYT PDSGPPGGSP 180
VSRHLGAPRD PQTPCEALLS FDRESLLTFA SLFHLLVSGR GTDEGELQEA AQADKREAEE 240
AAQQTELEGD VSTRGCWGGE VGARVQPEVT KKTRKSSASE EVVEGTSSGD RTRKKALEMS 300
SSPLVFLRGR RSREVLVRRL LRQLRGDDGE GQNSGAASSG GRASLSSAVH PPQAVLFEAW 360
MKALKIFLSF VTSSALVGSH AKALALVAVF DILLLLLSPE EDLSPSSASF SSASSAGAST 420
GKVVDGAGKA ADAASKGRRR EGASRPVLDF NNFADLLVEF ARTVPLREIG SVISFLEKNK 480
ESLIAAFQRK GRDCLSGSKQ LDTAQAKRAL QAAGAKVIGM VKSIEEPLMH ANEKQYIFAL 540
RRLLMETLSM THAGLSNRTM QRAEATPVVL DSLEAWNALS HFGVGVESPS ASTHADRSRS 600
KKSSGKSTQQ AGSGGKEAAP RSAKSLENER KSEKGGGDSL SSFGACSYAV YAAYGKALAF 660
LQAPDRVLEQ PAEVVTDVLS SLDTVLSYFE KHPAVSTRAA AETRSPAVSL DRTLDQDEAE 720
VKRACDNLLL LSSSGNPHYL SAGAFRERID NSFFRRELLT SACVAFHFLG QAVGLGGQEK 780
RGKTDEQSSQ GKAAKKETPV RPAGGGETGA TAQRMQQKIR ETYSALNEKT ARRNVDRARS 840
LASDVEEERS GIEKLLQTEQ LWLWWKNRNC FDSVLKPSGS DSLSFSAADV PAPALLPFAV 900
QAALSSSNLA KTLEKQAHAE TRKKSETEKS PEQPAQPEPA KNAQKLASSQ SASSQSASGE 960
EAKAKEVQLA LQESKERSDE KSESKADTMD RGGAEEAQTG EKEEAEPDKL LSRALALAKQ 1020
AEMGENSWWP ELHRGSKPRK DSVVMRRLVQ WLHAWEAAEG SSAASSPSVS SGLSSEQEGG 1080
EPLAGRVGLA IPGPLRLLRQ GEAWFTSDSR NLPSDYDDLL FMDKLRVKLN DLVQKMDDDD 1140
NPENDVDEEE RSKHNPVFSL RLRKLFALFY GDAYVAMPYK ETKCEDLRQA IAEYDTKGLA 1200
ARLGLKAVEK RSVSQATNEQ GAEQES 1226
Gene Ontology
Interpro
Pfam
SMART
PROSITE
PRINTS