CPLM 1.0 - Compendium of Protein Lysine Modification
Tag
Content
CPLM ID
CPLM-032726
UniProt Accession
B9QHE8_TOXGO
;
B9QHE8
Genbank Protein ID
EQ970688
Genbank Nucleotide ID
EEE30213.1
Protein Name
Putative uncharacterized protein
Protein Synonyms/Alias
Gene Name
TGVEG_053050
Gene Synonyms/Alias
Created Date
July 27, 2013
Organism
Toxoplasma gondii
NCBI Taxa ID
5811
Lysine Modification
Position
Peptide
Type
References
2312
QSVSLSS
K
KASTPVR
acetylation
[1]
Reference
[1] Lysine acetylation is widespread on proteins of diverse function and localization in the protozoan parasite Toxoplasma gondii.
Jeffers V, Sullivan WJ Jr.
Eukaryot Cell. 2012 Jun;11(6):735-42. [
PMID: 22544907
]
Functional Description
Sequence Annotation
Keyword
Complete proteome; Reference proteome.
Sequence Source
UniProt (SWISSPROT/TrEMBL); GenBank; EMBL
Protein Length
2336 AA
Protein Sequence
MLLQSDTSGN MAFHQADSAT LPHKDPPPVS SFRVFSSSLP FSQPTVPQAS FCSFPSTTTM 60
DCNTGPASRA SMDRVLPLLT QSSSLLLPPA WLAADGSAFP LLSSSRPAAN RSHAFLHAFG 120
ERAASDPRVE EISSEETEDE GEQPATGGTA TGDGPGGLRG IDDACRGAKT LPSQSQRSQG 180
SEPRKACGEG DRQQAWRLRE GERDSGTQSS LSLFSRASTS SLPPRKLRSP LIPASASSLS 240
AHRRPVPWTA VRQAYPRVQV DPRKHAVSAY SFSSLSRRSF SPPRTPRALL NSMHQTNAPF 300
PEPSEATKPD RVGSRRRIGA RLTSRLLSPA EVSRQLLEQY PGPPAQAVSG EPDAEKGTSG 360
VSGLFPEGKS RTEPVLPSSK GASEKHLPHG RTTLSLASTR ATRLSSSSSR LLSRLTLASL 420
STGKRHESEQ RRVSGIPSVR SRSDSQASGV RTPGEPPSSE PRAAYTHSPA FSRLLSRELP 480
LKRHSSCPRN AEATEASVAI QPPVGERVPR MHPVLSKRSV STASRVSSVK RRDREALQKR 540
NREEEGPLSG LWRLLAFASE EGGAEVSEAE SETSDASAEV EAVDAWGAAL DRVDTGDPWR 600
SHPAPPVSFR TLKSLPPKSP LLPPKPLSSK SFSVRSPASL REKEVLEAAQ PLRGPSQRGP 660
IFQGSESVHA RRSSEEAREE RSAPLHARPC GDSDALPRPP FGDAERDRKG TSSAKEKQAP 720
FSEPGGAPEP RAALQSLEND DERCLQSTCS SLPKPVRVPP LQLSVERPSP RWSSPSHSPA 780
PTGNTKQVLN SGLRRRSEET EGRHRVDPMS PARDGPHLTG LQLQRAGSVL SRSRESLLSL 840
EEESRQSDDG EALGEDEAAP RETEPGETEG RERVRSHAKR RLHSAFLSNE ETAGTSSTLS 900
SFLGVSPANE ERGSSASSIL VGGLEGRRRL ASPRDFPPPT FERILFGLPC DGDQDGTGGD 960
AATEEEEEEE AREEEEEEGE EEEEAREEVE EPREHVRGED ERKGPREQKR SEEKRNGSVP 1020
SGEMRSADVM REKRKRRSRD NKRASAALEV KEQKEDPAAD VGSTVAQGLS LLPSFFAAML 1080
ARGEDMSPPQ TAWNVASEVD ETPREERKKA GSLTRESAVP AKRGLSSSAS LEGSSMRKKL 1140
LSLVQSHSDV SKAKSRRPMN LRHLRDLPSL ASLSPKEKFR DLIVERPRQR LSPPTRKTAG 1200
TSSHAFLEES ASDGNLNRHR SLSSFFSFWR DPQRENSRFS ASSQARLGRP GGARMRAHGR 1260
SSAQRSLRRR DRRGFCSLSH EEGEWSSDRV WRHESEGRRT DRHRASATHG PARRIHSFPG 1320
DRLARAESMP SRVGFRCTST TRRELWDGVA ASALSPDASR VRRQRNCGWR EPSPGDGFQQ 1380
PFGETEGQGE EEASDGEAQA AWLRELRARR RERRGREMAF GPLLRRPQQL QASVRHLGVS 1440
TADFEDARDA PRGCDRRQLP CTDDSLPASP SETFLIVRRR LRPPFKHRRE EAARERRCLR 1500
AESFDGSPPR ASPRRNSLLS PSSPHLSSPV CLSEFEEGQL QGRDFERDRR HTCDSEERRE 1560
RRERRSGRER RSGRERTTRM VREREGSPSS DFIPPHTLYI PPLRLTSERS KRLPDGSRLD 1620
SRRLSDSVRS AVPRGRFAEC SDEEDEGARS VRKDRDLLTS LFELPFASSP ARRHLQQVIR 1680
RHALFKLYDE DEPDSSESLL PEASQTDACG LPAPPGKTKK RDIPRALLEQ LEAPKTPVHH 1740
EQRLSSPSAS LSPSSQSFPS SQSSPSSQSS PSSQSSPSSQ SSPSSQSSPS SRSFPSSQCS 1800
PSSESSPSPS TRSSASSALV FSPLASCARS KRKDQEEQTP ERHGREWRDD FGKEKRRRGR 1860
HREMSRRQRV SDASPASSIA RRRSGEQPRE RRARETSREA EGWNRHACLP QEPLHNVLRR 1920
QNERVHAPKK APRSLVGGKV SESRSSAANG REETGERQAA LRRQGRSRNT QKGDRGTNKE 1980
GNVKTTKAPQ ERLGQTESAE LRQSRRNASS SSLSSISSVS SSSSAAASLR GRLEKDSAPR 2040
RGRKGRKDKR RRLSASSVSS SACSSSPSTS HERTNTDSVQ KRGRRKKSER RSTRKEQELK 2100
QGAQLLEPKF TTEAELVAVV RRLLDQVTER KHSEADSPPV ASLHPLSNLH AVRAALLRRQ 2160
QDLEAERREL TASCLDDGDS YGAKELRDTH RARRDPHGDR GDRKSSREQS PLEKDSLVGA 2220
KPSQPGLLFV KLKVPEREKT SDKGGRGDTE DCVRSVGQKG ALPHPPFAAT RLRELASKSA 2280
ERRGGSPVSD FSCHSSSSDE DIPSQSVSLS SKKASTPVRR QRQRRINRQV WGCMHR 2336
Gene Ontology
Interpro
Pfam
SMART
PROSITE
PRINTS