CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-032726
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Putative uncharacterized protein 
Protein Synonyms/Alias
  
Gene Name
 TGVEG_053050 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Toxoplasma gondii 
NCBI Taxa ID
 5811 
Lysine Modification
Position
Peptide
Type
References
2312QSVSLSSKKASTPVRacetylation[1]
Reference
 [1] Lysine acetylation is widespread on proteins of diverse function and localization in the protozoan parasite Toxoplasma gondii.
 Jeffers V, Sullivan WJ Jr.
 Eukaryot Cell. 2012 Jun;11(6):735-42. [PMID: 22544907
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2336 AA 
Protein Sequence
MLLQSDTSGN MAFHQADSAT LPHKDPPPVS SFRVFSSSLP FSQPTVPQAS FCSFPSTTTM 60
DCNTGPASRA SMDRVLPLLT QSSSLLLPPA WLAADGSAFP LLSSSRPAAN RSHAFLHAFG 120
ERAASDPRVE EISSEETEDE GEQPATGGTA TGDGPGGLRG IDDACRGAKT LPSQSQRSQG 180
SEPRKACGEG DRQQAWRLRE GERDSGTQSS LSLFSRASTS SLPPRKLRSP LIPASASSLS 240
AHRRPVPWTA VRQAYPRVQV DPRKHAVSAY SFSSLSRRSF SPPRTPRALL NSMHQTNAPF 300
PEPSEATKPD RVGSRRRIGA RLTSRLLSPA EVSRQLLEQY PGPPAQAVSG EPDAEKGTSG 360
VSGLFPEGKS RTEPVLPSSK GASEKHLPHG RTTLSLASTR ATRLSSSSSR LLSRLTLASL 420
STGKRHESEQ RRVSGIPSVR SRSDSQASGV RTPGEPPSSE PRAAYTHSPA FSRLLSRELP 480
LKRHSSCPRN AEATEASVAI QPPVGERVPR MHPVLSKRSV STASRVSSVK RRDREALQKR 540
NREEEGPLSG LWRLLAFASE EGGAEVSEAE SETSDASAEV EAVDAWGAAL DRVDTGDPWR 600
SHPAPPVSFR TLKSLPPKSP LLPPKPLSSK SFSVRSPASL REKEVLEAAQ PLRGPSQRGP 660
IFQGSESVHA RRSSEEAREE RSAPLHARPC GDSDALPRPP FGDAERDRKG TSSAKEKQAP 720
FSEPGGAPEP RAALQSLEND DERCLQSTCS SLPKPVRVPP LQLSVERPSP RWSSPSHSPA 780
PTGNTKQVLN SGLRRRSEET EGRHRVDPMS PARDGPHLTG LQLQRAGSVL SRSRESLLSL 840
EEESRQSDDG EALGEDEAAP RETEPGETEG RERVRSHAKR RLHSAFLSNE ETAGTSSTLS 900
SFLGVSPANE ERGSSASSIL VGGLEGRRRL ASPRDFPPPT FERILFGLPC DGDQDGTGGD 960
AATEEEEEEE AREEEEEEGE EEEEAREEVE EPREHVRGED ERKGPREQKR SEEKRNGSVP 1020
SGEMRSADVM REKRKRRSRD NKRASAALEV KEQKEDPAAD VGSTVAQGLS LLPSFFAAML 1080
ARGEDMSPPQ TAWNVASEVD ETPREERKKA GSLTRESAVP AKRGLSSSAS LEGSSMRKKL 1140
LSLVQSHSDV SKAKSRRPMN LRHLRDLPSL ASLSPKEKFR DLIVERPRQR LSPPTRKTAG 1200
TSSHAFLEES ASDGNLNRHR SLSSFFSFWR DPQRENSRFS ASSQARLGRP GGARMRAHGR 1260
SSAQRSLRRR DRRGFCSLSH EEGEWSSDRV WRHESEGRRT DRHRASATHG PARRIHSFPG 1320
DRLARAESMP SRVGFRCTST TRRELWDGVA ASALSPDASR VRRQRNCGWR EPSPGDGFQQ 1380
PFGETEGQGE EEASDGEAQA AWLRELRARR RERRGREMAF GPLLRRPQQL QASVRHLGVS 1440
TADFEDARDA PRGCDRRQLP CTDDSLPASP SETFLIVRRR LRPPFKHRRE EAARERRCLR 1500
AESFDGSPPR ASPRRNSLLS PSSPHLSSPV CLSEFEEGQL QGRDFERDRR HTCDSEERRE 1560
RRERRSGRER RSGRERTTRM VREREGSPSS DFIPPHTLYI PPLRLTSERS KRLPDGSRLD 1620
SRRLSDSVRS AVPRGRFAEC SDEEDEGARS VRKDRDLLTS LFELPFASSP ARRHLQQVIR 1680
RHALFKLYDE DEPDSSESLL PEASQTDACG LPAPPGKTKK RDIPRALLEQ LEAPKTPVHH 1740
EQRLSSPSAS LSPSSQSFPS SQSSPSSQSS PSSQSSPSSQ SSPSSQSSPS SRSFPSSQCS 1800
PSSESSPSPS TRSSASSALV FSPLASCARS KRKDQEEQTP ERHGREWRDD FGKEKRRRGR 1860
HREMSRRQRV SDASPASSIA RRRSGEQPRE RRARETSREA EGWNRHACLP QEPLHNVLRR 1920
QNERVHAPKK APRSLVGGKV SESRSSAANG REETGERQAA LRRQGRSRNT QKGDRGTNKE 1980
GNVKTTKAPQ ERLGQTESAE LRQSRRNASS SSLSSISSVS SSSSAAASLR GRLEKDSAPR 2040
RGRKGRKDKR RRLSASSVSS SACSSSPSTS HERTNTDSVQ KRGRRKKSER RSTRKEQELK 2100
QGAQLLEPKF TTEAELVAVV RRLLDQVTER KHSEADSPPV ASLHPLSNLH AVRAALLRRQ 2160
QDLEAERREL TASCLDDGDS YGAKELRDTH RARRDPHGDR GDRKSSREQS PLEKDSLVGA 2220
KPSQPGLLFV KLKVPEREKT SDKGGRGDTE DCVRSVGQKG ALPHPPFAAT RLRELASKSA 2280
ERRGGSPVSD FSCHSSSSDE DIPSQSVSLS SKKASTPVRR QRQRRINRQV WGCMHR 2336 
Gene Ontology
  
Interpro
  
Pfam
  
SMART
  
PROSITE
  
PRINTS