CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-032687
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Putative uncharacterized protein 
Protein Synonyms/Alias
  
Gene Name
 TGVEG_083470 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Toxoplasma gondii 
NCBI Taxa ID
 5811 
Lysine Modification
Position
Peptide
Type
References
899GREAGEGKETPSSRAacetylation[1]
Reference
 [1] Lysine acetylation is widespread on proteins of diverse function and localization in the protozoan parasite Toxoplasma gondii.
 Jeffers V, Sullivan WJ Jr.
 Eukaryot Cell. 2012 Jun;11(6):735-42. [PMID: 22544907
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2282 AA 
Protein Sequence
MAWPPTPTAS PSAKQTPLAS LLPLHQFFPS SPGCSPTSRH FDVSPQALGF GISESTSPTT 60
GTNALRAALA GMPATSSQAA RGMSLPTSSG GGVFGLSAGH ALSSTPDSPR FSPVRRTASP 120
SAALLSKVLV GGAGSPSGHV TPSSSLQLTT SPTGDPFPLF ATTSTSSDSD TAPFGGAGHG 180
AEGSAAGSET PTLLLPLTRR RSDREPALLF STSVSASSTS DAATMSPTGE RPSGVERGDA 240
GADATPMGAA AAALNQCGLD FSPRLLNTPR SGHLFSRVFS PTGRAGSTAL LTDGGSHPAL 300
LVQQNPLQPL SKGDEAATLL TGALSSGAKD SRQRGGASPP EDSLPEPFAS ENDNCIQKAA 360
SGPPGASITR TMSGARSVSS PASRQGSASR PRGPSSGSSV RRTPPPSLLD PSFLFHPAQG 420
MDSSSCSRSP ALFLFDDAAF GMGGNGLGPG CGSLESVGPS GAVLGAGTRG SSGGGGGGSF 480
DAFGGTPFSP SCFSLPSLFG VGTANAPFFS PSNAAASPAG ASALAAATQR LLCHTGGGAS 540
PGSPLHLPLL SQRQQLLLHQ SSHPRRNSFP SPTSGGGRGP STPTGADPLF GVFSVSTASQ 600
HDSGDTPRAR RSGDCGGSDP REARADRALP AVQTPQGSLP FPSLLELDAG KDRGSGMSGA 660
DPSLFYSSHL FLQNGGTRTL SSPRQGSAVL RSAACGGAQT ATDRGAAGGD TAGLPFSGSV 720
ASQEGEDQER EAATLLEGSQ GTREGLAKEE RPHLGPGHST VSHRASFLPS VLHTPSFLPF 780
PEWQDRSNFA ALMQPGQRPE TLPGASEPDW SKPTVSTVAS TAEERTEGDS AVQGGDGGLV 840
GSQDKPHVAQ GELTTRAGAL ETQPETEGGN LQKAYGVSPS PLGTPSSLGL FGREAGEGKE 900
TPSSRASAEN RSHPATGQSK NAKPYPAAGA LQAGSQPGAA GPVSPTKAET EGSPSGTKAS 960
KSGASGFLAA FEDAAFQQPS EEFIGYCVQK HQARYPTPQN LPGIQMEQQQ RRWCASVYYR 1020
GCQHKRRFSM GRWGPLGAFY AAIEWRQSHY SRLNSLKGIR SPGNSEGAPG SDHKKRRRRS 1080
SSGSCSSRSH PALARGSVGP PGPGSLAESA PSEGPEGTET APGSGLPGMA GGVKKEEAGC 1140
VSEATGAGAA AATGLYPAPR QTSGGDGEPV GFYRRGSGGE GVSVGEKAGA PALVAIQGKE 1200
EESEQLHGLP EAVAEANTQR FGDENARDPQ QPFVGRLESA DGTGTTACDR TGYPFFSGFQ 1260
VPSSEEVDEN AVSEIGGKEG LDVERKRRRE NEFDSRSLLG MSSVPTADGA FGCFQQNGAP 1320
GFGSSLIRGE EDRPDGTGNV HSSHSTPGTC PSFNSPQVPH TQSLSQGSVP EGSSPQLPPP 1380
FLSWNEDRPA SGRDATTTVC SFSGSFLPTG ADSHFHDPRE LGKDVGASLP SQLHVSTNHA 1440
FNPGAVAPYN TSGLPGFPGT LSATPCRQGA GFVSGGVGVY GQETPPGLGM GFMQPQNGPF 1500
DMQHLHSLPS HQMYGHLPHL SGSLGAMGQN GMRPPGASNA SGVPLASSLA AYPTGVVSHS 1560
GQLAIGQPPL SPSQPLPSGG GLSSSHLLPS PHMYGSWMQS ANSQTLNPPN FLLGASGAPT 1620
QTSPMSSTRF PPCQSTVSPS LAPADAHGLT APPFGASVPT GPVQSARAKG GRKEGPGLAC 1680
AGGTDMRKST SREKQAGSGP ARSRRTKGAA SASNGGSLFP VSPYAAPVDP TGPASSFSPP 1740
FRPLSNSGSP PLAPANSDGA APVEWHEGFR RPFGTEGRQQ LLRHLRQVYN ADSAYWGERL 1800
RHANIAFSRI AYATVAELWK IAHVMDCFPI ALAFSNRHSA TASTGVEDGG ALTGAPGTGA 1860
VGLPGAAAVG RRACGAKGEK AKGSRRGNSG NAGATASSKT PQLKQTSALA GGLSSSGAHL 1920
GVISAPQEGS GNLQTSLLFS ESAFPEGGGS PVQSCVDVQD PSGVGTAAAA PETPGDGTGP 1980
EGTGGLQGSR AWDPVTEDFS GAFEHAGVSG RCLAGMPEVS LGARTVGCGR GPSEQGPVSP 2040
GQEDDASASL GPHSRESEKY FSGDQPLCVH TSDEAVHSTE GLILKAGGEE WRSRCLEAPG 2100
DKASGSQMCA GLGGTQAEGT DSEDCMLGMG KAGKVAEYPV KEETGVQGDQ ETESVKPTAA 2160
KPRSMGTSGK RKEEGARQPS LVACARTFVA ADGAEEGDID DASGDRVLSL PRVCTGKKKR 2220
EAKAKKGKEE GNAAVVTRRK EEARAEGHRE GETSAEVAKT LSEFAFVSED VQTLPSVEGA 2280
TQ 2282 
Gene Ontology
  
Interpro
  
Pfam
  
SMART
  
PROSITE
  
PRINTS