CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-032729
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Putative uncharacterized protein 
Protein Synonyms/Alias
  
Gene Name
 TGVEG_008680 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Toxoplasma gondii 
NCBI Taxa ID
 5811 
Lysine Modification
Position
Peptide
Type
References
642GGDAQDGKTTQAESQacetylation[1]
Reference
 [1] Lysine acetylation is widespread on proteins of diverse function and localization in the protozoan parasite Toxoplasma gondii.
 Jeffers V, Sullivan WJ Jr.
 Eukaryot Cell. 2012 Jun;11(6):735-42. [PMID: 22544907
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2124 AA 
Protein Sequence
MEESEALLPG SREAEAESSP QLLDDRGERR SQSVAEELSA SAPGLLDASS YSGSVQASSP 60
HPSLSSDEPY GNACSVAVPT SPLPASADSA PSETGEKEAE PPNVSLPASG RSPLYEAADT 120
DVLLGCKNGL SRSPRNPARD ASSPVTSVHF GVDPASSVGV SSPAAVAVTQ DSPSLPSPPL 180
ETSSPLLSSS PTLSSPETHK SPSVVFSADS QLSLSPVPPS FSSPLSPPRS PPLSPPRSPP 240
LSPPRSPPLS PSRFSAPESS QSSASPPGVS SSLLLAVLRP LLLRLLPASL QGVAADAVLL 300
HLSRKQLQLY DVTLQPQLLD AFHMPFYLLY GYIGSVRVYV PAHDGAGSSA ESHSAAPASS 360
AAEGESRGLS TPERPSEDGG AGTKQLEEGS AGVSASGGGV SASPSTSTAA ALQKQPTGRS 420
TQTPGAHAVT SEERGKGAEG ASASASLVVE VSKLILVFAP KPVEECSEAE FVARLQQRRR 480
RLLDAADLQI MEEDVRLQQG KETQEVDSKT PASGGLTRPF FAAAALLSRW VNRLLQDLLI 540
DVQDVHIRLE AVSVVPRSAA EHLFNPVRPP LPLDRVRSPA HHRNLRDLGS RNLSRYVHSG 600
DQLQTQDSLQ AQDALHPPRH RGAGRERHAR PFRGGGDAQD GKTTQAESQR FESRARGRSS 660
EGELPKTQRC STAAHLTRVL PFAVGVTLRR LQIRQVGAHV FAAAEKADAA GAASFWEEQG 720
QAGGGEAARG EAAREQTSSE LEPREGEWEG TNSGGQSNRV RSRRDGDSSA QDKGEESDAS 780
LNSDAFRFRV FDVEALAVYT HSETLFLSPT TRTNKELHAM LVQRMQTHLS VASSGGGPSG 840
RLEVGAARTA SSAANSGDGF VAPSSSQGLP KTVLANALDP ASHPGTPGLE SGRDVRSAGS 900
QKEQKCVTNV HAATALLEPL LGYLLEPVSA RLFLRQVNGG ENAARTGGGR QLGEKGGAVR 960
AESEKGCDKA LEERGEIHSK NEEAAGEKEN VGREEGKETT EERLALCGNG ETDSEAVAYA 1020
ALLHLDSVQI AVPVQYMYQV VWLWRLLEKH KEKVLLLERR IELFYPYRPQ CQVRGSARLW 1080
WSYAIGCVLR ERRCACAGID PAELSAASHS PGLSPSSRGP PFPADFPAGV LAFGETIKQA 1140
RLHQKGREYQ NHLLAVLRNR ASAEDLRCVE SLQRELPMQI LLSSHIQARR AFLAERKQQP 1200
RGFGATLWGA IWGRSNKGEA PEGQEAASGT SGPPRVSLTQ RDRLLGSRNA IKNTQDDHEQ 1260
LSDEEFLDAW SEAADERGGH QDGSHRRDVR ETLEGPEFLR RSSSSHLADF FDCQEELDEK 1320
QRHEQELQLA ARAAADAAKA ADEEAQILWT QKNQQLDRPF LALSVLLRAA GVELVGARTE 1380
RCRGRKGVSP VALSEDCSRG ETASFSKCSE DRKETPGSPT EAETKQGEAE TFGDGRGGDR 1440
CFQATQERKL GERSGRRHEN QEWDEKVHFR MGGTLREMRI HLVYGDLQTR VRVGLRDIAV 1500
VHQRGCVKQR LVYRQLTLER PLLLVSLSRI LVPGVAEAAQ DVAVLASEVQ NSLRWQVPSY 1560
NSPRTGDSFP GPGHRQRGAI VEEGGKEVSV EMQSMPDRED KTLADAKSTE GCAEPPGSPD 1620
ATREKAMAPG GFHPSALASG ANASHCDLSR AEASHRPAMN RMTPRHLLLR SRERLKALSL 1680
STKYFIQLCA PTVVLPTRME RFGEAPVVAL HLGEIHVYSV DSPDGPQKGA LCQPSSLAEP 1740
FFVSPAGAVC SPAAKAFLRS PATGATAFRL VDLPGACGAA SMAEQAVWPP FSSSGVPRFQ 1800
TGSGEAIHRP SSENVRKANA VDTRRDEAAQ GLRMLVGSRV RHFVVLDEFQ VHRYSSLVAF 1860
TASLDALKTK VAAGREHRDG GETLSGDEEE KRGTAGAANE KGDEKPVVDG GSAGEERTRD 1920
EKRMETQDAS SLRRSPSACD SLLRRRTEGL CLLDRVGMCF AVDITSYQVV DLRALILPAA 1980
LLPSPKAANS LAAASSPDTS AVAAVSGDAG DKVSPDSGPG GTAAVSPVER RPESGDIRQH 2040
WRSASSPLES DAEPEETSGQ QSIMLVKGVL PTVSLHLSSV ALDSVFRYVH AMQKAGRRTR 2100
QRTRDRGHGR SEEREHRGIC CAAT 2124 
Gene Ontology
  
Interpro
  
Pfam
  
SMART
  
PROSITE
  
PRINTS