CPLM 1.0 - Compendium of Protein Lysine Modification
Tag
Content
CPLM ID
CPLM-032729
UniProt Accession
B9QHT7_TOXGO
;
B9QHT7
Genbank Protein ID
EQ970689
Genbank Nucleotide ID
EEE29851.1
Protein Name
Putative uncharacterized protein
Protein Synonyms/Alias
Gene Name
TGVEG_008680
Gene Synonyms/Alias
Created Date
July 27, 2013
Organism
Toxoplasma gondii
NCBI Taxa ID
5811
Lysine Modification
Position
Peptide
Type
References
642
GGDAQDG
K
TTQAESQ
acetylation
[1]
Reference
[1] Lysine acetylation is widespread on proteins of diverse function and localization in the protozoan parasite Toxoplasma gondii.
Jeffers V, Sullivan WJ Jr.
Eukaryot Cell. 2012 Jun;11(6):735-42. [
PMID: 22544907
]
Functional Description
Sequence Annotation
Keyword
Complete proteome; Reference proteome.
Sequence Source
UniProt (SWISSPROT/TrEMBL); GenBank; EMBL
Protein Length
2124 AA
Protein Sequence
MEESEALLPG SREAEAESSP QLLDDRGERR SQSVAEELSA SAPGLLDASS YSGSVQASSP 60
HPSLSSDEPY GNACSVAVPT SPLPASADSA PSETGEKEAE PPNVSLPASG RSPLYEAADT 120
DVLLGCKNGL SRSPRNPARD ASSPVTSVHF GVDPASSVGV SSPAAVAVTQ DSPSLPSPPL 180
ETSSPLLSSS PTLSSPETHK SPSVVFSADS QLSLSPVPPS FSSPLSPPRS PPLSPPRSPP 240
LSPPRSPPLS PSRFSAPESS QSSASPPGVS SSLLLAVLRP LLLRLLPASL QGVAADAVLL 300
HLSRKQLQLY DVTLQPQLLD AFHMPFYLLY GYIGSVRVYV PAHDGAGSSA ESHSAAPASS 360
AAEGESRGLS TPERPSEDGG AGTKQLEEGS AGVSASGGGV SASPSTSTAA ALQKQPTGRS 420
TQTPGAHAVT SEERGKGAEG ASASASLVVE VSKLILVFAP KPVEECSEAE FVARLQQRRR 480
RLLDAADLQI MEEDVRLQQG KETQEVDSKT PASGGLTRPF FAAAALLSRW VNRLLQDLLI 540
DVQDVHIRLE AVSVVPRSAA EHLFNPVRPP LPLDRVRSPA HHRNLRDLGS RNLSRYVHSG 600
DQLQTQDSLQ AQDALHPPRH RGAGRERHAR PFRGGGDAQD GKTTQAESQR FESRARGRSS 660
EGELPKTQRC STAAHLTRVL PFAVGVTLRR LQIRQVGAHV FAAAEKADAA GAASFWEEQG 720
QAGGGEAARG EAAREQTSSE LEPREGEWEG TNSGGQSNRV RSRRDGDSSA QDKGEESDAS 780
LNSDAFRFRV FDVEALAVYT HSETLFLSPT TRTNKELHAM LVQRMQTHLS VASSGGGPSG 840
RLEVGAARTA SSAANSGDGF VAPSSSQGLP KTVLANALDP ASHPGTPGLE SGRDVRSAGS 900
QKEQKCVTNV HAATALLEPL LGYLLEPVSA RLFLRQVNGG ENAARTGGGR QLGEKGGAVR 960
AESEKGCDKA LEERGEIHSK NEEAAGEKEN VGREEGKETT EERLALCGNG ETDSEAVAYA 1020
ALLHLDSVQI AVPVQYMYQV VWLWRLLEKH KEKVLLLERR IELFYPYRPQ CQVRGSARLW 1080
WSYAIGCVLR ERRCACAGID PAELSAASHS PGLSPSSRGP PFPADFPAGV LAFGETIKQA 1140
RLHQKGREYQ NHLLAVLRNR ASAEDLRCVE SLQRELPMQI LLSSHIQARR AFLAERKQQP 1200
RGFGATLWGA IWGRSNKGEA PEGQEAASGT SGPPRVSLTQ RDRLLGSRNA IKNTQDDHEQ 1260
LSDEEFLDAW SEAADERGGH QDGSHRRDVR ETLEGPEFLR RSSSSHLADF FDCQEELDEK 1320
QRHEQELQLA ARAAADAAKA ADEEAQILWT QKNQQLDRPF LALSVLLRAA GVELVGARTE 1380
RCRGRKGVSP VALSEDCSRG ETASFSKCSE DRKETPGSPT EAETKQGEAE TFGDGRGGDR 1440
CFQATQERKL GERSGRRHEN QEWDEKVHFR MGGTLREMRI HLVYGDLQTR VRVGLRDIAV 1500
VHQRGCVKQR LVYRQLTLER PLLLVSLSRI LVPGVAEAAQ DVAVLASEVQ NSLRWQVPSY 1560
NSPRTGDSFP GPGHRQRGAI VEEGGKEVSV EMQSMPDRED KTLADAKSTE GCAEPPGSPD 1620
ATREKAMAPG GFHPSALASG ANASHCDLSR AEASHRPAMN RMTPRHLLLR SRERLKALSL 1680
STKYFIQLCA PTVVLPTRME RFGEAPVVAL HLGEIHVYSV DSPDGPQKGA LCQPSSLAEP 1740
FFVSPAGAVC SPAAKAFLRS PATGATAFRL VDLPGACGAA SMAEQAVWPP FSSSGVPRFQ 1800
TGSGEAIHRP SSENVRKANA VDTRRDEAAQ GLRMLVGSRV RHFVVLDEFQ VHRYSSLVAF 1860
TASLDALKTK VAAGREHRDG GETLSGDEEE KRGTAGAANE KGDEKPVVDG GSAGEERTRD 1920
EKRMETQDAS SLRRSPSACD SLLRRRTEGL CLLDRVGMCF AVDITSYQVV DLRALILPAA 1980
LLPSPKAANS LAAASSPDTS AVAAVSGDAG DKVSPDSGPG GTAAVSPVER RPESGDIRQH 2040
WRSASSPLES DAEPEETSGQ QSIMLVKGVL PTVSLHLSSV ALDSVFRYVH AMQKAGRRTR 2100
QRTRDRGHGR SEEREHRGIC CAAT 2124
Gene Ontology
Interpro
Pfam
SMART
PROSITE
PRINTS