CPLM 1.0 - Compendium of Protein Lysine Modification
Tag
Content
CPLM ID
CPLM-032728
UniProt Accession
B9QHQ3_TOXGO
;
B9QHQ3
Genbank Protein ID
EQ970688
Genbank Nucleotide ID
EEE30318.1
Protein Name
Putative uncharacterized protein
Protein Synonyms/Alias
Gene Name
TGVEG_054520
Gene Synonyms/Alias
Created Date
July 27, 2013
Organism
Toxoplasma gondii
NCBI Taxa ID
5811
Lysine Modification
Position
Peptide
Type
References
1976
RPRGEGG
K
DECELSV
acetylation
[1]
Reference
[1] Lysine acetylation is widespread on proteins of diverse function and localization in the protozoan parasite Toxoplasma gondii.
Jeffers V, Sullivan WJ Jr.
Eukaryot Cell. 2012 Jun;11(6):735-42. [
PMID: 22544907
]
Functional Description
Sequence Annotation
Keyword
Complete proteome; Reference proteome.
Sequence Source
UniProt (SWISSPROT/TrEMBL); GenBank; EMBL
Protein Length
2132 AA
Protein Sequence
MADPPLHLSA RAQVPLRPQV LSDAVRPSAV SPKVAGPPTG NPSAHGFQGI DASPVPGAPL 60
RNSSSDSLGG GTLSHGVSMS KSGLSVALAG VQSRSDVSFV LPPQLTSSSG APESGENNGR 120
ALREPLLSPS AACGAADAQV GQQATPPACR LADPARNAGT PQIVGKAPPF RPQKSPGAPV 180
QNGLQIRPNP QAQERTGLAE NSPADGVGSR LSASSSATLP TGRLHAARST FASTGGSHPG 240
SGNQEAPEPI QVPVASHISS NSPFPMAAAP SLDALSSPEK LSLPSTLNRI SSAAPASSLG 300
RPQAAQPLLG GVRVPDGQGR SGGPLPVQPV SSSALAAASA PSRSHAVSIR GPFSLNCSPT 360
SAPPATSGPP RALAGLHASN VFTGTAVAGA GCAPSHQFQA RAPVPARLPV SAGSSAPATI 420
SSSSAVASTA PSGSRPAGVL SADAAKTPNP VTSAVVNHSA SGGAALPRFT SPFASIPARS 480
LSAAGPPGDS PGAPSASVSP SATAVVAAKR RVLPPGGQPG ATAASPVSRV SAASRLANFA 540
SLGAQATLPT ALNCAATAGA AAGGGAGSGT GGGGPGVPGP NASCPVRYGS LGSSAAAGVQ 600
TPRGSQGPAR LSRGQAGGPT LTHASGRPSP AAKAQCSRSE CSSPRALLSS TPSPPGSPSL 660
RLHPRPGVCT AGKGRQRSTS VGSAHSSAAV LAAAASRLAH LSGKAPSPTA GLGPSRVPSS 720
PSNAGLDPLL SPGGAGGSPL AVGGSPLFGR GLGAQPTDAA SRAAAVVAAA AAAAAAGYSS 780
LAAATSSGLS SCAVLKSSKS ANAPSPRARS QQSADSPSAS GAKRLASGRA GASTPGRRSI 840
SQETPREVLS GDGALKPGPD AEACKEKETQ SPSKEKDGEA DTPAGGRQGS SVASAGAPEK 900
LDKSEDDAEA ARLQILASLF PRAETPVTSG SKTPGPVSGS LEPIAADGAP AHAARQSASE 960
EADDDHLRDF WSQELEISKV PQYLDDNPRG EGWLVVASPL PACVARQDLL REKVRAHLAP 1020
QLEAATGQKQ GPPPGFRDCL STATRNPFPA GTGRTSVDND GRRLSDTASA HQALLSNVLS 1080
SRPSSPKDAS ALAPSRQLSP EERGDSQRPV CGEEGPGATG AQGAQRETDG TSGDGHAPPL 1140
KRRRTDEDCA TAVGNGALTP PDDLLHVLPR LSSVERRRMR QLEQPVLSKE RRQKVTQHLC 1200
ASARNLLQCV QEMQLLGALP RAIAAAPEAP PNGLTDCRSA ALDELRKAML DLPPSALWSG 1260
LHAPGPFSSS RRQASSLSPA ERLGTDGGDE AATAEEEDSE RPDIFFHLAK DSDQESPAPP 1320
GGVGSGLWAF PGGAGSETFS GRGSASAVSG MAPAPHRETS FSPSPIDAEA FWFGDEDSLV 1380
WSESEEEGDA VEAFPFGPGD PYSVLYSGSH EVSASRHPLA DLAGALAASE VAARPAQGAS 1440
DDAYSPKLGA SFPSFSAPRT LREAFQEARA TLPWRVPPHL NEEEAAAWLR EALREQKKRE 1500
TEFRNLRLFQ LERKAVGAQL SLCSPRHLPA GQEVFAIERP GDGLRFLPLT AEEEDELHLL 1560
REQRRRDWEN VEKAKQQLRA VMCEKDVKAQ RHIQARLQRL SERLVNRERR RAEDFRRLRE 1620
LREQELKWCE ERKIQAAQEE REREALKVED ATAAERERAD RVSREREGER EKTGEGALHV 1680
HFRGEDRRPS RLGAAKSTAV SESEELPRSS SELPSDAPNS RPFPVSSTSL LQRQSAEVPT 1740
GGTQAAPSGL SAPQGLAGAA DGASIPAPAE TSNLLEKSST ETAVSSFSLS REISTPEANG 1800
KALEVTPCLK LARSAPVLGQ PATPQGGASE AGDRRSYFLA SSKTEANGGR ASPSEGENGD 1860
DKAPRDSGGP ACNFSEARDN PGDPNVLSQP SKLNVNPVLI ADRQTANSIR SGRAGEVPAG 1920
GTEESVVKAT RGGETTQEGK ENQELKMVEV SAGASPAPLS VGGNQADLRP RGEGGKDECE 1980
LSVSPNEGPR GGTGRDQDAT STLHAESQGQ GCGVPLAHAL QRAPESSMFQ PFEQLQPSQL 2040
QQLLQFQQTQ RLEVPAPGAQ GTHSLEAAPA RLREDPLPPQ LRSVQHQHSL VQLPAHGVQQ 2100
PHLQALLQKQ QLLRQQILQN ALAQQQRQNR PR 2132
Gene Ontology
Interpro
Pfam
SMART
PROSITE
PRINTS