CPLM 1.0 - Compendium of Protein Lysine Modification
Tag
Content
CPLM ID
CPLM-031743
UniProt Accession
B6K914_TOXGO
;
B6K914
;
B9Q9I2
Genbank Protein ID
DS984727
;
EQ970682
Genbank Nucleotide ID
EEA97397.1
;
EEE33015.1
Protein Name
Putative uncharacterized protein
Protein Synonyms/Alias
Gene Name
TGME49_113270, TGVEG_097300
Gene Synonyms/Alias
Created Date
July 27, 2013
Organism
Toxoplasma gondii
NCBI Taxa ID
5811
Lysine Modification
Position
Peptide
Type
References
1058
QHGRGSG
K
GCQDSGD
acetylation
[1]
Reference
[1] Protein intrinsic disorder in the acetylome of intracellular and extracellular Toxoplasma gondii.
Xue B, Jeffers V, Sullivan WJ, Uversky VN.
Mol Biosyst. 2013 Apr 5;9(4):645-57. [
PMID: 23403842
]
Functional Description
Sequence Annotation
Keyword
Complete proteome; Reference proteome.
Sequence Source
UniProt (SWISSPROT/TrEMBL); GenBank; EMBL
Protein Length
1546 AA
Protein Sequence
MLPFPSPMRG APQANGGTGP VPETDGPHAV GASGGRERPG RGSRPGTSPT RDNWRAEKAP 60
AAPGAGGFCD DPSRPASSGG RGGRDEDRAA GLVSGARSQG GQQYFHGAYS HASGAASQPG 120
ASPLLQGGTE RGGPGGPQRP PLIANAPLLG FAPPAGGIDA GRELRSKAFP ASAGAAQGGV 180
GRSASATRGP ETGPQGDSQA PVACGDSGPQ GAETTFVKDL HKNATLWQLS SASFPSGVPR 240
KLGANNASGG EGAEGAGLLR KPVGGAAGAG EKRAAASGAS GAPDAPAAQP VPASGGSGDL 300
SFKALLSTRP APTVLVATHH STKSDEGAAT CSTGGGAAGP EGLHVGLAPE PVVHGAAGDK 360
TRGPGAAGVL GDTPHQATDG DGAVTRGPGG KVFDVDERGK DRARKGERGA AFGARGAAGG 420
DSAAGAGEPA GPEGGIGISG AHPQVGNRGQ EKSHAKADEG ERGAGRNAGA YRKAGGNADG 480
SQRVWGVVEK PGKAPVSRGR DEEKNQAPCA ETTGRPGASG RRRSQQPFEG EREDISGAGT 540
GRGPHSKPQR GGHGRREGEE TAHAGEGGSR VRGAAASTGN KDSASARRGV GVGRGGAEAG 600
RARGGASEVG RAPSPGGAGD NAAYPKGRDQ GLLGTRGGKG GRGSLRGGAA SGRGSAYVGE 660
RATGGMSESN AQPDVHAPED GASRHRGQGG PISRDRAAGT GAAVWLPKGA TKKGETVAAP 720
GAPEAGAGPR RGGETGQLAG SRGPAESRQG DRGSPSASAG PENQGGAAPH THARQAASGL 780
RDSRSNSGSM PPAVADALLT RVNGALSVGA GLVCVTQEDN QEETAGDDEG FQLVKSKRRE 840
QQEKRKKAEH QQRGSDRGAA AKATPFSSPA APGSARSNSR REKGQSGTAA GGDALGPASG 900
TPGEARVPGS TASKSEGPVS SPRDQGTGAE RTEPDDSGAS GRGPRHEGGG RRPSASPRGG 960
GRGGAQQRGE GRKEFPGKGL RTEVALTASS TMEVSTVSSE VTSSVADHRT RQRGQEEKKT 1020
EDGEAEKKEL TSASGLLPLP GGGSQGGGRG QHGRGSGKGC QDSGDAHAYP GEGRASEKSD 1080
RGEGSGFSRG AGAGSSREDG AHGTGEKGRE RRGSSGRGAD RRVGGHMEEK RRPESPAEVE 1140
GGSQRQAHSA PTNVGDPWYA SQDLDAANAM SHRLSSDPSQ SAVSDPQGTD LMIPSGPTGH 1200
QGNAGAQGAD RERAGASPSA GAAPAPPTYS LWSSNFNQPA TASSRVAPET IAAIWARDPG 1260
SGPTAVGGAP RGARLPAGPG GAPADMAGLA SHHPMTPVGA AAATLFQPGC YGAPGGGQLP 1320
PGAPRMPAQR NVTSPLTGGP QSLSKNEMLA PPLLPGRTAP GLVTLAPGQG SASPGAASSS 1380
PRLDLYDPRN PLSGPSFSPV PGVSAPGAHP GVVAGAHSSS YVVPEAGHQL LPGFAHPQGA 1440
AFSPSLSHDG APSPAPGAPG SGRAAGQALW VPGGPAPQGS AFVPSAPGHH QQGRGHSIGR 1500
GNDMLAPQGF GSFIAAQGGR NGAQRFGAMG AGQPGAGAPG GGEFYM 1546
Gene Ontology
Interpro
Pfam
SMART
PROSITE
PRINTS