CPLM 1.0 - Compendium of Protein Lysine Modification
Tag
Content
CPLM ID
CPLM-031799
UniProt Accession
B6KHG4_TOXGO
;
B6KHG4
;
B9QCZ0
Genbank Protein ID
DS984732
;
EQ970684
Genbank Nucleotide ID
EEB00147.1
;
EEE31669.1
Protein Name
Putative uncharacterized protein
Protein Synonyms/Alias
Gene Name
TGME49_049540, TGVEG_002310
Gene Synonyms/Alias
Created Date
July 27, 2013
Organism
Toxoplasma gondii
NCBI Taxa ID
5811
Lysine Modification
Position
Peptide
Type
References
687
VGKAGAG
K
ETAVTGA
acetylation
[1]
1438
AFLSSSL
K
EPCSERK
acetylation
[1]
Reference
[1] Protein intrinsic disorder in the acetylome of intracellular and extracellular Toxoplasma gondii.
Xue B, Jeffers V, Sullivan WJ, Uversky VN.
Mol Biosyst. 2013 Apr 5;9(4):645-57. [
PMID: 23403842
]
Functional Description
Sequence Annotation
Keyword
Complete proteome; Reference proteome.
Sequence Source
UniProt (SWISSPROT/TrEMBL); GenBank; EMBL
Protein Length
1652 AA
Protein Sequence
MLSAAQQRQL LQCDWRTQGL QKEEGRSCWY AHHLQQHQLI PFDAQAVQEA HHGQNESQAS 60
GSMRVHAPAS KSEAPSGTAS SQSHLSGQFE ARQEGLPARA ASYSVQEDQT EHQAALRDPR 120
QEFQTLHFLN SSGAPAMSPD EYIMRQQIFQ QQQQLHHLHA QGFPHALHSS APAQHAHIPS 180
VLSGTENPQG LLHQYEAFPL TRGGGDGGTP SRSDNEGRSS HELSLSDTSR TNGYGVGSED 240
FRAGSTAPHA PETQGLPTLG RPADTADARG PSDHGYSDAS HGSSPLKELE ASSAPAWSSS 300
QGGACTSQPL CQATATSASR HFSEDQQGCG GLSGVPSHEA PSGSSESGAV GSAASDHASL 360
SSYSGEPGSF SGGTAPCFPQ SGSNGTVGPN LMFQPPPSSV STGYGQQFVA SGRFLVGGYP 420
PQTQACAGGG YGPPGFLNQF GAPYSPHLMH PYAHLPPHQL SPFATSQQNN GSQMDFSFSL 480
PGASGAPPDV SQTSQSFPSL VSASLHHPAH LLPFQGPSTP TRRPPGGAGP SSRRAQKRPS 540
FLATGTPSTT CSPPFNADSL LISASTGLPS EASSLYSGSR RSSAHSALGV RDSGSVAATQ 600
ERLLFDETGT FATCTPGSSH SRRSSKTAGN ARELARGGAN VDGSGKASFT WTGAATRASL 660
AGETLPGTES GATVSGKSAV GKAGAGKETA VTGATGNLRG PYAVFATQIA VPPPPELEEA 720
CQRLAELKAQ PLIQQWLDSR DVSWSFAFAL VFNARLAAGS VNVNVRWARE PTVVGGAFLV 780
GKGKTYSWRR APQEASPHII LQCFEAACKE RQKTFGRSQH VDDYLAVLKT DLATAWRLDA 840
EGLQQAADSL VALHGLDIFC GLPVTSKQVK DLLFFDPHAN CFSLAPESPP LPPLHFFFPG 900
SQDSDLASDL VLGITKETAA LAAPDSVHSS APPSPSGDVQ GFGMRCTYTF RCPAPDCLGL 960
LYTLNRVRLF CLNAKLRSSA AGPSSEAVDS NAGFECAPAT ACSRVPFSSG GRAAAAFPAS 1020
FSGALPAGVH TPHADFGDVP HAQFSGSFSR SGYPEGVAAG SGFGGKGEEP SLSSTFASGA 1080
PGVATCEAGL DPASQKDRSL EVAAFSPDAS LAFASSPRRR TKENPEGPSL YERLVHDGGE 1140
SLPCAEAQEF QGRPGGGFVS SHAVRDAERE SQEQSPSSCG VQAKLTAASA PTRSLLDQEE 1200
ERESTLGAAL KEGKRLRVGG RDEARTPSAR GERSDDEKDA LSPKEASRLA QTNAGDREAP 1260
ERHLENSREV TGSCGASEGL APELAEFGDT EAPGFEGERL DGMDNADFFQ EETKSEELSV 1320
GAAARPSRRA ASRGYAKAKK RRRPALSSSR RKMVHFSEAD ASVTEGGPDA QETLDAPEVE 1380
SQEAAEMQEA SVQDEADARE KDDEDSLSSA GTAGLPRYSA PNSTLSGSSL AFLSSSLKEP 1440
CSERKMDCSA AWMPEEQADR HLEAFFSACV DADRKRNREE TAGEHAGGEN ELDAGEEAPF 1500
LKSQRRQEGE FVGVNEGGLE TCMTHVLERI EEGCEDTESD RGQLSERHRR VHPETRPSDG 1560
GPFFEAGIEE ERLHLRRLKD EEKGMEENAG LRFLDDPNTP SLSPEDPGSR GASRVRGEDE 1620
EKGEDEGRRG EEQKEEGSGR RRRDSRARRA RR 1652
Gene Ontology
Interpro
Pfam
SMART
PROSITE
PRINTS