CPLM 1.0 - Compendium of Protein Lysine Modification
Tag
Content
CPLM ID
CPLM-032731
UniProt Accession
B9QHY2_TOXGO
;
B9QHY2
Genbank Protein ID
EQ970689
Genbank Nucleotide ID
EEE29896.1
Protein Name
CBS domain-containing protein, putative
Protein Synonyms/Alias
Gene Name
TGVEG_009230
Gene Synonyms/Alias
Created Date
July 27, 2013
Organism
Toxoplasma gondii
NCBI Taxa ID
5811
Lysine Modification
Position
Peptide
Type
References
556
RGGAAGE
K
TIKTVEM
acetylation
[1]
Reference
[1] Protein intrinsic disorder in the acetylome of intracellular and extracellular Toxoplasma gondii.
Xue B, Jeffers V, Sullivan WJ, Uversky VN.
Mol Biosyst. 2013 Apr 5;9(4):645-57. [
PMID: 23403842
]
Functional Description
Sequence Annotation
Keyword
Complete proteome; Reference proteome.
Sequence Source
UniProt (SWISSPROT/TrEMBL); GenBank; EMBL
Protein Length
1668 AA
Protein Sequence
MFSGLTLGLL TLDIVQLKLL INRPNKTAQD ERNAKYARKI LPLRSDGNYL LVTLLTGNVA 60
VNAGFSILLG DLTDGLVGFL VSTVVITIFG EILPQAACAR HGLVVGGVLA PVVYALEWLL 120
FPVVKPIAMI LNCVLGEDLG TIYDKKQLSA LVDYHNNVVH VLTRDEARIL KGGLEFAFTR 180
AEEVMTPMDE VYGIDVDSKL NYDVLSEVLS SGFSRIPVFD RSNSQCIVGL LFVKDLILVD 240
CHAEVEVRKL LQFFGRGLYA VDDDTPLLEL LKTFKQGHTH LAVVRRVSDD GEGDPFYIHV 300
GIITLEDVME EILQDEINDE FEHDKSQSHR RRRHQNQAVS GAQASLPYFA PSSPLAPRFS 360
VSSTAWEAQG DVDTAPEAES RRSRREKKKE KVKGDKRRFR LWSRGRRSST ASSNAPLTNA 420
CEKEAGEGTD GDVEDKRVDR NEASRRPSKE ASFPATCAST LQAKRPGEAE TDLLSGGSPV 480
GLSSFEKSSE CGNPERGSLG EASSRHSRWT AKRLEEGGAV ESHFLEDEPT RELPRGDTEE 540
GEGQRRRSRG GAAGEKTIKT VEMTPLEPMH AVLTTSREEP PKAEEIRRED ALGHPSLFPW 600
WGWSADTAGS VSKLRMRGTL RMFFDSHRRS RIAEPLTESE ARAIASFLSS SIPAFSPQIV 660
DEKLLVSLLS SFFCIQPPPA SLLWHSAVHP DLFKHKSTDK RGEHEAAFEP FAVIVLYVLG 720
NPAPVLLGQV ASQQPAEFSA PPSSPSPSGA LEFRETPVHM LDQPSCLSAS ADTLVVSSPS 780
RSSSREQLDS SSRPAASSDS SDQDSVCSSQ RLSTATFNSP SISPEAASRR ASSPRLTSRS 840
RPAPCTISTF AVSPVALPGC LPVSAGVTVS RGSSVFPSPC VWDAAIHRKA ASPPRGDWRN 900
AFRSEEPEAK ERKRENGESD RREETQGKEC VQSGAPDGSL DLGSDGPRRR RDIFCVLDAV 960
RENPGHTREE DFLSSTDRRS SSRSSSSSRH KSPREPSTSV PSPKESGTAW LGPWGTRRDS 1020
EEPRYAPLAS CVSSNVDNVE APSCRDGTRP LHFASPGKPP VEGKSRCCSP EVRFSPLKES 1080
TTPLRSLPPS TGLCQRSGGG VYRSLVETTS SLASLSSRGK ETDRETSREA SQPASAVQIA 1140
SERGDPSAKR VCDEEDQEED YEKEGYEEED REVSPLVSRR LLRDGLTSPD QFTLISSHAS 1200
GGCSSSPSLH DASSGGVFVE EAPQGSQTKS PSTGRSEREK TACDVDSACV EIDIGNRRAS 1260
TFEKDAFRRK ETYPQFGDTE RSCRKQNEML SWLRTSATSS VSLESGEEDV ELTAPSQAHS 1320
HVRLEEVCVC GVSLSLPWLS PKKGTDREQR RRAREEGLCL GLSREEPPEG ESSVEGTSLE 1380
KKEEQSAGAS AGRRRLPSEK SSLAFPAFSS QRLSGKSIFR RETAGTGVSE ISHLLRFEKP 1440
QADWSAARGP EPGPRREREE ERQEELSCGW GANRQEGPAK PRVQQSLEEQ LQIQETCLRQ 1500
LAKDMQEKKG VWLAVSSKQR GKEYQRRRDV EFVDEQDKAE RSSQAKSERE GREKQDASAR 1560
GAAETGEAGA ENEEPEGDGK KRIQEVRNRE ALCHEIHSWV QSLQDSKGGS EARGNEWSNE 1620
QTKTLTCEGY APDYTAVTEG ACGLLVFPRW FYVAAVKASL DLARSSRS 1668
Gene Ontology
Interpro
IPR000644
; Cysta_beta_synth_core.
IPR002550
; DUF21.
Pfam
PF01595
; DUF21
SMART
PROSITE
PS51371
; CBS
PRINTS