CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-032731
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 CBS domain-containing protein, putative 
Protein Synonyms/Alias
  
Gene Name
 TGVEG_009230 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Toxoplasma gondii 
NCBI Taxa ID
 5811 
Lysine Modification
Position
Peptide
Type
References
556RGGAAGEKTIKTVEMacetylation[1]
Reference
 [1] Protein intrinsic disorder in the acetylome of intracellular and extracellular Toxoplasma gondii.
 Xue B, Jeffers V, Sullivan WJ, Uversky VN.
 Mol Biosyst. 2013 Apr 5;9(4):645-57. [PMID: 23403842
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1668 AA 
Protein Sequence
MFSGLTLGLL TLDIVQLKLL INRPNKTAQD ERNAKYARKI LPLRSDGNYL LVTLLTGNVA 60
VNAGFSILLG DLTDGLVGFL VSTVVITIFG EILPQAACAR HGLVVGGVLA PVVYALEWLL 120
FPVVKPIAMI LNCVLGEDLG TIYDKKQLSA LVDYHNNVVH VLTRDEARIL KGGLEFAFTR 180
AEEVMTPMDE VYGIDVDSKL NYDVLSEVLS SGFSRIPVFD RSNSQCIVGL LFVKDLILVD 240
CHAEVEVRKL LQFFGRGLYA VDDDTPLLEL LKTFKQGHTH LAVVRRVSDD GEGDPFYIHV 300
GIITLEDVME EILQDEINDE FEHDKSQSHR RRRHQNQAVS GAQASLPYFA PSSPLAPRFS 360
VSSTAWEAQG DVDTAPEAES RRSRREKKKE KVKGDKRRFR LWSRGRRSST ASSNAPLTNA 420
CEKEAGEGTD GDVEDKRVDR NEASRRPSKE ASFPATCAST LQAKRPGEAE TDLLSGGSPV 480
GLSSFEKSSE CGNPERGSLG EASSRHSRWT AKRLEEGGAV ESHFLEDEPT RELPRGDTEE 540
GEGQRRRSRG GAAGEKTIKT VEMTPLEPMH AVLTTSREEP PKAEEIRRED ALGHPSLFPW 600
WGWSADTAGS VSKLRMRGTL RMFFDSHRRS RIAEPLTESE ARAIASFLSS SIPAFSPQIV 660
DEKLLVSLLS SFFCIQPPPA SLLWHSAVHP DLFKHKSTDK RGEHEAAFEP FAVIVLYVLG 720
NPAPVLLGQV ASQQPAEFSA PPSSPSPSGA LEFRETPVHM LDQPSCLSAS ADTLVVSSPS 780
RSSSREQLDS SSRPAASSDS SDQDSVCSSQ RLSTATFNSP SISPEAASRR ASSPRLTSRS 840
RPAPCTISTF AVSPVALPGC LPVSAGVTVS RGSSVFPSPC VWDAAIHRKA ASPPRGDWRN 900
AFRSEEPEAK ERKRENGESD RREETQGKEC VQSGAPDGSL DLGSDGPRRR RDIFCVLDAV 960
RENPGHTREE DFLSSTDRRS SSRSSSSSRH KSPREPSTSV PSPKESGTAW LGPWGTRRDS 1020
EEPRYAPLAS CVSSNVDNVE APSCRDGTRP LHFASPGKPP VEGKSRCCSP EVRFSPLKES 1080
TTPLRSLPPS TGLCQRSGGG VYRSLVETTS SLASLSSRGK ETDRETSREA SQPASAVQIA 1140
SERGDPSAKR VCDEEDQEED YEKEGYEEED REVSPLVSRR LLRDGLTSPD QFTLISSHAS 1200
GGCSSSPSLH DASSGGVFVE EAPQGSQTKS PSTGRSEREK TACDVDSACV EIDIGNRRAS 1260
TFEKDAFRRK ETYPQFGDTE RSCRKQNEML SWLRTSATSS VSLESGEEDV ELTAPSQAHS 1320
HVRLEEVCVC GVSLSLPWLS PKKGTDREQR RRAREEGLCL GLSREEPPEG ESSVEGTSLE 1380
KKEEQSAGAS AGRRRLPSEK SSLAFPAFSS QRLSGKSIFR RETAGTGVSE ISHLLRFEKP 1440
QADWSAARGP EPGPRREREE ERQEELSCGW GANRQEGPAK PRVQQSLEEQ LQIQETCLRQ 1500
LAKDMQEKKG VWLAVSSKQR GKEYQRRRDV EFVDEQDKAE RSSQAKSERE GREKQDASAR 1560
GAAETGEAGA ENEEPEGDGK KRIQEVRNRE ALCHEIHSWV QSLQDSKGGS EARGNEWSNE 1620
QTKTLTCEGY APDYTAVTEG ACGLLVFPRW FYVAAVKASL DLARSSRS 1668 
Gene Ontology
  
Interpro
 IPR000644; Cysta_beta_synth_core.
 IPR002550; DUF21. 
Pfam
 PF01595; DUF21 
SMART
  
PROSITE
 PS51371; CBS 
PRINTS