CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-032794
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 MYB-1, putative 
Protein Synonyms/Alias
  
Gene Name
 TGVEG_058530 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Toxoplasma gondii 
NCBI Taxa ID
 5811 
Lysine Modification
Position
Peptide
Type
References
871INASGGFKPDGGCLYacetylation[1]
Reference
 [1] Protein intrinsic disorder in the acetylome of intracellular and extracellular Toxoplasma gondii.
 Xue B, Jeffers V, Sullivan WJ, Uversky VN.
 Mol Biosyst. 2013 Apr 5;9(4):645-57. [PMID: 23403842
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1528 AA 
Protein Sequence
MLGVKPHLSH VSLTREGKHR GQQTHLRKTG VPTWSAEEDA SLAELVSRKG FKWALISSQL 60
TGAFGIPRTG KQCRERWFNH VNPEVKKGDW SAEEDAMILM LQNELGNRWA TIAKKLRGRT 120
ENAVKNRFIS LSNARLGYGR PKRDGSSADC FSNRRTGSGK SSGITGMPNL CQSVCSAGTT 180
KKDSSESGNH SVMSVATKVF EFSDVAVDSG VSRTRQCTGT SPSCGHPSAG EGDPSHLKNT 240
DVVGREQPIQ RNNNESGKAA EQTAFSGVKT GTLSVSQDAV PVGRLVVASV GPQHIRRSFP 300
TDETLPKFAA KEHNNQQLND EREHLEQSNS TSEGSFLASA HEHADIARSD PDEDTLEPHQ 360
KRRRKRCAIA YQGEERGDSN GLDSIADRAE QAGNFQAMTT ANTDNRKVDY LEPHRYEKLS 420
PCEQVIQPSL RPACDHRGGP QNSVESGEQS PDAQRQLCNQ GCRTSNRTVH SSVYSNEVES 480
NELRGVFRLA EQSLPSQSGD PAWSTAGFQL SILPQKVEVH SRNKCDGQNV MYRCSPGSLP 540
TTHQQTVFHY DRDSSRFPCA AKPAAASGAQ GTIEENDGLV KEGPSMIVTG SSVEVVHCCS 600
VSLRRRDRSL PSAQLWTSQE TESDTNPSPN QQHESCHQYC KRHAAWCGKT DQFSKLTSSH 660
QENSSGKDAC LVSVSPTVKL DDLQKQSRGT VLSAKEEIGK PETWSHVVDN TYSKTDHQRA 720
SLCAENSSGC AEGSTELVRF SGGSVKSGSS MSVDCGSGNP DDCQDCKAEE IWRGEQRYAE 780
RGHSVESRGA GSVGRSTDLT ITDSGSMPLC ASPIGRPPAD NDTLFLSDAR CNIVAQLNHQ 840
DNSRISRLAS CEEEFLACGG ERLINASGGF KPDGGCLYRM QQAGACNTKL HRPVHSCSTI 900
DSEQLEDLPS VEKAVGDRSF SSKRKGDIPP FAEWKKNDEL RELYRGVSEA VSHGQPGDWN 960
GSWPGISGRA HQTSSCFPDR VNASDRRELN SWRLHVSAAA ELGSSHIWNS QSYASASVSR 1020
DKQREPPKNG LTGCDVPEYL GTSQSAGLPA ANAHERGNFY GHDRCRPREG ERVRWVGLQR 1080
NRKPEASVSS GASNSATTAR PKDSTEPDEG NSEGVSTRRK DSGSTAATIS RAVSLGMVTP 1140
SAACENSSSL TDTSPPLSHR PSFSFTHCCE ETLSRCNSSN YLCPPATCHT SDDGRSLGPS 1200
REAQALRSLS LASGYGYPGI PAEATSFWQG SSLEHSIMEP QMVPSDDELR LWVHPRDAAN 1260
WSQSTLKPVA VVSGTDAGDD QHKTPENLTP ESGQAHRRDG HDMQRVQRCD DEGECPPTTV 1320
ELTFPHSHSS DEMQDLPSKV QGNFLLRREL SDSLQHETAE SVAGYGWMRI RNAGDIPNSK 1380
VPCAWEQCMP ASERERGVND HMSSEASRMS KAASSSFVPS SCTDAPVVRV GEDTTKSVCE 1440
EQQLCEGGNR GSLSPEATGF ESLGPPLQLL LVDGYTPFEP VVEKVSQTME QTLFPVPGQE 1500
TDTRDEDGRY NCECLQNRQP PLHSGGLM 1528 
Gene Ontology
 GO:0003682; F:chromatin binding; IEA:InterPro.
 GO:0003677; F:DNA binding; IEA:InterPro. 
Interpro
 IPR009057; Homeodomain-like.
 IPR017877; Myb-like_dom.
 IPR017930; Myb_dom.
 IPR001005; SANT/Myb. 
Pfam
  
SMART
 SM00717; SANT 
PROSITE
 PS51294; HTH_MYB
 PS50090; MYB_LIKE 
PRINTS