CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-032604
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Putative uncharacterized protein 
Protein Synonyms/Alias
  
Gene Name
 TGGT1_099310, TGME49_015950, TGVEG_038340 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Toxoplasma gondii 
NCBI Taxa ID
 5811 
Lysine Modification
Position
Peptide
Type
References
1235PLAVGLGKENNREKEacetylation[1]
Reference
 [1] Lysine acetylation is widespread on proteins of diverse function and localization in the protozoan parasite Toxoplasma gondii.
 Jeffers V, Sullivan WJ Jr.
 Eukaryot Cell. 2012 Jun;11(6):735-42. [PMID: 22544907
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1717 AA 
Protein Sequence
MAQSATTQLD SSAHRVEHLL QRAGSAEGGS SGPASSLSSA GAAAPLLMPL NNSEGATRST 60
AAAGGFQAHG HLSASHAGPA VFFNPSGSSE GNALVALPSS LPSVSLPLSS SHVDLSQVHP 120
GLSEAHGGGS AAAGASASSH LLSQIGASSL LLLTSPGVSA SSAAVGAKGT VPAGSSVGQG 180
ILFPQASGAT LIANPGAAGP SLIAVSSAPF SGSHPQHAAF LASQGQAPAA WVVDAGGVSF 240
SGPFAVGAPV SGLATAGTLA RRGSPQPSPT AFSSSSFYSP PFQAVPTHEG GVSALAAAIA 300
AVKVEPNAVE ERAEARDATA QAETLGAEPR PECATSPTHL VLSSGASSPH PSPSSLAVSS 360
SVPPASGLLG SSGFSSSFAS AAPLEGPAGA QSPSEGREEP SPGSTAACLL SGDAAPPSGN 420
VLLSPTVGSF ERDLARGEGA AGAVAFEALD AQCRPGQAQS ECETAPTVFF GRQELGNPEG 480
SGVQAGLLST HPSSQPGLAV VSSPILSASF PSFSSSFAPF PSSALTAVTV ERLAALAAAV 540
AGGGPAAPGV AQAVPSGETL GFLGARAPPA VGDANAETPD AAAGPQFATG TQGPEGQAAG 600
KRPRGRPRGS KNKCPRTATR ATPGLGPAPD GAASGAEAGP AREASQEVAA AEAAETGEAP 660
AEASPGAGAP AAKTKRRAGV RKKRADSVGD SGDSGAPAFV SSGPGDVEPT AFVMAHPAGH 720
EAVHGPSLGA SGSPHLPATP EGPAQGPDKE AAAETVAEAP GAEGPACGAF GPPCGAKDPA 780
SFSTTTAASC ADGSGALPGS LGGGSGLEVP APATFSAAGE GEAGADSHSQ GPPVVLVSCA 840
AEGAPEPGHQ APEEEPDPQT RSPERPEGEA AGSVLSAPSV SPPVSVSPVS ASASVSASPA 900
VSDAPAVSAS PSPSEGGPAA LVKADAAGER GMLSGEALAW RGLGMIKAGG IRKSYSASFK 960
LAVVAAAEGM SSNTKAAKQM GVTESLVRRW RMQKAVLEQL PGEKLSRRGR KHGKYVTLEQ 1020
QLCLHVCAVQ QQEGRILKDT EMRRLASEIA GNLAVSDFKA SSTWCFRFKR RWGLDRVQNL 1080
HAGVVPPKRI QAPLEPNEEC VVGNPVGPGG TLPGEETGVG GEAEAGVAAG APAGPADPKK 1140
ADSGLNLGAA AGESLGVGDL GMASQEGPRP DAQAGAQTQE AARAEGETLA LGARGPSQER 1200
ATGDTGSGAS PGVPAEGPPT FVSFSAAPLA VGLGKENNRE KEKQVLFLQA PVDALSGIAA 1260
QSLPGVSGTF VTLATAREAP AGIAAQGLAQ LQGASASSAF ASCPALPSHG FHDRAGAPGA 1320
VARVTGQAPE GLGAFQASFS SGTLPLAFLT PALSTPARAE SEREGGSENG DTGNTPQHGA 1380
LRPPSPPEGL CLLLPEGASQ AEQELALQRK LVELQRQQEA LEREIEQHRV HAQAFEGDTG 1440
EGAGGAAAGN HLPPRTETPE GTGLASPVAS DGRRPEEPKP SEDTQKSDES LPGASHATLA 1500
SPIHTSSPAS PTALQASQMG AGVEDSPSFA SSFAVSSSFV NSPSRREAGD PTPGVSRVSS 1560
APATAGDRGS EHGDRAETET VCGEGRAGLL AQEAPKQASS GASTASFSAF PFPQNPRERV 1620
AFASSGLAAE ALSSLSGPAV PSSALVSAFP VPAPFAVSFA PDGRVTSGDR GEAVSGGPPE 1680
TPQAQQISGA GLQASRADTL EETPGAERAR PEETPQK 1717 
Gene Ontology
 GO:0003677; F:DNA binding; IEA:InterPro.
 GO:0006355; P:regulation of transcription, DNA-dependent; IEA:InterPro. 
Interpro
 IPR017956; AT_hook_DNA-bd_motif.
 IPR000637; HMGI/Y_DNA-bd_CS.
 IPR009057; Homeodomain-like.
 IPR006600; HTH_CenpB_DNA-bd_dom. 
Pfam
 PF03221; HTH_Tnp_Tc5 
SMART
 SM00384; AT_hook
 SM00674; CENPB 
PROSITE
 PS00354; HMGI_Y
 PS51253; HTH_CENPB 
PRINTS