CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-031796
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Putative uncharacterized protein 
Protein Synonyms/Alias
  
Gene Name
 TGME49_047700, TGVEG_004090 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Toxoplasma gondii 
NCBI Taxa ID
 5811 
Lysine Modification
Position
Peptide
Type
References
1277ALRQASEKALAAEGRacetylation[1]
1547LGSLNCEKSCERGHWacetylation[1, 2]
1874VTPTESAKSLAGGQAacetylation[2]
2655GLGGDSAKGRTMSSFacetylation[1]
2812KNLRSLFKSPSAQREacetylation[1, 2]
3085GASGVAAKEGHGSSRacetylation[2]
3188DVSSLNEKALDSPRSacetylation[1]
Reference
 [1] Protein intrinsic disorder in the acetylome of intracellular and extracellular Toxoplasma gondii.
 Xue B, Jeffers V, Sullivan WJ, Uversky VN.
 Mol Biosyst. 2013 Apr 5;9(4):645-57. [PMID: 23403842]
 [2] Lysine acetylation is widespread on proteins of diverse function and localization in the protozoan parasite Toxoplasma gondii.
 Jeffers V, Sullivan WJ Jr.
 Eukaryot Cell. 2012 Jun;11(6):735-42. [PMID: 22544907
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 3837 AA 
Protein Sequence
MAFAPRTSPR LSAGAGGPSE ASRGGTAAGA PLGPVETPTG REPSSPFLRS ASAKRVTRAR 60
AGFLASSPDN QSRSTSPLRP ADALRISEAS SASAPGVRRS LRASNARPPV SASRGLMGKA 120
GEEGEGASGG GRPGGARRTS GGGEDVCASP RDISYRDKGA GGDATASSNS QSPSSVDAAV 180
SSSYSVASSV SSPPASSLSS ALSSSFSSST LSRSGSSVCP RATSAVSATL QQGERSQESS 240
LLAGERETDR DANRPQRETD EGQGAKSETD RAPEDDRGRS RRGSASPVQG PFSPRGFFSN 300
AVTKENSAYP ATSGQSGQEV GCRPNSTLSS VSVCSLSSRP PSTLASDQLL SVPNGDASTV 360
STSSPSLSCS CSSFSSSSSS LSSSSLLSSS PLSSTPSSFF SSSSSSSSSA SVAPPGEGKG 420
RPPVRSGRGA CPRKPAGPPP RLCVPYQCQF NVEKREWRAR YLFRGQKKMR VFSLARYSPE 480
VAVSLAELFL TFLADNDGIP RSEVIAYWAE TLARGPVTAT TGTNPKGGNL LGPGASEEET 540
VGGEGGEDAE SRAAEKEREE EGKASSSSGS SDQNITRVES SEAKEDGEEN SASSKPPGAA 600
SAATEPAGGD ADGRPGRAAS GPGDACRSVT STETEAAVAV APEAKGGPSS DVSCTLDKSR 660
ESRGNGVAGK RENPAWAVSP SSFAAFVETA KARQWVTEAS RLQAASLPPL APAERPARPP 720
ILPTLASSRA RRCTSHSLIS GLSAREGSQR TVSQGDSLSP ASGLAGEPGA VREAEGREAI 780
AVDDETGGEH RDFPHSQGPA GRGRLAGARP SSSDMRGEKR GRRALREGES KRPCRRREDL 840
KSEEGQRERR RRDTAWPAGR REASHGRQDS RVKEETPAPD AGAALALDGR AAAARDRPQK 900
APSPFGTPEA LSSSLTGSGL HPDGRNPHGH PALRVKLAAG RGNGLLAASP ASPSSASHAS 960
SLASPSASWH AAQGEAEIPG ASTGFVDSPC SANGSLDDSG LGGPAAALQK SWRDRKRNRK 1020
KLSKSMHRKS LASLGMRAPP QNACLADPSD VGLGVQMPSD AGTVPGISPP SFGASEQKAS 1080
SSALGLAFRA SSSFSPKNGD VEPAGRNPPQ FLPTASVQRA DPPGTGAPPS QQVVSSASPC 1140
SPSALAATAS PGACRGGASR NGDPQGERFS FPASPTSQYR WYAHPDGGAT GPSCCRQHVG 1200
GSGGGGWPVV WLKQLEMAVN GPPKFCSYVE AVDKHLRLGG LRRPVAFLPL ASRPASPTGL 1260
GGGLGAPGPA LRQASEKALA AEGRQGQNEE KQVGWKSATG SKAGMFQGDS GETTSERGAE 1320
EAEGTGGGRR GILGKEEEDR NGGEGEKAAT PTMGGASPAA SDDALSPMKA DRPAEALGTG 1380
GSAPTHADSR RAPGMPEGEK MTGPSKEQEM AEAGERDRCE RSLERNAELV LKENVSMTSA 1440
SDVSEAAEEK GAPKKLASSP HSVESPCGRT AEKTGTLNTS EKGENTRTAE GDDPGTTIVK 1500
EEFPLLPAPE TPVTVTAQDL LSPTVYTPRW QATVGKSLEL GSLNCEKSCE RGHWADASAC 1560
DLETKDLRLP EDNKSEELKK ETGMFLGVEG EQVEEAKSSK EAFSPEERER EEQKESSKAA 1620
GGGDSCRTPR QQEATPRASE ECQPESRIDM KVSPNTEMMV EKLEETRVQN TEEPEKVEEK 1680
EEGGSVCRDV SVASPLESPN SRLSEKGDQS ETPAGVAPPS SSALEARAGR DSALLSASLP 1740
LSPRASCPPT QSASPASRDP TPASLRVSSV ASGDRNGPTG ILFRPLSSPH KRVSFCLRGG 1800
AEPPQRPLSE AVPYPLNARL QEIVSRFRLL QGVSAARVSS HGKGETSSQA TPKAVQGEAT 1860
VKEKATVTPT ESAKSLAGGQ AETEKGESPS GAEAATQKAD EKEKTPDTDA TQSRSTSSGF 1920
ETQEAKTAPA SILPASSLPS SDRPSASCSD THASRDAVPL ASSPSSSSSP ALRRCSVRGK 1980
DLVSAPVDSF SEGDSSDARP FVSVRDLAVK LYRWLEQGEG LPAAAGEPQG ACGVGAKAQA 2040
REALRIDTVP FISRWRQMLE RSLSIASDLR KLDLQVVHLV ELTEALHIAV YICGQLRRRL 2100
REGAAPDAGA AEDLAPVDVD DPRGCSQQSG DTRDSSSPAT PGGRLAGGAG GAATSPKGQA 2160
FAPRGGEGEI KPQETGNSGD SKAEGKEASG DANTSEGKRL SGEVDKTAEV ETAGSEDINV 2220
ERGVPGAQAE TARTEMNGGV VKGQETSGDI LSVGSSQVLS LSSPSLSHLA SSSGKGPLKP 2280
TSSPSSSLYA LSPSSSAASP FSAQLASPSS HAPLSLSFRS SSSPTSLSSP LASYPFPQTL 2340
QQTSASPSSS ASARPSCASV KPLREAGDLV RAAARAALEQ AQVFGVGGKL SDATHQLAAR 2400
VTVAVRAAML AKGEGGLTRG DVDLLVEETE RFVREARFKA QETAAETTAL PDGVAEVVSS 2460
EAGLGLQTTN HAPVSPAAAP SAGGAFAGLT EAVEVEARQL PEASERVGRV SSPRGSLGFE 2520
AMDLAGELHL VKVLNAFHRH TECLMNERER LIQATNEDLS FLLHAMELAL PSGLDTPLLS 2580
ILEGDVDILP PLPPPNVEAL IYLHAVSLAQ ADASASPSSP SAVAPCLLSP SARLLLAHFA 2640
GASPTAGGLG GDSAKGRTMS SFPGRPGEER HRADERKGSV LPVRRGRPPS SARLNALRRL 2700
HAVGEPAADA GLDTVNGRFR SKRLRAMSQE EEARRAATHA SPTIPYPLSR YLHRPPRLLS 2760
PTDAGHFASS YSSPLSHPLS KGSSLTSPKR QRRSVCSEAP EHERKNLRSL FKSPSAQREE 2820
APRSLTRPFG PLKGEGFSPA SLGTLGSRRQ SELGIRRRDA LVAFPPAGMP CHPASPGRRL 2880
ERPRVDGADM DGERRRRTRC AGDRLEERRR PLGPVYIPTK VRDPATGRVA VCACDTERGE 2940
RVRKVQLFEK PHVGAFWCAR YGPNDEFVRC FSIEKVGSLK ALVSAVRFRQ YVTGHSLGYG 3000
VGNCVPVETI RSAGRRDRNG DVAPDRPLKQ AAASPPPAGV AGALGRGEVG QAQDESGETR 3060
DAVEEEGRGQ EPLGSGEGAS GVAAKEGHGS SRGEGEGAEG RTDSAAGSTA GDRSTEDSSR 3120
LLSEGRDAKH GSSPAGGSEA LAPGGEHALA EGSEKVGRAQ ETEARKEDLR TSQNETHSGE 3180
DVSSLNEKAL DSPRSSAPQG KSDQGREPIA LRIRSTLPPS EVDKQEAAGQ GGSASELAFP 3240
TGVSLASPVS PFSALARSPI SARASSVSPG ACDRPDVSRR HSGSSDEASE ALWDLGEDLG 3300
FAGDDANFPF LDSENSALLF APPRHLMSPG SASPTGGGLG IHYDKTKHRW KATWTTLDGQ 3360
RASTSFSVKV LGMERARELA LEARQRALAG LDPREVRDEM VAGGAAARDR ERERGRQDGR 3420
REGSERRVGF EAEAEGTEAA SERLRRRGER EDGDEERRRK KTRGDELRGA EGDREERELR 3480
RRKTSEERRK GKNEAAKNEA AKNEAAKNEG GKGETWKVRE GGKTPLGVKS HRAKVVGQTV 3540
ERRGEERRRD LRGSRREEGK TVWGQEQDAE HQVFEGVKED DNERGRRRER RRFEERDSLR 3600
GSHGATPSDE QRQMRRQTIL GSREVDGKPL SFDDTHRVDA QLGIQNEVAF PGPQGVGGAG 3660
NSLQFGREGE RFASSSPVAF LRTKEEDEEI VEVFLTPEGS GSERDKASSV SASSAPRDSR 3720
PASPRLRASR LRESARLQRR LEEAEVHDRG SRPLRPEERR VAKRHVAEEN VDATFSAGAG 3780
GTKKIRPHSS HDFSAEGLSK FQELLTWDCE VEIDGTDAHV WRAVAALPGP RPRPRYV 3837 
Gene Ontology
 GO:0003700; F:sequence-specific DNA binding transcription factor activity; IEA:InterPro. 
Interpro
 IPR001471; AP2/ERF_dom. 
Pfam
 PF00847; AP2 
SMART
  
PROSITE
  
PRINTS