CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-032661
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Putative uncharacterized protein 
Protein Synonyms/Alias
  
Gene Name
 TGVEG_068900 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Toxoplasma gondii 
NCBI Taxa ID
 5811 
Lysine Modification
Position
Peptide
Type
References
616RLKALAKKGAGKDGEacetylation[1, 2]
620LAKKGAGKDGEGPSGacetylation[1, 2]
Reference
 [1] Lysine acetylation is widespread on proteins of diverse function and localization in the protozoan parasite Toxoplasma gondii.
 Jeffers V, Sullivan WJ Jr.
 Eukaryot Cell. 2012 Jun;11(6):735-42. [PMID: 22544907]
 [2] Protein intrinsic disorder in the acetylome of intracellular and extracellular Toxoplasma gondii.
 Xue B, Jeffers V, Sullivan WJ, Uversky VN.
 Mol Biosyst. 2013 Apr 5;9(4):645-57. [PMID: 23403842
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 3946 AA 
Protein Sequence
MEEDGGAPSS PCGVPATGGV EKAVAYQSED MLGGEANIGC NTAHGVPDVQ ISPPSAGISP 60
SPPSSGTPPL LGQAAGLDVV EANRSGDEAD AAFSATRRPD SPASFMEGED PDTPRSPTQQ 120
TTSPQPPLSP QPSLSPPLVS EVVDALYPES PRPGSGALTR GMCLSAGGPV SSGPPLSSTP 180
PSPASSRSSS SPGGARVSSP VSAAPLVPTF PVPFAPERRA EPGRCRIREG QASSSRVDRY 240
GDSVPAEKAG RGSPSFLERN AFDRETNSGP TGIGGHSPFF SPLQGAGVSS SAGGSGGSNR 300
PQHPLVHAST PSWVKYGEEH YPEEAVPIEY QWVEDDGTCC CYVTDFLLYN KKGERLLQVE 360
TLDQDELADV ALYGKLVHVA LVHPPEALEA RKHPTACQVY PHLQQRKRHQ NSSRGRKHLQ 420
TPEDEPASQQ VSPRAGSALP RDGDKEDRQA RDGDNAWRAD GDSAVRSQPR EDGEKKEEKL 480
LVEGEKETAS GAATPSLGSR KVLRKRGGML HHPYITPKIP RPFRTFPVRV DLTEWMIDYG 540
KSASQTPYVW LVSSLNRYYR LEKPAGRYAP VLQSLKYKFE VASRVIKQLS VNHLYPYQEL 600
VDHLTADSRL KALAKKGAGK DGEGPSGCAS AGGDIGESGD TGLRASEASG LGGEPGRDEA 660
SEEKAGDDAE EEDEKDEADC SLGKGAGSQP ASAGKSRLGE GEEGLTYTER LKDDLSRPSS 720
IAGDDDEDRA GVDLASGLAA SSPSNRVAGD AHASAASSIL GTDFGNLVLP PGLGSPSSSL 780
PLDPSSLFTS SKFISALAIK KNKNCHLLSL SDPDGTGVRV LPDGSRSPES PWGAALAVEG 840
CTEESLIEVF PFLEAQVAAF EAATGTTGLL SSPFMNTLRA RVFRWVGKLD DDNRVEEVPP 900
LQQTTTPEIV GGRTDEGGLL SHQSEDVLLS LNEFASSCAA ERPPLSGSTA FARGGPGDRG 960
QPGPAFDSPA FGDGAGFFGP LMGPRGGRRR GARGSRGARG EGGRGRGRGR GGRDAAFARF 1020
LGDDRDDFFA GDEDARVRME FEKSDRLKRE GDAVDGEDDE PLTNKTGDGC LEDTLGLSDD 1080
DDGRKRIKRE EGGTSQGGPW GGKGDDAQRK AEEEEEEDEL STLPEWGIEG DPVDVKDELD 1140
LEAEDPVYAR VPLPGTSFTP YDFPELLEVQ DFCRVFIEML QLPSYPVDTL EAALLVAASS 1200
TVSQDFSTRA AWPAQAKGTS GLGAVKNGRK KRNPAIPQNP LPFIAASTCL APSPEPFQFL 1260
FPLAAKREDA ASGLRQPGAI RTGAALRDAA AVALANAAKA EESRSAAGHP GQLALAEKAE 1320
EGEQNVRPEA KREEGSADLG DSAATERDFA KGPEDESQRE GERRDDTQEG GESDLPAVSQ 1380
GDGAAAAAAK CLSRPPDEGP ASWSAGSVEP AALLPSSGTH APRSRSPFCG PPLLSGSARL 1440
HASSFPVFVS DALFLRLLQL AFLHISDALA RHNKTASSGD AVNPDARSPL PSLLTGAGAA 1500
SAGIVTPGGG PGVSGVPGTV GGAETPVAAT WLGSARADEA WDLGDPSTGD TSKDSGSALF 1560
GEEGGSRATR AQGEEEGVGS SRSSVASLWV HPGGPGFMNS EGKAGGGRAS KAAVGVAMKQ 1620
FWRHAALGSR LRRLVPRPAD EEEEERDEKR KGVEEQDGER ESGEKARKAG RRKEAGAEGE 1680
APEAETDDAT TEEEGQGRET GRDGRARRPR QTTPGAAGSD ETNNEAEVEP KVLHMLSSAP 1740
LDLRMIDFLT WPLVLQRMLF DCLYSPRRVA VKKPRRRSTR SEDEAREPEK PLGEAALRED 1800
NSLECTMKEG EDEDESLTTA KEEEKDGEEA GPPESKESQK ACEGKADSGP SAEETGQSEV 1860
GEESRGKHEA AVSSLSSSGA FQGCTPSPVN SQASKAEGER GKQDVSQKDG REHDGERASP 1920
GGPEEGACES GRGDREEKED AGDEGDGQMS AECRPVTEET KGDGEDSPDP GRGTREEDSG 1980
ETATASKATD EQEAEMEEQG DEKENGVLLD EDAEQNVEDR RDDDVESFQE EGKKTSRSRE 2040
TKRKRTADED QYEEERGEKD ASQKGESDET SLPADGNELK ASSSRKKTRR TSNDVELGDE 2100
EGDTVSKGRS GEEDDEEEAE EEEVERLEWH LDDQKAAQCG ILPSRAEAMA IAFHEMRSHS 2160
TTDVTRRLQL LQWLIACIAN GWMGKFFLDA KVEALFRARA NTLRLRMASM GTGPIGSSLA 2220
AGACRGQAGA GSASGAKAEG ADGDSGLAPS EAPVPPPSTN TEDSAAVDGS ERPEDSSLPV 2280
SASDTGETCV PCQSPLSGDA AGPAASGTEG ASSPSEPAAL SGAGRGRRAG DGEEKRSGRG 2340
NGADGDSTLG PGGKKKRGGA SAGCGVGVKG AEEEDCVSMS EGASKAASNS AEGFEAEDEM 2400
GRAGAEGVCG ASGPGGENAL NPGGRGVGRG KKPMGRVSNW EMELLAQKFP IRGEILGEDR 2460
FLNRYFLVPT ARGVPRVFVE AFSQSRIDPV DAINGFLDHV TAAAVETEER RKDEKEEEGA 2520
SALGLPPGTP TSLPVRSGAA DSGALSSGSP PAGLEEASNP AACGQGGPLA GDRVAEVGRR 2580
SSRRIAQMKE EQRQQELQDS LSSFSPATNG CRSDDRKRKR ASLSEGKRCG FGGSLLRPSA 2640
ALKDGSVNGE KAKFLRPEET RKHWNDFVRR AVGFESISTY LHYLDETLKA CSLFVVPPGK 2700
PLQQLQHALS PFLVRERALK EKLRRVDQQF QQLAAATPLP TPVPTFVPPS PFAAAIWHGC 2760
RLLLQIEEAL LLPLFSSSWG FASMFQKQQH ILRLLDQTRD KASPSLSASR GRGESRRGGG 2820
AGSEDEGEKE KEEKPERSSK EEEDAATQKE RELLRQLQSR VWPQDMLHKC LSTLAAHANV 2880
MDGARQGGEK REDGTNKMKP DAKAEERTKT MKEEKPSRLP TADDATDGDP AGRACQIQDP 2940
EAEEREREER REEEKEAQWM ERRICLGVFV QLLLYVEDRI HMQSNVHTAA WHLQQHEQWR 3000
RELTALGGAS LLAQLALPLP SVDVSSTALA GDSENREDIG RERGDKNDRE RRSDEDSTLA 3060
SPTGCRRRIL SEDDATAGEG DGEETGEGKA DKGSFGPQSA KGEDAEEDRE KPSLDWQSRD 3120
GDEDGDRERE RHQSEREMES EKAGDASGGR ISPSSTSSKN APTLPPSPLV STAVDPECAV 3180
PVSVPETHSV LSPNSPLRRV RDLIETIGWD CDRVFNFLRE LRIRELDEMK RARDLEGRTP 3240
HKGLLDRELF EGVTLQGAAT FEGEEDPVLS WTEVRNRHDV LAIWGCYLNL WKAHGVDKQK 3300
AVDARTAVDR NTFLSLLAHD EQKAQLPEVG SRLFYFRKGH EQMMKSMQQE YVETTGGKSW 3360
LDSTSLPLSI PFELGRVEEL IVESISYHPG IISPLPRAPS PASLSSLLSG LPGSEAEASS 3420
LLPEKLYSFS IVDDPYALGL LVGDAFRQKA RHGPSQPGGA KREASGPTGA GTTPASTPGV 3480
PSFAGASAPS TADLLLSSRP SSADGNAEGD CLAVPGNTLA ASTPGPASSV VRSPGVEHNS 3540
AAGERGEMDG SATPSDPFLS EKTLPAGVVG KSEFELLCAA DPLAVAAQEA ARGKHLLLQY 3600
LAGKNRDEGD GGETETKVPV ETAPYYRMVC RVLRRSHETT ADNITDRVKS IVAAAAAANE 3660
EKPHTRTSSR LLSYQNRSLS SRSGSRSGAS GERRGGASGG RKGQGGSGDD SLQGSEGAFF 3720
QSPHVSGPQG EREVVLCVPM RADDVEFVVR REKVFHALNL NWNPGMRFRM VFHTTVPAPV 3780
EAAPACGKEK DGAGASGAAG RDGGAGAGGG AGSTGSPNST GSSAGGSNAG GGGAGLGEKG 3840
NAGSGERPSF VSTTTRYTGT IRRVDLLHPD FWENVVVEWE DRSRTGVGAC TAGSSSRGTG 3900
AGIDRGATER GSGQKHSDAD AGLENVSLWE LEPLKRLKRP GGNAGD 3946 
Gene Ontology
 GO:0005634; C:nucleus; IEA:InterPro.
 GO:0003677; F:DNA binding; IEA:InterPro.
 GO:0006355; P:regulation of transcription, DNA-dependent; IEA:InterPro.
 GO:0009725; P:response to hormone stimulus; IEA:InterPro. 
Interpro
 IPR010525; Auxin_resp.
 IPR022702; Cytosine_MeTrfase1_RFD. 
Pfam
 PF06507; Auxin_resp
 PF12047; DNMT1-RFD 
SMART
  
PROSITE
  
PRINTS