CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-032712
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Putative uncharacterized protein 
Protein Synonyms/Alias
  
Gene Name
 TGVEG_046960 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Toxoplasma gondii 
NCBI Taxa ID
 5811 
Lysine Modification
Position
Peptide
Type
References
2256FRDRRAGKDSSEPFAacetylation[1, 2]
2267EPFADSAKGAARQSQacetylation[1, 2]
2513LGAWGAGKRRRARCGacetylation[1, 2]
Reference
 [1] Lysine acetylation is widespread on proteins of diverse function and localization in the protozoan parasite Toxoplasma gondii.
 Jeffers V, Sullivan WJ Jr.
 Eukaryot Cell. 2012 Jun;11(6):735-42. [PMID: 22544907]
 [2] Protein intrinsic disorder in the acetylome of intracellular and extracellular Toxoplasma gondii.
 Xue B, Jeffers V, Sullivan WJ, Uversky VN.
 Mol Biosyst. 2013 Apr 5;9(4):645-57. [PMID: 23403842
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Metal-binding; Reference proteome; Zinc; Zinc-finger. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 3112 AA 
Protein Sequence
MGSVKVAAVS PPGVGRHRTA SGFALLPSVP SSPSSFASSS STNGRDASTS TPARVSSPVS 60
ETRAASASGG THRAAGHDGA LSSRNGEESR SESPKVRSPT SCSTPSPGRD GGEPVDGDKG 120
RGRTSLRCPS SGGLTASSRA SSVHAETCGS FSPGNSSASA ACRRNDSQAP KQDEVRDSFS 180
SRGCRTDQEQ SSGPGEETPL WSLGSVRRSS RLNQVGADSE QRVSAPSLDA AALPLRSPQQ 240
LLLPTCAQAS SPTSGVSTRR GAVGGPASGA QRVKADRAEV AGPRRGSSKA DRDAASDGLR 300
SEEKNCDENF PSALFGDSRN TPRALTPVTR ATASRLPASA LVPASPLSRH RQTPRKVSPP 360
EGLSGRHHRR GVRGKLKQET AAQKRNRDGE REKKDCSSAS GPGKKAARGG GKAAGPLGRR 420
PSRFQSLDVT QCVAPDGEKK APTSTATLPC GTSLRGGATL DAPLCFASST DASPASSFSL 480
SRHNTPRNGI PRGEEREDSS ALPVAGRTRS FLAFPQQKEA SEYTMLSRLR GGREAAPTRA 540
LVRTAPANRR EDSVAESRSR VGYGSAVAPG SEETGDRGER GNSGASEGDR EGGKGGELAD 600
VEVKLEEREE RSGVVLSGVA NCAFEVVTVE EVARRRKKHV ARTQRLSARA QREQERLRDI 660
LQKQFGKRGS FWGGCGWVPA SSAAKATSGV STPHRSPQVA SHTWQPFIFC PYFGEEHRRV 720
YRHRHPRLYK ALLACVQEGK VHDGVQVIRL LDFRHPVRLV TPEGEDAYSL RYVGPRISAE 780
HRERIIFGEY TGYVSSDEDL RADHPQYCFQ LKFHREAFRD PQVVRTQAEG MQSPQRAARR 840
TDREERSEGR GSGEEGRGER ERGEGTCQDE ETYRDWDAKE DWKGAASSVA PSVVTIPHDE 900
MYAVDSTEEF NEMSMVNHFQ TVDLFGGRKC RINAEWQVVY VDAWPHIVLT SIPGVAINPG 960
EELLADFGFD WFAQINAECM RYCRRELQAL RLRDKLSPQA LRAFLREQEA ADDEEDEAFS 1020
SDQLCFLCYA NQKPCKVALT GEAQNKKKVK RSESASGAST LHDTRDNTQK SREEEATEKA 1080
VGEKEQETTG SAASVPPGGG GREDAHGSER EEKTGKSGSA GDQQKWLHAT NRQSVKRTWE 1140
ERRVGETRGT ERREKEERGI VCDGCNRAYH LECIHRTQDP PADEFEWFCP LCILFAERVN 1200
AARQARKDEE REAESSALKM SEDAEETRRA SRDEAAERET HLGNNQHSPC ATDGRILFSH 1260
GSRSSSPSSS SASSSLPSSP CCLSSSSSSC SSASLITGAG SSQPSSHEAN VENGSVRLVF 1320
AEASRLCPPS AADAESSPGP DMPNSLSCLS SGAAVPLSSS SLSSSLSSSS LSSSSLSSAS 1380
LSSASLSSSS LPVISSPEHE RKPQAVLVEE EPGEPADISP PHAAPCVASK EARTNAKAFQ 1440
APSCSVSSPP LPCLPSPSSP VDRHRGEAEN VCVIPQSLSD VSSSVSSSSS SSSSLSSSSS 1500
SSSSLSSSSP SSSSFSSSSS SSSSSFSSSS SSSSVSSSSS SSLSSWVSPT KAEDVGESPR 1560
VVKPRVVFSS VAEAFASQAK KIRVRPPPSV NEMGAGRQRG KLTDQPTLPE SPHLSTLLPC 1620
RACRSRFGDD ASGVTCRIFK LHLAKNFSDP GEEPATETPL EICHDIITAM RNVVMDYRRE 1680
ELEAAKAAAA PSPSNECALG HTKTGEETED GSVEGRAFPE LVSGEDLLAR MSRPSHRFVP 1740
LLGVYLGETK LERRRPTGVH VGKVAGLIRD AEAGAPKIVV KYEDGDQEVF SPKFFMTELL 1800
CQALRPRQHQ GASPFEVLFR ADLYPYISRE TRRHTRLPLP QSSLCSEEGG CGEETPRVAR 1860
RRASVENEVL VALALSGALS EAGAQFSEGR SAVDCFLGVD EEEEEGEAEG VAEAFEERHR 1920
RVERGAEHDW GSLFCFLSAC LPLLVVEPGE KRSRRLFSLP GGAVETLALL RETKRRGAGQ 1980
DLAHLPFLEL LSSFVARVSS NGDSPSGPSD ATASPKRRRL GAKAEDGARG DKSRERRPRD 2040
HAVALKDWLS ELEKSSDSSR SDAPAALWGP TGVLPPLGLY LSPDEEIELD SDTDVHSALP 2100
SSSLSSSRWP LRASCVSPPC LPVFPRPHTR AVAAAVASAA AEFVAMVLKS RLGSPAVDEK 2160
SEKPRDVASR FLARSVNLSL RNKELEETTV RAWLDCTSHL ATATLARAWV DDGAKECARG 2220
DTAGSREEAA ASKALWVDHG RTPGRDRAFR DRRAGKDSSE PFADSAKGAA RQSQRFLRAG 2280
RSESPSNRAH TFSSSESRRS RFPDPSASCT SPSSFSSAAS SPLSSARGPC AGFPPLALPS 2340
SLLSSPGVAF PPPNQPGDRR ASSALGPSSL RRSLSCQEGV WGPHAPANKV SSQGRETLGS 2400
SVGRAWYEAA TDAWVAEFVR ENGKLGRKRF LCEKLGAARA ERLAKIKARL LAAEGEGPLS 2460
RHELLLMQRH QQLISLHREG EARLKRERRQ QASADSADSD ERGEGLGAWG AGKRRRARCG 2520
PGGEVAPDAF VSAPSVSSFS SASTEESSET ASLLGEKNVE RRDRYGSLSG ESFPPVSSGC 2580
PTLAPLPGTR PGSLGFPPPR EEPRGPCSPG AGPGRPPVSP ALGPLPWASY TPLQTSQSFP 2640
PSYSSGRNGP SGYSYASGVS SVRDAARVGF PSPFPGETSF ASGGSPFLTA LPSPSVSFAS 2700
PPLQPAPLTF AAQQSICVDA LLALARCRAS AGGTAAGAAG PGSWAGFSAL FAFLSERGMT 2760
PNSLAPFWAA IQRLFLSVGC VVSSADLPRV HGLVKAAVTR NPSLGEAVAK ELDAALEQSQ 2820
RKSERAEVLR TERGIRGDGP RQAEGREQGD RAQDEVETGK KDGEELRGED KGAETRRQDA 2880
AGDGRHVGTG SGLQAREGPG VSLRESTTHL FVDGEDDEGL EGGDTFPSRP TPTPGLLETT 2940
PPAGASCPVG VVSCSPAGDA SCSALSSPRF VPAASSALPE ESTRSAGWFF PSQNGRAFDS 3000
CLLSPPAGAA LQTHALVGLD GDVPACDDQV ERRPNAELLL PSCEDLGSRR EDASPRYLLH 3060
PANLPSLSPA QLEDASLLRL TCSGASAGDA FLAPEKKKAD EEEAAAVCGL FS 3112 
Gene Ontology
 GO:0008270; F:zinc ion binding; IEA:InterPro. 
Interpro
 IPR001214; SET_dom.
 IPR011011; Znf_FYVE_PHD.
 IPR001965; Znf_PHD.
 IPR019787; Znf_PHD-finger.
 IPR013083; Znf_RING/FYVE/PHD. 
Pfam
 PF00628; PHD 
SMART
 SM00249; PHD 
PROSITE
 PS50280; SET 
PRINTS