CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-031795
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Putative uncharacterized protein 
Protein Synonyms/Alias
  
Gene Name
 TGME49_047450, TGVEG_004550 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Toxoplasma gondii 
NCBI Taxa ID
 5811 
Lysine Modification
Position
Peptide
Type
References
2251AKAAVAQKALLAAREacetylation[1]
Reference
 [1] Protein intrinsic disorder in the acetylome of intracellular and extracellular Toxoplasma gondii.
 Xue B, Jeffers V, Sullivan WJ, Uversky VN.
 Mol Biosyst. 2013 Apr 5;9(4):645-57. [PMID: 23403842
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2303 AA 
Protein Sequence
MDDFSPGRRS PRGISREAAD RRRGSPRWPS RDREERRSPW RSSSSRGRGE EIFPSRGGVR 60
REGASPEPKS GRDLPRANSV QRDDELASRR RASGRRDSRE GVGRDRRTST QRDARGPLSP 120
PYGSQRRFER RSPSPYRGGE REARSRGEND RKRGRSRESS EGGRQVRRES RRIEEGEEGG 180
FSHFGYRGDE GRRDEPRGDR GGRGGEDWRR EDPGRRVLSE RDEEFASRDN GAGRDREARE 240
TDERESSLSR PLPRDKRSRD DSSERREDRQ GSRRGTVASL EDRSRRRVSR SPGQGKEKQE 300
RGSLSSFYSV REREEREGRE EREEKGERGS EKADELPRGE RRGGSVAEAS RRAEGEKDRF 360
SGGREETSRR RESPTGRRRL SSDSLRLPET GRDRSSDSIG ERGRRSPPRG KDDRDEPRRE 420
DDSGRGDNPP VGRRERSGSA QKNEARRSAT REEERTKTRR NRVSSLSPET DKGEPASVSF 480
QEAGKKPPRR LPLPPPPLVS RGGRGERDGE EDKAGPRPRR GSEDNSRENH KRMDDRNRDH 540
APQSPVAARR RQTSFCLRIE NLPSSPSRSE IADLLRDCCG IRVPIDMVEL FPADARAVVY 600
LQSQQQAESA QYRLHRRTLR NCRLDVFLGD RAAGRDSRDD DIPQRDKARE DSRGSRSPRR 660
RGASPGRGEE ARERCASPAG SSFRILRPRS PPGPTGGRAS SPVRRGRRSP PRPLIIRPGP 720
SGGAFPGGIE VLPTDFARGD RRRDEDDERR GQRGADRRGY GRGSLSPRGR SACGAREAGS 780
SRSPKRRRRD EDLSSDSRRR RTSRSLSRSP MNGDDKGPGL DRGRTRDDRS ASREGRKASV 840
SQSSRRGREA GEKEKGEKGS SLSPEGRGGR LSSSRPYTGP EETSREPKAG ACSNLGPRPR 900
SRSPPQGGSR RFGGLGGPHG STPGISIRLA GRGGSSRDYM DGRGPGSGGR GRSPSPHSRR 960
RGGSPRRGWR PPAPRFGLGS VSRSRSRASH SLTASEKSDR RERDEDTKGE KRRDSVVDGK 1020
SRRRSLSRRS QEREGGARSD TDERNKARGP SPLNRERASD VPGGERRQER DRSSGRLGAE 1080
RRDRSVKQNP QEKAMLRLHE RGRCTPCRQI IKGLECFRIS TCPYCHHAEH DPMYNRPGVE 1140
EPELEASSLL SLASAGLQAL RDGVDAGAGQ QGEEESRPRG RSASLSPSAK SQTKPGGGNS 1200
ALSPMVLKQH LAGTCLPCSE YKQDRCRRGD SCPFCHHSDH LPKGEKAKGE PKKKEAKKLD 1260
SKEQQERYAL ERHEKGWCRP CKKYFAKNAE CPAVKKNQPC LYCHHEDHRD DMPSRIQSRQ 1320
PSPERASPPA EPLKMEEAPP QDTSAPLLAS PPHSLHSPSS LCSPTAQGAM GEQGNAGPAI 1380
LRPGDPATRL ENFVFADPAQ QRKHEKGLCL PCKFYVSGRQ CYKVEQGCVY CHHPSHLSLA 1440
VQGLSQHADG TGLPCLQKVR EQHEQGVCRP CVHHFVPGLT CAWGVQCLLC HHPQHGDRSS 1500
PSFYLKWEHD RNVCLPCAKA HVSGHGGCPE SDNCGYCHDP VHSDPTSELH FARVLHARGV 1560
CRPCQQFLQG TCPLEQLCCP FCHNPQHLGG FAAEKEGERE RGAEGAGPPG RGAVVPVSKI 1620
GPAAPPGHAL FMRPPGTDGR TAPGLPFPPQ QPAGNVPLSH PGLPGPPMGH PPPLAHPAGV 1680
GAAGMFPGPP HHPVRPAFVY GGVPPPSGTA VAGSPFPGQP PMFAGAGAGA ALPGNPTKPQ 1740
KEEREGAGEA GPGEGVKESL GLPVGVIAAP PVLGGAPKEE EQGAKEEEEL DPVAAAAAAY 1800
AKEQEARQVL LQNQSVMQTP QSMTGPSPPG GSPWGVPYMQ PPPPGAAPWG APGAPHPPSG 1860
SMPAGTPQGP HPIGVGPPPP GPPGTAARPS HLLPTPPGAP GVPPPGSRQQ AQMGPPLAMG 1920
GDRAMPSMPG APPPGAAPPA PMYGMAPAMG RPPAPAQAPG SFCGPIGSPA GIRPPQGPPG 1980
AVGTGGPPGY MGSESSFGVS HLAPGQRPAP PHPHGSPAPF ASPSAQKPSP PPLMGWGAQP 2040
LVASQPPPPG GHLTPQGDAG QGVGLAGPPP GMSPAMGTGA PPPPPPPPSG SPSDGFGDSS 2100
AQASGSSFQA SLAAKNPAAA AMLEAAQQAA QAAASKIGSS NSSPPPPPSQ SFLNQSSRVP 2160
GMSPAAAAAA AAAAALAAAA MAAKKKGEEA QKVRDMEQPQ GSKPSQAMAG LPAGHATMGT 2220
PLPSPFGFSV GSRSGGLASV VEAAKAAVAQ KALLAAREAQ EAPTAAPDNP QNLPAGVLAA 2280
AEAAKAAAAA ISASLAGVGG FTG 2303 
Gene Ontology
 GO:0003676; F:nucleic acid binding; IEA:InterPro.
 GO:0008270; F:zinc ion binding; IEA:InterPro. 
Interpro
 IPR000571; Znf_CCCH. 
Pfam
  
SMART
  
PROSITE
 PS50103; ZF_C3H1 
PRINTS