CPLM 1.0 - Compendium of Protein Lysine Modification
Tag
Content
CPLM ID
CPLM-031795
UniProt Accession
B6KGY9_TOXGO
;
B6KGY9
;
B9QDH0
Genbank Protein ID
DS984732
;
EQ970684
Genbank Nucleotide ID
EEA99971.1
;
EEE31849.1
Protein Name
Putative uncharacterized protein
Protein Synonyms/Alias
Gene Name
TGME49_047450, TGVEG_004550
Gene Synonyms/Alias
Created Date
July 27, 2013
Organism
Toxoplasma gondii
NCBI Taxa ID
5811
Lysine Modification
Position
Peptide
Type
References
2251
AKAAVAQ
K
ALLAARE
acetylation
[1]
Reference
[1] Protein intrinsic disorder in the acetylome of intracellular and extracellular Toxoplasma gondii.
Xue B, Jeffers V, Sullivan WJ, Uversky VN.
Mol Biosyst. 2013 Apr 5;9(4):645-57. [
PMID: 23403842
]
Functional Description
Sequence Annotation
Keyword
Complete proteome; Reference proteome.
Sequence Source
UniProt (SWISSPROT/TrEMBL); GenBank; EMBL
Protein Length
2303 AA
Protein Sequence
MDDFSPGRRS PRGISREAAD RRRGSPRWPS RDREERRSPW RSSSSRGRGE EIFPSRGGVR 60
REGASPEPKS GRDLPRANSV QRDDELASRR RASGRRDSRE GVGRDRRTST QRDARGPLSP 120
PYGSQRRFER RSPSPYRGGE REARSRGEND RKRGRSRESS EGGRQVRRES RRIEEGEEGG 180
FSHFGYRGDE GRRDEPRGDR GGRGGEDWRR EDPGRRVLSE RDEEFASRDN GAGRDREARE 240
TDERESSLSR PLPRDKRSRD DSSERREDRQ GSRRGTVASL EDRSRRRVSR SPGQGKEKQE 300
RGSLSSFYSV REREEREGRE EREEKGERGS EKADELPRGE RRGGSVAEAS RRAEGEKDRF 360
SGGREETSRR RESPTGRRRL SSDSLRLPET GRDRSSDSIG ERGRRSPPRG KDDRDEPRRE 420
DDSGRGDNPP VGRRERSGSA QKNEARRSAT REEERTKTRR NRVSSLSPET DKGEPASVSF 480
QEAGKKPPRR LPLPPPPLVS RGGRGERDGE EDKAGPRPRR GSEDNSRENH KRMDDRNRDH 540
APQSPVAARR RQTSFCLRIE NLPSSPSRSE IADLLRDCCG IRVPIDMVEL FPADARAVVY 600
LQSQQQAESA QYRLHRRTLR NCRLDVFLGD RAAGRDSRDD DIPQRDKARE DSRGSRSPRR 660
RGASPGRGEE ARERCASPAG SSFRILRPRS PPGPTGGRAS SPVRRGRRSP PRPLIIRPGP 720
SGGAFPGGIE VLPTDFARGD RRRDEDDERR GQRGADRRGY GRGSLSPRGR SACGAREAGS 780
SRSPKRRRRD EDLSSDSRRR RTSRSLSRSP MNGDDKGPGL DRGRTRDDRS ASREGRKASV 840
SQSSRRGREA GEKEKGEKGS SLSPEGRGGR LSSSRPYTGP EETSREPKAG ACSNLGPRPR 900
SRSPPQGGSR RFGGLGGPHG STPGISIRLA GRGGSSRDYM DGRGPGSGGR GRSPSPHSRR 960
RGGSPRRGWR PPAPRFGLGS VSRSRSRASH SLTASEKSDR RERDEDTKGE KRRDSVVDGK 1020
SRRRSLSRRS QEREGGARSD TDERNKARGP SPLNRERASD VPGGERRQER DRSSGRLGAE 1080
RRDRSVKQNP QEKAMLRLHE RGRCTPCRQI IKGLECFRIS TCPYCHHAEH DPMYNRPGVE 1140
EPELEASSLL SLASAGLQAL RDGVDAGAGQ QGEEESRPRG RSASLSPSAK SQTKPGGGNS 1200
ALSPMVLKQH LAGTCLPCSE YKQDRCRRGD SCPFCHHSDH LPKGEKAKGE PKKKEAKKLD 1260
SKEQQERYAL ERHEKGWCRP CKKYFAKNAE CPAVKKNQPC LYCHHEDHRD DMPSRIQSRQ 1320
PSPERASPPA EPLKMEEAPP QDTSAPLLAS PPHSLHSPSS LCSPTAQGAM GEQGNAGPAI 1380
LRPGDPATRL ENFVFADPAQ QRKHEKGLCL PCKFYVSGRQ CYKVEQGCVY CHHPSHLSLA 1440
VQGLSQHADG TGLPCLQKVR EQHEQGVCRP CVHHFVPGLT CAWGVQCLLC HHPQHGDRSS 1500
PSFYLKWEHD RNVCLPCAKA HVSGHGGCPE SDNCGYCHDP VHSDPTSELH FARVLHARGV 1560
CRPCQQFLQG TCPLEQLCCP FCHNPQHLGG FAAEKEGERE RGAEGAGPPG RGAVVPVSKI 1620
GPAAPPGHAL FMRPPGTDGR TAPGLPFPPQ QPAGNVPLSH PGLPGPPMGH PPPLAHPAGV 1680
GAAGMFPGPP HHPVRPAFVY GGVPPPSGTA VAGSPFPGQP PMFAGAGAGA ALPGNPTKPQ 1740
KEEREGAGEA GPGEGVKESL GLPVGVIAAP PVLGGAPKEE EQGAKEEEEL DPVAAAAAAY 1800
AKEQEARQVL LQNQSVMQTP QSMTGPSPPG GSPWGVPYMQ PPPPGAAPWG APGAPHPPSG 1860
SMPAGTPQGP HPIGVGPPPP GPPGTAARPS HLLPTPPGAP GVPPPGSRQQ AQMGPPLAMG 1920
GDRAMPSMPG APPPGAAPPA PMYGMAPAMG RPPAPAQAPG SFCGPIGSPA GIRPPQGPPG 1980
AVGTGGPPGY MGSESSFGVS HLAPGQRPAP PHPHGSPAPF ASPSAQKPSP PPLMGWGAQP 2040
LVASQPPPPG GHLTPQGDAG QGVGLAGPPP GMSPAMGTGA PPPPPPPPSG SPSDGFGDSS 2100
AQASGSSFQA SLAAKNPAAA AMLEAAQQAA QAAASKIGSS NSSPPPPPSQ SFLNQSSRVP 2160
GMSPAAAAAA AAAAALAAAA MAAKKKGEEA QKVRDMEQPQ GSKPSQAMAG LPAGHATMGT 2220
PLPSPFGFSV GSRSGGLASV VEAAKAAVAQ KALLAAREAQ EAPTAAPDNP QNLPAGVLAA 2280
AEAAKAAAAA ISASLAGVGG FTG 2303
Gene Ontology
GO:0003676
; F:nucleic acid binding; IEA:InterPro.
GO:0008270
; F:zinc ion binding; IEA:InterPro.
Interpro
IPR000571
; Znf_CCCH.
Pfam
SMART
PROSITE
PS50103
; ZF_C3H1
PRINTS