CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-032662
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Putative uncharacterized protein 
Protein Synonyms/Alias
  
Gene Name
 TGVEG_095170 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Toxoplasma gondii 
NCBI Taxa ID
 5811 
Lysine Modification
Position
Peptide
Type
References
2105GGDGNALKKGGTVDTacetylation[1]
2677RQKAGTKKETKSEKCacetylation[2]
Reference
 [1] Protein intrinsic disorder in the acetylome of intracellular and extracellular Toxoplasma gondii.
 Xue B, Jeffers V, Sullivan WJ, Uversky VN.
 Mol Biosyst. 2013 Apr 5;9(4):645-57. [PMID: 23403842]
 [2] Lysine acetylation is widespread on proteins of diverse function and localization in the protozoan parasite Toxoplasma gondii.
 Jeffers V, Sullivan WJ Jr.
 Eukaryot Cell. 2012 Jun;11(6):735-42. [PMID: 22544907
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2855 AA 
Protein Sequence
MQNEALARGS LPCPPRHKPA GFAACEGAAQ EDLAAPDSTG CSQASVSTLS TTAATTASPV 60
TDHLGNLCAS GVSFSPPPSR PFLRVDGSSS MPACSLSAAP GCVEGCASGL TSPVTQRLLF 120
PPFSASRLPS REGATCASAT GELGAPPGKA REEGGACHFE PGERGGLSTP VPPLTEPPAF 180
AVMQANRVSQ ERPFSFHPYG EAGRRDLVAA LPCQASSSAF LPFADRPGLV QMAGEVYVQP 240
TFAPARLPLA TPNRPAEVGE ESAHATQATH AGPPSTVEGD GAAVAALAEG RQAGRLQASL 300
GAAVCHPPQP GFQPPGLPPH VPGQTTQGSG SFSSFSSFAL PRPDPSVGVR GEGRGEQEDL 360
GSGVRTPSSV LRSSSRTSSG LATLRSSSVA LRGVQGCAPS AAFQEVGCLR HAPCLSPSLA 420
VGSKAVASAP FPKGAESGAA GPGSAYPPSA SLHPTTAGSC GPQPVQGAPS ASAPASMASG 480
TPPLFVPASA PFSSVCTSGS DQAGSCGFPP STTSRSASGH PPGVSPSFGL YPPCFLSKLS 540
PSAAANPLPP HLLLAHPPGS EATVPLSASG ASLAQSAAPA PPFLPPPSFT ATLAPPNLPA 600
PSPASSAHAP HALRAQFPPA PIQNSPPAGP PSLPAASARP GVCTPGGPEH AASSALGTPG 660
PATGLATSLH VGSSVAAVWS ALCSAGLNPG CDLWHAKQVL CQRESRPLRL SRFEMVEAYA 720
KSTLCLSCDS MEHKITNCPF GEFVCPNCHR SSHRGEHCPL PCRFCFECHS GISVNECIRR 780
TVRQPLERLL GVKMPLDAAL CRASGSSPFA SSVGDGGKAN GRWTVDDLSL HVADRPNTAH 840
GRSVYVSNMV PGTTKEALRT AINLLLEHGC VLSVEMRERS NLQPYAFVEL STLQAAYELV 900
QQKKTALVIR DQQLKVQFKK IGLTCTSTAR LRLTSDASHM GLEDDAGHGK PGVSGVCTPE 960
TGRDSATSEA SVERDRAQLA QVLLQSTEPF LPTGQSVHEV CRLVAVKLAK DYGLSSPLLL 1020
DAVPSAFFPR GPGAAHALPY YHLQNSLNAA HPQVFASCVS REETPGEDRF GRVPRGSVGS 1080
LGGQPRNEFG EKQGPFPSDS PFSSAASSSS FSSSAGAPGV RRVSEQAGRQ SDPLSLSGGS 1140
GRGSLTHFRA GQEPVPESSA ASAACGLSSA LSGVRTPLET SANAHFSSGG HPLERGAGDA 1200
GGAETQGRSE MSLSDSLERQ SSRTEIAGQS LGMPRPSREV PSRFWGADDG GALCLSDLET 1260
ENQRVHDGDR LGPRTVTGYP YPPFRANGQN REEETRRRFD AEFLPAHLGA TLAEGSNRHQ 1320
LAVHADRGDT QPERREALGS ILGRDADRRC EAARQGQRDG NRDSLSKGYG EAQGKCGDEC 1380
GTAGGSLNRT RYACDGDESP QRSTSRHDVD GGCLAAAAKN GVSKTSLTEH EFLSLKGCID 1440
GNVALQVGFG VSGSSWGRIE DRKLEGDASL SSFASAPSAL LSAPSTSGVT PAGGAPHLGR 1500
DAGSEAPSRQ VYAHPTPGVA PLSVHPKDFA DVCDRQGERG MPANEGLQEF SAGDRPAAFQ 1560
ARVASGCANS VARCPGLLPP GLSPLFGRHD VEHLQRRGAS CEGRTSSFDV EDLSRQRTSS 1620
FFSSLFSFSE NEGSGLAEAR RMETTETVAE LLGSADDGRQ EAQREKREDE KGQHTHGSRQ 1680
RGKGEAEIPK CGDPDAETAR EQGITSRSSL FLSSSAGSTL SSASGGAGTT PGTANDTPLS 1740
FVSSSSSSSA SSLSSSLASS LSSGPILTAS SPPSGGGRTP QHDVDGLAAF SGVMTPRRLS 1800
EDPAVALAAE PSQPQPVSFL SSWGGWMTRS RDSDPSRVDA TIYAKDEHLC DSTRRLSSSL 1860
SSLSSSFSSF ASPLSASHAS HTSACLSAPD GGAQRASLTN DGTLFGGGSS GDAEAGPSEG 1920
PRQADSGRPC LHEASRETLS ARTRGCAPPA GPRAWEEESK RETRDDTAGD AGRDMRREGD 1980
ARLFRFPFLA RSETEAAADF APGPSCGVSP FSSERGALAR GSSLTLPLSL STRSTEEEDE 2040
GREPERRTKL KTGEKAWMRS SSAREASETE VDANSTLVPL SQTFAKSREE RFRREEEGGD 2100
GNALKKGGTV DTATVRTGSP SLSSASTACL RKQVEFKSVY AEGQSRSLRF CLATAEALAA 2160
RDEEASPFFF FPLSQTPVEA PASTATDVSG LLEGKDARAG RAEATARNGE RFEGERGACW 2220
PGEGREASAA SLLLRSFANS TEEPESLPVS SSSLFPAGRE STSSLLLSSA GGAEARRLGA 2280
PEKSCEREEA PRSPEGDEHP TLSHLAAHQQ GLPPLVGAGL DSSLGVSFPP LLSPSSPLLR 2340
CQQTGEHSIS SPEEKAARCG SGRTADGQAA ARRGQQRGEE TPRNGGEHSG EAAAQLASFS 2400
LHAATPSEKD KRSRFRDDGG RIHRREGREE ASQKLERRRE GDATSGDEET KGEETRERGE 2460
AREAERTGNF STEREAKFED AEERPALSGW QTQRTSLRER EEAALSVDFA SASGKGAPWR 2520
REQGEDSARG SVAVKKGQVA GDENAFLLFT SQEQERERVV CAEDERQRIT TKVAREREAE 2580
GERVRKELAS EKARAPNDAG SPGSSPEGGC QVSKDAKAEE WNAKTGSGGI SLTATVGMKT 2640
GKETLEARGC LSRSFAGGDV SRDTDFQSLR QKAGTKKETK SEKCLPRCAS LVAEAPKATA 2700
GEQLSDESRN LREEPTLEKA RETNEMRTEE TVGQSPAQGG AAEGRRIERE ETLEDGEEKT 2760
RLCEDGSTGE WRRNERQEEG QKKEADDGLG SNGKDTSKKQ EPGRVKTVDF SGDHGSNVDV 2820
ESLLERIDGI TRQLPPSVLD AVLSSLASAR SPAER 2855 
Gene Ontology
 GO:0003676; F:nucleic acid binding; IEA:InterPro.
 GO:0000166; F:nucleotide binding; IEA:InterPro.
 GO:0008270; F:zinc ion binding; IEA:InterPro. 
Interpro
 IPR012677; Nucleotide-bd_a/b_plait.
 IPR000504; RRM_dom.
 IPR001878; Znf_CCHC. 
Pfam
  
SMART
 SM00360; RRM
 SM00343; ZnF_C2HC 
PROSITE
 PS50102; RRM 
PRINTS