CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-032734
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 SET domain-containing protein, putative 
Protein Synonyms/Alias
  
Gene Name
 TGVEG_009460 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Toxoplasma gondii 
NCBI Taxa ID
 5811 
Lysine Modification
Position
Peptide
Type
References
1738DPEQGILKWPLSVMSacetylation[1]
Reference
 [1] Protein intrinsic disorder in the acetylome of intracellular and extracellular Toxoplasma gondii.
 Xue B, Jeffers V, Sullivan WJ, Uversky VN.
 Mol Biosyst. 2013 Apr 5;9(4):645-57. [PMID: 23403842
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1906 AA 
Protein Sequence
MPLRTTVSPS SAETVQRVDG DELEITLQNE VDDEASANMA SARSALSRAE PGSGSCGSPY 60
AADAAERQSR ASLSIGDDST RKEGLTQVGV GPEEKGMQEA TAPLQPWGRG LSESRPPMNR 120
DGIATISATV ASLSVSPEDG TFSTGPRNVA GERANTSSYL TASDVGGCPL SQGLKDKQEG 180
TSSVAALTLV DGGAASLRSV RAAAKSLVRC SGRQLASSSR TPAGTKAEGV SWPSAPPGHA 240
GELAENAGPT QDVPGPLDYA AGVGEPCELS TRGDESSEAS APSCFGVLCA EGASETGALR 300
EKNAKMKDRR VNVPPTDKAS PQSSLVKKRT KASGAVSVCS GRSPCLASLS RGNDATKKRQ 360
GTTSPLPVRR VACKRPSVPK CRKGASAAGR RGREEDDCDR QFLRGQVKRA RRNMGDGQAS 420
AALSSHHPTL SPFQRSRAHN SPYHWLPLPH SSGSAVAGRG SLGGHCGCGG VSGGATGGRS 480
GGLSERVRRL VEQTMSEKHR VVWIDRGEAK GFLPLPGMQE ERVYLVWTEK GIDILVDVQA 540
YIPWTHCPVL MQTKYRRNQI PFLILSSTSL ALLFSHVCRR NATAVFRLPL ARTPQAYRKH 600
VLNLLRTGEA EKRSVQCSKS QRRRSPCSSP SPNARTKTAF PQGDKSPQLA DAASASVGDS 660
SDAPGNSPSA SRSASSPPPH SSTASEEQAT HSTSSSVSAA CVSLPDEGRD AKVTGDQNRP 720
VSLPSSLTAA SSLASSVGSS SPSQAESPPL ENALCTSVNE NNEKSASVEV VAEGSADQKG 780
QDQQTKGEKD AERRDSNAVC VGGICAAPRE RAAAGTVESS VTQTLRDETI SPVMRETEQA 840
HGLAATEAET KQEAGTGNGE HATSANIRSN DHEQGTTESP AIPHSSPCGG DQPASHSATA 900
WSSGSPSPGD RGYLHGSPGA SKDGSMSSID ADAASPSLQD RGEEEAEDAL DEDLFWANGG 960
KKGYPLTAVD MRDFYCCSQG KVIGEDYIYF RQYLPDDEGA DIYDLNVSVR KRPKKLLLKK 1020
SRAFANGGYS AGASALRRED RDSSPPSFLT KRSSFAASSS LASDNFAGTG SGSGDVSVHD 1080
SVHGENGPFG VGKAHAAGDG EGGEIELEAP GVVEVVNKDQ GACGGAGGRG QANKHHPAHL 1140
LRLLMAPNKG RSEADEAGTK EAEKADGNGR GGASGCKNWT YVDDYEDDGM DPALDLPYSF 1200
TSEVRVLLGL APSPTPHRCN MSLASHRVSG AVQRRFRIYQ CSPDGERREE EVDPFESALP 1260
SLQGAASDDE DEEKEETSEE RADDEKENEK TENKHWKDRG TCSVKGGEQG EKARDETPVK 1320
SERGSRSPTL PEDRKETGDG SLKREKGDAS GNVASLTENA LDEKPPPASV SLPAGRAEER 1380
IPSLEDGGKK GTEERGKKES CVGETDDATE KTRHSKDHDG GEATAVAEKK EQGLEEQDGN 1440
EEESGGQSRS KASSEHDDSE VNSDDDGPKG KARNKSPSFA LSSHDLPNGS LAEVDYPTEF 1500
SQFRESDRIR IYDKTFLSRA LQLIVDVAIV SRGADLSAVN VAQYAPNLFW NIIRFSRGNK 1560
DIDEVVRTIA PSVTSLRSAR GRRKRGEMEN SSFLSLAAGR FSASRRTGEF LRDAQAPSRW 1620
LKRSKTGQDD GAFCLETWLA GAGDDAAGGE RGRDREGAAD KAKQREERRQ KELEERFEEM 1680
KVEFEEKAQR MIARRAALTG EIYSDGKGSK KPRVPSLPEN DDDALIEIII DPEQGILKWP 1740
LSVMSIRQRT VIYQECLRRD LTACIHLTKV PGKGRAVFAA DTILKDDFVV EYKGELCSER 1800
EAREREQRYN RSKVPMGSFM FYFKNGSRMM AIDATDEKQD FGPARLINHS RRNPNMTPRA 1860
ITLGDFNSEP RLIFVARRNI EKGEELLVDY GERDPDVIKE HPWLNS 1906 
Gene Ontology
  
Interpro
 IPR001214; SET_dom. 
Pfam
 PF00856; SET 
SMART
 SM00317; SET 
PROSITE
 PS50280; SET 
PRINTS