CPLM 1.0 - Compendium of Protein Lysine Modification
Tag
Content
CPLM ID
CPLM-032734
UniProt Accession
B9QI05_TOXGO
;
B9QI05
Genbank Protein ID
EQ970689
Genbank Nucleotide ID
EEE29919.1
Protein Name
SET domain-containing protein, putative
Protein Synonyms/Alias
Gene Name
TGVEG_009460
Gene Synonyms/Alias
Created Date
July 27, 2013
Organism
Toxoplasma gondii
NCBI Taxa ID
5811
Lysine Modification
Position
Peptide
Type
References
1738
DPEQGIL
K
WPLSVMS
acetylation
[1]
Reference
[1] Protein intrinsic disorder in the acetylome of intracellular and extracellular Toxoplasma gondii.
Xue B, Jeffers V, Sullivan WJ, Uversky VN.
Mol Biosyst. 2013 Apr 5;9(4):645-57. [
PMID: 23403842
]
Functional Description
Sequence Annotation
Keyword
Complete proteome; Reference proteome.
Sequence Source
UniProt (SWISSPROT/TrEMBL); GenBank; EMBL
Protein Length
1906 AA
Protein Sequence
MPLRTTVSPS SAETVQRVDG DELEITLQNE VDDEASANMA SARSALSRAE PGSGSCGSPY 60
AADAAERQSR ASLSIGDDST RKEGLTQVGV GPEEKGMQEA TAPLQPWGRG LSESRPPMNR 120
DGIATISATV ASLSVSPEDG TFSTGPRNVA GERANTSSYL TASDVGGCPL SQGLKDKQEG 180
TSSVAALTLV DGGAASLRSV RAAAKSLVRC SGRQLASSSR TPAGTKAEGV SWPSAPPGHA 240
GELAENAGPT QDVPGPLDYA AGVGEPCELS TRGDESSEAS APSCFGVLCA EGASETGALR 300
EKNAKMKDRR VNVPPTDKAS PQSSLVKKRT KASGAVSVCS GRSPCLASLS RGNDATKKRQ 360
GTTSPLPVRR VACKRPSVPK CRKGASAAGR RGREEDDCDR QFLRGQVKRA RRNMGDGQAS 420
AALSSHHPTL SPFQRSRAHN SPYHWLPLPH SSGSAVAGRG SLGGHCGCGG VSGGATGGRS 480
GGLSERVRRL VEQTMSEKHR VVWIDRGEAK GFLPLPGMQE ERVYLVWTEK GIDILVDVQA 540
YIPWTHCPVL MQTKYRRNQI PFLILSSTSL ALLFSHVCRR NATAVFRLPL ARTPQAYRKH 600
VLNLLRTGEA EKRSVQCSKS QRRRSPCSSP SPNARTKTAF PQGDKSPQLA DAASASVGDS 660
SDAPGNSPSA SRSASSPPPH SSTASEEQAT HSTSSSVSAA CVSLPDEGRD AKVTGDQNRP 720
VSLPSSLTAA SSLASSVGSS SPSQAESPPL ENALCTSVNE NNEKSASVEV VAEGSADQKG 780
QDQQTKGEKD AERRDSNAVC VGGICAAPRE RAAAGTVESS VTQTLRDETI SPVMRETEQA 840
HGLAATEAET KQEAGTGNGE HATSANIRSN DHEQGTTESP AIPHSSPCGG DQPASHSATA 900
WSSGSPSPGD RGYLHGSPGA SKDGSMSSID ADAASPSLQD RGEEEAEDAL DEDLFWANGG 960
KKGYPLTAVD MRDFYCCSQG KVIGEDYIYF RQYLPDDEGA DIYDLNVSVR KRPKKLLLKK 1020
SRAFANGGYS AGASALRRED RDSSPPSFLT KRSSFAASSS LASDNFAGTG SGSGDVSVHD 1080
SVHGENGPFG VGKAHAAGDG EGGEIELEAP GVVEVVNKDQ GACGGAGGRG QANKHHPAHL 1140
LRLLMAPNKG RSEADEAGTK EAEKADGNGR GGASGCKNWT YVDDYEDDGM DPALDLPYSF 1200
TSEVRVLLGL APSPTPHRCN MSLASHRVSG AVQRRFRIYQ CSPDGERREE EVDPFESALP 1260
SLQGAASDDE DEEKEETSEE RADDEKENEK TENKHWKDRG TCSVKGGEQG EKARDETPVK 1320
SERGSRSPTL PEDRKETGDG SLKREKGDAS GNVASLTENA LDEKPPPASV SLPAGRAEER 1380
IPSLEDGGKK GTEERGKKES CVGETDDATE KTRHSKDHDG GEATAVAEKK EQGLEEQDGN 1440
EEESGGQSRS KASSEHDDSE VNSDDDGPKG KARNKSPSFA LSSHDLPNGS LAEVDYPTEF 1500
SQFRESDRIR IYDKTFLSRA LQLIVDVAIV SRGADLSAVN VAQYAPNLFW NIIRFSRGNK 1560
DIDEVVRTIA PSVTSLRSAR GRRKRGEMEN SSFLSLAAGR FSASRRTGEF LRDAQAPSRW 1620
LKRSKTGQDD GAFCLETWLA GAGDDAAGGE RGRDREGAAD KAKQREERRQ KELEERFEEM 1680
KVEFEEKAQR MIARRAALTG EIYSDGKGSK KPRVPSLPEN DDDALIEIII DPEQGILKWP 1740
LSVMSIRQRT VIYQECLRRD LTACIHLTKV PGKGRAVFAA DTILKDDFVV EYKGELCSER 1800
EAREREQRYN RSKVPMGSFM FYFKNGSRMM AIDATDEKQD FGPARLINHS RRNPNMTPRA 1860
ITLGDFNSEP RLIFVARRNI EKGEELLVDY GERDPDVIKE HPWLNS 1906
Gene Ontology
Interpro
IPR001214
; SET_dom.
Pfam
PF00856
; SET
SMART
SM00317
; SET
PROSITE
PS50280
; SET
PRINTS