CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-031782
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 KH domain-containing protein 
Protein Synonyms/Alias
 KH domain-containing protein, putative 
Gene Name
 TGME49_041170, TGVEG_090080 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Toxoplasma gondii 
NCBI Taxa ID
 5811 
Lysine Modification
Position
Peptide
Type
References
960EAAQAALKKLQEMQRacetylation[1]
Reference
 [1] Protein intrinsic disorder in the acetylome of intracellular and extracellular Toxoplasma gondii.
 Xue B, Jeffers V, Sullivan WJ, Uversky VN.
 Mol Biosyst. 2013 Apr 5;9(4):645-57. [PMID: 23403842
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Hydrolase; Reference proteome; Transferase. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1111 AA 
Protein Sequence
MQRDYRRPLF SDPAPGASDS SSACIQESCL KTEKRQSSLH RGLPPPQPLL PRQRCAAGQH 60
KVREAKETED TKGREEKDRK DRQSGGGQDR RRDSDKHRER GRGSDRGDRG QANRDDSRGD 120
EKDRDKKQGS KKKESKKRGE SKKSRRREES KSDRESRSRS RHRSSSRRRE KRKREEAKGE 180
VNGSTQKEES RRRSSVMSVE SLASKKKSRE RSQDRRAKKH RHSSRDRKAK KHRHHNSSRR 240
HRSRSSSDSE RRRSSSRSSS PPRRTAKNTT DPTRNSENDP FPADLQGPCF VKVLPQARDP 300
PVVLGIDNRG VLNLAKAHGC KLKLSAVSDL YPQTERRFLL VYGAEIGACV AALKGWVAKT 360
ADASDPKRSA AITFLVPDAA MSSVEVPDGE RKARTTTLED LRKLCGSRVK ISSRREMKKA 420
HGRERLVTLQ GSIESVQSGV EALATALQSF PELRDYMNVQ YVAYSQERQR RKRSPSPLPT 480
IPRVVIREPS MPSATVPGLQ FQRALQGVSV QDELVKITNL RGILDPTTPH MHGPAYAKIV 540
ISDLVTTLLL GTTTKEPNHS CPLRLIEATF KVAAKIMDPE SPGILERVLV LSGEPSDVDK 600
ATLAVLEQVY AACIMAGQPS QVTWRMCASN SAASLIIGTG GHRVKQLRTL SNTRIQINTR 660
DNVPNIDRFE RVIAVTGSFD SVVSVTKAML PFMHADTNHA SHVHQCYGTG RKLEMPDWVE 720
GVIHRDGMDE EACRRPPVPD KREVVLEGSC FMKLLIDNGL ANALIGEDSA NIQKLSEATQ 780
CNMKFADPDN VFPGCPGERI LMLAGSGDAM NAATIAIIEK CKEVHPNLSY DQMYGKVIIP 840
QSCCSAVVGH GGLKIREIRD ATLTRVEISK KGLLTSERLV TIFGMPQGVH TALITVCGLI 900
QFDPAVKAFL EVVYPPEVLE QQRKLEEERL SANVIGAVME GVNAVISKVG GDEAAQAALK 960
KLQEMQRAAA LAAGEAPPET YDFRGPLGPM GQIRQDNAAG PDYSAGISFV PASGASRPSP 1020
PGQSLFPPPP PPPGAPPGAL PSDGSAFFPP PEPPVPPLSR QELRQKILDA SLNRVATAAE 1080
AMHANPGYAN AYPDDDLEED PSLAGLMDEK F 1111 
Gene Ontology
 GO:0003723; F:RNA binding; IEA:InterPro.
 GO:0016740; F:transferase activity; IEA:UniProtKB-KW.
 GO:0004221; F:ubiquitin thiolesterase activity; IEA:EC. 
Interpro
 IPR004087; KH_dom.
 IPR004088; KH_dom_type_1. 
Pfam
 PF00013; KH_1 
SMART
 SM00322; KH 
PROSITE
 PS50084; KH_TYPE_1 
PRINTS