CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-032709
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Putative uncharacterized protein 
Protein Synonyms/Alias
  
Gene Name
 TGVEG_046360 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Toxoplasma gondii 
NCBI Taxa ID
 5811 
Lysine Modification
Position
Peptide
Type
References
760VGDRGMGKLTSSSSQacetylation[1, 2]
Reference
 [1] Lysine acetylation is widespread on proteins of diverse function and localization in the protozoan parasite Toxoplasma gondii.
 Jeffers V, Sullivan WJ Jr.
 Eukaryot Cell. 2012 Jun;11(6):735-42. [PMID: 22544907]
 [2] Protein intrinsic disorder in the acetylome of intracellular and extracellular Toxoplasma gondii.
 Xue B, Jeffers V, Sullivan WJ, Uversky VN.
 Mol Biosyst. 2013 Apr 5;9(4):645-57. [PMID: 23403842
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1915 AA 
Protein Sequence
MGTLKAPDRL RGRKAASSVP SPSRPSVCSS SDDAALATLL KERQVFKRTE ELLRKRAQEA 60
ETLQDEGEDM ENSASSRKRD EEDDFLLSNS EGSAAGSPLL LSRCSPFDAD AAVDCLHLLR 120
QGLDSQTGVP AVQGKGDQTN LLSTSLHTLL MCVCKEAASC SEAPDNERTR RITTGLSLLF 180
ALLVDEGLPQ SPAERHRKCH EDAALLDNLL KAIEAVSSFM QAASPRPLLL LLHALRRMYV 240
PSQGADEQEQ TTCGCLPTRA VWTCLLHLLT EDPRASVRWL ARQVALHFLR LATQQVTAQT 300
SNVSQALGDR KKKKARDSLQ TCLYVLDRWL RHLDFSVVPD SVASRASASS ALTAGDEGSS 360
QKASSVPASM RLQRSLPFFH RCLPPLLCLE SVQSEDEKRS VTADSRACLP QQPHKAAEAL 420
CIHLAFLSRK AGRTPAAADA LRCVAAVIRD LRRFRGNLEQ TNNGDSSASV SSKETNRRTG 480
DDRSGGRQAT TGSAFSLTVQ LLPALLQQQR AQGAQRGLWD ELGGEKGKGS GAVLQGAVQY 540
ATAWLDAVTA TTAALMHWAG EAREAWTPPQ AVGEAPLDGC NRGETSGVTN QEEENLDETL 600
LLQQLEKLSL GGSGSDAQIK TLIKDSEGFP LSSSSAASVS LSSYRLSICM QASERVFRSL 660
RRVLESETDP SILGAASVAG RRLIEASGAA SVPAFPAAVP FCLSLLHFRH KRRLAEGLEA 720
SRALFAALEA VACKQFLVCS LHPELEHFRR PLVGDRGMGK LTSSSSQEFD GKCLPQFLHK 780
FLDIYRPCFT PLLRQIVSML AVVETGEEPE QVTERQDEKK RVERKTKTVE EIFGLAKDNR 840
ARTDLRQHDV LPYRGRIRVA FSAALRAFKA RTVLTDPHLL PIRPRIPLSR DLSPQQLGFL 900
LERCPVTNAW ALPLLPRLLG RDELHFFSSH FLQLARLLQQ QTAEKAKTSP LEARELCRVG 960
QQLWSLLPGF SYDPIDLATG VAVEDFSLMK NMIQFLHDPV LREEICSTIR NLSNAAISES 1020
DVASIRDDDE DDDKQQRLER EERDDAEKDV MSVLCRVTKP CGKESMQACG SQLMGFLVVR 1080
FLQVQKVSLK NDEEAAGERA LAAMLLGEKE DLEQLTQSAK MKATQHLLDA IKAYAPLCPP 1140
ELLHANVRRF TSAFLRRANE AAGLSAPSAA APVTMGVAEC SALVDITDAL HPFLPSSVGL 1200
EVVDSLRTLV AVCAQGLEKA GSGYSEQSID LVETEKRGKN SDACRTEDNG AVLRALLRRG 1260
YKALKHVFER SASSSRASSL ADGSASESCC FLRRQELEIL WEILAKTRTV SCGTAAEKPR 1320
LGCLRAFVEC LTLSAQTAED SEDASTQAFW ESFLVDRVFP TVVPEIILSL KSLNRLVRET 1380
AFCSLSGLCD ACDGDSEKLS RLVILIATGL GGGTGVTARG GTAEPPLLKT AAVLALSRVI 1440
FSYAEQMDGA LIKQIAEVVL LLLQDRDKQV FLAALRFARV IAHVFDGQAL ERFLPAMLCA 1500
LNNKHAIRAK MKIRRLVEKL LKKLGEAAVK EAFPSEHLPL LRHLQKSVRK QQLRELLVNK 1560
ALQEGLSWDE LVFKPGDDGE DENKNAGRRQ GKKENNAFSK MMEEDDEEDE DDEDRPSAAK 1620
KAAARKRGAN AELQGEEEEV AEEQTENKDG STSFSAMTQL LDAFEEEDDS DTEGRRNRNR 1680
KRRRATSGDQ AMESDVMLLE DNAGELPLDF CSPAAAHRII LNQPLSKHRR RLTTEVERKT 1740
VANLTWNEEG RLVIPEDEDE ESDKDPGFTI GKVSSVSKRS AARRVEGGSK SSSTSLLAKN 1800
RKAPETGGQR KNLSCLAARR AAMKTAKKER RRGHFVQKSG DEFKAKKARG DVKQNNKVEP 1860
FAYIRLNRVM LREKHRPQAL QSLATLVKKR RGGDDKGKSK TGVRKPMNKA RKGRR 1915 
Gene Ontology
  
Interpro
 IPR011989; ARM-like.
 IPR016024; ARM-type_fold.
 IPR012978; Uncharacterised_NUC173. 
Pfam
 PF08161; NUC173 
SMART
  
PROSITE
  
PRINTS