CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-031737
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Putative uncharacterized protein 
Protein Synonyms/Alias
  
Gene Name
 TGME49_112630, TGVEG_096740 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Toxoplasma gondii 
NCBI Taxa ID
 5811 
Lysine Modification
Position
Peptide
Type
References
2035NYDGSEEKTAIRCLEacetylation[1, 2]
Reference
 [1] Lysine acetylation is widespread on proteins of diverse function and localization in the protozoan parasite Toxoplasma gondii.
 Jeffers V, Sullivan WJ Jr.
 Eukaryot Cell. 2012 Jun;11(6):735-42. [PMID: 22544907]
 [2] Protein intrinsic disorder in the acetylome of intracellular and extracellular Toxoplasma gondii.
 Xue B, Jeffers V, Sullivan WJ, Uversky VN.
 Mol Biosyst. 2013 Apr 5;9(4):645-57. [PMID: 23403842
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2638 AA 
Protein Sequence
MYLFIWVKRN PDAGVATAVQ SVIQHQTFKR MLLFGLRSLA DFCSPSNQLY QENALDALDR 60
GVLSAIQTAV TTFSDDDDLM LCASRVLWAM SVAIKEEMDP AHIARVHSEG SPVIVAVVNS 120
SPTDPQTIED SMNFVDNLKR AGAPVDGASL AGGMLSIFTK TALDMKTAKR VTAALAIAAE 180
TAEGSTALYN AGGTSVLLTY CLDQGDLSDA GVEMVEGAFD TVRYMAGYQC TDATTLPQCI 240
ALMDKYRGRK SASAKGSSAL AAMIGPEQLQ KCLNTLKTAE AGSAEYDEAL VTLGSMSYIS 300
SFTDEIVRAG GVPLLIELIN SGLPQMEGNP EKIASMISGA AKMLARIASN PVNVDAIVQA 360
GGVATLCTAV SYCTESMEAL GALCMALVPL ASRESLAHEI VQYQTFATVL PILYQNVESP 420
EIAALAMELV ATGSQHEEIQ EHMLQNQAAE ICSLCCQYHT ADASYQQHAI SALNRLVPRL 480
TTLHGVSEYG GIQGVIASLN ANVNNEQVAL LAVQLLDNFS EVSDAKTYMS DGTCVDAVLA 540
AMLEHEGNDL LISAGVHCLA RIATEDDCAR HLNVLDTAIQ TARGNPDGVY RVLAAISGLS 600
RVPSLRQIFE EKNASDTILA GISSWIECSR FEGQNRIIKA ALKTVKNMKI SGDGDLTSCF 660
AAMCDVACLP QVKRVVELEE PDNNILVADT AAFRDLAATM RITGAENLER CIESVLRVMR 720
KYPDSRRAQL NCLETLNYLA QCDGGEGVAI LSRTGGLNAV VQYLTRAPMY LDAQIAGFTV 780
LATSAKIDSN VGETLRKCNC LQALKVAMRT HAKSKELKRT IAPLVALLMP TDALETEIQE 840
LLNECASACE KNNFPHLHEN LAALNELLIS SEGAKIAARL GIGAHMCKYQ EYISAHEQDA 900
LAVTDYDILG KDLFDATVSE CAHAMEQVAS TRSGRNALIK AGNVATLISL YESLKAPQSQ 960
YSEEAAIHCL EALRILLKSD KRSAELAFER NFVSTLCVGI DSFPHSAPVL GATCACLAAM 1020
ATTPERVQML TAQPAFESLL QKLVFVIQND PSKDNKLVAM RALQELVEIT NDATMANKIA 1080
EAGAVTALFR IIDEYGDDEQ LTVQAAEVLA LLGAFEDLRR FYDNDVRFPA QVLTAALTKQ 1140
KNNETAVVHL LDVLNKLATS EDRAVLRELG VMEQVADAMR VHSESEAVTR LGGELFAKMG 1200
ADEQIKSLML QIIETVESGA EDTAQTVDIL CGRLAVFLAA PLEDPRDALQ HTEKCLGSLV 1260
ATLQTYPGSE RLEGNVALVC RRLCDRCFDD ADDPYGAWAV AASGMLAQFA GMVAGETVLA 1320
NKKFLGPAYR TFTACCANAY CMPTMVEVAP SFLPQTYTLL EMHKNDAETV ARVLEFLRYF 1380
AEDPTACGLI VQNMSGSSGD VVALTVLLMQ QHQNNDAVVC AGMEFLGALA YTLSQAGYEP 1440
LPTLADGSVL RDCDALMGSN SSSARQLAHM HMIEKMLLSK AYNDALIQEQ ALKKLTMSLK 1500
AEDDKKRFSD EERLYAAMAC VLLAAGGAGL TGEMEKFNGF EVVLQAIEEF GENPTVIKEV 1560
NRALQGLSMA DVNMTARTVK EAVPKLCTEA TTAIQTDAEC ADTFCDLMLQ LVSQEGNGRQ 1620
LLQVYGLEET LQGVENLAAY YGEDFGTQLS EKVAMIRQAM EDDQPREKTC KDVYDLLNSR 1680
VQQGLSVAIS EVAILQEEVE FLVSQMGMYN QEQLDHQTAM GADHQYGNMA FELLAATSAN 1740
VKLLQANEFS KMELALIKGQ ADPEIVLYAV KALTAFCKFP PAAQDTARIQ GCPALVTEAC 1800
SKINKSGLPN ERKEEHLCAR YFLVERTAIN RNLYNKTPIM TELINSWNDY DKGAYTTTLL 1860
RFVFRAMRRV VSDAHVEELL KANVLQRLIG IISDVNADMA LLPDVLFLLG SLAVVPEIKT 1920
KIGELNGIAA CTDLLQRALP KPNTAPVVTN VCLAFANICI GHKKNTEIFS KLGGPALNVK 1980
VLNDRGHEYD VCNAASVLLC NLLYKNESMK KLLGTNGAPA ALVKGLSNYD GSEEKTAIRC 2040
LESVFKAISN LSLYTPNIQP FLDAGIENAY STWLSNLSET FPDAQLETGC RTLVNLVMEN 2100
EENNMRKFGV CLLPCMAVAK QGRTDTKALL LLLDIEASLC RLKENAEAFA ANGGIETTIR 2160
LIHQFDYDVG LLTLGIHLLG IQSAVKDSIQ RMMDADVFSI LVGCVEVDAE GNEVTDLVVG 2220
GLRCTRRIVR SEELAFEYCN AGGIATIANV ICKSINQPMV MLEACRVLLG LLFYTTRSQA 2280
DRQAAVEALH AQCQQRAEQM HAQAQADYEA GVVSEPPPEE MEVPEPDPDE LANAAYGGWY 2340
QMGMDEVMID AILQAVCACA AVEAHAKQLR LQRVCLGLAA YFASEQMGTS SLVGSGIEQV 2400
LTQIMTNFAG EGTTMQLSCV IINSIAMTSG DMYEEIKTSA LLSALKTSVG KMATKKPEEK 2460
ALKETCAATL EAASSGEDPF DAFSKTVTEL DFKFTEWNVD PYPNGVHDLP SNVKEALRKG 2520
GKLKVFLPEK EKEEIRWRSS QDLNVFEWCM GNDQDYNNRI PIVRIRNVAK GLVHPALKAA 2580
AKKEPRKVAA KFTMCLFGPP NDDFPEGVEL PMVAKSQKER DAFVEMMVQW RDAATYNF 2638 
Gene Ontology
  
Interpro
 IPR011989; ARM-like.
 IPR016024; ARM-type_fold.
 IPR000225; Armadillo. 
Pfam
  
SMART
  
PROSITE
 PS50176; ARM_REPEAT 
PRINTS