CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-032717
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Putative uncharacterized protein 
Protein Synonyms/Alias
  
Gene Name
 TGVEG_035630 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Toxoplasma gondii 
NCBI Taxa ID
 5811 
Lysine Modification
Position
Peptide
Type
References
469GGDSGIGKESGKDAEacetylation[1]
Reference
 [1] Lysine acetylation is widespread on proteins of diverse function and localization in the protozoan parasite Toxoplasma gondii.
 Jeffers V, Sullivan WJ Jr.
 Eukaryot Cell. 2012 Jun;11(6):735-42. [PMID: 22544907
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 3744 AA 
Protein Sequence
MTAPPVSVAP GAPSPPAPYF PASVVPATAA SHLQAAVRLI QEAQALHQLP ALSAQDNVGK 60
EKEKVHSQSN EMLPVRPNHA GASARLVKVL LLLDASVLSH ARMSSSSMPI SLSLPPSPSV 120
SSHEEDPRGV FTLLTFLYNI ILAVDEALEV DSGTAAPFSN PPATPALCRM VTVSRASSPL 180
PVPKLVERQL SEATRQLAEI FSPFYSGLSR AATSSVSPGG GCSSPCPAAV PPPRPPAKAL 240
ASRQGGENEG FTHSGLPQVL EEKGKRGACV PESPQTSLAF PSSGNSSRKR RDTHVGRAQH 300
QQRQLLLLQQ QQQRSRNRLR QEKPRRLKFL HECDTAVSQL PFAFWKAFGR LMRVPHTHVP 360
SCMPFASYEH MDALEQVWGD AWALSQLRSS GLSYKILHAN ATAAAFALAG KSGSGERGES 420
RSKTGGRRDG RGSGSAKAKD RSKAEWTKGV GTVCGEGVAA SGGDSGIGKE SGKDAEDEIE 480
SKNEKGKEGL AQERGEPRGT KVNEKTGSVA TPSQDPAGPE PGENGASTQA HNAQDNVKTA 540
AKTSSATAAA RGALQGVIAA ATGVAGAASG APSLQWALWQ LQSAMEGGTQ NSLCESSGTT 600
TPATREESDA LYLCAGDTTQ EGKDARKSAG EAGTEKERGE RKEMGPGLGL LLTKRAQEQV 660
LDAIEKALGG VDASEQGRVT KEVPLLVVLL RRLSALSPSK GVSSAVRETA RVLLVDWLGV 720
PVPIEDSLKE NQTPWKKNSG KDRKKPVGSP HLCLQIAACL LSAAAGQAPY VWSDVPDNDA 780
EVQEKGDTYP VSPLSKSLSP SVSSCHPFGV KIFHLSQLKE LLDVAGECLY TAAASQEMEE 840
RANVNDESGG RRKEGDARCG DAGKEGEKAK QDASLRFYQL LKKLLAWLAL LQRQLGAALG 900
ASGQTESGEK LLYSLPGDSG EGATGALAAI MLLADLHDEE ERPWGDVVGR YHEREQADSP 960
GNQDGKEEEN TGASDVVDRK RTAAALKQAT TLSHPALSEQ KVDLGDVPPL PVLRRKAIAA 1020
AVAEGMDVEL LESLVGPFAS RSFARFVSSH RRLNGQIDGH KEGQAAEEAK KGEAQVGGAV 1080
VSPTFRASAS LPNTGHEETT GASELKSTSL SSSSCFSNLQ FSRACVLDAL VDTLREAESQ 1140
LFSLSLRLHV QARQHAAEET SAAISGTTSA AQGGDKEDES CLSSGSAKAI LSSDLQADLL 1200
FFLHSTIPFI SHITRQLSAL SASSLSLSSL PASLSKHTGK QGGKEVMIPL LVFVTPVLES 1260
FFLRLTSLYG DLFTLLALTA SRSLDAPLVS SLGAALDPSF SMEEIRKQRY GVHDRFVVAL 1320
CSSLPVFLSS LGVFRRHSRG VSQREVSPGG CFEGTAEWWG SSCLDEVVGR QERGGGEDEQ 1380
VSREDSTAGA DERSGRNEGK TQEEEGSRPR GSASGSPLVP PVAALFPVAH KIDQPGGPQP 1440
QMDGKGCQFP TPTSQQVLPS LDTPFCMWGV AVPSKRSTAE WLLKGRELAQ AEEAEDDDDT 1500
LRRLLSPLAF RSASRNDTSS SSKTSNSASP AQSSFSVPAS AFSSSSPASP APLPADGCYL 1560
HSSAARLCVA LAAFTLIELV AVSTFCAACT SFLEKKKLPS LPTVAGPMAV AAGGGAGGAG 1620
AVFSAGVALV TIARKRGEAW KDFCFSGQLV RWTLHALLYL VAAPLEASAV AASITTHFPN 1680
VGHGDRKKGG RQGKRKRDDE GGEAQEADGS NENPDDERNA ERDESLLWQL HPLTSILSVK 1740
EDDDGRAPIF LDGAYHDLNL GRGSTPISDT SPLAFMESVA EASAVPLSSP SSLSAVFPLS 1800
SASPHSPSSL SSTRHGSPPV SSSSSASPSS SFTLSLLGAL GDVLGRNEER GVLDGGEYAS 1860
RLAEVIRLSL EAEMVVRELA GGAGSSQLDG LPGFSGASPL SGAGASGAGG GMSKVNAGAL 1920
LLIGEAKVKK GERGERAARA ASAAPAAIAL LPEWLTSAYR QEEELPWEKK IDEIWMKGTA 1980
PASAYADTPP TSILGGVNSD VRSRQRQLPG LELYLPPGMC TASLPLAALQ VALRSIRPSP 2040
IRGFEREMLV LASSCSPSSL PCRPDADAAR AAVREVLTMR PGRGGDETGA KSGEEGERGE 2100
GVRCQWIERA GTRLAPFATM SGSSTFLIVP LAVRLLRLHL CATAQCQSDL LLAASGSCEE 2160
GVVEGTGRQH KSGVSEQGND GGVQEMKIGS TETTAEFANE RVKETNATMT PGEAGLACCS 2220
NKLSFCWAAS EVPDRLGDLA HSIACCYFLV YGQPVLPRPA SPLDKLVLKT TKEQTRNNFI 2280
EMSLPATLLG VYIHRAVQKC GPWATEIQSK EMLLQQAQRS AASEPEALER AGFAIRAAAC 2340
APHNPVLWMA VGCIGDRAAP NEETLLQIFA HTALVINVCG EIWLAALRRF CPHVCVEEPD 2400
LPQDEQVEEE AEQERREAKK VKRRADEENK VRGGDMSLEV MWERMNVKKL SETWWIQMCR 2460
SGKKIEDPCD LQWRYLSTRL DVLLLLVLLW KQFRHSSSSL SAAPTAAGGM SAPTADPTER 2520
HRRYWMCLDA ANEAFFAKAN LGSRSEGLTP RQKEILELCG AASVDGQKSH ADSAPSSSSQ 2580
SHGAVENNSE KVVQSSLLSF AKHPSSSSQQ PSRDEKVGET RMTDGSVEDV QRLRSFILEV 2640
STLLRFVFNR SNLPLVVASS LCLEEEQLGG KTEDRDIWNW ALVASLNRCR TWVEAETFLW 2700
HLPLMRSKWL ARLARWGVRL PLDRKRAQMT EAGGNRTPVS LTPSVTATSF APPDVAAMAR 2760
AIGLEAARAI ELPPGGAREQ EKQHPSDATS SNSASLNGGR ARNSSENDDA SFFSSALFSS 2820
ALCDAVDALM ISCIQAFGAE AMLDPAAGDK SQEQTKDNIQ SKGKRSEGKE QSHSVRSHRG 2880
EGATTSRDTE KGDNASLLSS PRSRSELKDD IEGKQGETEV EGCASSTSSV RLQGGVHFVS 2940
SRVSLSLFYS CLGDMEELVE AEGDVLPLAM PLYHLHALRL KIVMLSRGAL WRLAALFPWR 3000
CSPAGVSVKS QDKGGKKDNC IEALQYLALK SRQAGLWTCI PRLLLVNASL ASGQPTLALR 3060
HLRLLFSRGA TKLLPPDAWL SNHYQALKQR RPAYERSKFR LFCATVDTAI LLTQAILQRT 3120
EKAKEEQLLA LRNEDVGNSE AEEETKQNNV GLGFSSSSSV AASPAGEDER EGKRQSIPKQ 3180
VAGEGWIFGG LEKVEHLDDL VQGVKFLVEQ LELQVKLLRR LQLNELGTYL QLAFFQDPSP 3240
QNTKPGTPFY TLQTQIAQLC QSPNAGGLIA EGRGDKGTGG SNSGTSASGD RTGVQGAANF 3300
LMLMGSLLGS GGVGGQTGTL QGTATDEDGQ QANKRMQREI VNILLKGTID VFEYLVTYCP 3360
DVLDMREMHR AITQLFPEVG VPAVEPQNTY VATTASLDGR LLRLLRDAVT CVRNASPSAT 3420
SGLLKQLKQL IREKAKEVKD VGAREEIDMN VEMVSVESGP QNENVIETRD ARVPDQDDED 3480
DGVICKFVER LENIFVAACR KLLYYNPPSP AEDLAASYCS EGTQTHPTPP PAVAVMRLSL 3540
GLPVQCGMEV QEIEKVTEAL AAYAKATRAG APSVELFLKS GRVTATPSGR GGKGHGTAGV 3600
HGAGGGETGR KERGRGGRGG SASVAQARGV AGGGNANAET SVKADGAGST SPAETPSLSP 3660
PGQNWVGKAA EGDAGIKDRG GDEKRRKTEE RGDGGQPNKT PREEPESPPA GESADRGVSS 3720
TPDCPVPVIE SSMPTENRLQ HRGP 3744 
Gene Ontology
  
Interpro
  
Pfam
  
SMART
  
PROSITE
  
PRINTS