CPLM 1.0 - Compendium of Protein Lysine Modification
Tag
Content
CPLM ID
CPLM-031756
UniProt Accession
B6KA57_TOXGO
;
B6KA57
;
B9Q8M2
Genbank Protein ID
DS984727
;
EQ970682
Genbank Nucleotide ID
EEA97094.1
;
EEE32705.1
Protein Name
Putative uncharacterized protein
Protein Synonyms/Alias
Gene Name
TGME49_109890, TGVEG_093950
Gene Synonyms/Alias
Created Date
July 27, 2013
Organism
Toxoplasma gondii
NCBI Taxa ID
5811
Lysine Modification
Position
Peptide
Type
References
2015
LVQAHAA
K
NLGEKED
acetylation
[1]
Reference
[1] Protein intrinsic disorder in the acetylome of intracellular and extracellular Toxoplasma gondii.
Xue B, Jeffers V, Sullivan WJ, Uversky VN.
Mol Biosyst. 2013 Apr 5;9(4):645-57. [
PMID: 23403842
]
Functional Description
Sequence Annotation
Keyword
Complete proteome; Reference proteome.
Sequence Source
UniProt (SWISSPROT/TrEMBL); GenBank; EMBL
Protein Length
3122 AA
Protein Sequence
MGTSYSNPLV DSTLVFPAPP SSYGLDTPGL LLLRSKYIPS REVPAFLIRP RLTPTRRGSA 60
PFAGSLLSGG FPSRSHLSPG VCAAPWASTP STPRPCSSPC VSTQISVETS VSRPPISRSV 120
LAPARQKDSG SDTPDTGNSG SAAAGQPSRG KKGASHAAGE EGAVGAEALL GRSSASSDHS 180
GAEIEASSAS SLPDFVFPRS SVSHETAPSS PKAAQALGGR GATFFEECSG RSFDAHGLDH 240
AARAGQTACG ASCGGAEGRR KDGKGFCLLY FHGNACDANM MRDWLQIVAD ELGVTVLIFE 300
YPGYGLLEDY DKSSRGIDLC ARIAFEFLVD KLLFPIERII LCGRSIGTGA AAWLASQLAQ 360
CNVQVGGLVL VSPYVSLAAV ASDWADAPLV LTEVLVHHHW NNEAAIASIP TVPLCIIHGK 420
EDDVIPVAHA KRLWQAARQP PSLRVARFAD GNHNVGMNLE SFYGDILVPL QVFFRKVSSN 480
LRTKKAASQT RQRAVETASA AHLASRAHQQ PLGDASPHLS GGSWKSAKAG GGRGRLDRTV 540
SGVRHQGRRQ VSPRAVQGRG SGCQGVGKSG SRMLLSHRLG TGPHAKSDED CEAGIEGELK 600
NSLPSLASLG SRRRAVRVIQ GKKESTEEAG SSDLAKALGA MDMASLETRA SSSPSLHSFL 660
LADAAGPRDS LRKTDSNLSS CADQSPIRSD HPPLLPTLKK RVPWRPTTVS LNRAIGSAPS 720
PGETRTPGDP CFSKKLPRLR KPGSLPPPVF VLNDVKTLAF PRATTDGAFA GSPVSEWDAG 780
ARRRSSSSEG SCEGRQEPGR LLGHRQASRR SSWTAGDSMC MSHSPLSKEG ANVFIGDKHS 840
KFLCRAMPAR TVSVAGLLHR GKHRDMGGRR EESRGRRLGK EENDRERRLS EAGYWPSTDQ 900
SVHPAELLVG PAERGATAET QDEPWDDEEG EAEEEGDERE VDDEGEDEEE FVGTDGSPKS 960
RARLGSCDSL LAASSLLEPE TSRLCFCEDS YDGLQTDVED GDLFSDDAFA SADSTPFHAP 1020
HPVSALLFEG ADPVVDMPLR CDTEATGCGA SDENPGNAQK RPWKVEGEVC DFSGRLAPDP 1080
AQERESPWQS DPTLLPSAFR RRFFLRGKKG FDRFLPQQGP FSSSPFFPSL VKAPASPPEA 1140
PSTKALLAPL SAPSISGRRF QRPSLCQLDT PSCERTPDAG KGSGAEGFQE PTAPKDFKER 1200
TCPRSAATAA LAFFAPDARR HSSLLAAFLQ QETSRELQRQ RETLARTETT DGAGVRGLRS 1260
LRLQRSSLEL AGAPAPGHSP ISSSGASSKP GDASLVSRAS VSSSFLSSPL RTVASWKFPG 1320
ANKSPRKAEG KSDTTREGDE DGHETRGSGM GAVSSDRRQR PPSTGDLSEG KAIRVHSLAG 1380
VTPGLGSPSL VSPATRLDRG FFAKTSERPV CWGRQRRGRD QGTQARCDRA EAKREGEYRL 1440
DDDTTDSPLL GYKAPHDRVT ESDAGAVFGF SRELKVREKR RKSIGGGGQV GPMTAAEDLN 1500
VSLGEKTFLR ACLRPHRPLA QPSSTSVLNK KAGTVWTYTE QIQSPTIRGP PFLGARSLTS 1560
SSPPGVALES LRPAAPAEKV DSSKAAADNE KTSARVGEVD LSDAWTPPQC GLLSRSCSFG 1620
PDASGGFSGP FESPSSGGFP RKLHSDLGFA ASSRPVDDAS RVSRRPASVS APPRSNSCSP 1680
PRSSPSCPIL LSMHAPSASP VSRTKSLVIM NPANCPSVVL SPSSPLDFLS RDMSGHDVGL 1740
QSAAQKIPSE AFSPYASCSL ASSSFASASS LSCSPLSCGA SSSPSAASVS AASSQLLPTC 1800
PSSTTVLWSS NLSVDASASF PASSSSCSPS ASIDPASYCS ASPSPSCSPS HITSSSHLIS 1860
FSPDLPSAFH VAAPHLPFCG SASYSELDSS SPLSSTPDLL WPLSSQPVPS SCAALSLSSS 1920
CPRSPRPVSF ASCSLPASAA RLGLPSLGSR ETVGGRPAPA DLEISVAPTL GLPAQTSELL 1980
RFLRCDEGGA SIRQCEAAGE DSATPGDLVQ AHAAKNLGEK EDEKHQGEKA TEESDGCEKS 2040
EPGFYGEIHS DELSAASASG VSIARTSFFS SPCISPATCD LSLPVSLANA PPLSSSFLPG 2100
REEKGGYLLA SIASVKGRGL SLLDAEGEEK SMEQAVSLLQ RHPVFAPSRT TELSLHLHAS 2160
TELGSGSCPM TEPALFLRDR EESTTETNDV ALLLPGQETR RMDSFDLEKD QVREAAEFHF 2220
SSPYSPLVLS GGPPAGLRNS TAQPALPKAV GDAESRREAR EESSKEDEDG GETGEQTASP 2280
RILDHAEGEH SRRQMECQGY RGRMTPQRLA WSAQSLLSHT HLRPVTGIGR EADHERRQNS 2340
DKENVPFDHI ACFSSPAVRA CGGVISSPQS PVSPDLPEAS SLPLDAQVEA TKDQGDLSEQ 2400
EETEAKTPNS TPQTPVEARA AGATMSSGAS SAPDETVEAL QLGLSLPRGH ETNPGLLEGD 2460
SLSRKSEEET CVMKQLGSER CSSERDSQEG EDASDERSAV AGFSPSGIFV FGRGDRAGRP 2520
CPPAWPRDDL FCLCSEEDSP QITFVGETEE TRGGRCEEPE NHEQVELAKP GEKRKRGSPS 2580
LCLDGDLIQV DTDPPFRQLR ARVDKENESQ ENRLEVTPQR HAASLSEECR ETQVPFSCST 2640
EKETLNLNLE AAQMTRQTTC SGFDFLFPQN LDPNHAETEE FLETKQAEMG TFSSPTTFAR 2700
HARFFADETK AVCSLGRVRI FKDGTEREDE EDGFQAHDSF LSETEGSEWM SGIPSCDHRE 2760
DKLFEPDSPS FSSSSTVGPL TRRHLHLHAR RREGDWRGRE TARPFPGPLS SAAIAFSQRR 2820
RVNRYIKREE ESAALSMRDF AALPPIGGHE GVRVPQTCPD TRRRRGEMEA LDGCERAAQR 2880
CLGEKDPRFS SAPLSPTSRA TSAALTAAAH AAAAAAAYAE AARTIASSDK VQPTAVVSLR 2940
SAGSIPSVGT GPSVGNGSSV GTVSGASRSD ALAFLEGRLP LPGKQEVVQQ FLQARRRVSR 3000
HCSVACSRAD SPVGERSRRL QKSEEAVPPA KEEREDQKGQ NAVWILEDRQ GDDFETRHER 3060
LILEQKTLGF GARVCAEKKC QHNSGETITS VSYSSSLSCL LSSFLPTHFS LSVSLGGIQK 3120
EG 3122
Gene Ontology
Interpro
Pfam
SMART
PROSITE
PRINTS