CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-032789
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Putative uncharacterized protein 
Protein Synonyms/Alias
  
Gene Name
 TGVEG_036800 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Toxoplasma gondii 
NCBI Taxa ID
 5811 
Lysine Modification
Position
Peptide
Type
References
2878AAASLAAKRTAALKRacetylation[1]
Reference
 [1] Protein intrinsic disorder in the acetylome of intracellular and extracellular Toxoplasma gondii.
 Xue B, Jeffers V, Sullivan WJ, Uversky VN.
 Mol Biosyst. 2013 Apr 5;9(4):645-57. [PMID: 23403842
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 3717 AA 
Protein Sequence
MSKFVRRPPV SFPSAGSPRP SSSAKAEPCR LLVLSHRRRR LLSDSRKSTS FLFFLDGQIC 60
GSSECLHCFS PPHCASRCAS ESPSGSSPSL SSGLVSGSAA SSFSSRSFSS SSFVSASVAS 120
SSLCSRISAL PGTKPASRAS QKSVSEPDPR CSLPQRARRG RRDPRKKETT RRAKRRSEIL 180
VAPSKPSSQQ RSNSEETGAR DSFPWIRFLP RFNRSPDKPF PSASCSCPDA ASSAPSSSSS 240
FRTYLPSPSP PSVDRLRRVP VLRDKRPVRR SVANWKRAFS SLLSLLSLLC LFVFLPSPPA 300
SRRLSPGLLS PAQALSPFLE PRGGGTPASA PTAVAKPANG EKGDTLPAVH VSPSLETERQ 360
RNEAGSTAGG QEGGGEQVGE GTEREGSKSG GEESKTDAQG QESLFRLDAK PSKETTDTSF 420
SVEAMAKDPF PLFSSFSADE GDSEEEENEG YEREQARGLK EGEGSVEKRR RTASGGKAGG 480
EGFSSLDDQT WGTTEDRQKK REEKGDSDVN IDSEREIERS GDGDGNSTSS SSASPDTNRN 540
RPVLQRKNGA PEFSGLRETV DNDEAAAVHA ADANGATPHE RRTAEAKEEA SAQGEAEKET 600
AKETEKSREN ETEKGNRKEG EGWEARRDAS ALPPEEHQQN PRESGEAWAD GADVSEASLK 660
RNEQHKETEG ATSKDAPSQR KNESSCSACS MEESCENRRK NRSQDAGQTS KQTTCEAVDT 720
SSSFLSFPSS PSSSCSSSSS PSSALSSSPF TSPCSNPFPS SPSSSFPFSS SFSPSSRSSL 780
SPMRDTVLEN ETLKRVSPSS LPPSSSSSAA SYESANVCPL SFDASLSLSF PRFSSEAAVS 840
LRTPGEARDR LQASGRQSGE EGRDEGRGER RMFSRALGLI SSLTNLLSPG RLLVSSPPFP 900
GAPAALPSQG EVSAAPSPSL APLAPVSSRL FSCSRLAGER FCEVSPSEGT DKVWLSESRP 960
GSVSRSDSIG RASPESDHQT AKAYLLGSSL CCLSPHHSEA SVAAGVCDSR KASAVYPFLP 1020
SCSLRTSTEL ESCTGEAEKR LSFACVAPLS CPSVLPHSPC VSSRHTENSE RGCPAFSELG 1080
GEILFSPASA CPAVSGERTA EARNLATEHD AERSRCSPGG AWMPCTSRGD SAARAGQTVT 1140
PGKVQGESKG LANCAGRNSS LHAEAELSPA AEGECPGHGH DSSVFSSTCL REKSQEIPMS 1200
QMRDMDGHAE TRPSDTNPQR ALECADAGER EVERREERKP SSEGDNNFQD PSPGGSDTAP 1260
ASRLPFSEEQ GREDSPNRRG EDSDEDFGRS ERKGETESDD DKLSLPGASG AVNSQTGNRF 1320
FSDEKAKNEG ASDREKKRSF RDLLSPPAVV LRRFFLPLSL RRLFLAPLVK AFNRREAHPL 1380
VFPFSRDLCA GENANLSGSA CDAASQLRGR KEASDERNEP VQPRPRHVET DSEDEANKES 1440
SGSPAPLQRP PGESGSFTQQ QQETLSASLP VRSSSKEVPE AVPVSSGPSR PRKTAEFTLN 1500
GRPFLRGRRY EAEAQKLKFD FASVDAGARV VASSRGVANI KALQRNDLDS YMLVPCELHP 1560
KFFVLSFTEP IHVEQVALAS MEIYASAFRH IQLLGSDAYP TKQWRLLANL ETTASEAHEI 1620
FDVKRECSAL HEGQACWAKY LKVRLLSHHL DSPYYYCSLT SFQVFGSTGF QMLESHIHSE 1680
SAPETDSGQS DGDAAAEAKE DGPETTQGSG PESGQHQGGD ERTKGGEGDG SDATASGEGG 1740
GDEWREEGGN FVSSSGEKNA RAAEAAGKAR GENQARRNET GTGGGADEAR EEKATEEAHL 1800
PERSHGDQER GDERTREDEE ERKTRRGEEK TNSDDPNKER SAAFNGRGRD GVPTSPHRQT 1860
GTDEEDEVEE EKTARDWREQ VSRERANDAS VSPDLTDSAE ILPSPSTSPI AAVPSTEEKG 1920
ETLAAEKTTE AYVCGGVHAG AYLCDQGERK SGEKEPSRLR SPEDVEGEVT RVTEQRRSEE 1980
KGKSWTASPS RGEARGMTPE HKREDDRDPR ERRDKSEGNG NSNQEEDENI ERKRPPSAGL 2040
ASAEEEERVS PGAFRASTKS PSSPSSSSAP DSPAGAGGAQ QEERRDQATP SDSVHAGRDE 2100
KKGDGRRQET DAAGRSGGDA HWGAESHRVR TAAEEEALEE RTTVEKRRHG EKQVERRGSE 2160
TREECVSSEQ VQSEGGGLND AVKTKQDLVS CFSNCEEDRD SRPRAKRTPE GGAEGPYVKS 2220
ESQRGDEGGP SSRKKPRSAS LFPEKVPREG DAEETEREQG LLRQLHQWRG LARLPLLFSP 2280
PHGFPLGGAT AGDPAGGASL LRSVINFLFP SWDPDAGSRS PSVVFEPLTV SPSFSVFAPS 2340
FSRLSPLKRS EARPDEEDGI RRSGQGGFEG GDNRGENETQ RISPEVPGAA SRQSPGRLPA 2400
KAPTEGESEE ELSEGASPSP NGRIEAAAKA AREAEDEGVK PCGENEGVKL CEDGEKGEKP 2460
ARDGSSPSPQ TTRVAPPRAS SVSSDSQDPR LRLAGPFPGV ARLFSRRRWT PSSPSGGRLP 2520
RALSPLRTSQ ERQSTVRPHA ASASPSYSSA PPTFPAAFPF PALLPQNTIL EDLLAWAERG 2580
GQSLSVAGRE NAVAEAGGGE TVEGSASFLA ASSLSPILPL LAQRRKGEGP AGKEGAKALA 2640
PVVAATFPSG RNAGAKLPSE TDGDRLATRG NALSQMQQQD LLHLLLQNLP AAAATASASL 2700
FPSGFPAAQV VENLAGDLSA VPAVASSVQS RPAASGSVSS SSAAGSAKGG AKGSAASASQ 2760
TGDRTNSAAP PASAKESAAA STVSSKGSGH VLLTLVDRMK AAESQGAQLR EKLSEVALSL 2820
HSNQQQTLQQ VVLLQLLLEL VSFLYMRLSK FDAIVPRLPF LLDFADSGAA AAASLAAKRT 2880
AALKRVSSAS GSVVGDGDTP TGLAADAERS GCSGSDLSPL RKAPGLDTNL SFKGEGASDG 2940
GDFITSVASF FAGGREETAA GLASCCRRRP RTGTHALSPK ECDGPARKEG ASCGGSGEGG 3000
ESGADNSCES LDAQAEVKDC SGFGEAKGGG TFEGEKESFL ATVLRLITHH VVTVFSSVAY 3060
LTEEACFVIV SPLVDGLAAS VSTPQTTSTT NSQGRRFCLH ATETVAGFFS AVATLQQSVL 3120
SSLIECWREA VKWKQAGGVS PGAAPSAEKG PSWLSVFLLL LLLHMAYGVF ILRFFSRKTS 3180
EAVTRAEAAV RECRDLRLSL EFLLRAPTER VGFLGRWPTL DEIEMESSAR ALSAEDTARL 3240
EPSEGAPASV SREENGEAEP TVLSGERRDA EAKTGTLDAG ARSEDRAHGK KDTLASLASP 3300
HQRQYDGPHV SRSQPVTDRE EYVGGYERFR LASLALQRDT MKWGRHRNDG TEQGRRQSFD 3360
LSSDARDSTR SSRLHPSWQE RGKPVFPSLK SISTSQSLEM EKDPLQPCDL RTCRATARPP 3420
ERSPHLAGDV PSEPLGDFNL TPRPSSAPSG GGVSPSSGVS GGGAPVKATD DSDSRVSKTL 3480
HARIDGPFPE ETNGEKEWGE DGTDGQLVVK SDHGASSSPK ETLTASVHSS VSLSPYRGFG 3540
QERTSDGGGS ASEATGSTPA HVASPVPPFL KHKKRTNTCG NLDGNRHSGG ELQRGLSSAA 3600
GSRRGDRGAD RAGKEGTKGE EEKAPQRKSY NGQSHGNRRA FKLQRHTVKG VSGGPSGSVT 3660
TLGARRHPLV ALKETPVKDA RAGTVSVGPG SGAAPRDEAD KRGAPRVEEA AGMLPAN 3717 
Gene Ontology
  
Interpro
 IPR012919; Sad1_UNC_C. 
Pfam
 PF07738; Sad1_UNC 
SMART
  
PROSITE
 PS51469; SUN 
PRINTS