CPLM 1.0 - Compendium of Protein Lysine Modification
Tag
Content
CPLM ID
CPLM-032789
UniProt Accession
B9QQH1_TOXGO
;
B9QQH1
Genbank Protein ID
EQ970709
Genbank Nucleotide ID
EEE27415.1
Protein Name
Putative uncharacterized protein
Protein Synonyms/Alias
Gene Name
TGVEG_036800
Gene Synonyms/Alias
Created Date
July 27, 2013
Organism
Toxoplasma gondii
NCBI Taxa ID
5811
Lysine Modification
Position
Peptide
Type
References
2878
AAASLAA
K
RTAALKR
acetylation
[1]
Reference
[1] Protein intrinsic disorder in the acetylome of intracellular and extracellular Toxoplasma gondii.
Xue B, Jeffers V, Sullivan WJ, Uversky VN.
Mol Biosyst. 2013 Apr 5;9(4):645-57. [
PMID: 23403842
]
Functional Description
Sequence Annotation
Keyword
Complete proteome; Reference proteome.
Sequence Source
UniProt (SWISSPROT/TrEMBL); GenBank; EMBL
Protein Length
3717 AA
Protein Sequence
MSKFVRRPPV SFPSAGSPRP SSSAKAEPCR LLVLSHRRRR LLSDSRKSTS FLFFLDGQIC 60
GSSECLHCFS PPHCASRCAS ESPSGSSPSL SSGLVSGSAA SSFSSRSFSS SSFVSASVAS 120
SSLCSRISAL PGTKPASRAS QKSVSEPDPR CSLPQRARRG RRDPRKKETT RRAKRRSEIL 180
VAPSKPSSQQ RSNSEETGAR DSFPWIRFLP RFNRSPDKPF PSASCSCPDA ASSAPSSSSS 240
FRTYLPSPSP PSVDRLRRVP VLRDKRPVRR SVANWKRAFS SLLSLLSLLC LFVFLPSPPA 300
SRRLSPGLLS PAQALSPFLE PRGGGTPASA PTAVAKPANG EKGDTLPAVH VSPSLETERQ 360
RNEAGSTAGG QEGGGEQVGE GTEREGSKSG GEESKTDAQG QESLFRLDAK PSKETTDTSF 420
SVEAMAKDPF PLFSSFSADE GDSEEEENEG YEREQARGLK EGEGSVEKRR RTASGGKAGG 480
EGFSSLDDQT WGTTEDRQKK REEKGDSDVN IDSEREIERS GDGDGNSTSS SSASPDTNRN 540
RPVLQRKNGA PEFSGLRETV DNDEAAAVHA ADANGATPHE RRTAEAKEEA SAQGEAEKET 600
AKETEKSREN ETEKGNRKEG EGWEARRDAS ALPPEEHQQN PRESGEAWAD GADVSEASLK 660
RNEQHKETEG ATSKDAPSQR KNESSCSACS MEESCENRRK NRSQDAGQTS KQTTCEAVDT 720
SSSFLSFPSS PSSSCSSSSS PSSALSSSPF TSPCSNPFPS SPSSSFPFSS SFSPSSRSSL 780
SPMRDTVLEN ETLKRVSPSS LPPSSSSSAA SYESANVCPL SFDASLSLSF PRFSSEAAVS 840
LRTPGEARDR LQASGRQSGE EGRDEGRGER RMFSRALGLI SSLTNLLSPG RLLVSSPPFP 900
GAPAALPSQG EVSAAPSPSL APLAPVSSRL FSCSRLAGER FCEVSPSEGT DKVWLSESRP 960
GSVSRSDSIG RASPESDHQT AKAYLLGSSL CCLSPHHSEA SVAAGVCDSR KASAVYPFLP 1020
SCSLRTSTEL ESCTGEAEKR LSFACVAPLS CPSVLPHSPC VSSRHTENSE RGCPAFSELG 1080
GEILFSPASA CPAVSGERTA EARNLATEHD AERSRCSPGG AWMPCTSRGD SAARAGQTVT 1140
PGKVQGESKG LANCAGRNSS LHAEAELSPA AEGECPGHGH DSSVFSSTCL REKSQEIPMS 1200
QMRDMDGHAE TRPSDTNPQR ALECADAGER EVERREERKP SSEGDNNFQD PSPGGSDTAP 1260
ASRLPFSEEQ GREDSPNRRG EDSDEDFGRS ERKGETESDD DKLSLPGASG AVNSQTGNRF 1320
FSDEKAKNEG ASDREKKRSF RDLLSPPAVV LRRFFLPLSL RRLFLAPLVK AFNRREAHPL 1380
VFPFSRDLCA GENANLSGSA CDAASQLRGR KEASDERNEP VQPRPRHVET DSEDEANKES 1440
SGSPAPLQRP PGESGSFTQQ QQETLSASLP VRSSSKEVPE AVPVSSGPSR PRKTAEFTLN 1500
GRPFLRGRRY EAEAQKLKFD FASVDAGARV VASSRGVANI KALQRNDLDS YMLVPCELHP 1560
KFFVLSFTEP IHVEQVALAS MEIYASAFRH IQLLGSDAYP TKQWRLLANL ETTASEAHEI 1620
FDVKRECSAL HEGQACWAKY LKVRLLSHHL DSPYYYCSLT SFQVFGSTGF QMLESHIHSE 1680
SAPETDSGQS DGDAAAEAKE DGPETTQGSG PESGQHQGGD ERTKGGEGDG SDATASGEGG 1740
GDEWREEGGN FVSSSGEKNA RAAEAAGKAR GENQARRNET GTGGGADEAR EEKATEEAHL 1800
PERSHGDQER GDERTREDEE ERKTRRGEEK TNSDDPNKER SAAFNGRGRD GVPTSPHRQT 1860
GTDEEDEVEE EKTARDWREQ VSRERANDAS VSPDLTDSAE ILPSPSTSPI AAVPSTEEKG 1920
ETLAAEKTTE AYVCGGVHAG AYLCDQGERK SGEKEPSRLR SPEDVEGEVT RVTEQRRSEE 1980
KGKSWTASPS RGEARGMTPE HKREDDRDPR ERRDKSEGNG NSNQEEDENI ERKRPPSAGL 2040
ASAEEEERVS PGAFRASTKS PSSPSSSSAP DSPAGAGGAQ QEERRDQATP SDSVHAGRDE 2100
KKGDGRRQET DAAGRSGGDA HWGAESHRVR TAAEEEALEE RTTVEKRRHG EKQVERRGSE 2160
TREECVSSEQ VQSEGGGLND AVKTKQDLVS CFSNCEEDRD SRPRAKRTPE GGAEGPYVKS 2220
ESQRGDEGGP SSRKKPRSAS LFPEKVPREG DAEETEREQG LLRQLHQWRG LARLPLLFSP 2280
PHGFPLGGAT AGDPAGGASL LRSVINFLFP SWDPDAGSRS PSVVFEPLTV SPSFSVFAPS 2340
FSRLSPLKRS EARPDEEDGI RRSGQGGFEG GDNRGENETQ RISPEVPGAA SRQSPGRLPA 2400
KAPTEGESEE ELSEGASPSP NGRIEAAAKA AREAEDEGVK PCGENEGVKL CEDGEKGEKP 2460
ARDGSSPSPQ TTRVAPPRAS SVSSDSQDPR LRLAGPFPGV ARLFSRRRWT PSSPSGGRLP 2520
RALSPLRTSQ ERQSTVRPHA ASASPSYSSA PPTFPAAFPF PALLPQNTIL EDLLAWAERG 2580
GQSLSVAGRE NAVAEAGGGE TVEGSASFLA ASSLSPILPL LAQRRKGEGP AGKEGAKALA 2640
PVVAATFPSG RNAGAKLPSE TDGDRLATRG NALSQMQQQD LLHLLLQNLP AAAATASASL 2700
FPSGFPAAQV VENLAGDLSA VPAVASSVQS RPAASGSVSS SSAAGSAKGG AKGSAASASQ 2760
TGDRTNSAAP PASAKESAAA STVSSKGSGH VLLTLVDRMK AAESQGAQLR EKLSEVALSL 2820
HSNQQQTLQQ VVLLQLLLEL VSFLYMRLSK FDAIVPRLPF LLDFADSGAA AAASLAAKRT 2880
AALKRVSSAS GSVVGDGDTP TGLAADAERS GCSGSDLSPL RKAPGLDTNL SFKGEGASDG 2940
GDFITSVASF FAGGREETAA GLASCCRRRP RTGTHALSPK ECDGPARKEG ASCGGSGEGG 3000
ESGADNSCES LDAQAEVKDC SGFGEAKGGG TFEGEKESFL ATVLRLITHH VVTVFSSVAY 3060
LTEEACFVIV SPLVDGLAAS VSTPQTTSTT NSQGRRFCLH ATETVAGFFS AVATLQQSVL 3120
SSLIECWREA VKWKQAGGVS PGAAPSAEKG PSWLSVFLLL LLLHMAYGVF ILRFFSRKTS 3180
EAVTRAEAAV RECRDLRLSL EFLLRAPTER VGFLGRWPTL DEIEMESSAR ALSAEDTARL 3240
EPSEGAPASV SREENGEAEP TVLSGERRDA EAKTGTLDAG ARSEDRAHGK KDTLASLASP 3300
HQRQYDGPHV SRSQPVTDRE EYVGGYERFR LASLALQRDT MKWGRHRNDG TEQGRRQSFD 3360
LSSDARDSTR SSRLHPSWQE RGKPVFPSLK SISTSQSLEM EKDPLQPCDL RTCRATARPP 3420
ERSPHLAGDV PSEPLGDFNL TPRPSSAPSG GGVSPSSGVS GGGAPVKATD DSDSRVSKTL 3480
HARIDGPFPE ETNGEKEWGE DGTDGQLVVK SDHGASSSPK ETLTASVHSS VSLSPYRGFG 3540
QERTSDGGGS ASEATGSTPA HVASPVPPFL KHKKRTNTCG NLDGNRHSGG ELQRGLSSAA 3600
GSRRGDRGAD RAGKEGTKGE EEKAPQRKSY NGQSHGNRRA FKLQRHTVKG VSGGPSGSVT 3660
TLGARRHPLV ALKETPVKDA RAGTVSVGPG SGAAPRDEAD KRGAPRVEEA AGMLPAN 3717
Gene Ontology
Interpro
IPR012919
; Sad1_UNC_C.
Pfam
PF07738
; Sad1_UNC
SMART
PROSITE
PS51469
; SUN
PRINTS