CPLM 1.0 - Compendium of Protein Lysine Modification
Tag
Content
CPLM ID
CPLM-032773
UniProt Accession
B9QNR2_TOXGO
;
B9QNR2
Genbank Protein ID
EQ970699
Genbank Nucleotide ID
EEE28131.1
Protein Name
Putative uncharacterized protein
Protein Synonyms/Alias
Gene Name
TGVEG_038390
Gene Synonyms/Alias
Created Date
July 27, 2013
Organism
Toxoplasma gondii
NCBI Taxa ID
5811
Lysine Modification
Position
Peptide
Type
References
2158
MVSPRAF
K
QLSPGDG
acetylation
[1]
Reference
[1] Protein intrinsic disorder in the acetylome of intracellular and extracellular Toxoplasma gondii.
Xue B, Jeffers V, Sullivan WJ, Uversky VN.
Mol Biosyst. 2013 Apr 5;9(4):645-57. [
PMID: 23403842
]
Functional Description
Sequence Annotation
Keyword
Complete proteome; Reference proteome.
Sequence Source
UniProt (SWISSPROT/TrEMBL); GenBank; EMBL
Protein Length
3029 AA
Protein Sequence
MAARLRSLSL GRFCLLLLLA TWSAWSAAGS SSASFSPEEE SLVSHSRVPP EGKEASSAAA 60
FEPTFRLVAF SPPSSSQQES VPVLRGRQPI AAVFSVPVVP LGSSLFSENA SAPSEPRTPD 120
AFRLSTNSAA KTVKGRGFWV TTSIFRFDPE EPWPADLEVY VHLNAELRSA SGQLVTLPED 180
CGRGVAPQGG LKNHGMIAEK LDGEDERRSL ATSNSRSACI VHLRTDSLRV SVASVTSEFA 240
SAVTDNEWKP VLGNSADPLA ALFEVPRDGK VRVKMNSPVA LAVLKKTSLA ERLLAVGREE 300
RNGEFSFVPF DIRTCAENER EVWAASDEVD FTTDSEEGWV SWRDDSSEDS RQEGRVGGNR 360
RGEEVLCVEL SFPERPLQTR EVYELFVKKG TSYSRYSGPL GVASAFLSSS SPAPHFAFFG 420
AVRVSPAFID QSVYAVVSHR HRRLKEKKGS FTGPRPFRLH GGRRALKHVS YRRLTLVLPH 480
SLALPVARHS EANTDTLSPS PSSPSSSSSP PSSPSSSSSL SSSSTPSPPS TSSPPSTSSS 540
SRSVARLAVE QTSATLQKAA EKLMARMQLV HVASGQALSL RATMQSKVHL VLWCSQLQPG 600
ETYKLRIVGD DAVAGESDFS PLLDGFGLAL ESSEIEFSMA QLRPNLDFLR CQETLWILHD 660
RQLEHLGASH RADYGECDSG LPMAVMTVRH GKKEGEEGEG GHTATWGDVD AVALEAEPFA 720
ACIRSGDAKC VARLLPPWSH RHPPLKRQPS HELFYFDGAT GVVSHESPSQ LSTAVVAASS 780
TSSSVVAPCL TDFSSNSLYL VSTHTQHYRS ASISTDKHLL GALSISVFHS LHIHHDTNRL 840
VLNFQVLSLR TAQPISGATI RLFAASVAET REATLLECDS EEPRPREKAR GAWRSSGRKN 900
ACLAVQTDFN GFATWVSREI LEEGIEVFAT VVASIENSSP SGESATAVFV TDRLNWPWEF 960
RHKPRLGWEA WQGGPNVLAG YDRRFFVPAN EIRFFLLTDR SSFRPGEKVS LHGLLSTVNA 1020
SLCGFHGACL LHHALRLDAP EKLRVLVGAQ WPPETGGHYE SPGVHASSEV GSSHSCTSAV 1080
VSLNRFGAFS LSLKIPADAA LGSRAFLEYR IFKAGDVDEA ALSETACPET WYQLRNAGNL 1140
KHIDPAGGFS IVVEDPKPPS VVVSPLKLPA VVDPASALTV SGSVQTYAGL PVQDHRVEVR 1200
FAFQPSFGET TLRKAAARLQ DSWNREGKRA GQGASERTSD CGRDFSKFLI GSDQVGVNVV 1260
TTGEVSSRDT VHLHAEVRTD AGGQFELPLV LEKLTVAPSK CVGIETETAL GVHPLEWEEG 1320
TEISVTVRVT GLTGDVLPPQ TASVVVASSE FSLARKLSVS VRPVLPGVPF AVTAVAKPYP 1380
GVKATQLPTG SWLEVSVLRV DPQKPLGSSW EAQADNWNFW NSDQGLASCQ PSLFSEAGAS 1440
ESAERVPNTE YRAVTEEAME AFTDGAVDLA LFVEKHEDRF SIVKKCRGHG ERLSCPVTLP 1500
LDAAQYIFLT TMRVNGRRVR TCEYFLPTAG NLLTRALKPD LQLYSDVVAP GQAVSLVLPA 1560
LFRPGSSIRP RIYEVRRGGG EKTGPSRQLR GALSVWWHTQ SNRRLFFSVS LDSTDDELRI 1620
PIGRVPEDCP ASCSVRVVLV PPLSETPLPI SQVDTSSNAL MPKRDYFPIE TEVGARYGPF 1680
LIDETLSVMV RPAASPFVIP ENVIRVSFNE SGASAREPRD AELKPGKTAS VRVALLTRTL 1740
EKKSALASFF SRLTSPQRQI RAYAFVGIVD KRYFDLGPVD LPQVEEEFRQ RLVDQNVRQS 1800
VSSSFAEAAS LATYLYLFQF LKVLKERDPW VSDLHWPQAG LLGATALTSS WAPWSKTLRN 1860
VLLGRTSTLT GSFAASPRGE NLVDGVVFAQ MSREATAMPM MGAGVMAQAK RSALTASRSG 1920
DAEVSVDSGE THGVAAAAKL LLSDTPLILW KSVELSEGET PGELIGEVVV RVPDDSNRYL 1980
LRTQVVVEVE ESEAPALSFF RRLWTRAEAL KLSRVFYGQF EKEVVAKKRV KLTPFRPKLL 2040
RKGDLARVGA ILQVDETLVS EGREAVVSCW FPERNETQET GHRQRVKLTK TALPVTVQVD 2100
TDDPSLILNN RLQVNCFAQL ASDASFSHGI AFAVPVVPMA PRLSLSSVWA MVSPRAFKQL 2160
SPGDGGAGGQ QLEPPREDST SVTSVEEKIE LPLPVLEGVG GLSASVGVGY GAVLLGKIHS 2220
FLSAEICPFL PAKQAHEDLL WDVDGRRRTD LGSKKILCTC CLEAFPAAAT PFFERRSPSL 2280
SALLLVLLAR QVVKTAKLQM HGELVASARF AETKLTAYLP ADPEIFRRVG FLPRPSTEYT 2340
PSALHELRVD LDVNLLVLLV AKRQASSVVL PYVASVAETI RAFLRDAEAR FLALDSGPSP 2400
GDLWPQPVPL IAASEPGVDM PDSQVVAQAA SRVGRADAAK KKREDFTGFL AFVGPDLLAR 2460
IRYVLGSATP LNLVHPEAEE ALAFPSLLAW TLSAESRDRE SAKEQRLSPH LLWALVVGLE 2520
HREEANGRQL LALAHAVLKA AVGVLRFLPD SAAYFSRVAG GSVPASDAQH ALLIQIASFL 2580
LSATSAERPA PSTVFSTLDP ELFASGSERM SPESIVHVFS KVLLYLARGG MSGQQPLGRM 2640
VPWTGGLFRN PWEDLLVMDA VAQWHTATKS DRADLSVSVE LGSEKRSTWA DGDKRLMQPP 2700
WLSVLRGQLS LRRQTAVLEK KLSWTDLDAS EILKPVSFVS SNFMHANDAV SDDAEQCTWH 2760
GKVRVNVHGE GVALVSIAMD FVPDRSHMLP TFMGALLQKE FLPFNSATEQ CENAPVASVV 2820
RGNRVCVSLR VTVKDELRDV TIRDILPAGL ELSSRDPSMS PLAVVDDAVS SVPPSSHFPT 2880
VGKRGSASFF WFLPRCEPRL LGNAVEWKCP HLAAGTHTLR VIAIATVSGF FAVPMATVEV 2940
GTDEEGGGET LSGQKQADKV MLLGASGAAK RAFVVLEEGE RQLAANEVAI FKEAGLQPPP 3000
AWLLGSAPKG CGVCPAGTVC SPALGKCVS 3029
Gene Ontology
Interpro
Pfam
SMART
PROSITE
PRINTS