CPLM 1.0 - Compendium of Protein Lysine Modification
Tag
Content
CPLM ID
CPLM-031862
UniProt Accession
B6KUG4_TOXGO
;
B6KUG4
;
B9QQ76
Genbank Protein ID
DS984755
;
EQ970707
Genbank Nucleotide ID
EEB04347.1
;
EEE27539.1
Protein Name
Putative uncharacterized protein
Protein Synonyms/Alias
Gene Name
TGME49_095380, TGVEG_050500
Gene Synonyms/Alias
Created Date
July 27, 2013
Organism
Toxoplasma gondii
NCBI Taxa ID
5811
Lysine Modification
Position
Peptide
Type
References
547
WVDKAEG
K
GGGSVSA
acetylation
[1]
Reference
[1] Protein intrinsic disorder in the acetylome of intracellular and extracellular Toxoplasma gondii.
Xue B, Jeffers V, Sullivan WJ, Uversky VN.
Mol Biosyst. 2013 Apr 5;9(4):645-57. [
PMID: 23403842
]
Functional Description
Sequence Annotation
Keyword
Complete proteome; Reference proteome.
Sequence Source
UniProt (SWISSPROT/TrEMBL); GenBank; EMBL
Protein Length
1979 AA
Protein Sequence
MPFSPLPAVR SSRASLASGA NALCRALSPQ GRGYAPSPQF RVRQGDACCV SLLRERTGKD 60
NASSSLRHSG RQPLPPQRRP LGLKRGVRGQ GETAERREHT TRGDARRLRR HWRGSRKAAG 120
ARDDKGDDVS LWGAVSFASS PRTRTSLGND CAGLPFPPFS TPPLSSLLAS SASSAFCPLS 180
SLEFCSSSPA TRSRPGASPF ASLMRFPASP SLSSSLSRAA IPRAVESRQE AVDTAVKLRN 240
VSEGTFYRQA RRTFAHVASL RREEPRSSPE LADSSAVESS DLSSLSPAFS CKTALSQPAS 300
RLFREGADDP SSAVSASSPT TAAAPPAASR FSASDAFRQD TLLEDWLSVR EEIRAAVERQ 360
RDSGETACAV ARLTAAIQAG QEAGASKAKS AEQALLTALR ENEESPAAAG ASGDSLLQRD 420
LCLFKAVSTN LALEHLSVRL LTLLLSHRRH VAPAASPPRS FESSPASTTN LTTLSGAGYP 480
DVPPSPQPVS KPQPVCASKE TVCPTEPASE KYVSAAAASP RPSPCVSGAV PLFPVSTRVW 540
VDKAEGKGGG SVSAWVYRQL LVLGALSLLP RASGFAVQMP VLRYLLARVF PTVGETPDAQ 600
SESAHARREE EAERTSDMDW GRESWHPAVV FRVSLQSLST CPLPSLPLEV QRKCWVVLYF 660
SLWHWDIQQH GSFLCSSSPS AAPPGERAAP TSLSLSPSTL ASPAAPRGRL DTSSSPSSGL 720
VSPAVSLSAS LSAYSPFPDP PLAALVSWTC LQSLAASSTP PPRIPGVSSS SSSPSADPAA 780
STSLGSAPET FDSVAGAAGQ AVALCSAREA VQLLQALNHL KHQLVELPFL ADDFAAETEA 840
SDTEARNPGA QPRASESLSA AAPASSDDAL GPAAALSPRR RSEAWLAGLQ DLLLRVLTGT 900
ISAVGLAETR GILGSLGRQE VRDAVFVHSL AHRVAELMGE VPERSAAPQS REGRDARAGA 960
QNGQATSGEE PPKERGVSEG RPVEQGEETS ALRPSFAYSG KRLKLSEMHN IVMLAMKAEV 1020
YDQHLIDALA RLIVGEAPPA ARQAGLAAAG TDMSARACSE GDGEALERAQ LGCVIGIVQT 1080
LGKLQARTTD SALFERMMLA LCEELHRRHA FIHDCTTGQV LLGLTRLRFV HPPLIRALAL 1140
KTTRAASRQN AQQTLQQKVE TPRLAPVSAD AVGRNSPASP DLESEEPPGL SSAAGVNTNH 1200
YVLAANMISR NGFYSFSLLQ QVIKLQLTSG ATTSVLAMTQ MLATLARFGD FQPADSHLRL 1260
LIRWCLPRFV AELLRRPASA ISPLNASQLL CSFAKLRYVD VPFVSHLLSI FTSPQPSSLR 1320
SSSSSSPSAA SSSASSNEVE RNLLGVEVEV PRSSPLAFPR LPPCPFDIRG CRRVLKTLDA 1380
AALVSIVEAV DAMNFWSAAS LHLLYQIKCI LEDSLHDLKA SQIIAICLAF RDWHYRDWEL 1440
PDAMVAGEAL DESTDEEEFL LTPERGDLSS DSSLLGASAA LLMQTASPPF QPQPLNSFSF 1500
SASLASSSSS SSASAVSPLA RGGAVGSEVR DIEWQCLHRS AWKEELVLNC VEALERHESF 1560
ALSNWLCMQK LKTLVDALHL DLFFPYFSWF LASSPSARLL PPAASRAVSA VGGEAKALSN 1620
YAEEAPGVGG SVCRASEELS PKDENSPGTH AVAGNSVGKT EKANWSGTDG APLLQYEVLD 1680
LTPPVYVHPA LETPQSSRGE RPGGAAQTVA GTQGETRAVS LLERFQVLPP WLWRQLVKAS 1740
FVSRGEVEAN RHLHDREGDD ASEESDSREA CARPCRRDSP AADQVTPAPE GNETHSPSPS 1800
SSSSSSSSSS PSSSSSSPSS SCVVRPVALR LVRGKPSSEG ASAKSSVSHL SLQTEEPPSP 1860
CSLLPSSSSS CSPSSSSSFS SFSSPRAHSA CFSERAETPL EASHPPVLHP FGHRGAGAVI 1920
EDVGPFRVFV GPFVRGMRVD VVVESLPRGP ETVPKKKEKK TKKRRKQKPR LAPEPDETT 1979
Gene Ontology
Interpro
Pfam
SMART
PROSITE
PRINTS