CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-032733
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Putative uncharacterized protein 
Protein Synonyms/Alias
  
Gene Name
 TGVEG_009390 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Toxoplasma gondii 
NCBI Taxa ID
 5811 
Lysine Modification
Position
Peptide
Type
References
1756SAVAGGAKRGTVELAacetylation[1]
Reference
 [1] Protein intrinsic disorder in the acetylome of intracellular and extracellular Toxoplasma gondii.
 Xue B, Jeffers V, Sullivan WJ, Uversky VN.
 Mol Biosyst. 2013 Apr 5;9(4):645-57. [PMID: 23403842
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2810 AA 
Protein Sequence
MTRFLRRRSM AYCLRFLACL SFFLAAGPLR CFVPSCYLLY QLLWTEAATG GFPWQRPSPT 60
PLDFFGSEDS LSLFSSTSPA LPFPSANSVS PSEANVLGDA AASVAPASSP ISREAGERRQ 120
GAGPEFSSPS SAQSAPSTFA DVATRLSPSF SEGENDRQPP GRAAPSNARL SSGLAAAVSQ 180
EAGTQRRLPP SDSAKSMPPT WNDHNHGTLP ASVPSPLREH NPFSSHLEGQ HACGALFAPP 240
RGRGSDRLFS QNEAEKEKGK GASSDNLEIL VRNINMPHRS VAVGSVQVHL PVDRQRGSRT 300
DSESSGDAET EKHRNKEAAE KRTGEGSVFD AIAGHREPQI REDPAETSEE ETEREVAASE 360
HASVKLDISH FSLPTKAVLF FRWGKPPTFV NYDLRLEVAS PGTYFLPLVW PADGISPLRR 420
SVECSSPSSP LASSSSAYSP SASFPASSSS LSSTPSSLST HPSGGDALSS SLSSVSDVRP 480
CACSRPAFPP PFASASLAVS AASPPLSEAG SMSLESSFLP VGRLYYLVIS CTVAELASYR 540
QKLTPRGRDA FLAGTLSLSF HGDKFLRIFP LLTQDERENM DFSLDPGKNG NVHYFHLPST 600
VSSPATPVSE GAERAENAPD LSPLGGAFFR ALGDSARRHG DAADGAERLD SVGRELSEAE 660
KHAPDSVENL ECDKRHAKVP QAGRKHETTS GEAEGDLSSN RDRGNRVASA VEKARKGDQE 720
QVKNFEKEVD SQFTRFFFFA PKTLLQEVVV EGGGVEGRLD AFRDEGSPLP LSPPSSLSRL 780
GRTHPAGFRL RFPRKRVSPP YQLKAAPTFS FEWHQVLSLP PQVAAPSSSV SNPFSFSPSS 840
SSESSSSSGA SGSPPFPIAC IRVNFSGAPV STFQRENEER GLLTSLDTCS FFSEAVHRGG 900
VQREEKQGDV LSEQEGRGSK QEAEAQGDAR NGDALRRLST QEMEAEEFSS VSATLSRFFD 960
LRAVDILKPR QRSAKREGGR SAGEEKSAEG GRQTREWTFV VPQERLPEAI QKAAHTVTLS 1020
WRIAPLLVVH RETPSPARRE GVGPQFWDQA AANKSEREER GRRRQTEDAS VEKERTPPET 1080
EGLQRRELSG ESSGRRGGDT AAGKHGIPLP GVLSTEGVEE TDSEGMENAR REEMSKERET 1140
ERAVGGASRM REDRRKEERE TTERDTLDRT LFLLSQRITF DLPSRGPWAF TFYLNTTMPL 1200
ERVIFSSESV DLGDFGSPQN HAALTVSSPT APRSSRLSPA ASVDGEPSLE TFSKVPFFIR 1260
RHSCRARVLV SQCRDTSETA GAKEARQRQS STFPQPSFSH GRGRTGERGD SGSRAKNGGF 1320
FAETSPALDV FSPTAVRFSS LFSGMISPIL SLFSTSWSEE TEVPQAAVSS KRQSTQSASL 1380
SYPVSPPSSL SSQHGCTSCP SARSSLLFLP LVSPALPSPS FSSFLILSSV FSSRSSPPSS 1440
LPRSSPHSSP SWCPCSRSSG VCDTWCATPL LSFSRPLLPI FQPNLLQSLA LSLGPETAER 1500
PALAELFFSP SDVSGGVALL LKLDAGDIWR SPQFVCEEAD WQQEEARLRA EREDSELVKK 1560
RGQKARQQEV GDAGKPGTQR VGDSREEKGD TAFERAEGGE SQQQREEHEL SGMASATASD 1620
RLHSSPSQTW TKTDSRGAPF ASQEARVPFE RAAPTAGPVK RDLEAVERAR SRQHASGTAT 1680
EAPETDVVVS FFTVEFLLRR EAAPFRRARP DELEAAKEET RDQAFNTHWG SATDTVVAHR 1740
WRLPFACRSA VAGGAKRGTV ELAGDNSSGG TPVVMPVVME LGADEETDDV SRRRSFAVDR 1800
VRTDKREFAD EVQERSSQIL DTNRSRVSED ASSSRVSSLA AVMDSSLSGG LSASGEASSF 1860
SATSPSRTFS LAKKVGAGVE FLYEATVAPS WNFRPGRHLA SWRLLKRTVV VSANDNLKRE 1920
INRFLMMTEI DQEANRDELQ KYEARKEAVS ETTFSQHEAQ DLRWGNNSPG VFLKAQFLAD 1980
EPCEQTCHRG HCWPLGFVDG FRLNYCRCTL GVGGAACDEE LLPRWYVAFL TALLVFSNLA 2040
FLFVIRFSIR ELLDSARPWE TEANSDQGEE TLSRARWKRP AAWRPVVRLT VFGTAMVASS 2100
MYHLCFDGDR CFVLSPMDWQ NLDFLFSFFS ILLVLVALSR LPVSLEFFLV GVSFAVLSRA 2160
TLATPRRSAT VSRLFVSLSL LLLLRLFGAF PRRWKERRRQ RRLLSPAAWR CRRLLLTAMA 2220
RAIEAERASR KQQGSMETGR GAVWGGVPTE FEEGPDFFVV LGTQVVEGVA DPLSLLLGIS 2280
LAALGVSAFV FFETNENYAF VHSLWHVTIE LSPFFCLRGL APSRAAPVLA ALQTPRKKRR 2340
QEETGSEQGR EEREPELVHE ARAEGREGKG SASVTGKEER RQASLQMASE SKGSGETLAD 2400
AGVEPRQGGF FRRRFPGLTR VQAAVSEFCG SRKHEDVSGI SWKTLLTRAV AEATELKWQL 2460
ETGDAEKAAG GFAEEAGRTS YTRRGGSEMS EGDKGAGDAV HKRCLSSRLA ERFNKEKGDR 2520
EDSERGLLAG PLIEASANAS CRSQFVGTDR RNPSDPLELS SSRCGQTGEA HGGDKEASFV 2580
VNTVDDDPHA DPPTSLSADQ VNQVAFLLWR QLAFHKRCES VGRKGRVMFQ HPNGFQRTVA 2640
SYERANSHVS SSHSGNVVGL RCAASESTTV SASPSSSASP SSPAAYSPPC SSRSPRHSSL 2700
PSSSSSPPSS SLSASSSSSP PSPSSSSYAS SSLSSSSSSA PATSSAHPSG SPLLPVPVCR 2760
SSGASLGGEC LWCAVAGRLS ALDVAAIQGV MVCLRAKKMV SKLSGNVGCS 2810 
Gene Ontology
  
Interpro
 IPR021910; DUF3522.
 IPR013032; EGF-like_CS. 
Pfam
 PF12036; DUF3522 
SMART
  
PROSITE
 PS00022; EGF_1 
PRINTS