CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-031840
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Calpain family cysteine protease domain-containing protein 
Protein Synonyms/Alias
 Calpain, putative 
Gene Name
 TGME49_093820, TGVEG_013970 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Toxoplasma gondii 
NCBI Taxa ID
 5811 
Lysine Modification
Position
Peptide
Type
References
1916VVSQYSQKDEFNFTIacetylation[1]
Reference
 [1] Protein intrinsic disorder in the acetylome of intracellular and extracellular Toxoplasma gondii.
 Xue B, Jeffers V, Sullivan WJ, Uversky VN.
 Mol Biosyst. 2013 Apr 5;9(4):645-57. [PMID: 23403842
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Hydrolase; Protease; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2196 AA 
Protein Sequence
MKGFASSFSL LPPTFPVSSA SACLEKKQAI YPPEGAAPGG ASFFAAFPSP VFEVGTLSNF 60
FSSLSPLSPL TSGDHFYSSS HSSPRSTPSS GNAVPSLHSV PPAHAPSGSS QCTDSLPTAR 120
GSCSSLQTKV PDTPQRRRCS LWTAAPSHVW EERWRVDCQA GWERAIEESL DLRSEKEDTA 180
LYREQTPPCI ETAVSAGCEG HRRSNSSSPR GASPSVASDR PLASLCVGEE LSRAAASPIR 240
APEASCSPSS ICPSALPSTR EAGPLSPGVA REFSPSVARL RRHSRFRLQD ASGSQKGAAE 300
ARDARKLRGF FGTFLADGSG DTRVDPERDP SKPPRHLETR LFNPALRFLS ERRIQASSSS 360
CSVFLSSPSP FRGNYLERGV PDCGRDSSPA VVAFGTAPYT RRLSPWSPLR SPNSASACAQ 420
PPRRRNGIAA SSARAPSFSP SLSSHSSSLS SYRSSVPSLS PSHVVSSRLS LSSVSSAFAR 480
PFRRVLRRPE TWSPLPSEGC PAWRLSPATP EDSLPLSGSL SAASAPSGVR LTASALSPER 540
RPDSLGIASA PPNCLRKMGC QCSTQRNVRR PESPSKSQSD SVVRRSEGTR AGLSSTPATV 600
EGTPREAKRQ GSRGSEARCE AADGEPRKRG GIRAREADEP GKEEEEVSAD AASEKPVDGT 660
PGGVSSPEGR AGQGTQEPGK KCGVRGDTSH ASREAGTWRL EERENHQTEG SSPPLPWQSE 720
QKDLERSGVG EGDAPPLFSR LASSGLTALE KDVHAGHLEA MESRVALSAA GGRKRRGEEV 780
KERGDSEAHP SEQETLMTSP ARLASTGTTR ATESTREAPP AGPALGTCSG RGCPSPETAT 840
SGPGKGSLRS CASQDGSPAA IFSSPYKRRS SGRQGRGDEA GSAAPIAGAC SRPKDGKATP 900
GVASSSFASG DGCSDGGVDP DRGSAARRRD SSRSLVRRRV SSASPSRPEG PSSGRVSGSR 960
PRHTQSGCSG IVSQGSAFPA ALSQWKKGAC VSNKCADSDA LCCLSPLAYS QQKLLPSHLS 1020
GSLAVSEILL ESSIVGRYVM LPWNEEDGDI RQNFYVHCPP PVALCSPLSQ SVAMHASTPP 1080
PFPLGPSRRE AVAACPVSPP CGRRGRVNRR SEPERGDGRG RDELESGMYA GAKLAPEGLV 1140
KSTQESPVNG RHLASSACGN ARAGREEGGR TTEKSRWTKR NYPAPAQGDV TESLESLSYG 1200
FGGFPQESRE RRRRNTGGGS REACNSRAQG RETRKTEDGA VSSRFEGEAR SHPCPRPPAC 1260
CTSIPLASAS SPTGGSSFAG ASVSDPVVRT EAEGEVSPGE LGVPGDSAEA PACDLATLSP 1320
CGDPGALGRC EEWGPDGKWT DPDGVMSLSR KQQQKFAAWK RLSEIVKDPV VIQDVPNSRA 1380
IRQGFVGDCS FLSSLAVLSE FERKHNVPVL SGLLYPQGTI PAVSEAGRRP QDGGERVGPI 1440
FNPRGMYACR LYFNGVPRKV LIDDYVPVRK DNKLLAAHSS NKSELWVTLL EKAFVKLMGG 1500
SYSMQGSNPG ADLYHLTGWI PETIPFRSDV HTGSPASHKP IVTGRENEDD EVVKDPKWTQ 1560
IWTQLYLGLT AGRCVACLGT SEVSDAAPSG LDFPEGVSVS SGIVARHAYS ILNYAEIEGH 1620
RLLYVKNPWG CVRWKGKFSP SDKASWTQNM KEKLGYDPAT AAKSDCGCFW IEWLDVVRWF 1680
SHLYVCWDNS CFPYQAEIHS KWERSPFIEN SSLADDSHMV AFNPQFHLRI SPRPEDFASS 1740
ALAGPYPFLS FPSLTSPASA ASLPMPSNSP PGTPHTASLS PVPSFSQENQ PVELWILLSR 1800
HVRERQKDLA TKYLAIHLHA ASERATCPPP PAKQGVYSNG ECTLVKLRVH PHVFRDLFAR 1860
DASDEGSRFP SDNSAKREEK GKKEAAGGRP PLPGFEGHQL LDASEFVLVV SQYSQKDEFN 1920
FTIKVYGHVH SCLTQLPPLL PTDAQSIYFK NSWGPSTAGG CSNDLWRYFT NPHIRLKVPQ 1980
PCDAIFFLES PQEHSVNLRL FKNRVATARL LRTGKALSTG PYRAGCCMLK ARLEATTYTI 2040
IPSTFRPDDF DTFQLSLHVP GNVAKPRPVL LPQPYAIPPP SSLFYRVVDG RKCTRQVWAR 2100
VALQVQGPTL VALRLQMPGP PEPGDVFPSL TVYRYTETAE REREVESERS AERKNQKKLQ 2160
FVIRSDLDGG LSEDGHTASS AAQDLFQKQV RTRKRR 2196 
Gene Ontology
 GO:0005622; C:intracellular; IEA:InterPro.
 GO:0004198; F:calcium-dependent cysteine-type endopeptidase activity; IEA:InterPro.
 GO:0006508; P:proteolysis; IEA:UniProtKB-KW. 
Interpro
 IPR022684; Calpain_cysteine_protease.
 IPR022682; Calpain_domain_III.
 IPR001300; Peptidase_C2_calpain_cat. 
Pfam
 PF01067; Calpain_III
 PF00648; Peptidase_C2 
SMART
 SM00230; CysPc 
PROSITE
 PS50203; CALPAIN_CAT 
PRINTS
 PR00704; CALPAIN.