CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-031816
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Pre-mRNA splicing factor PRP8, putative 
Protein Synonyms/Alias
  
Gene Name
 TGME49_031970, TGVEG_023800 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Toxoplasma gondii 
NCBI Taxa ID
 5811 
Lysine Modification
Position
Peptide
Type
References
166GKSPFYGKDVRRMETacetylation[1]
Reference
 [1] Protein intrinsic disorder in the acetylome of intracellular and extracellular Toxoplasma gondii.
 Xue B, Jeffers V, Sullivan WJ, Uversky VN.
 Mol Biosyst. 2013 Apr 5;9(4):645-57. [PMID: 23403842
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2538 AA 
Protein Sequence
MLPNGSGGGF RPPPPGGSGV HPGPPPGHPC SPAGHLPPGL PLPPPPGFLG FPGSLGPPSS 60
SPPPPHPAGL FVPPQQGPPA PGASSVPGSS QRMPPGGMFM PPPGFAGFQQ PGGARPMHLP 120
PHAQARGGSG APLGPKGSPG GAAGAPGLNG ATPNPPASGK SPFYGKDVRR METSSPYAGA 180
PQAKIESGFG APQPSVFSRY PPPLPGPGQV SAGPPKDLRA KVKTAEELLA EKARKWQQLN 240
TKRYGEKSKL GTGQDTTKEE MPPEHLRKLI KDHGDMTSKK FRHDKRVYLG ALKYVPHAVY 300
KLLENMPMPW EQVRNVAVLY HITGAITFVN EIPWVVEPIY LAQWGTMWIM MRREKRDRRH 360
FKRMRFPPFD DEEPPLDYGD NVLDVEPLEA IQMQLDEEED APVIDWLYDS KPLQHDPRYL 420
AGPSYRHWRL EIRQLSVLYR LANQLVSDLQ DKNYFYLFNL ESFYTAKALN MAIPGGPKFE 480
PLFRDLHEED EDWNEFNDIN KIIIRQQIRT EYKIAFPYLY NNRPRKVAIG VYREPTCSFV 540
KPEDPDLPAF YYDAIVNPLP AYKSGSSTTT QQDFSVFEDF VLPREIQPLL QDAPLSTDTT 600
VDGIMLYWAC RPFNLRSGRT RRSVDVPLVQ SWYREHVPTN YPVKVRVSYQ KLLKCWVLNH 660
LHQRPPKSLK KRYLFRVFKS TKFFQCTELD WVEVGLQVAR QGYNMLNLLI HRKNLNYLHL 720
DYNFNLKPVK TLTTKERKKS RFGNAFHLCR EILRLTKLVV DSHVQYRLGN VDAFQLADGL 780
QYTFAHVGQL TGMYRYKYRL MRQVRMCKDL KHLIYYRFNT GPVGKGPGCG FWAPVWRVWL 840
FFLRGVLPLL ERWLGNLLAR QFEGRVSKGV AKTVTKQRVE SHFDLELRAA VMHDILDTMP 900
EGVKANKART ILQHLSEAWR CWKANIPWKV PGLPAPVENM ILRYVKMKAD WWTNAAYYNR 960
ERIRRGATVD KTVCKKNLGR LTRLWLKAEQ ERQHAYLKDG PYITGEEAVA IYTTAVHWLE 1020
SRKFTHIPFP PLNYKHDTKL LILALERLKE LYSVKSRLNQ VQREELGLIE QAYDNPHEAL 1080
SRIKRHLLTQ RAFKELTLEF MDLYSHLVPI YEVDPLEKIT DAYLDQYLWY EADARHLFPN 1140
WVKPADSEPP PLLVYKFCQG INNLTDVWKT SDGEAVVLLE TKYEKVYEKI DLTLLNRLLR 1200
LIVDHNIADY ITAKNNVNIN FKDMNHINSF GLIRGLQFAS FVFQYYGLIL DLLVLGLTRA 1260
TELAGPPNLP NDFLTFTDVE TETRHPIRLF CRYIDRFWIV FRFEKEEARD LVQRYLTENP 1320
DPNNENIVGY NNKTCWPRDC RMRRMKHDVN LGRAVFWEIE NRLPRSVSTL EWSNSFASVY 1380
SKDNPNLLFA MCGFEVRILP KIRTYTEEFS QREGVWKLQN EVTKEMAAQA FLKVGDEGMK 1440
HFENRVRQIL MASGATTFTK IANKWNTTLI SLMTYFREAV IHTEALLDLL VKCENKIQTR 1500
IKIGLNSKMP SRFPPVVFYT PKELGGLGML SMGHILIPQS DLRYSKQTET GITHFRSGMT 1560
HEEDQLIPNL YRYIQTWESE FIESQRVWAE YALKRSEAAA QNRRLTLEDL EDSWDRGIPR 1620
INTLFQKDRH TLAYDKGWRV RQDFKQYQQM KAHPFWWTHQ RHDGKLWNLN NYRTDMIQAL 1680
GGVEGILEHT LFKGTYFPTW EGLFWEKASG FEESMKYKKL TNAQRSGLNQ IPNRRFTLWW 1740
SPTINRANVY VGFQVQLDLT GIFMHGKIPT LKISLIQIMR AHLWQKVHES IVMDLCQVFD 1800
LELDSLEIEM VQKETIHPRK SYKMNSSCAD ILLFAAYKWQ ISKPSLLADG KDVMDGTTTS 1860
KYWLDIQLRW GDFDSHDIER YCRSKFLDYT TDNMSIYPSP TGVLLGVDLA YNLHSGFGNW 1920
FPGLKPLMQR AMNKIMKSNP ALYVLRERIR KGLQLYSSEP TEPYLTSQNY GELFSNQTIW 1980
FVDDTNVYRV TIHKTFEGNL TTKPVNGAIF IFNPRTGQLF LKIIHTSVWA GQKRLTQLAK 2040
WKTAEEVAAL IRSLPVEEQP KQLIATRKGM LDPLEVHLLD FPNIVIKGSE LNLPFQAIMK 2100
VEKFGDMILK ATQPEMVLFN MYDDWLKSIS SYTAFSRLLL LLRAMHVNTE RTKIILRPNK 2160
TTVTQSHHIW PSLTDEEWIH VEVALKDLIL ADYGKKNNVN VASLTQSEIR DIILGMEISP 2220
PSLQRQQIAE IEAQTKDVSQ VTATTTRTVN AHGDEIIVST QSPHEQQVFS SKTDWRIRAI 2280
SAASLHLRTH HIYVNSDDIK ESGYTYVLPK NLLKKFICVS DLRTQIAAYL YGVSPPDNEQ 2340
VKEVRAMVFV PQVGSHQSVS LPQALPEHTY LADLEPIGWI HTQPNENPQL SPQDVTAHAK 2400
ILNENKAWDA ASTVIITCSF TPGSCSLTAY KLTPQGYQWG KSNKDTGPNP QGYLPTHYEK 2460
VQMLLSDVFV GFFMVPEGGL WNYNFMGVKH SPSMRYNLVL GTPKEFYHEQ HRPSHYLQFT 2520
QMETATETAG ADREDLFA 2538 
Gene Ontology
 GO:0005681; C:spliceosomal complex; IEA:InterPro.
 GO:0000398; P:mRNA splicing, via spliceosome; IEA:InterPro. 
Interpro
 IPR000555; JAB1_Mov34_MPN_PAD1.
 IPR012591; Pre-mRNA-splicing_factor-8.
 IPR012984; PRO_C.
 IPR012592; PROCN.
 IPR021983; PRP8_domainIV.
 IPR019581; Prp8_U5-snRNA-bd.
 IPR019580; Prp8_U6-snRNA-bd.
 IPR019582; RRM_spliceosomal_PrP8. 
Pfam
 PF01398; JAB
 PF08082; PRO8NT
 PF08083; PROCN
 PF08084; PROCT
 PF12134; PRP8_domainIV
 PF10598; RRM_4
 PF10597; U5_2-snRNA_bdg
 PF10596; U6-snRNA_bdg 
SMART
 SM00232; JAB_MPN 
PROSITE
  
PRINTS