CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-027325
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Merozoite surface protein 1 
Protein Synonyms/Alias
  
Gene Name
 MSP1 
Gene Synonyms/Alias
 PFI1475w 
Created Date
 July 27, 2013 
Organism
 Plasmodium falciparum (isolate 3D7) 
NCBI Taxa ID
 36329 
Lysine Modification
Position
Peptide
Type
References
527KFNNNFNKDVVDKIFacetylation[1]
Reference
 [1] Extensive lysine acetylation occurs in evolutionarily conserved metabolic pathways and parasite-specific functions during Plasmodium falciparum intraerythrocytic development.
 Miao J, Lawrence M, Jeffers V, Zhao F, Parker D, Ge Y, Sullivan WJ Jr, Cui L.
 Mol Microbiol. 2013 Aug;89(4):660-75. [PMID: 23796209
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Merozoite; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1720 AA 
Protein Sequence
MKIIFFLCSF LFFIINTQCV THESYQELVK KLEALEDAVL TGYSLFQKEK MVLNEEEITT 60
KGASAQSGAS AQSGASAQSG ASAQSGASAQ SGASAQSGTS GPSGPSGTSP SSRSNTLPRS 120
NTSSGASPPA DASDSDAKSY ADLKHRVRNY LFTIKELKYP ELFDLTNHML TLCDNIHGFK 180
YLIDGYEEIN ELLYKLNFYF DLLRAKLNDV CANDYCQIPF NLKIRANELD VLKKLVFGYR 240
KPLDNIKDNV GKMEDYIKKN KTTIANINEL IEGSKKTIDQ NKNADNEEGK KKLYQAQYDL 300
SIYNKQLEEA HNLISVLEKR IDTLKKNENI KKLLDKINEI KNPPPANSGN TPNTLLDKNK 360
KIEEHEEKIK EIAKTIKFNI DSLFTDPLEL EYYLREKNKK VDVTPKSQDP TKSVQIPKVP 420
YPNGIVYPLP LTDIHNSLAA DNDKNSYGDL MNPHTKEKIN EKIITDNKER KIFINNIKKK 480
IDLEEKNINH TKEQNKKLLE DYEKSKKDYE ELLEKFYEMK FNNNFNKDVV DKIFSARYTY 540
NVEKQRYNNK FSSSNNSVYN VQKLKKALSY LEDYSLRKGI SEKDFNHYYT LKTGLEADIK 600
KLTEEIKSSE NKILEKNFKG LTHSANGSLE VSDIVKLQVQ KVLLIKKIED LRKIELFLKN 660
AQLKDSIHVP NIYKPQNKPE PYYLIVLKKE VDKLKEFIPK VKDMLKKEQA VLSSITQPLV 720
AASETTEDGG HSTHTLSQSG ETEVTEETEE TEETVGHTTT VTITLPPTQP SPPKEVKVVE 780
NSIEHKSNDN SQALTKTVYL KKLDEFLTKS YICHKYILVS NSSMDQKLLE VYNLTPEEEN 840
ELKSCDPLDL LFNIQNNIPA MYSLYDSMNN DLQHLFFELY QKEMIYYLHK LKEENHIKKL 900
LEEQKQITGT SSTSSPGNTT VNTAQSATHS NSQNQQSNAS STNTQNGVAV SSGPAVVEES 960
HDPLTVLSIS NDLKGIVSLL NLGNKTKVPN PLTISTTEME KFYENILKNN DTYFNDDIKQ 1020
FVKSNSKVIT GLTETQKNAL NDEIKKLKDT LQLSFDLYNK YKLKLDRLFN KKKELGQDKM 1080
QIKKLTLLKE QLESKLNSLN NPHNVLQNFS VFFNKKKEAE IAETENTLEN TKILLKHYKG 1140
LVKYYNGESS PLKTLSEVSI QTEDNYANLE KFRVLSKIDG KLNDNLHLGK KKLSFLSSGL 1200
HHLITELKEV IKNKNYTGNS PSENNKKVNE ALKSYENFLP EAKVTTVVTP PQPDVTPSPL 1260
SVRVSGSSGS TKEETQIPTS GSLLTELQQV VQLQNYDEED DSLVVLPIFG ESEDNDEYLD 1320
QVVTGEAISV TMDNILSGFE NEYDVIYLKP LAGVYRSLKK QIEKNIFTFN LNLNDILNSR 1380
LKKRKYFLDV LESDLMQFKH ISSNEYIIED SFKLLNSEQK NTLLKSYKYI KESVENDIKF 1440
AQEGISYYEK VLAKYKDDLE SIKKVIKEEK EKFPSSPPTT PPSPAKTDEQ KKESKFLPFL 1500
TNIETLYNNL VNKIDDYLIN LKAKINDCNV EKDEAHVKIT KLSDLKAIDD KIDLFKNPYD 1560
FEAIKKLIND DTKKDMLGKL LSTGLVQNFP NTIISKLIEG KFQDMLNISQ HQCVKKQCPE 1620
NSGCFRHLDE REECKCLLNY KQEGDKCVEN PNPTCNENNG GCDADATCTE EDSGSSRKKI 1680
TCECTKPDSY PLFDGIFCSS SNFLGISFLL ILMLILYSFI 1720 
Gene Ontology
 GO:0016020; C:membrane; IEA:InterPro.
 GO:0009405; P:pathogenesis; IEA:InterPro. 
Interpro
 IPR024731; EGF_dom_MSP1-like.
 IPR010901; MSP1_C.
 IPR024730; MSP1_EGF_1. 
Pfam
 PF12947; EGF_3
 PF12946; EGF_MSP1_1
 PF07462; MSP1_C 
SMART
  
PROSITE
  
PRINTS