CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-024842
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 CPSF (Cleavage and polyadenylation specific factor), subunit A, putative 
Protein Synonyms/Alias
  
Gene Name
 PFC0780w 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Plasmodium falciparum (isolate 3D7) 
NCBI Taxa ID
 36329 
Lysine Modification
Position
Peptide
Type
References
899KEMHGLGKYCMNYNKacetylation[1]
Reference
 [1] Extensive lysine acetylation occurs in evolutionarily conserved metabolic pathways and parasite-specific functions during Plasmodium falciparum intraerythrocytic development.
 Miao J, Lawrence M, Jeffers V, Zhao F, Parker D, Ge Y, Sullivan WJ Jr, Cui L.
 Mol Microbiol. 2013 Aug;89(4):660-75. [PMID: 23796209
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Nucleus; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2870 AA 
Protein Sequence
MSPYHFYNNV IDSKSVRSSV CCNIKGNEKK YLIYACNNHL NVCCIDKNGN TDDYSEHVLF 60
AEVLELREYV PEKLVHSTYN KEKVKSYLFV LTRKYVLLLL EYDVKENDFI TLSKINLCEL 120
NGLHLEEDVI FLLDERHKTI LFYGYKNILK YIYLDYDNFL NLNNVYTMRI DESLIIDIAF 180
LGTHTMGCNK QLRNKQDDDD KINYGDNKNN YGDNKNNYGD NNNNYGDYNN NYDDNRSNYD 240
DDKSNYDDDK SNYDDDKSNY DDDKSNYDDD KSNYDDDNKY YDRVHKSGNF LCDDIYKNEM 300
LFNKEDLFLE HQNIYKKIKK EMSTDDQDDV PLINKTLYSI GNKSCLEIDK LLKKKSYDFV 360
GTMNYDNNDC RPNSINKLKV EERGKRNYDV RNCNNFDHIS GGLLSQHPNS FESYYPKYNE 420
YNKNIRRMLD GYYNIKGRQQ DDIKLAFHDS CNSTYNKNND EYMYSTICIL YDYKKSDTEF 480
YERYIRIIPL CRMNDASIKF LGDDNSEFDF YDYSDNDKYY AYKYNKRNRD EDNGKDNHKD 540
NHKDNHKDNH KDNHKDNDKD NDRDNLKYKY DNYEYNIYSN NKNINKKNSF VNNILDKGCL 600
LPKYYKPLHV DPSINKILCL SKYKILLIGF QFINYINIVK EKRRSFFLSS EFRTITYIES 660
INMNKYILSD DYGDLFILSF LPKKKNKDIF DEEDIDVHNN MEKKNDKYED DHMKDIQDKH 720
DIEKKNEPYK GRYYDFYCKY GGNDTFMGDI SSMRVQFLGT CSRANVITRI YPDIIFLGSQ 780
ISDSYLLRMH YFPIYEREDF EPVEYSPYCQ MLKEDETVSK EMNMYDVNKY VEDIWNKNKN 840
VMVPIMNNNS RNIPHMENMK SMNNIYNVNN INNIFNFPAD LNNLYNNHYK NKEMHGLGKY 900
CMNYNKTYFC TYPYNDNNIG RNKYHRDKND YYTSIINTSQ GDVNYSPNTN YIYDYNEGNK 960
KNKIEKKKFY IEILSVIQNM GPILDMCVVK NKNNEYEIIT CNSYGRTGCV SIIQSGLKTN 1020
ITCDLNFNKL NNFFVVKYVI YLKKKKKNTH THIHTHATIT NDAMKKENDH KDCIEDNKDK 1080
VNKESLNINE NRNNILINDD EGMMNDVCND DGTNLPRKKQ KTKNKKNKSV SEEPLHFEVF 1140
VSLNNKKNIK IKNINLCFLN KYYFESEKEI NVLKYSNFIY FHIFMICVTY ANQTKIVGVS 1200
RDIFKRRKIK KDSSSETLIN NKDNKFDNGT NHYPDNIMNC FTSDSHNNNN NNNNNHMFGM 1260
QNGRDIQDDK KNNLNNPNFL NEEKKGDMKN KIPSESFLKD VCNEIFLCEY ENTDIDMYSN 1320
TLYFNIIKNH PYLIQICNNH IRLLCCLSLK LIYNLQVAYV YNYSIYNDYI YIYCKEGIKI 1380
YGIIENYIIH IYTYIIKENI SSWSLYKNLL ACVFNNNEVV IYNINMNTLK EIKEENHKRE 1440
REAIDLDINV EQGKGKKLHH IFNIVNYYKP EMSFFVYISD VELIMMNDNM YLFLGYSHGN 1500
IEYFIMCAYN KKGKKKNMGA ECMCRNDDNK NGDKKEKKMK KKEHKKKSST FSNNSDYSDN 1560
SNNSNNSDYS DYSDYSYGIN NSYEYVPENS NLKNFLIHTD YKGLVRKRKE CNTLFKLHKE 1620
YLKRKELILK YIYKVCKDNV CNEDTGSRCK YTYKLNMRKD LKKEKKIKKR KRNVLKYLSQ 1680
YNMNVFDFFD FDKISFNNMN ELYKICNRSK DDVIYIDNEY YYNVNIESDN FVSLQKFLES 1740
DNEINKMDTG SAVVDVASDD MIGSSCNNNI KENVGDNIKE NVGDNIKENV GDNIKENVGD 1800
NIKENVGDNI KENVGDNIKE NVGDNIKEYV GDNIKENVGD NIKENVGDNI KENVGDNIKE 1860
NVGDNIKENV GDNIKENVGD NIKENVGDNI KENVGDNDNV SNNSYTNYSN SNTYDISHAY 1920
NNNCSDKNTL HFSQNNKGKK KGMIKNSNSV KCNVKKEVDN NNNNNNNKKI LVKGEKNVKI 1980
HHKRARYFFN FFQVYGILSE ESSDYDSSVD IMTFKRSIKP KEKYNDDTYI HRKKNHRKRI 2040
VYNTEGTIFD ELHYGTLSPP SLYIHKYNKS IDKFSHYINK YSISDDDMLS SYDDNDDEVN 2100
PLDVQDTRKN KKNKIKKKNI KYIKNIKYIK NIKNIKNNKT HRKAGRQKRD ESITSTDSSS 2160
IYNSKMDECY QYDDDDDDDD DDNENFEEES QGDDHIFEQN NILRTFIKNE NKNKNIKKKN 2220
NNNKKKKASN DNMNVFNNTS NVNHNDIKKR KRDIKNLLYK KLKRQCKDIY LQENCKSDCS 2280
KYCNSNNMSS EIPSSFDNLN NILFDEKIFL KNNILKSEKI FLINKRKINV CNDTIKFKKF 2340
VKVFSEKKKI DINQSNIIKK YNFLFVCCES PIIIYSDLKK KINVSKLSLK NIYIVDIFND 2400
FNYLNPFHNF LSFKKKNQNN FYFIFYDGSN IHISPLNQIK KTFLKKIPFH RTVEKIAYHS 2460
DTGLLIAACP SEEKHKTNEM MKQIICFFDP YHDSIKYTYI IPSKYTVSTI IIYDNEKLMK 2520
SNFDVTSFIF VGTCNSNEKY TEPTSGHIHI FIAKKKANIF EIKHIYTHNI NYGGVTNLVP 2580
YDDKIVATIN NMVVILDINN LIIKYEAFMD PQNLQPKIEG NNAIVELVSF TPSSWIMTVD 2640
VYGDYIVVGD IMTSVTILQY DYENSQLFEV CRDYSNIWCT SLCALSKSHI VVSDMDANFI 2700
ILQKSKFKYN DEDSYKLSSV SLFNHGSIIN KMLPLSNTNL IEEDYDKRNI LTKNDGILCA 2760
SSEGSISVLI PFSSFANFKK ALCIEIAITD NISSIGNLSH NAYREYKVNF RSKHCKGIVD 2820
GELLKMFFHM SFEKQYKTFI YAKWIAKKIN CKFGSFNNFI LDLENMCSFL 2870 
Gene Ontology
 GO:0005634; C:nucleus; IEA:UniProtKB-KW.
 GO:0003676; F:nucleic acid binding; IEA:InterPro. 
Interpro
 IPR004871; Cleavage/polyA-sp_fac_asu_C. 
Pfam
 PF03178; CPSF_A 
SMART
  
PROSITE
  
PRINTS