CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-035330
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
  
Protein Name
 Protein Pcf11 
Protein Synonyms/Alias
  
Gene Name
 Pcf11 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Rattus norvegicus (Rat) 
NCBI Taxa ID
 10116 
Lysine Modification
Position
Peptide
Type
References
425EDVKEKRKTAEKKDKacetylation[1]
429EKRKTAEKKDKDEHMacetylation[1]
723PRYEDSDKPFVDGPAacetylation[1]
Reference
 [1] Proteomic analysis of lysine acetylation sites in rat tissues reveals organ specificity and subcellular patterns.
 Lundby A, Lage K, Weinert BT, Bekker-Jensen DB, Secher A, Skovgaard T, Kelstrup CD, Dmytriyev A, Choudhary C, Lundby C, Olsen JV.
 Cell Rep. 2012 Aug 30;2(2):419-31. [PMID: 22902405
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1551 AA 
Protein Sequence
MSEQTPAEAG AAGAREDACR DYQSSLEDLT FNSKPHINML TILAEENLPF AKEIVSLIEA 60
QTAKAPSSEK LPVMYLMDSI VKNVGREYLT AFTKNLVATF ICVFEKVDEN TRKSLFKLRS 120
TWDEIFPLKK LYALDVRVNS LDPAWPIKPL PPNVNTSSIH VNPKFLNKSP DEPSTPGTVV 180
SSPNISTPPI VPDIQKNLTQ EQLIRQQLLA KQKQLLELQQ KKLELELEQA KAQLAVSLSV 240
QQETANLGPG SAPSKLHVPQ IPTMAVKTPH QVPVQPDKSR VGPSLQMQDL KGTNRDPRLN 300
RMSQHSSHGK EQSHRKEFVM NTINQSDIKT SKTVPSEKLN SSKQEKSKSG ERITKKELDQ 360
LDSKSKSKSK SPSPLKNKLS HTKDLKNQES ESMRLSDMNK RDPRLKKHLQ DKAEGKDEDV 420
KEKRKTAEKK DKDEHMKSSE HRLIGSRSKI INGVVQKQDM VTEELEKQGT KPGRSSTRKR 480
SRSRSPKSRS PIVHSPKRRD RRSPKRRQRS MSPTLAPKAG KMRQSGLKQS HMDEFPPPSR 540
EERNIKRSAK QDVRDPRRLK KMDEDRPQET AGQHSMKSGG DPKENIENWQ SSKSAKRWKS 600
GWEENKSLQQ GDEHSKPPHL RHRESWSSTK GILSPRAPKQ QHRLSVDANL QIPKELTLAS 660
KRELLQKTSE RLASGEITQD EFLVVVHQIR QLFQYQEGVR EEQRSPFNDR FPLKRPRYED 720
SDKPFVDGPA SRFAGLDTNQ RLTALAEDRP LFDGPGRPSV TRDGPTKMIF EGPNKLSPRI 780
DGPPTPGSLR FDGSPGQIGG GGPMRFEGPQ GQLGGGCPLR FEGPPGPVGT PLRFEGPIGQ 840
GGGGGFRFEG SPSLRFEGSA GGLRFEGPGG QPVGGLRFEG HRGQPVGGLR FEGPHGQPVG 900
SLRFDNPRGQ PVGGLRFEGG HGPSGAAIRF DGPHGQPGGG IRFEGPLLQQ GVGMRFEGPH 960
GQSVAGLRFE GHNQLGGNLR FEGPHGQPGV GIRFEGPLVQ QGGGMRFEGP VPGGGLRIEG 1020
PLGQGGPRFE GCHSLRFDGQ PGQPSLLPRF DGLHGQPGPR FERTGQPGPQ RFDGPPGQQV 1080
QPRFDGVPQR FDGPQHQQAS RFDIPLGLQG TRFDNHPSQR IESFNQTGPY SDPPGNAFNV 1140
PSQGLQFQRH EQIFDTPQGP NFNGPHGPGN QNFPNPINRP SGHYFDEKNL QSSQFGNFGN 1200
LPTPMSVGNI QTSQQVLTGV AQPVAFGQGQ QFLPVHPQNP GAFIQNPSGG LPKAYPDNHL 1260
SQVDVNELFS KLLKTGILKL SQPDSATAQV NEAVAQPPPE EEEDQNEDQD VPDLTNFTIE 1320
ELKQRYDSVI NRLYTGIQCY SCGMRFTTSQ TDVYADHLDW HYRQNRTEKD VSRKVTHRRW 1380
YYSLTDWIEF EEIADLEERA KSQFFEKVHE EVVLKTQEAA KEKEFQSVPA GPAGAVESCE 1440
ICQEQFEQYW DEEEEEWHLK NAIRVDGKIY HPSCYEDYQN TSSFDCTPSP SKTPVENPLN 1500
IMLNIVKNEL QEPCENPKVK EEQIDAPPAC SEESVATPTE IKTESDTVES V 1551 
Gene Ontology
  
Interpro
 IPR006569; CID_dom.
 IPR008942; ENTH_VHS.
 IPR006903; RNA_pol_II-bd. 
Pfam
 PF04818; CTD_bind 
SMART
 SM00582; RPR 
PROSITE
 PS51391; CID 
PRINTS