CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-038001
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
  
Protein Name
 Protein Cenpf 
Protein Synonyms/Alias
  
Gene Name
 Cenpf 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Rattus norvegicus (Rat) 
NCBI Taxa ID
 10116 
Lysine Modification
Position
Peptide
Type
References
81EICENLEKTRQKLSHacetylation[1]
Reference
 [1] Proteomic analysis of lysine acetylation sites in rat tissues reveals organ specificity and subcellular patterns.
 Lundby A, Lage K, Weinert BT, Bekker-Jensen DB, Secher A, Skovgaard T, Kelstrup CD, Dmytriyev A, Choudhary C, Lundby C, Olsen JV.
 Cell Rep. 2012 Aug 30;2(2):419-31. [PMID: 22902405
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2985 AA 
Protein Sequence
MSWALEEWKE GLPTRALQKI QELEGQLDKL KKEKQQRQFQ LDSLEAALQK QKQKVEDGKT 60
EGADLKRENQ RLMEICENLE KTRQKLSHEL QVKESQVNFQ ESQLSSCKKQ IEKLEQELKR 120
CKSEFERSQQ SAEVSLNPCS TPQKLFTTPL TPSQYYICST YEDLKEKYNK EVEDRKRLEA 180
EVKALHAKKA SLPVSQATMN HRDIARHQAS SSVFSWQQDK TPSRLSSCSL RTPLRRDVSA 240
AHFLGEEVTP NKSSVQIGRG DCSGLPDDPH CSQPLHQAKA QNQELKSKMN ELELRLRGQE 300
KEMKDQVNKF QELQLQLEKT KVDLIEKEKI LNKTRDEVVR TTAQYEQAAA KCTALEQKLK 360
NLTEELSCHR QNAESAKRSL EQRVKEKEKE LREELSRQHQ SFQALDHEYT QMKTRLTQEL 420
QQAKHSLSVL QLELEKVTSV KQQLERSLEE IRHKFSRAEQ ALQASQLTEN ELRRSSEEMK 480
KENSLIRSQS EQRTREACRL EDELGKVKVC LSQSQSFAEE MRAKNTSQEI MLRDLQEKLN 540
QQENSLTLEK LKLALADLEG QRDCSQDLLK KREHHIEQLN DKLNKIEKEF ETLLSALELK 600
KKECEELKEE KNQISCWKSE NEKLINQIES EKEILLGKVN HLEASLKTQQ ISHDYSERVR 660
TLEMERENLT VEIRNLHSLL DSKMVEIETQ KQAYLELQQQ SESSDQKHQK EMENMCLKAN 720
KLTGQVESLE CKLQSLSGEV ETKDQQYQDL RMEYETMRDS LQARGSSLVT DEKNQRSSSA 780
FEEQPAVSHS FANLVGEKGS IYSERSDSSV DRGQSPENVA VLQSRVTLLE SSLESQNQMN 840
SDLQKQCEEL LQIKGEIEEN LIKADQIHQN FVAETNQRIG KLQEDAAVHQ NIVAETLATL 900
ESREKELQLL KEQLEAQQTE VQKLEKNNCL LEGALKELQL LTDTLSSEKK EMNSIISLRK 960
KDIEELTQAN GALKEVNEAL RQEKMNLLQQ HEEITRCVAE GERSIAELSG QYKQERLLLL 1020
QRCEETETVL EGLRGDYKAA QENNAKLECM LSECTALCEN KKNELEQLKE TFAKEQQEFL 1080
TKLAFTEEQN RKLVLELETE PQTVRSEITN INKHPVSETD ALGQESWSSK EEQKEKQKEV 1140
SNLTPENEQL MELTQTKHDY YHLEVEPVEN SVKATEEEIR KSSSQYQMDI DTKDISLDSY 1200
KAQLVQLEAL IKVMEVKLDR SEEEKNSLRQ ELQTIREELG TKTSQDTQSQ ARVGLKDCEV 1260
EAEEKYVSVL QELSTSQNEN VHLQCSLQTA MNKLNELGKM CEVLRVEKLQ LESELNDSRS 1320
ECITATSQMA AEVEKLVNEM KMLNHENALS HGELMKDTAD VEFDDKPNHT SVFLTPLDNS 1380
EQMTSSNKEV RVHFAELQEK FSCLQSEHKI LHDQHCEVSS KMSALRSYVD TLKAENSVLS 1440
MNLRTLQGDL VKEKEPGAED GHILPLSFCR TDSPTLTNFG ENSFYKDVLE QTGDTSHLSL 1500
EGNASANPCD VDEVSYSSLE EENLTEKEIP FASLRTVEEL EILCQVYLQS IKNLEEKIES 1560
QRIMKNKEIE ELEQLLSSER KELSCLRKQY LSEKEQWQQK LTSVTLEMES KLAEEKQQTK 1620
NLSLELEVAR LQLQELDLSS RSLLGTDLEG AVRGPNDGYA IKESEVYISE TTEKTPKQDT 1680
NQTCDNDVQQ DLCLETSVTE TETTRLTGDG CEEQPSKISC EAPAEDRTQD YSECISELFS 1740
TPSVLVPMDV LEDQGSIQNL HLQKDTSNEN LRLLPEVEDW DKKVESLLNE IKEADSKLSL 1800
QELQLKIKIA TCIELEKIVK HLKKEETDLS EKLESLPCNQ EVCPRVERSD LDFNLDMGDD 1860
ELFRESTKDD AANTEDNYKE KFLDMERELT RIKSEKANIE HHIVSVKANL EVVQAEKLCL 1920
ERDTESKQKV IVDLKGELFT VISERNRLRE ELDNVSKESK ALDQMSKKMK EKIEELESHQ 1980
RESLHHIGAV ESEVKDKAEL IQTLSFSVDE LTKDKAHLQE QLQNLQNDSQ GLSLAIGELE 2040
IQIGQLNKEK ESLVKESQNF QVKLTESECE KQTISKALEV ALMEKGEFAM RLSSTQEEVH 2100
QLRQGIEKLS VRIEADEKKH LSAVAKLKES QRESDSLKDK VENLERELEM SEENQELAIL 2160
DSENLKAEVE TLKAQMDEMA KSLRVFELDL VNVRSERENL AKQLQEKQSR VSELDELCSS 2220
LRSLSEEKEQ ARVQMERDSK SAMLMLQTQL KELWEEVAAL YNDQETLKAQ EQSLDQPGEE 2280
VHLLKSSIQK LKVHIDADKK KQCHILEQLK ESKHHADLFK DRVENLEQEL MLSEKNKEHL 2340
IFQAENSKAE IQTLKTEIQT MDQNLQDLEL ELTNTRSEKE NLMKELKNEQ EQISKLETIN 2400
SSIERLLKDK EAEXVQVKEE ARITVEMLQT QLKGSNETGG SLCNDQEAYK TKEQNLGSRV 2460
QTLELEKAQL LQDLGEAKNK YIIFQSSVNA LTQEVEAGKQ KLEKEEEEVR TLKEQLKGQE 2520
QLVCKLARVE GEQQLCQKQK LELRSLTMEL EQKVKVLQSE NDALQTTYEA LQSSYKSLEG 2580
ELGLIKMEKM ALVERVNTMT GKEAELQREL HDVEQKSTQL KEEYSKEKTR LTEDLEVVME 2640
ELKNTKVAHL KNVNHLEKEF QRAQGKIKLL LKSCKQLEGE KKMLQKELSQ LEAAQKQRAG 2700
SLVDSNVDEL MTENKELKET LEEKIKEADK YLDKYCSLLI SHEELEKTKE ILEIQVARLS 2760
SRQNKQDLQS SPLLDSSVAG PSPSTSVSER KSTSGLNKTL GKRQRSSGIG ENGNGTAPST 2820
PETFSKKSRK SVSNSAHPAE DEEETEFEPE GLPEVVKKGF ADIPTGKTSP YILRRTTMAT 2880
RTSPRLAAQK LLGSSPTLGK ENLVESSKPT AGGSRSQKVK VVQESSMDAH AVFQELPAKS 2940
LTANNMPGRN STESPREGLR AKRACPASSP AAGPDPMNNE NCRVQ 2985 
Gene Ontology
  
Interpro
 IPR019513; Centromere_CenpF_leu-rich_rpt.
 IPR018463; Centromere_CenpF_N.
 IPR018302; Centromere_CenpF_Rb-prot-bd. 
Pfam
 PF10473; Cenp-F_leu_zip
 PF10481; Cenp-F_N
 PF10490; Rb-bdg_C_Cenp-F 
SMART
  
PROSITE
  
PRINTS