CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-035603
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
  
Protein Name
 Protein Prr12 
Protein Synonyms/Alias
  
Gene Name
 Prr12 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Rattus norvegicus (Rat) 
NCBI Taxa ID
 10116 
Lysine Modification
Position
Peptide
Type
References
1217GTRLEPLKPLKIKLSacetylation[1]
Reference
 [1] Proteomic analysis of lysine acetylation sites in rat tissues reveals organ specificity and subcellular patterns.
 Lundby A, Lage K, Weinert BT, Bekker-Jensen DB, Secher A, Skovgaard T, Kelstrup CD, Dmytriyev A, Choudhary C, Lundby C, Olsen JV.
 Cell Rep. 2012 Aug 30;2(2):419-31. [PMID: 22902405
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2032 AA 
Protein Sequence
MDRNYPSAGF GDPLGAGAGW SYERSAKASL VYGSSRTSHP ETDILHRQAY AAPHPLQSYA 60
TNHHPAGLSG LFDTGLHHAG SAGPDASVMN LISALESRGP QPGPSASSLL SQFRSPSWQT 120
AMHTPGPTEL FISGALPGSS TFPSSSALSA YQHPASFGSR PFPVPSSLSL QDPPFSPPAN 180
GLLSPHDVLH LKPSQAPTVP SSLGFERLAG GGVLGPAGLG PAQTPPYRPG PPDPPPPPRH 240
LPTQFNLLAS SSAATAAEPS SPQLYNFSGA APGPPPERAL PRQDTVIKHY QRPASAQPPP 300
PPPPAHSLQH YLSCGGSYPS MGHRASLACS PLGGGEPSPG AGEPSKGGPS GATAGAAGRA 360
AGPETAGGGA AAGGGGYRPI IQSPGYKTSK GGYGPATGGA TRPPPPRSTA TPKCQSLGGP 420
AAAYATGKAS GAGGAGSQAY SPGQPQGLLG PQAYGQGFGG GQAQDLSKGP NYSGGPPQPP 480
SGPPPPGLAT CQSYSPDQLQ GQLYGVQSEP YPGPAAHSQG LPTASPSLSY STGHSPALSG 540
HGGGWGPSSL GGGGEASPSH IIRPLQSPPA TGRPPGVGSP GAPGKYLSSV LASAPFLAPP 600
GASSYAAGAG GYKGKGDGSE LLAGPGGAAA ERTEDEEFLI QHLLQAPSPP RTSGADGLVG 660
EDGPADASKG LGGSGGAGGP PGTPYELAKE DPQRYHLQSV IRTSASLDEG ATAALELGLG 720
RMKDKKKGPE RGGETPEGLA TSVVHYGAGA KELGAFLQKS PPPPPPTAQA TQPTPHGLLL 780
EAGGPDLPMV LPPPPPQLLP SVLSHAPSPS SSAPKVGVHL LEPGTRDGAP QPPPPPPPPP 840
MPLQLEAHLR GHGLEPTAPS PRLRPEESLE PPGAMQELLG ALEPLPPGPG DTGVGPPNPE 900
GKDPTGAYRS PSPQGTKAPR FVPLTSICFP DSLLQDEERS FFPTMEEMFG GGAADDYGKA 960
GQTEDDGDPK TGAGPPPGPT AYDPYGPYCG GRASGTGPET PGLGLDPNKP PELPSTVNAE 1020
PLGLIQSGPH QSAPPPPPPP PPPPPVSEPK GGLTSPIFCS TKPKKLLKTS SFHLLRRRDP 1080
PFQTPKKLYA QEYEFEADED KADVPADIRL NPRRLPDLVS SCRSRPALSP LGDIDFCPPN 1140
PGPDGPRRRG RKPTKAKRDG PPRPRGRPRI RPLEGPAMAG PASASITTDG AKKPRGRGRG 1200
RGRKAEEMGG TRLEPLKPLK IKLSVPKVGE GLGAPSNDVI SGVDHNSLDS NLTREKIEAK 1260
IKEVEEKQPE MKSGFMASFL DFLKSGKRHP PVYQAGLTPP LSPPKSVPAS VPTRGLQPPP 1320
PTVPTVPHPA PSGPFGLGGA LEAAESEGLG LGCPSPCKRL DEELKRNLET LPSFSSDEED 1380
SVAKNRDLQE SISSAISALD DPPLTGPKDT STPEEPPLDT GPTASGPPPL PSLPSTNSNG 1440
TPEPPLLEEK PPPTPPPAPT PQPAPPPPPP PVPALPSPAP LVTPVASSPP PPQPPPPPAL 1500
PSPPPPPPPA PTTVPPVAPP EEPAAPSPED PEPPDARPLH LAKKQETAAV CGETDEEAGE 1560
SGGEGIFRER DEFVIRAEDI PSLKLALQTG REPPPIWRVQ KALLQKFTPE IKDGQRQFCA 1620
TSNYLGYFGD AKNRYQRLYV KFLENVNKKD YVRVCARKPW HRPPLPVRRS GQTKGPTPVG 1680
GNAAPPSKVP APPPKPETPE KMTSEKPPEP TPEPAVPEPP APEKPSPPRP AEKEKEKEKE 1740
RVTRGGDRPL RSERAASGRQ MRTDRSLAPG QSTTSRLPKA RPSKVKAEPP PKKRKKWLKE 1800
AVGSASAGDG PGCSSSDSES SPGAPSEDER AVPGRLLKTR AMREMYRSYV EMLVSTALDP 1860
DMIQALEDTH DELYLPPMRK IDGLLNEHKK KVLKRLSLSP ALQDALHTFP QLQVEQTGEG 1920
SPEEGAVRLR PAGEPYNRKT LSKLKRSVVR AQEFKVELEK SGYYTLYHSL HHYKYHTFLR 1980
CRDQTLAIEG GAEDLGQEEV VQQCMRNQPW LEQLFDSFSD LLAQAQAHSR CG 2032 
Gene Ontology
 GO:0003677; F:DNA binding; IEA:InterPro. 
Interpro
 IPR017956; AT_hook_DNA-bd_motif.
 IPR025451; DUF4211. 
Pfam
 PF13926; DUF4211 
SMART
 SM00384; AT_hook 
PROSITE
  
PRINTS