CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-035633
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
  
Protein Name
 Protein Urb1 
Protein Synonyms/Alias
  
Gene Name
 Urb1 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Rattus norvegicus (Rat) 
NCBI Taxa ID
 10116 
Lysine Modification
Position
Peptide
Type
References
2014HQAKDLMKMLKDKHRacetylation[1]
2017KDLMKMLKDKHRPLGacetylation[1]
Reference
 [1] Proteomic analysis of lysine acetylation sites in rat tissues reveals organ specificity and subcellular patterns.
 Lundby A, Lage K, Weinert BT, Bekker-Jensen DB, Secher A, Skovgaard T, Kelstrup CD, Dmytriyev A, Choudhary C, Lundby C, Olsen JV.
 Cell Rep. 2012 Aug 30;2(2):419-31. [PMID: 22902405
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2278 AA 
Protein Sequence
MGVPKRKASE SPGGAASPAG TSKRTRAEEF TGVRFKTQLK DAQGPGPALE AFVSAAKQLP 60
QEDMCDVVEG YIKISMECAE IFQLLSGEKR PESEVLLIFQ AFEAILLRTA SDLTHFHVVG 120
TNIVKKLLNN HMKLLCESLY ASGYRMARAC LDLMTAVVTQ GPEAARDVCS CFDLNKKALY 180
ALVTKRDSKG VHDVRLAYIQ FALSFLIAGD DNTIGQVLEI KEFIPCIFSS GIKEDKISTI 240
NILLSTLKTK VIHNKNITKT QKVRFFTGQL LNHIAALYNW NGVSDIVPEN PESTEMSAEE 300
AGKAMVRDLV HNFLIDLCCS RKHGISFYDA SLGTSGRGGN LTLLHFLLSL KTAAGDELVA 360
SLVVSILKVC PDLLTKYFRE VTFSFLPRVK STWLNNVKLL NKIYEAQPEI SPAFWTREFI 420
PLPRLLAMVM VTTVPLVCNK IMFTQALNLD SIPVKQSALS LISIILKRAL KTVDHCLDKE 480
IWQDSDVYTA EMMEEFVQLF REALSKILPD LNTVIWVWQS FKKQEIKENH EKGKKSSSKT 540
PAASKAVHRD DAETILLKSV LLQVICLYQQ VVPHVVTQYN FDFSKLLKGI ISEQGPAEEA 600
PPILQHHMLK VALELPANKF LWLKAQEGPE AEIVGGERSV FYLLMKMFVN SSHLQLKLST 660
QRLIMKILRD TGVFEHTWRE LELWLERLDS TAEKHKETVI QFLERILLTL VVNPYSYTDK 720
ASEFVQEAST LQGAVGKQDA DDVSIPISHI DDVLDMVDVL VEGSEGLDED IGFLLNEDMI 780
LLTFPFSALV PAALEARNKL LLGTECEAGE SIVAYMTAVL TDILHTQRDP LALCLLLQSY 840
DKFEPASLLC CQQLAQFHRY YSLWIPEQAQ EALPLQVSGT SGPYTPPSSA CFSTLLQTAY 900
ESQTLGDKNV QAKLLAAVPC LPLQHMLRSA KQVLLYLKST VENFSQLGKS VGPTLLQFLL 960
GLLKHLVIHS EQLDTQNQQK LEAARAESDL FLDMESVASL ELATDKTVEE LLVAILKHPT 1020
LETWFLALEQ KALPPHTLSP VLVKLLAAQF SAGVLQLLVA SSPILHKLGQ LGLLAKYSEA 1080
ITQSVLRELR TRTVNSAMTP KTLPQLEALR ELHPYMEGVQ IREVTLALLG LPEAHLLTQQ 1140
GTQCPGKERQ LSSLGKTLVQ LLTSSHQNRL QSSELLWCAE YVRGLGALLP TLAEQELDTV 1200
FLQTLQKDPV LAPVVPADLL EYCLVRRTKA ALDIASLLLQ YSSTHLLKFE LWCGQPGVGP 1260
SLQEHLDDFF PLIHVYLQHR MQGSFMRPTE VSSAVTPVLR ALWRQVRDRF FHIAGPSKHA 1320
LHLEALAQLI PFARTKDLRV LMDHLPNLLR TLSNHKSWIV ADSVSAALAE SAEELASWRK 1380
TLLRSCIQWL AVSFSGREPE NENTQEQEKT MLLRLSELLH AVKEVDPGDW QQFVKTGLKF 1440
RYQDLTFLRT LLDATKLLYS PESSGHTKLV QLSVMHMMLT QHSLFLPTML TSKEEETPDS 1500
PVKETLLDLM STVVRMCPSV CESSHFAVLL GTYSATLSIL DQKILLLLRA YEQHSLSLIS 1560
FRVLLWGPAA VEHHKTCRSL GKSLWQQPSV GDILRLLDPD RMMQTILHFP QYRRLLHTED 1620
TGEPQVFKDK TARVDLSGLY DPCFLLHLFG ELTRPEFVVD CRKFLDSNAL GLTVAALSSY 1680
DPQMRAAAYY VLAAYYSHLE GARFREQSQV LYLLDVVRNG IQAPNLRLPF TVALFIAKAA 1740
AQILRPEEHM YWKISKFLLS HENLNMNKLP GFYQFFYSSD FQQKTEQEWV LEILRQGIRD 1800
KHCYELCSRR GVFHIILSFF SSPLCDEVAQ NWILEILQNV AHIPRSAYEV IRDYSLLTWI 1860
LHILESRFIE TQLLSNVISL LHTLWVTNLG SRAAEERSQL PCQTDCHESE KTLALHMVNE 1920
FLYVLVALTK HLRPTLASTQ LMDFFWTLES VLSYRATVIK IFKDMGRFTV NQVTLSTKDV 1980
LVLLHKWSLI ERDTKLQGEL KAIIEQHQAK DLMKMLKDKH RPLGAARAKG PRGREKRRRE 2040
REEETAEPQL EASTLEKCKD LLRATLTHWG PAVPLPGPTQ ESVGQAIPKS KALGSAHAAV 2100
SLVAGWVLRS LAERPLSRAE VTRLLDWLKS HILPQPMVVA DLLRDSAVKT GIFKLYSRHC 2160
NAQGLVGPAQ DVACKFSMVM LQLLVAQGRT PSPFHSVAEA LCLDSLNEKD EAKRAPAAFL 2220
VSLYIKDIWL GAQQPDTFLA HIQTVCEASK DMALGESEAL VVLCRDVGSS AQSLTHSC 2278 
Gene Ontology
 GO:0005730; C:nucleolus; IEA:Compara. 
Interpro
 IPR021714; Npa1_N. 
Pfam
 PF11707; Npa1 
SMART
  
PROSITE
  
PRINTS