CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-035547
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
  
Protein Name
 Protein Rai1 
Protein Synonyms/Alias
  
Gene Name
 Rai1 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Rattus norvegicus (Rat) 
NCBI Taxa ID
 10116 
Lysine Modification
Position
Peptide
Type
References
77VATAAADKYHRGNKSacetylation[1]
657PEPLQLDKGGSTKDFacetylation[1]
932ESSLSHMKPGEGPELacetylation[1]
1064TTPTPPDKLGGKQRAacetylation[1]
1068PPDKLGGKQRAAFKSacetylation[1]
1074GKQRAAFKSGKRVGKacetylation[1]
1077RAAFKSGKRVGKPSPacetylation[1]
1120DSPGMPGKDQRSMVLacetylation[1]
1194PKPGTGSKLSDRPLHacetylation[1]
1380SSRSLKGKLLNSKKLacetylation[1]
Reference
 [1] Proteomic analysis of lysine acetylation sites in rat tissues reveals organ specificity and subcellular patterns.
 Lundby A, Lage K, Weinert BT, Bekker-Jensen DB, Secher A, Skovgaard T, Kelstrup CD, Dmytriyev A, Choudhary C, Lundby C, Olsen JV.
 Cell Rep. 2012 Aug 30;2(2):419-31. [PMID: 22902405
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1887 AA 
Protein Sequence
MQSFRERCGF HGKQQNYPQT SQETSRLENY RQPGQAGLSC DRQRLLAKDY YNPQPYTGYE 60
GGTGTPAGTV ATAAADKYHR GNKSLQGRPA FPSYVQDSSP YPGRYSGEEG LQTWGSPQPP 120
PPQPQPLPGA VSKYEENLMK KTVVPPNRQY PEQGPQLPFR THGLHVPPPQ PQQPLAYPKL 180
QRQKPQNDLA SPLPFPQGSH FPQHSQSFPT SSTYAPTVQG GGQGAHSYKS CTAPSAQPHD 240
RPMSANASLA PGQRVQNLHA YQPGRLGYEQ QQQALQGRHH TQETLHYQNL AKYQHYGQQG 300
QGYCPADTAV RTPEQYYQTF SPSSSHSPAR SVGRSPSYSS TPSPLMPNLE NFPYSQQPLS 360
TGAFPTGITD HSHFMPLLNP SPTDAASSVD PQVSNCKPLQ KEKLPDNLLS DLSLQSLTAL 420
TSQVENISNT VQQLLLSKAT IPQKKGVKNL VSRTPEQHKS QHCSPEGSGY SAEPAGTPLS 480
EPPSSTPQST HAEPQDTDYL SGSEDPLERS FLYCSQARGS PARVNSNSKA KPESVSTCSV 540
TSPDDMSTKS DDSFQSLHST LPLDSFSKFV AGERDCPRLL LSALAQEDLA SEILGLQEAI 600
VEKADKAWAE ASSLPKDNGK PPFSLENHST CLDTVAKTSW SQPGEPEALP EPLQLDKGGS 660
TKDFSPGLFE DPSVAFATTD PKKTTSPLSF GTKPLLGTAT PDPTTAAFDC FPDTTTASSV 720
DGANPFAWPE ENLGDACPRW GLHPGELTKG LEQGAKASDS VGKADAHETS ACLGFQEEHA 780
IGKPAAALSG DFKQQEADGV KEEVGGLLQC PEVGKADQWL EDSRHCCSSA DFGDLPLLPP 840
PGRKEDLEAE EEYSSLCELL GSPEQRPSLQ DPLSPKAPLI CTKEEAEEVL DTKAGWASPC 900
HLSGEPAVLL GPSVGAESKV QSWFESSLSH MKPGEGPELE RAPGSASTSQ GSLAPKPNTP 960
AVPEGPIAKK EPVPRGKSLR SRRVHRGLPE AEDSPCRAPA LPKDLLLPES CTGPPQGQAE 1020
GAGAPGRGLS EGLPRMCTRS LTALSEPQTP GPPGLTTTPT PPDKLGGKQR AAFKSGKRVG 1080
KPSPKAASSP SNPAALPVAS DSSPMGSKTK EPDSPGMPGK DQRSMVLRSR TKPQQVFHAK 1140
RRRPSESRIP DCRPTKKPPA NNHLPTAFKV SSGPQKEGRM SQRGKVPKPG TGSKLSDRPL 1200
HTLKRKSAFM APVPAKKRSL ILRSNNGSGV DGREERAESS PGLLRRMASP QRARPRGSGE 1260
PPPPPPLEPP AACLGLATQS SLPSAVRTKV LPPRKGRGLK LEAIVQKITS PGLKKLACRV 1320
AGAPPGTPRS PALPEKRSGG SPAGAEEGVG GMGAGQMLPA ASGADPLCRN PASSRSLKGK 1380
LLNSKKLSSA ADCPKAEAFM SPETLPSLGT ARAPKKRSRK GRTGALGPSK GPLEKRPCPG 1440
QALILAPHDR ASSTQGGGED NSSGGGKKPK TEELGLASQP PEGRPCQPQT RAQKQPGQAS 1500
YSSYSKRKRL SRGRGKATHA SPCKGRATRR RQQQVLPLDP AEPEIRLKYI SSCKRLRADS 1560
RTPAFSPFVR VEKRDAYTTI CTVVNSPGDE PKPHWKPSSV ASSSTSSSEP AGASLTTFPG 1620
GSVLQLRPSL PLSSTMHLGP VVSKALSTSC LVCCLCQNPA NFKDLGDLCG PYYPEHCLPK 1680
KKPKLKEKVR LEGTLEEASL PLERTLKGLE CAASTTAATP TTTTITTTTT LGRLSRPDGP 1740
ADPAKQGSLR TSARGLSRRL QSCYCCDGQG DGGEEVAPAD KSRKHECSKE APAEPGGDTQ 1800
EHWVHEACAV WTSGVYLVAG KLFGLQEAMK VAVDMPCTSC HESGATISCS YEGCTHTYHY 1860
PCANDTGCTF IEENFTLKCP KHKALPL 1887 
Gene Ontology
 GO:0008270; F:zinc ion binding; IEA:InterPro. 
Interpro
 IPR001965; Znf_PHD. 
Pfam
  
SMART
 SM00249; PHD 
PROSITE
  
PRINTS