CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-015947
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Retinoic acid-induced protein 1 
Protein Synonyms/Alias
  
Gene Name
 RAI1 
Gene Synonyms/Alias
 KIAA1820 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
466VPQKKGVKNLVSRTPubiquitination[1]
774KGLEQGGKASDGISKacetylation[2]
Reference
 [1] Systems-wide analysis of ubiquitylation dynamics reveals a key role for PAF15 ubiquitylation in DNA-damage bypass.
 Povlsen LK, Beli P, Wagner SA, Poulsen SL, Sylvestersen KB, Poulsen JW, Nielsen ML, Bekker-Jensen S, Mailand N, Choudhary C.
 Nat Cell Biol. 2012 Oct;14(10):1089-98. [PMID: 23000965]
 [2] Lysine acetylation targets protein complexes and co-regulates major cellular functions.
 Choudhary C, Kumar C, Gnad F, Nielsen ML, Rehman M, Walther TC, Olsen JV, Mann M.
 Science. 2009 Aug 14;325(5942):834-40. [PMID: 19608861
Functional Description
 May function as a transcriptional regulator. Regulates transcription through chromatin remodeling by interacting with other proteins in chromatin as well as proteins in the basic transcriptional machinery. May be important for embryonic and postnatal development. May be involved in neuronal differentiation (By similarity). 
Sequence Annotation
 ZN_FING 1856 1903 PHD-type.
 MOTIF 1160 1177 Nuclear localization signal (Potential).
 MOTIF 1223 1240 Nuclear localization signal (Potential).
 MOD_RES 568 568 Phosphoserine.
 MOD_RES 683 683 Phosphoserine.
 MOD_RES 696 696 Phosphothreonine.
 MOD_RES 880 880 Phosphoserine.
 MOD_RES 892 892 Phosphoserine.
 MOD_RES 1064 1064 Phosphoserine.
 MOD_RES 1068 1068 Phosphothreonine.
 MOD_RES 1122 1122 Phosphoserine.
 MOD_RES 1352 1352 Phosphoserine.
 MOD_RES 1358 1358 Phosphoserine.
 MOD_RES 1374 1374 Phosphoserine.
 MOD_RES 1431 1431 Phosphoserine.  
Keyword
 Alternative splicing; Complete proteome; Cytoplasm; Metal-binding; Nucleus; Phosphoprotein; Polymorphism; Reference proteome; Triplet repeat expansion; Zinc; Zinc-finger. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1906 AA 
Protein Sequence
MQSFRERCGF HGKQQNYQQT SQETSRLENY RQPSQAGLSC DRQRLLAKDY YNPQPYPSYE 60
GGAGTPSGTA AAVAADKYHR GSKALPTQQG LQGRPAFPGY GVQDSSPYPG RYAGEESLQA 120
WGAPQPPPPQ PQPLPAGVAK YDENLMKKTA VPPSRQYAEQ GAQVPFRTHS LHVQQPPPPQ 180
QPLAYPKLQR QKLQNDIASP LPFPQGTHFP QHSQSFPTSS TYSSSVQGGG QGAHSYKSCT 240
APTAQPHDRP LTASSSLAPG QRVQNLHAYQ SGRLSYDQQQ QQQQQQQQQQ QALQSRHHAQ 300
ETLHYQNLAK YQHYGQQGQG YCQPDAAVRT PEQYYQTFSP SSSHSPARSV GRSPSYSSTP 360
SPLMPNLENF PYSQQPLSTG AFPAGITDHS HFMPLLNPSP TDATSSVDTQ AGNCKPLQKD 420
KLPENLLSDL SLQSLTALTS QVENISNTVQ QLLLSKAAVP QKKGVKNLVS RTPEQHKSQH 480
CSPEGSGYSA EPAGTPLSEP PSSTPQSTHA EPQEADYLSG SEDPLERSFL YCNQARGSPA 540
RVNSNSKAKP ESVSTCSVTS PDDMSTKSDD SFQSLHGSLP LDSFSKFVAG ERDCPRLLLS 600
ALAQEDLASE ILGLQEAIGE KADKAWAEAP SLVKDSSKPP FSLENHSACL DSVAKSAWPR 660
PGEPEALPDS LQLDKGGNAK DFSPGLFEDP SVAFATPDPK KTTGPLSFGT KPTLGVPAPD 720
PTTAAFDCFP DTTAASSADS ANPFAWPEEN LGDACPRWGL HPGELTKGLE QGGKASDGIS 780
KGDTHEASAC LGFQEEDPPG EKVASLPGDF KQEEVGGVKE EAGGLLQCPE VAKADRWLED 840
SRHCCSTADF GDLPLLPPTS RKEDLEAEEE YSSLCELLGS PEQRPGMQDP LSPKAPLICT 900
KEEVEEVLDS KAGWGSPCHL SGESVILLGP TVGTESKVQS WFESSLSHMK PGEEGPDGER 960
APGDSTTSDA SLAQKPNKPA VPEAPIAKKE PVPRGKSLRS RRVHRGLPEA EDSPCRAPVL 1020
PKDLLLPESC TGPPQGQMEG AGAPGRGASE GLPRMCTRSL TALSEPRTPG PPGLTTTPAP 1080
PDKLGGKQRA AFKSGKRVGK PSPKAASSPS NPAALPVASD SSPMGSKTKE TDSPSTPGKD 1140
QRSMILRSRT KTQEIFHSKR RRPSEGRLPN CRATKKLLDN SHLPATFKVS SSPQKEGRVS 1200
QRARVPKPGA GSKLSDRPLH ALKRKSAFMA PVPTKKRNLV LRSRSSSSSN ASGNGGDGKE 1260
ERPEGSPTLF KRMSSPKKAK PTKGNGEPAT KLPPPETPDA CLKLASRAAF QGAMKTKVLP 1320
PRKGRGLKLE AIVQKITSPS LKKFACKAPG ASPGNPLSPS LSDKDRGLKG AGGSPVGVEE 1380
GLVNVGTGQK LPTSGADPLC RNPTNRSLKG KLMNSKKLSS TDCFKTEAFT SPEALQPGGT 1440
ALAPKKRSRK GRAGAHGLSK GPLEKRPYLG PALLLTPRDR ASGTQGASED NSGGGGKKPK 1500
MEELGLASQP PEGRPCQPQT RAQKQPGHTN YSSYSKRKRL TRGRAKNTTS SPCKGRAKRR 1560
RQQQVLPLDP AEPEIRLKYI SSCKRLRSDS RTPAFSPFVR VEKRDAFTTI CTVVNSPGDA 1620
PKPHRKPSSS ASSSSSSSSF SLDAAGASLA TLPGGSILQP RPSLPLSSTM HLGPVVSKAL 1680
STSCLVCCLC QNPANFKDLG DLCGPYYPEH CLPKKKPKLK EKVRPEGTCE EASLPLERTL 1740
KGPECAAAAT AGKPPRPDGP ADPAKQGPLR TSARGLSRRL QSCYCCDGRE DGGEEAAPAD 1800
KGRKHECSKE APAEPGGEAQ EHWVHEACAV WTGGVYLVAG KLFGLQEAMK VAVDMMCSSC 1860
QEAGATIGCC HKGCLHTYHY PCASDAGCIF IEENFSLKCP KHKRLP 1906 
Gene Ontology
 GO:0005737; C:cytoplasm; IEA:UniProtKB-SubCell.
 GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
 GO:0008270; F:zinc ion binding; IEA:InterPro.
 GO:0040015; P:negative regulation of multicellular organism growth; IEA:Compara.
 GO:0006357; P:regulation of transcription from RNA polymerase II promoter; IEA:Compara.
 GO:0001501; P:skeletal system development; IEA:Compara. 
Interpro
 IPR001965; Znf_PHD. 
Pfam
  
SMART
 SM00249; PHD 
PROSITE
 PS01359; ZF_PHD_1
 PS50016; ZF_PHD_2 
PRINTS