CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-038446
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
  
Protein Name
 Protein Arid1b 
Protein Synonyms/Alias
  
Gene Name
 Arid1b 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Mus musculus (Mouse) 
NCBI Taxa ID
 10090 
Lysine Modification
Position
Peptide
Type
References
401PMPTVNRKAQEAAAAubiquitination[1]
1142QRRKITSKDIVTPEAacetylation[2]
Reference
 [1] Proteomic analyses reveal divergent ubiquitylation site patterns in murine tissues.
 Wagner SA, Beli P, Weinert BT, Schölz C, Kelstrup CD, Young C, Nielsen ML, Olsen JV, Brakebusch C, Choudhary C.
 Mol Cell Proteomics. 2012 Dec;11(12):1578-85. [PMID: 22790023]
 [2] SIRT5-Mediated Lysine Desuccinylation Impacts Diverse Metabolic Pathways.
 Park J, Chen Y, Tishkoff DX, Peng C, Tan M, Dai L, Xie Z, Zhang Y, Zwaans BM, Skinner ME, Lombard DB, Zhao Y.
 Mol Cell. 2013 Jun 27;50(6):919-30. [PMID: 23806337
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Nucleus; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1762 AA 
Protein Sequence
MSPGTPGPTM GRSQGSPMDP MVMKRPQLYG MGTHPHSQPQ QSSPYPGGSY GPPGAQRYPL 60
GMQGRAPGAL GGLQYPQQQM PPQYGQQAVS GYCQQGQQPY YNQQPQPSHL PPQAQYLQPA 120
AAQSQQRYQP QQDMSQEGYG TRSQPPLAPG KSNHEDLNLI QQERPSSLPD LSGSIDDLPT 180
GTEATLSSAV SASGSTSSQG DQSNPAQSPF SPHASPHLSS IPGGPSPSPV GSPVGSNQSR 240
SGPISPASIP GFMTGTQRNP QMSQYGPQQT GPSMSPHPSP GGQMHPGISN FQQSNSSGTY 300
GPQMSQYGPQ GNYSRTPTYS GVPSASYSGP GPGMGINANN QMHGQGPAQP CGAMPLGRMP 360
SAGMQNRPFP GTMSSVTPSS PGMSQQGGPG MGPPMPTVNR KAQEAAAAVM QAAANSAQSR 420
QGSFPGMNQS GLVASSSPYS QSMNNNSSLM STQAQPYSMT PTMVNSSTAS MGLADMMSPS 480
ESKLSVPLKA DGKEEGVSQP ESKSKDSYGS QGISQPPTPG NLPVPSPMSP SSASISSFHG 540
DESDSISSPG WPKTPSSPKS SSSSTTGEKI TKVYELGNEP ERKLWVDRYL TFMEERGSPV 600
SSLPAVGKKP LDLFRLYVCV KEIGGLAQVN KNKKWRELAT NLNVGTSSSA ASSLKKQYIQ 660
YLFAFECKTE RGEEPPPEVF STGDSKKQPK LQPPSPANSG SLQGPQTPQS TGSNSMAEVP 720
GDLKPPTPAS TPHGQMTPMQ SGRSSTVSVH DPFSDVSDSA YPKRNSMTPN APYQQGMGMP 780
DMMGRMPYEP NKDPFSGMRK VPGSSEPFMT QGQVPNSGMQ DMYNQSPSGA MSNLGMGQRQ 840
QFPYGTSYDR RHEAYGQQYP GQGPPTGQPP YGGHQPGLYP QQPNYKRHMD GMYGPPAKRH 900
EGDMYNMQYG SQQQEMYNQY GGSYSGPDRR PIQGQYPYPY NRERMQGPGQ MQPHGIPPQM 960
MGGPMQSSSS EGPQQNMWAT RNDMPYPYQS RQGPGGPAQA PPYPGMNRTD DMMVPEQRIN 1020
HESQWPSHVS QRQPYMSSSA SMQPITRPPQ SSYQTPPSLP NHISRAPSPA SFQRSLESRM 1080
SPSKSPFLPT MKMQKVMPTV PTSQVTGPPP QPPPIRREIT FPPGSVEASQ PILKQRRKIT 1140
SKDIVTPEAW RVMMSLKSGL LAESTWALDT INILLYDDST VATFNLSQLS GFLELLVEYF 1200
RKCLIDIFGI LMEYEVGDPS QKALDHRSGK KDDSQSLEDD SGKEDDDAEC LVEEEEEEEE 1260
EEEDSEKIES EGKSSPALAA PDASVDPKET PKQASKFDKL PIKIVKKNKL FVVDRSDKLG 1320
RVQEFSSGLL HWQLGGGDTT EHIQTHFESK MEIPPRRRPP PPLSSTGKKK ELEGKGDSEE 1380
QPEKSIIATI DDVLSARPGA LPEDTNPGPQ TDSGKFPFGI QQAKSHRNIR LLEDEPRSRD 1440
ETPLCTIAHW QDSLAKRCIC VSNIVRSLSF VPGNDAEMSK HPGLVLILGK LILLHHEHPE 1500
RKRAPQTYEK EEDEDKGVAC SKDEWWWDCL EVLRDNTLVT LANISGQLDL SAYTESICLP 1560
ILDGLLHWMV CPSAEAQDPF PTVGPNSVLS PQRLVLETLC KLSIQDNNVD LILATPPFSR 1620
QEKFYATLVR YVGDRKNPVC REMSMALLSN LAQGDTLAAR AIAVQKGSIG NLISFLEDGV 1680
TMAQYQQSQH NLMHMQPPPL EPPSVDMMCR AAKALLAMAR VDENRSEFLL HEGRLLDISI 1740
SAVLNSLVAS VICDVLFQIG QL 1762 
Gene Ontology
 GO:0005634; C:nucleus; IEA:UniProtKB-KW.
 GO:0003677; F:DNA binding; IEA:InterPro. 
Interpro
 IPR001606; ARID/BRIGHT_DNA-bd.
 IPR011989; ARM-like.
 IPR021906; DUF3518. 
Pfam
 PF01388; ARID
 PF12031; DUF3518 
SMART
 SM00501; BRIGHT 
PROSITE
 PS51011; ARID 
PRINTS