CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-041876
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 AT rich interactive domain 1B (SWI1-like), isoform CRA_a 
Protein Synonyms/Alias
 AT-rich interactive domain-containing protein 1B 
Gene Name
 ARID1B 
Gene Synonyms/Alias
 hCG_2031127 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
1611QRRKITSKDIVTPEAacetylation[1]
1772KFDKLPIKIVKKNNLacetylation[1]
1772KFDKLPIKIVKKNNLubiquitination[2]
2092PPFSRQEKFYATLVRubiquitination[3]
2182DMMCRAAKALLAMARubiquitination[3]
Reference
 [1] Integrated proteomic analysis of post-translational modifications by serial enrichment.
 Mertins P, Qiao JW, Patel J, Udeshi ND, Clauser KR, Mani DR, Burgess MW, Gillette MA, Jaffe JD, Carr SA.
 Nat Methods. 2013 Jul;10(7):634-7. [PMID: 23749302]
 [2] Refined preparation and use of anti-diglycine remnant (K-ε-GG) antibody enables routine quantification of 10,000s of ubiquitination sites in single proteomics experiments.
 Udeshi ND, Svinkina T, Mertins P, Kuhn E, Mani DR, Qiao JW, Carr SA.
 Mol Cell Proteomics. 2013 Mar;12(3):825-31. [PMID: 23266961]
 [3] A proteome-wide, quantitative survey of in vivo ubiquitylation sites reveals widespread regulatory roles.
 Wagner SA, Beli P, Weinert BT, Nielsen ML, Cox J, Mann M, Choudhary C.
 Mol Cell Proteomics. 2011 Oct;10(10):M111.013284. [PMID: 21890473
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Nucleus; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2231 AA 
Protein Sequence
METGLLPNHK LKTVGEAPAA PPHQQHHHHH HAHHHHHHAH HLHHHHALQQ QLNQFQQQQQ 60
QQQQQQQQQQ QQQHPISNNN SLGGAGGGAP QPGPDMEQPQ HGGAKDSAAG GQADPPGPPL 120
LSKPGDEDDA PPKMGEPAGG RYEHPGLGAL GTQQPPVAVP GGGGGPAAVP EFNNYYGSAA 180
PASGGPGGRA GPCFDQHGGQ QSPGMGMMHS ASAAAAGAPG SMDPLQNSHE GYPNSQCNHY 240
PGYSRPGAGG GGGGGGGGGG GSGGGGGGGG AGAGGAGAGA VAAAAAAAAA AAGGGGGGGY 300
GGSSAGYGVL SSPRQQGGGM MMGPGGGGAA SLSKAAAGSA AGGFQRFAGQ NQHPSGATPT 360
LNQLLTSPSP MMRSYGGSYP EYSSPSAPPP PPSQPQSQAA AAGAAAGGQQ AAAGMGLGKD 420
MGAQYAAASP AWAAAQQRSH PAMSPGTPGP TMGRSQGSPM DPMVMKRPQL YGMGSNPHSQ 480
PQQSSPYPGG SYGPPGPQRY PIGIQGRTPG AMAGMQYPQQ QMPPQYGQQG VSGYCQQGQQ 540
PYYSQQPQPP HLPPQAQYLP SQSQQRYQPQ QDMSQEGYGT RSQPPLAPGK PNHEDLNLIQ 600
QERPSSLPDL SGSIDDLPTG TEATLSSAVS ASGSTSSQGD QSNPAQSPFS PHASPHLSSI 660
PGGPSPSPVG SPVGSNQSRS GPISPASIPG SQMPPQPPGS QSESSSHPAL SQSPMPQERG 720
FMAGTQRNPQ MAQYGPQQTG PSMSPHPSPG GQMHAGISSF QQSNSSGTYG PQMSQYGPQG 780
NYSRPPAYSG VPSASYSGPG PGMGISANNQ MHGQGPSQPC GAVPLGRMPS AGMQNRPFPG 840
NMSSMTPSSP GMSQQGGPGM GPPMPTVNRK AQEAAAAVMQ AAANSAQSRQ GSFPGMNQSG 900
LMASSSPYSQ PMNNSSSLMN TQAPPYSMAP AMVNSSAASV GLADMMSPGE SKLPLPLKAD 960
GKEEGTPQPE SKSKDSYSSQ GISQPPTPGN LPVPSPMSPS SASISSFHGD ESDSISSPGW 1020
PKTPSSPKSS SSTTTGEKIT KVYELGNEPE RKLWVDRYLT FMEERGSPVS SLPAVGKKPL 1080
DLFRLYVCVK EIGGLAQVNK NKKWRELATN LNVGTSSSAA SSLKKQYIQY LFAFECKIER 1140
GEEPPPEVFS TGDTKKQPKL QPPSPANSGS LQGPQTPQST GSNSMAEVPG DLKPPTPAST 1200
PHGQMTPMQG GRSSTISVHD PFSDVSDSSF PKRNSMTPNA PYQQGMSMPD VMGRMPYEPN 1260
KDPFGGMRKV PGSSEPFMTQ GQMPNSSMQD MYNQSPSGAM SNLGMGQRQQ FPYGASYDRR 1320
HEPYGQQYPG QGPPSGQPPY GGHQPGLYPQ QPNYKRHMDG MYGPPAKRHE GDMYNMQYSS 1380
QQQEMYNQYG GSYSGPDRRP IQGQYPYPYS RERMQGPGQI QTHGIPPQMM GGPLQSSSSE 1440
GPQQNMWAAR NDMPYPYQNR QGPGGPTQAP PYPGMNRTDD MMVPDQRINH ESQWPSHVSQ 1500
RQPYMSSSAS MQPITRPPQP SYQTPPSLPN HISRAPSPAS FQRSLENRMS PSKSPFLPSM 1560
KMQKVMPTVP TSQVTGPPPQ PPPIRREITF PPGSVEASQP VLKQRRKITS KDIVTPEAWR 1620
VMMSLKSGLL AESTWALDTI NILLYDDSTV ATFNLSQLSG FLELLVEYFR KCLIDIFGIL 1680
MEYEVGDPSQ KALDHNAARK DDSQSLADDS GKEEEDAECI DDDEEDEEDE EEDSEKTESD 1740
EKSSIALTAP DAAADPKEKP KQASKFDKLP IKIVKKNNLF VVDRSDKLGR VQEFNSGLLH 1800
WQLGGGDTTE HIQTHFESKM EIPPRRRPPP PLSSAGRKKE QEGKGDSEEQ QEKSIIATID 1860
DVLSARPGAL PEDANPGPQT ESSKFPFGIQ QAKSHRNIKL LEDEPRSRDE TPLCTIAHWQ 1920
DSLAKRCICV SNIVRSLSFV PGNDAEMSKH PGLVLILGKL ILLHHEHPER KRAPQTYEKE 1980
EDEDKGVACS KDEWWWDCLE VLRDNTLVTL ANISGQLDLS AYTESICLPI LDGLLHWMVC 2040
PSAEAQDPFP TVGPNSVLSP QRLVLETLCK LSIQDNNVDL ILATPPFSRQ EKFYATLVRY 2100
VGDRKNPVCR EMSMALLSNL AQGDALAARA IAVQKGSIGN LISFLEDGVT MAQYQQSQHN 2160
LMHMQPPPLE PPSVDMMCRA AKALLAMARV DENRSEFLLH EGRLLDISIS AVLNSLVASV 2220
ICDVLFQIGQ L 2231 
Gene Ontology
 GO:0005634; C:nucleus; IDA:HPA.
 GO:0003677; F:DNA binding; IEA:InterPro. 
Interpro
 IPR001606; ARID/BRIGHT_DNA-bd.
 IPR011989; ARM-like.
 IPR021906; DUF3518. 
Pfam
 PF01388; ARID
 PF12031; DUF3518 
SMART
 SM00501; BRIGHT 
PROSITE
 PS51011; ARID 
PRINTS