CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-032967
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
  
Protein Name
 Nuclear receptor corepressor 2 
Protein Synonyms/Alias
  
Gene Name
 NCOR2 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
137AGSEDLTKDRSLTGKubiquitination[1, 2]
335KVREYYEKQFPEIRKubiquitination[2]
941RANASPQKPLDLKQLacetylation[3, 4]
1200VPGGSITKGIPSTRVubiquitination[2]
1499ACYEESLKSRPGTASubiquitination[1, 2]
1525VIVPELGKPRQSPLTacetylation[2, 4]
1568EGSLSSSKASQDRKLubiquitination[1, 2]
1777GGPTHLTKPTTTSSSacetylation[3]
1949TGHAFLAKPPARSGLacetylation[2, 3, 4]
2016HREKTQSKPFSIQELacetylation[2, 3, 4, 5]
Reference
 [1] Refined preparation and use of anti-diglycine remnant (K-ε-GG) antibody enables routine quantification of 10,000s of ubiquitination sites in single proteomics experiments.
 Udeshi ND, Svinkina T, Mertins P, Kuhn E, Mani DR, Qiao JW, Carr SA.
 Mol Cell Proteomics. 2013 Mar;12(3):825-31. [PMID: 23266961]
 [2] Integrated proteomic analysis of post-translational modifications by serial enrichment.
 Mertins P, Qiao JW, Patel J, Udeshi ND, Clauser KR, Mani DR, Burgess MW, Gillette MA, Jaffe JD, Carr SA.
 Nat Methods. 2013 Jul;10(7):634-7. [PMID: 23749302]
 [3] Proteomic investigations reveal a role for RNA processing factor THRAP3 in the DNA damage response.
 Beli P, Lukashchuk N, Wagner SA, Weinert BT, Olsen JV, Baskcomb L, Mann M, Jackson SP, Choudhary C.
 Mol Cell. 2012 Apr 27;46(2):212-25. [PMID: 22424773]
 [4] Proteomic investigations of lysine acetylation identify diverse substrates of mitochondrial deacetylase sirt3.
 Sol EM, Wagner SA, Weinert BT, Kumar A, Kim HS, Deng CX, Choudhary C.
 PLoS One. 2012;7(12):e50545. [PMID: 23236377]
 [5] Monoclonal antibody cocktail as an enrichment tool for acetylome analysis.
 Shaw PG, Chaerkady R, Zhang Z, Davidson NE, Pandey A.
 Anal Chem. 2011 May 15;83(10):3623-6. [PMID: 21466224
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Nucleus; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2504 AA 
Protein Sequence
MSGSTQPVAQ TWRATEPRYP PHSLSYPVQI ARTHTDVGLL EYQHHSRDYA SHLSPGSIIQ 60
PQRRRPSLLS EFQPGNERSQ ELHLRPESHS YLPELGKSEM EFIESKRPRL ELLPDPLLRP 120
SPLLATGQPA GSEDLTKDRS LTGKLEPVSP PSPPHTDPEL ELVPPRLSKE ELIQNMDRVD 180
REITMVEQQI SKLKKKQQQL EEEAAKPPEP EKPVSPPPIE SKHRSLVQII YDENRKKAEA 240
AHRILEGLGP QVELPLYNQP SDTRQYHENI KINQAMRKKL ILYFKRRNHA RKQWEQKFCQ 300
RYDQLMEAWE KKVERIENNP RRRAKESKVR EYYEKQFPEI RKQRELQERM QRVGQRGSGL 360
SMSAARSEHE VSEIIDGLSE QENLEKQMRQ LAVIPPMLYD ADQQRIKFIN MNGLMADPMK 420
VYKDRQVMNM WSEQEKETFR EKFMQHPKNF GLIASFLERK TVAECVLYYY LTKKNENYKS 480
LVRRSYRRRG KSQQQQQQQQ QQQQQQQQQP MPRSSQEEKD EKEKEKEAEK EEEKPEVEND 540
KEDLLKEKTD DTSGEDNDEK EAVASKGRKT ANSQGRRKGR ITRSMANEAN SEEAITPQQS 600
AELASMELNE SSRWTEEEME TAKKGLLEHG RNWSAIARMV GSKTVSQCKN FYFNYKKRQN 660
LDEILQQHKL KMEKERNARR KKKKAPAAAS EEAAFPPVVE DEEMEASGVS GNEEEMVEEA 720
EATVNNSSDT ESIPSPHTEA AKDTGQNGPK PPATLGADGP PPGPPTPPPE DIPAPTEPTP 780
ASEATGAPTP PPAPPSPSAP PPVVPKEEKE EETAAAPPVE EGEEQKPPAA EELAVDTGKA 840
EEPVKSECTE EAEEGPAKGK DAEAAEATAE GALKAEKKEG GSGRATTAKS SGAPQDSDSS 900
ATCSADEVDE AEGGDKNRLL SPRPSLLTPT GDPRANASPQ KPLDLKQLKQ RAAAIPPIQV 960
TKVHEPPRED AAPTKPAPPA PPPPQNLQPE SDAPQQPGSS PRGKSRSPAP PADKEAEKPV 1020
FFPAFAAEAQ KLPGDPPCWT SGLPFPVPPR EVIKASPHAP DPSAFSYAPP GHPLPLGLHD 1080
TARPVLPRPP TISNPPPLIS SAKHPSVLER QIGAISQGMS VQLHVPYSEH AKAPVGPVTM 1140
GLPLPMDPKK LAPFSGVKQE QLSPRGQAGP PESLGVPTAQ EASVLRGTAL GSVPGGSITK 1200
GIPSTRVPSD SAITYRGSIT HGTPADVLYK GTITRIIGED SPSRLDRGRE DSLPKGHVIY 1260
EGKKGHVLSY EGGMSVTQCS KEDGRSSSGP PHETAAPKRT YDMMEGRVGR AISSASIEGL 1320
MGRAIPPERH SPHHLKEQHH IRGSITQGIP RSYVEAQEDY LRREAKLLKR EGTPPPPPPS 1380
RDLTEAYKTQ ALGPLKLKPA HEGLVATVKE AGRSIHEIPR EELRHTPELP LAPRPLKEGS 1440
ITQGTPLKYD TGASTTGSKK HDVRSLIGSP GRTFPPVHPL DVMADARALE RACYEESLKS 1500
RPGTASSSGG SIARGAPVIV PELGKPRQSP LTYEDHGAPF AGHLPRGSPV TTREPTPRLQ 1560
EGSLSSSKAS QDRKLTSTPR EIAKSPHSTV PEHHPHPISP YEHLLRGVSG VDLYRSHIPL 1620
AFDPTSIPRG IPLDAAAAYY LPRHLAPNPT YPHLYPPYLI RGYPDTAALE NRQTIINDYI 1680
TSQQMHHNAA TAMAQRADML RGLSPRESSL ALNYAAGPRG IIDLSQVPHL PVLVPPTPGT 1740
PATAMDRLAY LPTAPQPFSS RHSSSPLSPG GPTHLTKPTT TSSSERERDR DRERDRDRER 1800
EKSILTSTTT VEHAPIWRPG TEQSSGSSGG GGGSSSRPAS HSHAHQHSPI SPRTQDALQQ 1860
RPSVLHNTGM KGIITAVEPS TPTVLRSTST SSPVRPAATF PPATHCPLGG TLDGVYPTLM 1920
EPVLLPKEAP RVARPERPRA DTGHAFLAKP PARSGLEPAS SPSKGSEPRP LVPPVSGHAT 1980
IARTPAKNLA PHHASPDPPA PPASASDPHR EKTQSKPFSI QELELRSLGY HGSSYSPEGV 2040
EPVSPVSSPS LTHDKGLPKH LEELDKSHLE GELRPKQPGP VKLGGEAAHL PHLRPLPESQ 2100
PSSSPLLQTA PGVKGHQRVV TLAQHISEVI TQDYTRHHPQ QLSAPLPAPL YSFPGASCPV 2160
LDLRRPPSDL YLPPPDHGAP ARGSPHSEGG KRSPEPNKTS VLGGGEDGIE PVSPPEGMTE 2220
PGHSRSAVYP LLYRDGEQTE PSRMGSKSPG NTSQPPAFFS KLTESNSAMV KSKKQEINKK 2280
LNTHNRNEPE YNISQPGTEI FNMPAITGTG LMTYRSQAVQ EHASTNMGLE AIIRKALMGK 2340
YDQWEESPPL SANAFNPLNA SASLPAAMPI TAADGRSDHT LTSPGGGGKA KVSGRPSSRK 2400
AKSPAPGLAS GDRPPSVSSV HSEGDCNRRT PLTNRVWEDR PSSAGSTPFP YNPLIMRLQA 2460
GVMASPPPPG LPAGSGPLAG PHHAWDEEPK PLLCSQYETL SDSE 2504 
Gene Ontology
 GO:0005634; C:nucleus; IDA:HPA.
 GO:0003682; F:chromatin binding; IEA:InterPro.
 GO:0003677; F:DNA binding; IEA:InterPro. 
Interpro
 IPR009057; Homeodomain-like.
 IPR001005; SANT/Myb.
 IPR017884; SANT_dom. 
Pfam
 PF00249; Myb_DNA-binding 
SMART
 SM00717; SANT 
PROSITE
 PS51293; SANT 
PRINTS