CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-017349
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Integrator complex subunit 1 
Protein Synonyms/Alias
 Int1 
Gene Name
 INTS1 
Gene Synonyms/Alias
 KIAA1440; UNQ1821/PRO3434 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
17RRPSAAAKPSGHPPPacetylation[1]
47KTASTLLKPAPSGLPacetylation[2]
305ELLIAEEKLSPEQEGubiquitination[3]
681ELADHLVKRAAAVQAubiquitination[3]
695ADDVEVLKVGRTQLIubiquitination[3, 4]
854PHILDQVKSLNQSLRubiquitination[3]
962QDLLLGPKADEQTTCubiquitination[5]
992ASRVLAMKGLSLVLSubiquitination[3, 4, 5, 6, 7]
1009SLRDGEEKEPPMEEDubiquitination[3, 4]
1135VRRMRQSKEGEEVYSubiquitination[3]
1325STEAPKPKSSPEQPIubiquitination[3]
1543VEPDLISKVLQGLIEubiquitination[3, 5]
1868DASMACRKLAVAHPLubiquitination[8]
2052AEMAPYMKRLSRGQTubiquitination[4]
Reference
 [1] Aspirin acetylates multiple cellular proteins in HCT-116 colon cancer cells: Identification of novel targets.
 Marimuthu S, Chivukula RS, Alfonso LF, Moridani M, Hagen FK, Bhat GJ.
 Int J Oncol. 2011 Nov;39(5):1273-83. [PMID: 21743961]
 [2] Lysine acetylation targets protein complexes and co-regulates major cellular functions.
 Choudhary C, Kumar C, Gnad F, Nielsen ML, Rehman M, Walther TC, Olsen JV, Mann M.
 Science. 2009 Aug 14;325(5942):834-40. [PMID: 19608861]
 [3] Systematic and quantitative assessment of the ubiquitin-modified proteome.
 Kim W, Bennett EJ, Huttlin EL, Guo A, Li J, Possemato A, Sowa ME, Rad R, Rush J, Comb MJ, Harper JW, Gygi SP.
 Mol Cell. 2011 Oct 21;44(2):325-40. [PMID: 21906983]
 [4] Landscape of the PARKIN-dependent ubiquitylome in response to mitochondrial depolarization.
 Sarraf SA, Raman M, Guarani-Pereira V, Sowa ME, Huttlin EL, Gygi SP, Harper JW.
 Nature. 2013 Apr 18;496(7445):372-6. [PMID: 23503661]
 [5] Ubiquitin ligase substrate identification through quantitative proteomics at both the protein and peptide levels.
 Lee KA, Hammerle LP, Andrews PS, Stokes MP, Mustelin T, Silva JC, Black RA, Doedens JR.
 J Biol Chem. 2011 Dec 2;286(48):41530-8. [PMID: 21987572]
 [6] Global identification of modular cullin-RING ligase substrates.
 Emanuele MJ, Elia AE, Xu Q, Thoma CR, Izhar L, Leng Y, Guo A, Chen YN, Rush J, Hsu PW, Yen HC, Elledge SJ.
 Cell. 2011 Oct 14;147(2):459-74. [PMID: 21963094]
 [7] Integrated proteomic analysis of post-translational modifications by serial enrichment.
 Mertins P, Qiao JW, Patel J, Udeshi ND, Clauser KR, Mani DR, Burgess MW, Gillette MA, Jaffe JD, Carr SA.
 Nat Methods. 2013 Jul;10(7):634-7. [PMID: 23749302]
 [8] A proteome-wide, quantitative survey of in vivo ubiquitylation sites reveals widespread regulatory roles.
 Wagner SA, Beli P, Weinert BT, Nielsen ML, Cox J, Mann M, Choudhary C.
 Mol Cell Proteomics. 2011 Oct;10(10):M111.013284. [PMID: 21890473
Functional Description
 Component of the Integrator complex, a complex involved in the small nuclear RNAs (snRNA) U1 and U2 transcription and in their 3'-box-dependent processing. The Integrator complex is associated with the C-terminal domain (CTD) of RNA polymerase II largest subunit (POLR2A) and is recruited to the U1 and U2 snRNAs genes. 
Sequence Annotation
 MOD_RES 47 47 N6-acetyllysine.
 MOD_RES 307 307 Phosphoserine.
 MOD_RES 1318 1318 Phosphoserine.
 MOD_RES 1326 1326 Phosphoserine (By similarity).
 MOD_RES 1327 1327 Phosphoserine (By similarity).  
Keyword
 Acetylation; Complete proteome; Membrane; Nucleus; Phosphoprotein; Polymorphism; Reference proteome; Transmembrane; Transmembrane helix. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2190 AA 
Protein Sequence
MNRAKPTTVR RPSAAAKPSG HPPPGDFIAL GSKGQANESK TASTLLKPAP SGLPSERKRD 60
AAAALSSASA LTGLTKRPKL SSTPPLSALG RLAEAAVAEK RAISPSIKEP SVVPIEVLPT 120
VLLDEIEAAE LEGNDDRIEG VLCGAVKQLK VTRAKPDSTL YLSLMYLAKI KPNIFATEGV 180
IEALCSLLRR DASINFKAKG NSLVSVLACN LLMAAYEEDE NWPEIFVKVY IEDSLGERIW 240
VDSPHCKTFV DNIQTAFNTR MPPRSVLLQG EAGRVAGDLG AGSSPHPSLT EEEDSQTELL 300
IAEEKLSPEQ EGQLMPRYEE LAESVEEYVL DMLRDQLNRR QPIDNVSRNL LRLLTSTCGY 360
KEVRLLAVQK LEMWLQNPKL TRPAQDLLMS VCMNCNTHGS EDMDVISHLI KIRLKPKVLL 420
NHFMLCIREL LSAHKDNLGT TIKLVIFNEL SSARNPNNMQ VLYTALQHSS ELAPKFLAMV 480
FQDLLTNKDD YLRASRALLR EIIKQTKHEI NFQAFCLGLM QERKEPQYLE MEFKERFVVH 540
ITDVLAVSMM LGITAQVKEA GIAWDKGEKR NLEVLRSFQN QIAAIQRDAV WWLHTVVPSI 600
SKLAPKDYVH CLHKVLFTEQ PETYYKWDNW PPESDRNFFL RLCSEVPILE DTLMRILVIG 660
LSRELPLGPA DAMELADHLV KRAAAVQADD VEVLKVGRTQ LIDAVLNLCT YHHPENIQLP 720
PGYQPPNLAI STLYWKAWPL LLVVAAFNPE NIGLAAWEEY PTLKMLMEMV MTNNYSYPPC 780
TLTDEETRTE MLNRELQTAQ REKQEILAFE GHLAAASTKQ TITESSSLLL SQLTSLDPQG 840
PPRRPPPHIL DQVKSLNQSL RLGHLLCRSR NPDFLLHIIQ RQASSQSMPW LADLVQSSEG 900
SLDVLPVQCL CEFLLHDAVD DAASGEEDDE GESKEQKAKK RQRQQKQRQL LGRLQDLLLG 960
PKADEQTTCE VLDYFLRRLG SSQVASRVLA MKGLSLVLSE GSLRDGEEKE PPMEEDVGDT 1020
DVLQGYQWLL RDLPRLPLFD SVRSTTALAL QQAIHMETDP QTISAYLIYL SQHTPVEEQA 1080
QHSDLALDVA RLVVERSTIM SHLFSKLSPS AASDAVLSAL LSIFSRYVRR MRQSKEGEEV 1140
YSWSESQDQV FLRWSSGETA TMHILVVHAM VILLTLGPPR ADDSEFQALL DIWFPEEKPL 1200
PTAFLVDTSE EALLLPDWLK LRMIRSEVLR LVDAALQDLE PQQLLLFVQS FGIPVSSMSK 1260
LLQFLDQAVA HDPQTLEQNI MDKNYMAHLV EVQHERGASG GQTFHSLLTA SLPPRRDSTE 1320
APKPKSSPEQ PIGQGRIRVG TQLRVLGPED DLAGMFLQIF PLSPDPRWQS SSPRPVALAL 1380
QQALGQELAR VVQGSPEVPG ITVRVLQALA TLLSSPHGGA LVMSMHRSHF LACPLLRQLC 1440
QYQRCVPQDT GFSSLFLKVL LQMLQWLDSP GVEGGPLRAQ LRMLASQASA GRRLSDVRGG 1500
LLRLAEALAF RQDLEVVSST VRAVIATLRS GEQCSVEPDL ISKVLQGLIE VRSPHLEELL 1560
TAFFSATADA ASPFPACKPV VVVSSLLLQE EEPLAGGKPG ADGGSLEAVR LGPSSGLLVD 1620
WLEMLDPEVV SSCPDLQLRL LFSRRKGKGQ AQVPSFRPYL LTLFTHQSSW PTLHQCIRVL 1680
LGKSREQRFD PSASLDFLWA CIHVPRIWQG RDQRTPQKRR EELVLRVQGP ELISLVELIL 1740
AEAETRSQDG DTAACSLIQA RLPLLLSCCC GDDESVRKVT EHLSGCIQQW GDSVLGRRCR 1800
DLLLQLYLQR PELRVPVPEV LLHSEGAASS SVCKLDGLIH RFITLLADTS DSRALENRGA 1860
DASMACRKLA VAHPLLLLRH LPMIAALLHG RTHLNFQEFR QQNHLSCFLH VLGLLELLQP 1920
HVFRSEHQGA LWDCLLSFIR LLLNYRKSSR HLAAFINKFV QFIHKYITYN APAAISFLQK 1980
HADPLHDLSF DNSDLVMLKS LLAGLSLPSR DDRTDRGLDE EGEEESSAGS LPLVSVSLFT 2040
PLTAAEMAPY MKRLSRGQTV EDLLEVLSDI DEMSRRRPEI LSFFSTNLQR LMSSAEECCR 2100
NLAFSLALRS MQNSPSIAAA FLPTFMYCLG SQDFEVVQTA LRNLPEYALL CQEHAAVLLH 2160
RAFLVGMYGQ MDPSAQISEA LRILHMEAVM 2190 
Gene Ontology
 GO:0016021; C:integral to membrane; IEA:UniProtKB-KW.
 GO:0032039; C:integrator complex; IDA:HGNC.
 GO:0031965; C:nuclear membrane; IEA:UniProtKB-SubCell.
 GO:0001833; P:inner cell mass cell proliferation; IEA:Compara.
 GO:0043066; P:negative regulation of apoptotic process; IEA:Compara.
 GO:0043154; P:negative regulation of cysteine-type endopeptidase activity involved in apoptotic process; IEA:Compara.
 GO:0016180; P:snRNA processing; IDA:HGNC.
 GO:0034474; P:U2 snRNA 3'-end processing; IEA:Compara. 
Interpro
 IPR022145; DUF3677. 
Pfam
 PF12432; DUF3677 
SMART
  
PROSITE
  
PRINTS