CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-015656
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Integrator complex subunit 8 
Protein Synonyms/Alias
 Int8; Protein kaonashi-1 
Gene Name
 INTS8 
Gene Synonyms/Alias
 C8orf52 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
165SFPVKQAKPGPPQLSubiquitination[1]
189ELTENILKVLKEQAAubiquitination[1, 2]
210EAALKLNKDLYVHTMubiquitination[2]
242ESSTAGLKVKTEEMQubiquitination[1]
369VLRELFKKAQQGNEAubiquitination[2]
430SRSVNLEKASESLKGubiquitination[1, 2]
436EKASESLKGNMAAFLubiquitination[1, 2]
475LLKDEERKLLVDQMRubiquitination[2]
702QLLAATCKELPGPKEubiquitination[2]
743DGRVSLIKQRESTLGubiquitination[2]
762SELLSFIKKLREPLVubiquitination[1]
910FLREIDYKTAFKSLQubiquitination[2]
959DKRQIAIKAIGQTELubiquitination[1]
985AAQRRKKKFLQAMAKubiquitination[2]
Reference
 [1] Systematic and quantitative assessment of the ubiquitin-modified proteome.
 Kim W, Bennett EJ, Huttlin EL, Guo A, Li J, Possemato A, Sowa ME, Rad R, Rush J, Comb MJ, Harper JW, Gygi SP.
 Mol Cell. 2011 Oct 21;44(2):325-40. [PMID: 21906983]
 [2] Refined preparation and use of anti-diglycine remnant (K-ε-GG) antibody enables routine quantification of 10,000s of ubiquitination sites in single proteomics experiments.
 Udeshi ND, Svinkina T, Mertins P, Kuhn E, Mani DR, Qiao JW, Carr SA.
 Mol Cell Proteomics. 2013 Mar;12(3):825-31. [PMID: 23266961
Functional Description
 Component of the Integrator complex, a complex involved in the small nuclear RNAs (snRNA) U1 and U2 transcription and in their 3'-box-dependent processing. The Integrator complex is associated with the C-terminal domain (CTD) of RNA polymerase II largest subunit (POLR2A) and is recruited to the U1 and U2 snRNAs genes. 
Sequence Annotation
 REPEAT 250 288 TPR 1.
 REPEAT 320 356 TPR 2.
 REPEAT 570 603 TPR 3.
 REPEAT 833 866 TPR 4.
 MOD_RES 18 18 Phosphothreonine.  
Keyword
 Alternative splicing; Complete proteome; Nucleus; Phosphoprotein; Reference proteome; Repeat; TPR repeat. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 995 AA 
Protein Sequence
MSAEAADREA ATSSRPCTPP QTCWFEFLLE ESLLEKHLRK PCPDPAPVQL IVQFLEQASK 60
PSVNEQNQVQ PPPDNKRNRI LKLLALKVAA HLKWDLDILE KSLSVPVLNM LLNELLCISK 120
VPPGTKHVDM DLATLPPTTA MAVLLYNRWA IRTIVQSSFP VKQAKPGPPQ LSVMNQMQQE 180
KELTENILKV LKEQAADSIL VLEAALKLNK DLYVHTMRTL DLLAMEPGMV NGETESSTAG 240
LKVKTEEMQC QVCYDLGAAY FQQGSTNSAV YENAREKFFR TKELIAEIGS LSLHCTIDEK 300
RLAGYCQACD VLVPSSDSTS QQLTPYSQVH ICLRSGNYQE VIQIFIEDNL TLSLPVQFRQ 360
SVLRELFKKA QQGNEALDEI CFKVCACNTV RDILEGRTIS VQFNQLFLRP NKEKIDFLLE 420
VCSRSVNLEK ASESLKGNMA AFLKNVCLGL EDLQYVFMIS SHELFITLLK DEERKLLVDQ 480
MRKRSPRVNL CIKPVTSFYD IPASASVNIG QLEHQLILSV DPWRIRQILI ELHGMTSERQ 540
FWTVSNKWEV PSVYSGVILG IKDNLTRDLV YILMAKGLHC STVKDFSHAK QLFAACLELV 600
TEFSPKLRQV MLNEMLLLDI HTHEAGTGQA GERPPSDLIS RVRGYLEMRL PDIPLRQVIA 660
EECVAFMLNW RENEYLTLQV PAFLLQSNPY VKLGQLLAAT CKELPGPKES RRTAKDLWEV 720
VVQICSVSSQ HKRGNDGRVS LIKQRESTLG IMYRSELLSF IKKLREPLVL TIILSLFVKL 780
HNVREDIVND ITAEHISIWP SSIPNLQSVD FEAVAITVKE LVRYTLSINP NNHSWLIIQA 840
DIYFATNQYS AALHYYLQAG AVCSDFFNKA VPPDVYTDQV IKRMIKCCSL LNCHTQVAIL 900
CQFLREIDYK TAFKSLQEQN SHDAMDSYYD YIWDVTILEY LTYLHHKRGE TDKRQIAIKA 960
IGQTELNASN PEEVLQLAAQ RRKKKFLQAM AKLYF 995 
Gene Ontology
 GO:0032039; C:integrator complex; IDA:HGNC.
 GO:0016180; P:snRNA processing; IDA:HGNC. 
Interpro
  
Pfam
  
SMART
  
PROSITE
 PS50005; TPR
 PS50293; TPR_REGION 
PRINTS