CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-021395
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 HEAT repeat-containing protein 1 
Protein Synonyms/Alias
 Protein BAP28 
Gene Name
 HEATR1 
Gene Synonyms/Alias
 BAP28 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
34ASLLFDPKEAATIDRubiquitination[1]
84LERSVQTKAVNKQLDubiquitination[2]
201SLVTKSVKVFAEYPGubiquitination[2]
248KLFPYIQKGLKSSLPubiquitination[3, 4, 5]
251PYIQKGLKSSLPDYRubiquitination[2]
385HLEAILTKISLKNNLubiquitination[2]
435LIRLLESKYPRTLDVubiquitination[2, 6]
456KEIADLKKQELFHQFubiquitination[4]
503ILAMNHLKKIMKTSKubiquitination[4]
504LAMNHLKKIMKTSKEubiquitination[4]
510KKIMKTSKEGVDESFubiquitination[2, 4]
519GVDESFIKEAVLARLubiquitination[2, 4]
568FQRAELSKNGEWYEVubiquitination[2, 4]
627KIAIYLSKSGICSLHubiquitination[4]
663LIGVANQKMIELLADubiquitination[2, 4]
699EEESFNLKQKVTFHVubiquitination[2]
735RVFSLLQKKIKKLESubiquitination[2]
801KKFIYALKAPKSFPKubiquitination[2, 4]
909SSQKTQCKHQLASISubiquitination[4]
932INLGSPVKEVRRAAIubiquitination[2]
1122FAAISDEKVQQKLLRubiquitination[2, 4, 7]
1189RQKMQQKKSQDLESVubiquitination[4]
1272KLSPDGGKIPKDILDubiquitination[4]
1348YSFQVINKTVKMVIPubiquitination[2, 4]
1351QVINKTVKMVIPALIubiquitination[2]
1483EKEETIPKAVSFNKSubiquitination[3, 5]
1508NVETHTSKQLRHFKFubiquitination[2, 4]
1544SGGPEILKGLEERLLubiquitination[2]
1575NADKLTVKFWRALLSubiquitination[2, 3, 5]
1583FWRALLSKAYDLLDKubiquitination[2]
1618PLPSVRRKALDLLNNubiquitination[4]
1626ALDLLNNKLQQNISWubiquitination[4]
1634LQQNISWKKTIVTRFubiquitination[2, 4]
1635QQNISWKKTIVTRFLubiquitination[4]
1643TIVTRFLKLVPDLLAubiquitination[8]
1675QTALYTLKLLCKNFGubiquitination[4]
1679YTLKLLCKNFGAENPubiquitination[4]
1811IRLTSLKKTLATTLAubiquitination[4]
1828VLLPAIKKTYKQIEKubiquitination[4]
2036NRLGGEEKFQERVTKubiquitination[4]
Reference
 [1] Proteome-wide identification of ubiquitylation sites by conjugation of engineered lysine-less ubiquitin.
 Oshikawa K, Matsumoto M, Oyamada K, Nakayama KI.
 J Proteome Res. 2012 Feb 3;11(2):796-807. [PMID: 22053931]
 [2] Systematic and quantitative assessment of the ubiquitin-modified proteome.
 Kim W, Bennett EJ, Huttlin EL, Guo A, Li J, Possemato A, Sowa ME, Rad R, Rush J, Comb MJ, Harper JW, Gygi SP.
 Mol Cell. 2011 Oct 21;44(2):325-40. [PMID: 21906983]
 [3] A proteome-wide, quantitative survey of in vivo ubiquitylation sites reveals widespread regulatory roles.
 Wagner SA, Beli P, Weinert BT, Nielsen ML, Cox J, Mann M, Choudhary C.
 Mol Cell Proteomics. 2011 Oct;10(10):M111.013284. [PMID: 21890473]
 [4] Refined preparation and use of anti-diglycine remnant (K-ε-GG) antibody enables routine quantification of 10,000s of ubiquitination sites in single proteomics experiments.
 Udeshi ND, Svinkina T, Mertins P, Kuhn E, Mani DR, Qiao JW, Carr SA.
 Mol Cell Proteomics. 2013 Mar;12(3):825-31. [PMID: 23266961]
 [5] hCKSAAP_UbSite: improved prediction of human ubiquitination sites by exploiting amino acid pattern and properties.
 Chen Z, Zhou Y, Song J, Zhang Z.
 Biochim Biophys Acta. 2013 Aug;1834(8):1461-7. [PMID: 23603789]
 [6] Methods for quantification of in vivo changes in protein ubiquitination following proteasome and deubiquitinase inhibition.
 Udeshi ND, Mani DR, Eisenhaure T, Mertins P, Jaffe JD, Clauser KR, Hacohen N, Carr SA.
 Mol Cell Proteomics. 2012 May;11(5):148-59. [PMID: 22505724]
 [7] Global identification of modular cullin-RING ligase substrates.
 Emanuele MJ, Elia AE, Xu Q, Thoma CR, Izhar L, Leng Y, Guo A, Chen YN, Rush J, Hsu PW, Yen HC, Elledge SJ.
 Cell. 2011 Oct 14;147(2):459-74. [PMID: 21963094]
 [8] Ubiquitin ligase substrate identification through quantitative proteomics at both the protein and peptide levels.
 Lee KA, Hammerle LP, Andrews PS, Stokes MP, Mustelin T, Silva JC, Black RA, Doedens JR.
 J Biol Chem. 2011 Dec 2;286(48):41530-8. [PMID: 21987572
Functional Description
 Involved in nucleolar processing of pre-18S ribosomal RNA. Involved in ribosome biosynthesis (By similarity). 
Sequence Annotation
 REPEAT 2106 2142 HEAT.
 MOD_RES 1190 1190 Phosphoserine.  
Keyword
 Complete proteome; Nucleus; Phosphoprotein; Polymorphism; Reference proteome; Ribonucleoprotein; Ribosome biogenesis; rRNA processing. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2144 AA 
Protein Sequence
MTSLAQQLQR LALPQSDASL LSRDEVASLL FDPKEAATID RDTAFAIGCT GLEELLGIDP 60
SFEQFEAPLF SQLAKTLERS VQTKAVNKQL DENISLFLIH LSPYFLLKPA QKCLEWLIHR 120
FHIHLYNQDS LIACVLPYHE TRIFVRVIQL LKINNSKHRW FWLLPVKQSG VPLAKGTLIT 180
HCYKDLGFMD FICSLVTKSV KVFAEYPGSS AQLRVLLAFY ASTIVSALVA AEDVSDNIIA 240
KLFPYIQKGL KSSLPDYRAA TYMIICQISV KVTMENTFVN SLASQIIKTL TKIPSLIKDG 300
LSCLIVLLQR QKPESLGKKP FPHLCNVPDL ITILHGISET YDVSPLLHYM LPHLVVSIIH 360
HVTGEETEGM DGQIYKRHLE AILTKISLKN NLDHLLASLL FEEYISYSSQ EEMDSNKVSL 420
LNEQFLPLIR LLESKYPRTL DVVLEEHLKE IADLKKQELF HQFVSLSTSG GKYQFLADSD 480
TSLMLSLNHP LAPVRILAMN HLKKIMKTSK EGVDESFIKE AVLARLGDDN IDVVLSAISA 540
FEIFKEHFSS EVTISNLLNL FQRAELSKNG EWYEVLKIAA DILIKEEILS ENDQLSNQVV 600
VCLLPFMVIN NDDTESAEMK IAIYLSKSGI CSLHPLLRGW EEALENVIKS TKPGKLIGVA 660
NQKMIELLAD NINLGDPSSM LKMVEDLISV GEEESFNLKQ KVTFHVILSV LVSCCSSLKE 720
THFPFAIRVF SLLQKKIKKL ESVITAVEIP SEWHIELMLD RGIPVELWAH YVEELNSTQR 780
VAVEDSVFLV FSLKKFIYAL KAPKSFPKGD IWWNPEQLKE DSRDYLHLLI GLFEMMLNGA 840
DAVHFRVLMK LFIKVHLEDV FQLFKFCSVL WTYGSSLSNP LNCSVKTVLQ TQALYVGCAM 900
LSSQKTQCKH QLASISSPVV TSLLINLGSP VKEVRRAAIQ CLQALSGVAS PFYLIIDHLI 960
SKAEEITSDA AYVIQDLATL FEELQREKKL KSHQKLSETL KNLLSCVYSC PSYIAKDLMK 1020
VLQGVNGEMV LSQLLPMAEQ LLEKIQKEPT AVLKDEAMVL HLTLGKYNEF SVSLLNEDPK 1080
SLDIFIKAVH TTKELYAGMP TIQITALEKI TKPFFAAISD EKVQQKLLRM LFDLLVNCKN 1140
SHCAQTVSSV FKGISVNAEQ VRIELEPPDK AKPLGTVQQK RRQKMQQKKS QDLESVQEVG 1200
GSYWQRVTLI LELLQHKKKL RSPQILVPTL FNLLSRCLEP LPQEQGNMEY TKQLILSCLL 1260
NICQKLSPDG GKIPKDILDE EKFNVELIVQ CIRLSEMPQT HHHALLLLGT VAGIFPDKVL 1320
HNIMSIFTFM GANVMRLDDT YSFQVINKTV KMVIPALIQS DSGDSIEVSR NVEEIVVKII 1380
SVFVDALPHV PEHRRLPILV QLVDTLGAEK FLWILLILLF EQYVTKTVLA AAYGEKDAIL 1440
EADTEFWFSV CCEFSVQHQI QSLMNILQYL LKLPEEKEET IPKAVSFNKS ESQEEMLQVF 1500
NVETHTSKQL RHFKFLSVSF MSQLLSSNNF LKKVVESGGP EILKGLEERL LETVLGYISA 1560
VAQSMERNAD KLTVKFWRAL LSKAYDLLDK VNALLPTETF IPVIRGLVGN PLPSVRRKAL 1620
DLLNNKLQQN ISWKKTIVTR FLKLVPDLLA IVQRKKKEGE EEQAINRQTA LYTLKLLCKN 1680
FGAENPDPFV PVLNTAVKLI APERKEEKNV LGSALLCIAE VTSTLEALAI PQLPSLMPSL 1740
LTTMKNTSEL VSSEVYLLSA LAALQKVVET LPHFISPYLE GILSQVIHLE KITSEMGSAS 1800
QANIRLTSLK KTLATTLAPR VLLPAIKKTY KQIEKNWKNH MGPFMSILQE HIGVMKKEEL 1860
TSHQSQLTAF FLEALDFRAQ HSENDLEEVG KTENCIIDCL VAMVVKLSEV TFRPLFFKLF 1920
DWAKTEDAPK DRLLTFYNLA DCIAEKLKGL FTLFAGHLVK PFADTLNQVN ISKTDEAFFD 1980
SENDPEKCCL LLQFILNCLY KIFLFDTQHF ISKERAEALM MPLVDQLENR LGGEEKFQER 2040
VTKHLIPCIA QFSVAMADDS LWKPLNYQIL LKTRDSSPKV RFAALITVLA LAEKLKENYI 2100
VLLPESIPFL AELMEDECEE VEHQCQKTIQ QLETVLGEPL QSYF 2144 
Gene Ontology
 GO:0005739; C:mitochondrion; IDA:HPA.
 GO:0005730; C:nucleolus; IDA:HPA.
 GO:0030529; C:ribonucleoprotein complex; IEA:UniProtKB-KW.
 GO:0006364; P:rRNA processing; IEA:UniProtKB-KW. 
Interpro
 IPR011989; ARM-like.
 IPR016024; ARM-type_fold.
 IPR012954; BP28_C_dom.
 IPR022125; U3snoRNP10. 
Pfam
 PF08146; BP28CT
 PF12397; U3snoRNP10 
SMART
 SM01036; BP28CT 
PROSITE
 PS50077; HEAT_REPEAT 
PRINTS