CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-023404
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 HEAT repeat-containing protein 1 homolog 
Protein Synonyms/Alias
  
Gene Name
 l(2)k09022 
Gene Synonyms/Alias
 CG10805 
Created Date
 July 27, 2013 
Organism
 Drosophila melanogaster (Fruit fly) 
NCBI Taxa ID
 7227 
Lysine Modification
Position
Peptide
Type
References
35ASILFDPKEAATKDRacetylation[1]
Reference
 [1] Proteome-wide mapping of the Drosophila acetylome demonstrates a high degree of conservation of lysine acetylation.
 Weinert BT, Wagner SA, Horn H, Henriksen P, Liu WR, Olsen JV, Jensen LJ, Choudhary C.
 Sci Signal. 2011 Jul 26;4(183):ra48. [PMID: 21791702
Functional Description
 Involved in nucleolar processing of pre-18S ribosomal RNA. Involved in ribosome biosynthesis (By similarity). 
Sequence Annotation
 REPEAT 2058 2094 HEAT.  
Keyword
 Complete proteome; Nucleus; Reference proteome; Ribonucleoprotein; Ribosome biogenesis; rRNA processing. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2096 AA 
Protein Sequence
MSTALAQQLQ KLAAPQSSVT LADARSRASI LFDPKEAATK DRRSIYEIGL TGLQELTDFN 60
PAFKEFQLTL FDEATLTLER SVELPEINKM LDAAIAKFLR LLSPYLLLRP AHMAFEWLLR 120
RFQVHEYNRS EVMALILPYH ETMIFVQIVK TMRLRSSDGD WYWLRPLQRP GVPLAKTAII 180
NRAASNPAFL GFICQSTQKA VKELGPRAHQ LQAQINFYAT VVVGALQTAK PLQDWHITTI 240
LESLLRGLIS DNIDFMAAAY VIVAQLVSRT KLKSKVCNAL LERVANCPFE RLHSESLLLL 300
VCIYGKQQAA LPHFKPETIL NLVGKKWLIS TLSSLAKGNI AIQSICMPLM TGAVAAIRDD 360
DASSNSCKLF LDNLLSEVPM PKPTAQQLIN CFLDTYVETA IDAPEPMETN SNEDDDTIVI 420
DSDDEIETEK TTFQAWYSTY LEKLERRYPE AFDLSVKEAL RSKSSTSNRQ KALKLALGFR 480
LNTTDEKAKH AYEKLYHYSA DWRLSAVQKL LQNLNVTKKR ERSVKLLQEC LPDRINDDSG 540
AVVSTLLSLP TEELAEMLGP LPLAQTLCHL LYRAQSEKDE EWQPVVPLAV RHLTSALVSG 600
SYDTNLVLLA LMPLLFPGEA LAEHQHKALR ILLGSDFVSK VPFLAELKVS NKFSDFNVGE 660
HRQHFLDIIA SSNQELSSQE RALLQSVEDH GGELYIQKAS QLTHLLLLLT AYAKRELQPR 720
ESLHMLEKIG LYSRRLQFRV VNGSQNTQNC APLQLYVDFL LTLVKNTKWT ALASTPWNQM 780
TDELRLCLRL LEIICAQVFS EKADQPERQE WTRALQQSLQ LILPEAQDRL EVLSNFYVFE 840
RLPELWPRDS DYAVFRLQGF IILEAVLSNP KSQIDCGLVH VLRVANACGS PLQTLRVQAI 900
NILQLISNRK LVSHVEQLVR SLLQRKSELS MDHEQYALIL YTILEPEKAT AKERLVLSKL 960
KRSVLALASD PKQSPICTAS LLAALKHVND ENFLNELLPL GLDSLKTITA GEDNQNIKQL 1020
PWPHSEIYKS VIERFEGRVA LNVLLRKDLA WKLFEDSFAQ YDTYVQLEQK LQPLPCVLLN 1080
SLTPETFEQM HAKHKIALIK LIVESATNSD NDSIFLASHR LLKRCRLDCQ PLVPILLEMA 1140
NTKVEKKQPV KRRSVQATQL DLTSPYWKQG MTLLELLEHK KQLVGAELLI PPLFELLQAC 1200
LTMEEHSAAE YPKQLILSSL LHCCQTAQSA GVQLVKAMPE SSFRIELVVQ SLRNTRNPQT 1260
QQHALLFLTH CAGMYPQQVL HKIVEIFTFV GSTVARHDDA FSLHIIHNVV ESIIPILLLN 1320
TGHNELVIPV LKVFADICTD VPVHRRLPLY ATLFRVLEPK EHLWQFLCII FESQVLLEQV 1380
PQKVSTDKSR LDFARELTLM FEDPTVAIQT CIRLLDYLAK LPATKSSLSG GSGSSVLSTE 1440
QQLFDVRTRT FKQLRHYKYL IMDFLSGISS CNEWEKKMKR PDPNELLPYY QEFILKTLAY 1500
VGVLNGALEA ASETPSLEKF WRVLANHAHD VLDNAIGLLA PQHFISVITE LLKHDHVYVR 1560
IKVMDLLVTK LSPSSDYFQQ SNAEHFGVLF APLQEIINGI LEGSSNSAQQ AKLQQTALHA 1620
LQLLALRHGR DYIEECRSLL ATLTKITKRR ANVPKAVVGN VVLTLVEICA SLKAHALAQL 1680
PKFAPQLTEL LKEQVHQMAS LKQGPDYVCS TLVTALHKLF KALPLFLGPY LVDIIGGLAR 1740
LSVQLENPQL LQDKRTQVLK QKLADVWSAV AQGVEVRILV PSCAKAFSSL LEQQAYDELG 1800
HLMQQLLLQS VRHNSAAQLQ PVQDPLSELF LQALNFRLQV RGLGLQRQLV SDVEASITET 1860
FVTWILKLSE TSFRPMYSRV HKWALESTSR ETRLTYFLLT NRIAEALKSL FVLFASDFVE 1920
DSSRLLTEHN SIRPEFEVEE REDDVDLLMA ILNTLHHVFL YCSEDFINDH RFNVLMPPLV 1980
NQLENDLVLG NESLQQVLSN CIAQFAVATN DVMWKQLNSQ VLLKTRTSNP EVRILAFNSC 2040
VAIARKLGES YAALLPETVP FIAELLEDEH QRVEKNTRTG VQELETILGE SVQKYL 2096 
Gene Ontology
 GO:0005730; C:nucleolus; IEA:UniProtKB-SubCell.
 GO:0030529; C:ribonucleoprotein complex; IEA:UniProtKB-KW.
 GO:0022008; P:neurogenesis; IMP:FlyBase.
 GO:0006364; P:rRNA processing; IEA:UniProtKB-KW. 
Interpro
 IPR011989; ARM-like.
 IPR016024; ARM-type_fold.
 IPR012954; BP28_C_dom.
 IPR022125; U3snoRNP10. 
Pfam
 PF08146; BP28CT
 PF12397; U3snoRNP10 
SMART
 SM01036; BP28CT 
PROSITE
 PS50077; HEAT_REPEAT 
PRINTS