CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-041834
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 HEAT repeat containing 1 
Protein Synonyms/Alias
 Protein Heatr1 
Gene Name
 Heatr1 
Gene Synonyms/Alias
 mCG_119362 
Created Date
 July 27, 2013 
Organism
 Mus musculus (Mouse) 
NCBI Taxa ID
 10090 
Lysine Modification
Position
Peptide
Type
References
693DLVCAGEKESYSLKQacetylation[1]
Reference
 [1] Proteomic investigations of lysine acetylation identify diverse substrates of mitochondrial deacetylase sirt3.
 Sol EM, Wagner SA, Weinert BT, Kumar A, Kim HS, Deng CX, Choudhary C.
 PLoS One. 2012;7(12):e50545. [PMID: 23236377
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2143 AA 
Protein Sequence
MTSLAQQLQR LALPQTDPSL LSRREVASLL FDPKEAATID RDTAFAIGCT GLEELLGIDP 60
AFEQFEAPLF SQLAKGLERS VQTKAVNKQL DENISSFLLH LSPYFLLKPA QKCLEWLIHR 120
FHIHLYNADS LIACVLPYHE TRVFVRVIQL LKISNPKHKW FWLSPVKHSG VPLARGTLVT 180
HCYKDLGFMD FICSLVTRSV KAFAEDPGSS TRLRVLLAFY ASTIVSALVA AENLSDNVVA 240
KLFPYIQKGL KSSLPDYRAA TYMIICQISV KVTMEDTFVK SLASQLIKTL TKVPSQVNDG 300
LGCLIILLQR QKPENLGEKP FLHLCGVPDL IGLLHGISES YDVSPLLRCM LPHLVASVVQ 360
HIAGEEAEGI DGQIYKNHLE EILIKIPLTN NLDHLLASHL FEEYISYSSQ EETQANGVAL 420
LNEQFLPLIR LLESKYPRAL DAVLEEHLKE ITGLKKQELF HQFISLSTSG GKYQFLEDSD 480
TSLMLSLNHP LAPVRLLAVN HLKTFMKTSK EGIDETFIKE AILTRLGDDN VDVVLATLSA 540
FEIFQQHFGV EETVSSLLNL FQRADLSKNE GWFRVLELAA NILIKEEILS KNDQLANQVV 600
VQLLPFMVIT SNDIESPDMK IAIHLSKSGI CSLHPLLRGW KEALENVIKS RKSREIIGVG 660
NQKMVQLLGS NLSLGERSTV LKLVEDLVCA GEKESYSLKQ KVAFHVTVSV LISCCSSFQE 720
TCFPFALRVF SLLQKKIRKL KSVITAVEIP SEWHLELMLN RGLPEELWVR YVQELHGAQR 780
VVMEDAILLV FSMKCFIFAM KAPKSFPTGA MWWNPEQLDE DSRHYLHLLI GIFEMLLEVS 840
DAMHFRVLIR LIMKVHLQDV LQLFKFFCVL WTYGSSLSNP LNCTVKSELQ TQALYIGSAM 900
LSSQNTQYKQ KLASTASPVV MSLLLNLGSH IKEVRRAAVQ CLQALRGVPS KFELVIDHLI 960
PKAEEITSDA TYVLQDLATL FDELQNEEKQ KSHQKLSETL RSLLHCVYGC PSYIAKGLMK 1020
VLQGVNSEMV LAQLLPMVEQ LLEKVEKEPT AVLKDEAVVL HLTLGKYNEY SASLLQKDPK 1080
SLDLFIKAMH TTKELHPGMP TVRITALEKI TKPFFAAVSD GQVQQKLLCV LFDLLVNCKD 1140
AHCVQTVGSV FKGISVDAEQ IRIELEPRDK AKSLGTIQQT RRQKMQQKKS QDVESVQEVE 1200
GPYWQRVTLI LELLQHKKKL KCPQILIPPL FNLLSRCLEP LSSEQGNMEY TKQLILSCLL 1260
NICQKLSPDG GRIPKDVVDE EKFNVELIVQ CIRLSEMPQT HHHALLLLGT VAGIFPDKVL 1320
HNIMSIFTFM GANVMRLDDA YSFQVISKTV KMVIPALIQS DTGDSVEVTR NVEQIVVKII 1380
GVFVDALPHV PEHRRLPILV QLVTTLSAKK FLWILLVLLF EQYVTKTVLV AAYGEKDAIL 1440
EADTEFWISV CCEFSVQHQV QSLMHILHYL EKLPEEKEEA TSKTVSTKSE VQDEMLPVFK 1500
VDAHTSKQLR HFKYLSVSFM SQLLASNHFL KKVVGSGGPK SLHGLEQGLL ETVLSYINTV 1560
AQSMEKNADK LTGKFWRALL SKAYDMLDKV NALLPTETFI SVIRGLVGNP LPSVRRKALD 1620
LLNNKLQQHT FWRKKMVHRF LKLVPVLLAI VQHKKREAED EQAINRQTAL YTLKLLCKNF 1680
GAQNREPFIP VLSTAVKLIE PEKKEEKNVL GSALLCIAEV TSTLEALAIP QLPSLMPSLL 1740
TAMKSTSELV HSEVCLLSAL AALHKVVETL PHFISPYLEG LLTQVIHLEK ITREMGSASQ 1800
ANIRLTALKK TLATELSPRV LLPAISKTFK QIQKNWKNHM GPFMSILQEH IGVMKKEELL 1860
SHQSQLTTFF LEALDFRAQH SEDDLEEVGK TEGWIIDCLV AMVVKLSEVT FRPLFFKLFD 1920
WAKTEDAPKD RLLTFYNLAD CIAEKLKGLF TLFAGHLVKP FADTLNQVNI SKTDEAFFDS 1980
ERDPEKCCLL LQFILNCLYK VFLFDTQNFM SRERAEALMM PLVDQLENRL GGEERFQERV 2040
TKYLVPCIAQ FSVAMADDSM WKPLNYQILL KTRDSSPKVR FAALITVLAL AEKLRENYIV 2100
LLPESIPFLA ELMEDECEEV EHQCQKTIQQ LEAVLGEPLQ SYF 2143 
Gene Ontology
 GO:0005739; C:mitochondrion; IEA:Compara.
 GO:0005730; C:nucleolus; IEA:Compara. 
Interpro
 IPR011989; ARM-like.
 IPR016024; ARM-type_fold.
 IPR012954; BP28_C_dom.
 IPR022125; U3snoRNP10. 
Pfam
 PF08146; BP28CT
 PF12397; U3snoRNP10 
SMART
 SM01036; BP28CT 
PROSITE
  
PRINTS