CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-034999
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 HEAT repeat containing 1 (Predicted) 
Protein Synonyms/Alias
 Protein Heatr1 
Gene Name
 Heatr1 
Gene Synonyms/Alias
 Heatr1_predicted; rCG_30537 
Created Date
 July 27, 2013 
Organism
 Rattus norvegicus (Rat) 
NCBI Taxa ID
 10116 
Lysine Modification
Position
Peptide
Type
References
699EKESYTVKQKVAFHVacetylation[1]
Reference
 [1] Proteomic analysis of lysine acetylation sites in rat tissues reveals organ specificity and subcellular patterns.
 Lundby A, Lage K, Weinert BT, Bekker-Jensen DB, Secher A, Skovgaard T, Kelstrup CD, Dmytriyev A, Choudhary C, Lundby C, Olsen JV.
 Cell Rep. 2012 Aug 30;2(2):419-31. [PMID: 22902405
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2143 AA 
Protein Sequence
MTSLAQQLQR LALPQTDSSL LSRREVASLL FDPKEAATID RDTAFAIGCT GLEELLGIDP 60
AFEQFEAPLF SQLAKTLERS VQTKAVNKQL DENISLFLLH LSPYFLLKPA QKCLEWLIHR 120
FHIHLYNPDS LIACVLPYHE TRVFVRVIQL LKIGNPKNKW FWLFPVKQSG VPLARGTLVT 180
HCYKDLGFMD FICSLVTRSV KAFAECPGSS AQLRVLLAFY ASTIVSALVA AENLSDNVVA 240
KLFPYIQKGL KSSLADYRAA TYMIICQISV KVTMEDTFVN SLASQLIKTL TKVPSQVKDG 300
LGCLIILLQR QKPETLGKKP FPHLCGVPDL IGLLHGISEN YDVSPLLRYM LPHLVASLVQ 360
HVAGEETEGI DGQIYKKHLE AILTKIPLTN NLDHLLASRL FEEYISYSSQ EETQANRVAL 420
LNEQFLPLVR LLESKYPKTL DVVLEEHLKE ITGLKKQELF HQFISLSTSG GKYQFLEDSD 480
TSLMLSLNHP LAPVRLLAVN HLKNIMKTSK EGIDETFIKE AILTRLGDDN VDVVSAALSA 540
FGSFQQHFGV EETVSNILNL FQRAELSKNK GWYSVLELAA NILVREEILS KNDQLSNQVV 600
AQLLPFMVII DNDVESPDTK MAIHLSRSGI CSLHPLLRGW KEALENVIKS RKSREIIGVG 660
NQKMVQLLGS NLSSGDRSSM LKLVEDLVCA GEKESYTVKQ KVAFHVIVSV LVSCCSSLRE 720
TCFPFAIKVF SLLQKKIKKL KSIITAVEVP SEWHLELMLN RGLPEELWVH YVQQLHGAQR 780
IATEDSVLLV FSLKSFIFAL KAPKSFPTGA MWWNPERLDE DSRHYLCLLI GLFEMLLEVS 840
GAMHFRVLMR LIVKVHLQDV LQLFKFFCIL WTYGSSLSNP LSCTVKSELQ TQALYIGSAM 900
LSSQSTQCKQ NLAHPASPVV MSLLLNLGSP VKEVRRAAVQ CLQALSGVAS KFQLVIDHLV 960
PKAEEITSDA TYVVQDLATL FGELQNEEKQ KSHHKLSETL RSLLHCVYDC PSYIAKDLMK 1020
VLQDVNSEVV LAQLLPMVEQ LLEKVEKEPT AVLKDEAIVL HLTLGKYNEC SASLLQKDSK 1080
SLDLFIKAMH TTKELHPGMP TVQITALEKI TKPFFAAVSD GKVQQKLLCV LFDLLVNCKN 1140
SHCVQTVGSV FKGISVDAEQ IRIELEPRDK AKSLGTIQQT RRQKMQQKKS QDVEAVQEVE 1200
GPYWQRVTLI LELLQHKKKL KCPQILIPTL FNLLSRCLEP LSSEQGNMEY TKQLILSCLL 1260
NICQKLSPDG GKIPKDVVDE EKFNVELIVQ CIRLSEMPQT HHHALLLLGT VAGIFPDKVL 1320
HNIMSIFTFM GANVMRLDDA YSFQVISKTV KMVIPALIQS DNGDSIEVTR NVEEIVVKII 1380
GVFVDALPHV PEHRRLPILV QLINTLGAEK FLWILLALLF EQYVTKTVLV AAYGEKDAIL 1440
EADTEFWISV CCEFSVQHQV QSLMHILQYL EKLPEEKEEA TSKAVSAKIE VQDEMLPVFK 1500
VDTHTSKQLR HFKYLSVSFM AQLLASNHFL KKVVESGGPK SLHGLEQSLL ETVLGYINTV 1560
AQSMEKNADK LTGKFWRALL SKAYDMLDKV NALLPTETFI SVIKGLVGNP LPSVRRKALD 1620
LLNNKLQHST FWKKKMVHRF LKLVPVLLAI VQHKKKEAED EQAINRQTAL YTLKLLCKNF 1680
GAQNREPFIP VLSTAVKLIA PEKKEEKNVL GSALLCIAEV TSTLEALAIP QLPSLMPSLL 1740
TAIKSTSELV RSEVCLLSAL TALHKVVETL PHFISPYLEG LLTQVIHLEK ITSEMGSASQ 1800
ANIRLTSLKK TLATGLSPRV LLPAISKTFK QIQKNWKNLM GPFMSILQEH IGVMKKEELL 1860
SHQSQLTTFF LEALDFRAQH SEDDLEEVGK TESWIIDCLV AMVVKLSEVT FRPLFFKLFD 1920
WAKTEDAPKD RLLTFYNLAD CIAEKLKGLF TLFAGHLVKP FADTLNQVNI SKTDEAFFDS 1980
EHDPEKCCLL LQFILNCLYK IFLFDTQNFM SKERAEALMM PLVDQLENRL GGEDRFQERV 2040
TKHLVPCIAQ FSVAMADDSM WKPLNYQILL KTRDSSPKVR FAALITVLAL AEKLRENYIV 2100
LLPESIPFLA ELMEDECEEV EHQCQTTIQQ LEAVLGEPLQ SYF 2143 
Gene Ontology
  
Interpro
 IPR011989; ARM-like.
 IPR016024; ARM-type_fold.
 IPR012954; BP28_C_dom.
 IPR022125; U3snoRNP10. 
Pfam
 PF08146; BP28CT
 PF12397; U3snoRNP10 
SMART
 SM01036; BP28CT 
PROSITE
  
PRINTS