CPLM 1.0 - Compendium of Protein Lysine Modification
Tag
Content
CPLM ID
CPLM-041834
UniProt Accession
G3X9B1_MOUSE
;
G3X9B1
Genbank Protein ID
AC154221
;
CH466588
Genbank Nucleotide ID
EDL32306.1
Protein Name
HEAT repeat containing 1
Protein Synonyms/Alias
Protein Heatr1
Gene Name
Heatr1
Gene Synonyms/Alias
mCG_119362
Created Date
July 27, 2013
Organism
Mus musculus (Mouse)
NCBI Taxa ID
10090
Lysine Modification
Position
Peptide
Type
References
693
DLVCAGE
K
ESYSLKQ
acetylation
[1]
Reference
[1] Proteomic investigations of lysine acetylation identify diverse substrates of mitochondrial deacetylase sirt3.
Sol EM, Wagner SA, Weinert BT, Kumar A, Kim HS, Deng CX, Choudhary C.
PLoS One. 2012;7(12):e50545. [
PMID: 23236377
]
Functional Description
Sequence Annotation
Keyword
Complete proteome; Reference proteome.
Sequence Source
UniProt (SWISSPROT/TrEMBL); GenBank; EMBL
Protein Length
2143 AA
Protein Sequence
MTSLAQQLQR LALPQTDPSL LSRREVASLL FDPKEAATID RDTAFAIGCT GLEELLGIDP 60
AFEQFEAPLF SQLAKGLERS VQTKAVNKQL DENISSFLLH LSPYFLLKPA QKCLEWLIHR 120
FHIHLYNADS LIACVLPYHE TRVFVRVIQL LKISNPKHKW FWLSPVKHSG VPLARGTLVT 180
HCYKDLGFMD FICSLVTRSV KAFAEDPGSS TRLRVLLAFY ASTIVSALVA AENLSDNVVA 240
KLFPYIQKGL KSSLPDYRAA TYMIICQISV KVTMEDTFVK SLASQLIKTL TKVPSQVNDG 300
LGCLIILLQR QKPENLGEKP FLHLCGVPDL IGLLHGISES YDVSPLLRCM LPHLVASVVQ 360
HIAGEEAEGI DGQIYKNHLE EILIKIPLTN NLDHLLASHL FEEYISYSSQ EETQANGVAL 420
LNEQFLPLIR LLESKYPRAL DAVLEEHLKE ITGLKKQELF HQFISLSTSG GKYQFLEDSD 480
TSLMLSLNHP LAPVRLLAVN HLKTFMKTSK EGIDETFIKE AILTRLGDDN VDVVLATLSA 540
FEIFQQHFGV EETVSSLLNL FQRADLSKNE GWFRVLELAA NILIKEEILS KNDQLANQVV 600
VQLLPFMVIT SNDIESPDMK IAIHLSKSGI CSLHPLLRGW KEALENVIKS RKSREIIGVG 660
NQKMVQLLGS NLSLGERSTV LKLVEDLVCA GEKESYSLKQ KVAFHVTVSV LISCCSSFQE 720
TCFPFALRVF SLLQKKIRKL KSVITAVEIP SEWHLELMLN RGLPEELWVR YVQELHGAQR 780
VVMEDAILLV FSMKCFIFAM KAPKSFPTGA MWWNPEQLDE DSRHYLHLLI GIFEMLLEVS 840
DAMHFRVLIR LIMKVHLQDV LQLFKFFCVL WTYGSSLSNP LNCTVKSELQ TQALYIGSAM 900
LSSQNTQYKQ KLASTASPVV MSLLLNLGSH IKEVRRAAVQ CLQALRGVPS KFELVIDHLI 960
PKAEEITSDA TYVLQDLATL FDELQNEEKQ KSHQKLSETL RSLLHCVYGC PSYIAKGLMK 1020
VLQGVNSEMV LAQLLPMVEQ LLEKVEKEPT AVLKDEAVVL HLTLGKYNEY SASLLQKDPK 1080
SLDLFIKAMH TTKELHPGMP TVRITALEKI TKPFFAAVSD GQVQQKLLCV LFDLLVNCKD 1140
AHCVQTVGSV FKGISVDAEQ IRIELEPRDK AKSLGTIQQT RRQKMQQKKS QDVESVQEVE 1200
GPYWQRVTLI LELLQHKKKL KCPQILIPPL FNLLSRCLEP LSSEQGNMEY TKQLILSCLL 1260
NICQKLSPDG GRIPKDVVDE EKFNVELIVQ CIRLSEMPQT HHHALLLLGT VAGIFPDKVL 1320
HNIMSIFTFM GANVMRLDDA YSFQVISKTV KMVIPALIQS DTGDSVEVTR NVEQIVVKII 1380
GVFVDALPHV PEHRRLPILV QLVTTLSAKK FLWILLVLLF EQYVTKTVLV AAYGEKDAIL 1440
EADTEFWISV CCEFSVQHQV QSLMHILHYL EKLPEEKEEA TSKTVSTKSE VQDEMLPVFK 1500
VDAHTSKQLR HFKYLSVSFM SQLLASNHFL KKVVGSGGPK SLHGLEQGLL ETVLSYINTV 1560
AQSMEKNADK LTGKFWRALL SKAYDMLDKV NALLPTETFI SVIRGLVGNP LPSVRRKALD 1620
LLNNKLQQHT FWRKKMVHRF LKLVPVLLAI VQHKKREAED EQAINRQTAL YTLKLLCKNF 1680
GAQNREPFIP VLSTAVKLIE PEKKEEKNVL GSALLCIAEV TSTLEALAIP QLPSLMPSLL 1740
TAMKSTSELV HSEVCLLSAL AALHKVVETL PHFISPYLEG LLTQVIHLEK ITREMGSASQ 1800
ANIRLTALKK TLATELSPRV LLPAISKTFK QIQKNWKNHM GPFMSILQEH IGVMKKEELL 1860
SHQSQLTTFF LEALDFRAQH SEDDLEEVGK TEGWIIDCLV AMVVKLSEVT FRPLFFKLFD 1920
WAKTEDAPKD RLLTFYNLAD CIAEKLKGLF TLFAGHLVKP FADTLNQVNI SKTDEAFFDS 1980
ERDPEKCCLL LQFILNCLYK VFLFDTQNFM SRERAEALMM PLVDQLENRL GGEERFQERV 2040
TKYLVPCIAQ FSVAMADDSM WKPLNYQILL KTRDSSPKVR FAALITVLAL AEKLRENYIV 2100
LLPESIPFLA ELMEDECEEV EHQCQKTIQQ LEAVLGEPLQ SYF 2143
Gene Ontology
GO:0005739
; C:mitochondrion; IEA:Compara.
GO:0005730
; C:nucleolus; IEA:Compara.
Interpro
IPR011989
; ARM-like.
IPR016024
; ARM-type_fold.
IPR012954
; BP28_C_dom.
IPR022125
; U3snoRNP10.
Pfam
PF08146
; BP28CT
PF12397
; U3snoRNP10
SMART
SM01036
; BP28CT
PROSITE
PRINTS