CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-038839
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
  
Protein Name
 Myosin-4 
Protein Synonyms/Alias
  
Gene Name
 Myh4 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Rattus norvegicus (Rat) 
NCBI Taxa ID
 10116 
Lysine Modification
Position
Peptide
Type
References
30ERIEAQNKPFDAKSSacetylation[1]
35QNKPFDAKSSVFVVDacetylation[1]
44SVFVVDAKESYVKATacetylation[1]
49DAKESYVKATVQSREacetylation[1]
63EGGKVTAKTEGGATVacetylation[1]
73GGATVTVKEDQVFSMacetylation[1]
84VFSMNPPKYDKIEDMacetylation[1]
237LEAFGNAKTVRNDNSacetylation[1]
273IETYLLEKSRVTFQLacetylation[1]
281SRVTFQLKAERSYHIacetylation[1]
366VMHYGNMKFKQKQREacetylation[1]
386DGTEVADKAAYLTSLacetylation[1]
400LNSADLLKALCYPRVacetylation[1]
408ALCYPRVKVGNEYVTacetylation[1]
505VLEQEEYKKEGIEWEacetylation[1]
568GKSNNFQKPKPAKGKacetylation[1]
570SNNFQKPKPAKGKAEacetylation[1]
601IGWLDKNKDPLNETVacetylation[1]
614TVVGLYQKSGLKTLAacetylation[1]
618LYQKSGLKTLAFLFSacetylation[1]
661LFRENLNKLMTNLKSacetylation[1]
667NKLMTNLKSTHPHFVacetylation[1]
723RILYADFKQRYKVLNacetylation[1]
727ADFKQRYKVLNASAIacetylation[1]
743EGQFIDSKKASEKLLacetylation[1]
744GQFIDSKKASEKLLGacetylation[1]
748DSKKASEKLLGSIDIacetylation[1]
761DIDHTQYKFGHTKVFacetylation[1]
784LEEMRDEKLAQLITRacetylation[1]
829VRAFMNVKHWPWMKLacetylation[1]
839PWMKLYFKIKPLLKSacetylation[1]
845FKIKPLLKSAETEKEacetylation[1]
851LKSAETEKEMATMKEacetylation[1]
857EKEMATMKEDFEKAKacetylation[1]
862TMKEDFEKAKEDLAKacetylation[1]
864KEDFEKAKEDLAKSEacetylation[1]
869KAKEDLAKSEAKRKEacetylation[1]
880KRKELEEKMVALMQEacetylation[1]
914ERCDQLIKTKIQLEAacetylation[1]
916CDQLIKTKIQLEAKIacetylation[1]
922TKIQLEAKIKELTERacetylation[1]
924IQLEAKIKELTERAEacetylation[1]
943INAELTAKKRKLEDEacetylation[1]
946ELTAKKRKLEDECSEacetylation[1]
955EDECSELKKDIDDLEacetylation[1]
967DLELTLAKVEKEKHAacetylation[1]
970LTLAKVEKEKHATENacetylation[1]
978EKHATENKVKNLTEEacetylation[1]
980HATENKVKNLTEEMAacetylation[1]
995GLDENIAKLTKEKKAacetylation[1]
1020DLQAEEDKVNTLTKAacetylation[1]
1026DKVNTLTKAKTKLEQacetylation[1]
1030TLTKAKTKLEQQVDDacetylation[1]
1046EGSLEQEKKLRMDLEacetylation[1]
1077TMDIENDKQQLDEKLacetylation[1]
1083DKQQLDEKLKKKEFEacetylation[1]
1086QLDEKLKKKEFEMSNacetylation[1]
1087LDEKLKKKEFEMSNLacetylation[1]
1097EMSNLQSKIEDEQALacetylation[1]
1110ALGMQLQKKIKELQAacetylation[1]
1169SAQIEMNKKREAEFQacetylation[1]
1170AQIEMNKKREAEFQKacetylation[1]
1177KREAEFQKMRRDLEEacetylation[1]
1198ATAAALRKKHADSVAacetylation[1]
1199TAAALRKKHADSVAEacetylation[1]
1218IDNLQRVKQKLEKEKacetylation[1]
1225KQKLEKEKSELKMEIacetylation[1]
1229EKEKSELKMEIDDLAacetylation[1]
1244SNMETVSKAKGNLEKacetylation[1]
1251KAKGNLEKMCRTLEDacetylation[1]
1266QLSEVKTKEEEQQRLacetylation[1]
1281INELSAQKARLHTESacetylation[1]
1298FSRQLDEKDAMVSQLacetylation[1]
1309VSQLSRGKQAFTQQIacetylation[1]
1320TQQIEELKRQLEEESacetylation[1]
1358YEEEQEAKAELQRAMacetylation[1]
1367ELQRAMSKANSEVAQacetylation[1]
1378EVAQWRTKYETDAIQacetylation[1]
1414HVEAVNSKCASLEKTacetylation[1]
1455KKQRNFDKVLAEWKQacetylation[1]
1461DKVLAEWKQKYEETQacetylation[1]
1463VLAEWKQKYEETQAEacetylation[1]
1476AELEASQKESRSLSTacetylation[1]
1487SLSTELFKVKNAYEEacetylation[1]
1489STELFKVKNAYEESLacetylation[1]
1503LDQLETLKRENKNLQacetylation[1]
1507ETLKRENKNLQQEISacetylation[1]
1532KHIHELEKIKKQIDQacetylation[1]
1535HELEKIKKQIDQEKSacetylation[1]
1541KKQIDQEKSELQASLacetylation[1]
1561SLEHEEGKILRIQLEacetylation[1]
1573QLELNQVKSEIDRKIacetylation[1]
1583IDRKIAEKDEEIDQLacetylation[1]
1591DEEIDQLKRNHLRVVacetylation[1]
1620NDALRIKKKMEGDLNacetylation[1]
1621DALRIKKKMEGDLNEacetylation[1]
1655RNTQGMLKDTQLHLDacetylation[1]
1672LRGQDDLKEQLAMVEacetylation[1]
1731NTSLINTKKKLETDIacetylation[1]
1733SLINTKKKLETDISQacetylation[1]
1795KNMEQTVKDLQHRLDacetylation[1]
1810EAEQLALKGGKKQIQacetylation[1]
1813QLALKGGKKQIQKLEacetylation[1]
1818GGKKQIQKLEARVREacetylation[1]
1835NEVENEQKRNIEAVKacetylation[1]
1842KRNIEAVKGLRKHERacetylation[1]
1852RKHERRVKELTYQTEacetylation[1]
1863YQTEEDRKNVLRLQDacetylation[1]
1874RLQDLVDKLQTKVKAacetylation[1]
1899QSNVNLAKFRKIQHEacetylation[1]
1923IAESQVNKLRVKSREacetylation[1]
Reference
 [1] Proteomic analysis of lysine acetylation sites in rat tissues reveals organ specificity and subcellular patterns.
 Lundby A, Lage K, Weinert BT, Bekker-Jensen DB, Secher A, Skovgaard T, Kelstrup CD, Dmytriyev A, Choudhary C, Lundby C, Olsen JV.
 Cell Rep. 2012 Aug 30;2(2):419-31. [PMID: 22902405
Functional Description
  
Sequence Annotation
  
Keyword
 Actin-binding; ATP-binding; Coiled coil; Complete proteome; Motor protein; Myosin; Nucleotide-binding; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1939 AA 
Protein Sequence
MSSDAEMAVF GEAAPYLRKS EKERIEAQNK PFDAKSSVFV VDAKESYVKA TVQSREGGKV 60
TAKTEGGATV TVKEDQVFSM NPPKYDKIED MAMMTHLHEP AVLYNLKERY AAWMIYTYSG 120
LFCVTVNPYK WLPVYNPEVV AAYRGKKRQE APPHIFSISD NAYQFMLTDR ENQSILITGE 180
SGAGKTVNTK RVIQYFATIA VTGDKKKEEA PSGKMQGTLE DQIISANPLL EAFGNAKTVR 240
NDNSSRFGKF IRIHFGATGK LASADIETYL LEKSRVTFQL KAERSYHIFY QVMSNKKPEL 300
IEMLLITTNP YDFAYVSQGE ITVPSIDDQE ELMATDTAVD ILGFTADEKV AIYKLTGAVM 360
HYGNMKFKQK QREEQAEPDG TEVADKAAYL TSLNSADLLK ALCYPRVKVG NEYVTKGQTV 420
QQVYNSVGAL AKAMYEKMFL WMVTRINQQL DTKQPRQYFI GVLDIAGFEI FDFNTLEQLC 480
INFTNEKLQQ FFNHHMFVLE QEEYKKEGIE WEFIDFGMDL AACIELIEKP MGIFSILEEE 540
CMFPKATDTS FKNKLYEQHL GKSNNFQKPK PAKGKAEAHF SLVHYAGTVD YNIIGWLDKN 600
KDPLNETVVG LYQKSGLKTL AFLFSGGQAA EAEGGGGKKG GKKKGSSFQT VSALFRENLN 660
KLMTNLKSTH PHFVRCLIPN ETKTPGAMEH ELVLHQLRCN GVLEGIRICR KGFPSRILYA 720
DFKQRYKVLN ASAIPEGQFI DSKKASEKLL GSIDIDHTQY KFGHTKVFFK AGLLGTLEEM 780
RDEKLAQLIT RTQAVCRGYL MRVEFRKMME RRESIFCIQY NVRAFMNVKH WPWMKLYFKI 840
KPLLKSAETE KEMATMKEDF EKAKEDLAKS EAKRKELEEK MVALMQEKND LQLQVQAEAD 900
GLADAEERCD QLIKTKIQLE AKIKELTERA EDEEEINAEL TAKKRKLEDE CSELKKDIDD 960
LELTLAKVEK EKHATENKVK NLTEEMAGLD ENIAKLTKEK KALQEAHQQT LDDLQAEEDK 1020
VNTLTKAKTK LEQQVDDLEG SLEQEKKLRM DLERAKRKLE GDLKLAQEST MDIENDKQQL 1080
DEKLKKKEFE MSNLQSKIED EQALGMQLQK KIKELQARIE ELEEEIEAER ASRAKAEKQR 1140
SDLSRELEEI SERLEEAGGA TSAQIEMNKK REAEFQKMRR DLEEATLQHE ATAAALRKKH 1200
ADSVAELGEQ IDNLQRVKQK LEKEKSELKM EIDDLASNME TVSKAKGNLE KMCRTLEDQL 1260
SEVKTKEEEQ QRLINELSAQ KARLHTESGE FSRQLDEKDA MVSQLSRGKQ AFTQQIEELK 1320
RQLEEESKAK NALAHALQSA RHDCDLLREQ YEEEQEAKAE LQRAMSKANS EVAQWRTKYE 1380
TDAIQRTEEL EEAKKKLAQR LQDAEEHVEA VNSKCASLEK TKQRLQNEVE DLMIDVERSN 1440
AACAALDKKQ RNFDKVLAEW KQKYEETQAE LEASQKESRS LSTELFKVKN AYEESLDQLE 1500
TLKRENKNLQ QEISDLTEQI AEGGKHIHEL EKIKKQIDQE KSELQASLEE AEASLEHEEG 1560
KILRIQLELN QVKSEIDRKI AEKDEEIDQL KRNHLRVVES MQSTLDAEIR SRNDALRIKK 1620
KMEGDLNEME IQLNHANRQA AEAIRNLRNT QGMLKDTQLH LDDALRGQDD LKEQLAMVER 1680
RANLMQAEIE ELRASLEQTE RSRRVAEQEL LDASERVQLL HTQNTSLINT KKKLETDISQ 1740
IQGEMEDIVQ EARNAEEKAK KAITDAAMMA EELKKEQDTS AHLERMKKNM EQTVKDLQHR 1800
LDEAEQLALK GGKKQIQKLE ARVRELENEV ENEQKRNIEA VKGLRKHERR VKELTYQTEE 1860
DRKNVLRLQD LVDKLQTKVK AYKRQAEEAE EQSNVNLAKF RKIQHELEEA EERADIAESQ 1920
VNKLRVKSRE VHTKVISEE 1939 
Gene Ontology
 GO:0005925; C:focal adhesion; IEA:Compara.
 GO:0030016; C:myofibril; IEA:Compara.
 GO:0016459; C:myosin complex; IEA:UniProtKB-KW.
 GO:0005730; C:nucleolus; IEA:Compara.
 GO:0005524; F:ATP binding; IEA:UniProtKB-KW.
 GO:0003774; F:motor activity; IEA:InterPro.
 GO:0006936; P:muscle contraction; IEA:Compara.
 GO:0014823; P:response to activity; IEA:Compara. 
Interpro
 IPR000048; IQ_motif_EF-hand-BS.
 IPR027401; Myosin-like_IQ_dom.
 IPR015650; Myosin_1/23/4/7/8/13/15.
 IPR001609; Myosin_head_motor_dom.
 IPR004009; Myosin_N.
 IPR002928; Myosin_tail.
 IPR027417; P-loop_NTPase. 
Pfam
 PF00063; Myosin_head
 PF02736; Myosin_N
 PF01576; Myosin_tail_1 
SMART
 SM00015; IQ
 SM00242; MYSc 
PROSITE
 PS50096; IQ 
PRINTS
 PR00193; MYOSINHEAVY.