CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-041544
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Myosin, heavy polypeptide 9, non-muscle 
Protein Synonyms/Alias
 Protein LOC100911597 
Gene Name
 LOC100911597 
Gene Synonyms/Alias
 Myh9; rCG_59912 
Created Date
 July 27, 2013 
Organism
 Rattus norvegicus (Rat) 
NCBI Taxa ID
 10116 
Lysine Modification
Position
Peptide
Type
References
8MAQQAADKYLYVDKNacetylation[1]
74VNKDDIQKMNPPKFSacetylation[1]
102ASVLHNLKERYYSGLacetylation[1]
139EEIVDMYKGKKRHEMacetylation[1]
299LLLEPYNKYRFLSNGacetylation[1]
355QLGNIVFKKERNTDQacetylation[1]
435WLVLRINKALDKTKRacetylation[1]
545TDKSFVEKVVQEQGTacetylation[1]
555QEQGTHPKFQKPKQLacetylation[1]
580YAGKVDYKADEWLMKacetylation[1]
613KFVSELWKDVDRIIGacetylation[1]
651RTVGQLYKEQLAKLMacetylation[1]
656LYKEQLAKLMATLRNacetylation[1]
679IIPNHEKKAGKLDPHacetylation[1]
682NHEKKAGKLDPHLVLacetylation[1]
833QWWRLFTKVKPLLNSacetylation[1]
972EKVTTEAKLKKLEEDacetylation[1]
1014NLMEEEEKSKSLAKLacetylation[1]
1020EKSKSLAKLKNKHEAacetylation[1]
1022SKSLAKLKNKHEAMIacetylation[1]
1024SLAKLKNKHEAMITDacetylation[1]
1048KQRQELEKTRRKLEGacetylation[1]
1173EQEVSILKKTLEDEAacetylation[1]
1181KTLEDEAKTHEAQIQacetylation[1]
1209AEQLEQTKRVKATLEacetylation[1]
1212LEQTKRVKATLEKAKacetylation[1]
1234GELANEVKALLQGKGacetylation[1]
1248GDSEHKRKKVEAQLQacetylation[1]
1249DSEHKRKKVEAQLQEacetylation[1]
1277ELADKVSKLQVELDSacetylation[1]
1301SKSSKLTKDFSALESacetylation[1]
1330QKLSLSTKLKQMEDEacetylation[1]
1332LSLSTKLKQMEDEKNacetylation[1]
1338LKQMEDEKNSFREQLacetylation[1]
1352LEEEEEAKRNLEKQIacetylation[1]
1357EAKRNLEKQIATLHAacetylation[1]
1370HAQVTDMKKKMEDGVacetylation[1]
1392EAKRRLQKDLEGLSQacetylation[1]
1404LSQRLEEKVAAYDKLacetylation[1]
1410EKVAAYDKLEKTKTRacetylation[1]
1441QSVSNLEKKQKKFDQacetylation[1]
1454DQLLAEEKTISAKYAacetylation[1]
1459EEKTISAKYAEERDRacetylation[1]
1492LEEAMEQKAELERLNacetylation[1]
1500AELERLNKQFRTEMEacetylation[1]
1513MEDLMSSKDDVGKSVacetylation[1]
1518SSKDDVGKSVHELEKacetylation[1]
1525KSVHELEKSKRALEQacetylation[1]
1613SIAMAARKKLEMDLKacetylation[1]
1614IAMAARKKLEMDLKDacetylation[1]
1620KKLEMDLKDLEAHIDacetylation[1]
1638KNREEAIKQLRKLQAacetylation[1]
1648RKLQAQMKDCMRELDacetylation[1]
1669EEILAQAKENEKKLKacetylation[1]
1724GALALEEKRRLEARIacetylation[1]
1793QNKELKAKLQEMESAacetylation[1]
1802QEMESAVKSKYKASIacetylation[1]
1815SIAALEAKIAQLEEQacetylation[1]
1828EQLDNETKERQAASKacetylation[1]
1843QVRRAEKKLKDVLLQacetylation[1]
1845RRAEKKLKDVLLQVEacetylation[1]
1862RRNAEQFKDQADKASacetylation[1]
Reference
 [1] Proteomic analysis of lysine acetylation sites in rat tissues reveals organ specificity and subcellular patterns.
 Lundby A, Lage K, Weinert BT, Bekker-Jensen DB, Secher A, Skovgaard T, Kelstrup CD, Dmytriyev A, Choudhary C, Lundby C, Olsen JV.
 Cell Rep. 2012 Aug 30;2(2):419-31. [PMID: 22902405
Functional Description
  
Sequence Annotation
  
Keyword
 Actin-binding; ATP-binding; Coiled coil; Complete proteome; Motor protein; Myosin; Nucleotide-binding; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1960 AA 
Protein Sequence
MAQQAADKYL YVDKNFINNP LAQADWAAKK LVWVPSTKNG FEPASLKEEV GEEAIVELVE 60
NGKKVKVNKD DIQKMNPPKF SKVEDMAELT CLNEASVLHN LKERYYSGLI YTYSGLFCVV 120
INPYKNLPIY SEEIVDMYKG KKRHEMPPHI YAITDTAYRS MMQDREDQSI LCTGESGAGK 180
TENTKKVIQY LAHVASSHKS KKDQGELERQ LLQANPILEA FGNAKTVKND NSSRFGKFIR 240
INFDVNGYIV GANIETYLLE KSRAIRQAKE ERTFHIFYYL LSGAGEHLKT DLLLEPYNKY 300
RFLSNGHVTI PGQQDKDMFQ ETMEAMRIMG IPEDEQMGLL RVISGVLQLG NIVFKKERNT 360
DQASMPDNTA AQKVSHLLGI NVTDFTRGIL TPRIKVGRDY VQKAQTKEQA DFAIEALAKA 420
TYERMFRWLV LRINKALDKT KRQGASFIGI LDIAGFEIFD LNSFEQLCIN YTNEKLQQLF 480
NHTMFILEQE EYQREGIEWN FIDFGLDLQP CIDLIEKPAG PPGILALLDE ECWFPKATDK 540
SFVEKVVQEQ GTHPKFQKPK QLKDKADFCI IHYAGKVDYK ADEWLMKNMD PLNDNIATLL 600
HQSSDKFVSE LWKDVDRIIG LDQVAGMSET ALPGAFKTRK GMFRTVGQLY KEQLAKLMAT 660
LRNTNPNFVR CIIPNHEKKA GKLDPHLVLD QLRCNGVLEG IRICRQGFPN RVVFQEFRQR 720
YEILTPNSIP KGFMDGKQAC VLMIKALELD SNLYRIGQSK VFFRAGVLAH LEEERDLKIT 780
DVIIGFQACC RGYLARKAFA KRQQQLTAMK VLQRNCAAYL RLRNWQWWRL FTKVKPLLNS 840
IRHEDELLAK EAELTKVREK HLAAENRLTE METMQSQLMA EKLQLQEQLQ AETELCAEAE 900
ELRARLTAKK QELEEICHDL EARVEEEEER CQYLQAEKKK MQQNIQELEE QLEEEESARQ 960
KLQLEKVTTE AKLKKLEEDQ IIMEDQNCKL AKEKKLLEDR VAEFTTNLME EEEKSKSLAK 1020
LKNKHEAMIT DLEERLRREE KQRQELEKTR RKLEGDSTDL SDQIAELQAQ IAELKMQLAK 1080
KEEELQAALA RVEEEAAQKN MALKKIRELE TQISELQEDL ESERACRNKA EKQKRDLGEE 1140
LEALKTELED TLDSTAAQQE LRSKREQEVS ILKKTLEDEA KTHEAQIQEM RQKHSQAVEE 1200
LAEQLEQTKR VKATLEKAKQ TLENERGELA NEVKALLQGK GDSEHKRKKV EAQLQELQVK 1260
FSEGERVRTE LADKVSKLQV ELDSVTGLLN QSDSKSSKLT KDFSALESQL QDTQELLQEE 1320
NRQKLSLSTK LKQMEDEKNS FREQLEEEEE AKRNLEKQIA TLHAQVTDMK KKMEDGVGCL 1380
ETAEEAKRRL QKDLEGLSQR LEEKVAAYDK LEKTKTRLQQ ELDDLLVDLD HQRQSVSNLE 1440
KKQKKFDQLL AEEKTISAKY AEERDRAEAE AREKETKALS LARALEEAME QKAELERLNK 1500
QFRTEMEDLM SSKDDVGKSV HELEKSKRAL EQQVEEMKTQ LEELEDELQA TEDAKLRLEV 1560
NLQAMKAQFE RDLQGRDEQS EEKKKQLVRQ VREMEAELED ERKQRSIAMA ARKKLEMDLK 1620
DLEAHIDTAN KNREEAIKQL RKLQAQMKDC MRELDDTRAS REEILAQAKE NEKKLKSMEA 1680
EMIQLQEELA AAERAKRQAQ QERDELADEI ANSSGKGALA LEEKRRLEAR IAQLEEELEE 1740
EQGNTELIND RLKKANLQID QINTDLNLER SHAQKNENAR QQLERQNKEL KAKLQEMESA 1800
VKSKYKASIA ALEAKIAQLE EQLDNETKER QAASKQVRRA EKKLKDVLLQ VEDERRNAEQ 1860
FKDQADKAST RLKQLKRQLE EAEEEAQRAN ASRRKLQREL EDATETADAM NREVSSLKNK 1920
LRRGDMPFVV TRRIVRKGTG DCSDEEVDGK ADGADAKAAE 1960 
Gene Ontology
 GO:0016459; C:myosin complex; IEA:UniProtKB-KW.
 GO:0005524; F:ATP binding; IEA:UniProtKB-KW.
 GO:0003774; F:motor activity; IEA:InterPro. 
Interpro
 IPR000048; IQ_motif_EF-hand-BS.
 IPR027401; Myosin-like_IQ_dom.
 IPR001609; Myosin_head_motor_dom.
 IPR004009; Myosin_N.
 IPR002928; Myosin_tail.
 IPR027417; P-loop_NTPase. 
Pfam
 PF00612; IQ
 PF00063; Myosin_head
 PF02736; Myosin_N
 PF01576; Myosin_tail_1 
SMART
 SM00015; IQ
 SM00242; MYSc 
PROSITE
 PS50096; IQ 
PRINTS
 PR00193; MYOSINHEAVY.