CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-041614
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Myomesin 2 
Protein Synonyms/Alias
 Protein Myom2 
Gene Name
 Myom2 
Gene Synonyms/Alias
 rCG_43076 
Created Date
 July 27, 2013 
Organism
 Rattus norvegicus (Rat) 
NCBI Taxa ID
 10116 
Lysine Modification
Position
Peptide
Type
References
11VVVPFYQKRHKHFDQacetylation[1]
14PFYQKRHKHFDQSYRacetylation[1]
100AASYGEAKRQRFLSEacetylation[1]
125ARTQARNKLDKYFLEacetylation[1]
128QARNKLDKYFLEQTVacetylation[1]
327DVLLKESKWTKMFFGacetylation[1]
579VFDLAEGKSYVFRVLacetylation[1]
735TLGWKVPKFSGGSAIacetylation[1]
749IIGYYLDKREVHHKNacetylation[1]
755DKREVHHKNWHEINSacetylation[1]
873ATPNRYLKVCDLHQGacetylation[1]
881VCDLHQGKTYVFRVRacetylation[1]
1098QIQDGKAKNQSSLVLacetylation[1]
1193LLIPKLSKKDHGEYKacetylation[1]
1204GEYKATLKDDRGQDVacetylation[1]
1268MKVSWYHKEAKISSSacetylation[1]
1301PTEKDKGKYTFEIFDacetylation[1]
1310TFEIFDGKDNHQRSLacetylation[1]
1342KAAAFAEKNRGKVIGacetylation[1]
1400SVKVEQSKYVSLTIKacetylation[1]
1453ISAPQQAKPKLIPASacetylation[1]
Reference
 [1] Proteomic analysis of lysine acetylation sites in rat tissues reveals organ specificity and subcellular patterns.
 Lundby A, Lage K, Weinert BT, Bekker-Jensen DB, Secher A, Skovgaard T, Kelstrup CD, Dmytriyev A, Choudhary C, Lundby C, Olsen JV.
 Cell Rep. 2012 Aug 30;2(2):419-31. [PMID: 22902405
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome; Repeat. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1464 AA 
Protein Sequence
MSLVVVPFYQ KRHKHFDQSY RNIQTRYLLD RYASKKQASS QSSSQRSLTE RSSSKRASSQ 60
SSAEGMTCRL CAKRMSASEE EEVENENRYR SLAASYGEAK RQRFLSELAQ LEENVQLART 120
QARNKLDKYF LEQTVADNLA LERSSFEDRL SRAPEILVRL RSHTIWERMS VRLCFTVQGF 180
PTPVVQWYKD GSLICQAGEP GKYRIESRYG VHTLEINRAD FEDTATYSAV ATNPHGQVST 240
NAAVVVRRYR GDEEPYHSVG LPIGLPLSSV IPYTHFDVQF LEKFGVTFRR EGETVTLKCT 300
LLVTPDLKRV QPRAEWYRDD VLLKESKWTK MFFGEGQASL SFSHLNKDDE GLYTLRIVSR 360
GGVSDHSAFM FVRDADPLVT GAPGAPMDLQ CHDANRDYVI VTWKPPNTTT ESPVIGYFID 420
KCEVGTNNWV QCNDAPVKIC KYPVTGLFEG RSYVFRVRAV NNAGISRPSR ISDAVAALDP 480
VDLRRLQAIH LEGEKEIVIY QDDLEGDVQI PGPPTNVQAS EVSRNYVVLS WDPPSPRGKE 540
PLMYFIEKSA VGSGSWQRVN AQTAVRSPRY AVFDLAEGKS YVFRVLSANK HGLSDPSEIT 600
PPIQAQDMIV VPSAPGRVLA SRNTKTSVVV QWDRPKHEED LLGYYVDCCV AGTNMWEPCN 660
HKPIGYNRFV VHGLTTGEQY IFRVKAVNAV GTSENSQESE VIKVQAALTV PSHPYGITLL 720
NCDGHSMTLG WKVPKFSGGS AIIGYYLDKR EVHHKNWHEI NSSPVKERIL TVEGLTEGSL 780
YEFKIAATNL AGIGQPSDPS EHFKCEAWTA PEPGPAYDLT FCEVRDTSLV ILWKAPVYSG 840
SSPVSGYFVD FKEEDSGEWK TTSEAATPNR YLKVCDLHQG KTYVFRVRAV NASGPGKPSD 900
TSEPVLVEAR PGTKEISAGV DEEGNIYLGF DCQEMTDASQ FTWCKAYEEI ADEERFEVHT 960
EGDHSKLYFK NPDKIDIGTY SVSVSDTDGV SSSFVLDEEE LERLMALSNE IKNPTIPLKS 1020
ELAYEIFDKG QVRFWLQAEH LSPDANFRFI INDREVSDSE THRIKCDRST GMIEMVMDRF 1080
TIENEGTYTV QIQDGKAKNQ SSLVLIGDAF RAVLEEAEFQ RKEFLRKQGP HFAEYLHWDV 1140
TEECEVRLIC KVANTKRETV FKWLKDDVLY ETETPPPDLE KGICELLIPK LSKKDHGEYK 1200
ATLKDDRGQD VSVLEVAGKV YEDMILAMSR VCGASASPLK VLCTPEGIRL QCFMKYFTEE 1260
MKVSWYHKEA KISSSEHMRI GGSEEMAWLQ ICEPTEKDKG KYTFEIFDGK DNHQRSLDLS 1320
GQAFDEAYAE FQQLKAAAFA EKNRGKVIGG LPDVVTIMEG KTLNLTCTVF GNPDPEVVWF 1380
KNDKDIELSE HFSVKVEQSK YVSLTIKGVT AEDSGKYSIN VKNKYGGEKI DVTVSVYKHG 1440
EKIPDISAPQ QAKPKLIPAS TSAD 1464 
Gene Ontology
 GO:0031430; C:M band; IEA:Compara.
 GO:0008307; F:structural constituent of muscle; TAS:RGD.
 GO:0006936; P:muscle contraction; IEA:Compara. 
Interpro
 IPR003961; Fibronectin_type3.
 IPR007110; Ig-like_dom.
 IPR013783; Ig-like_fold.
 IPR013098; Ig_I-set.
 IPR003599; Ig_sub.
 IPR003598; Ig_sub2. 
Pfam
 PF00041; fn3
 PF07679; I-set 
SMART
 SM00060; FN3
 SM00409; IG
 SM00408; IGc2 
PROSITE
 PS50853; FN3
 PS50835; IG_LIKE 
PRINTS