CPLM 1.0 - Compendium of Protein Lysine Modification
Tag
Content
CPLM ID
CPLM-042161
UniProt Accession
H0V975_CAVPO
;
H0V975
Genbank Protein ID
AAKN02019132
Genbank Nucleotide ID
Protein Name
Uncharacterized protein
Protein Synonyms/Alias
Gene Name
LOC100727418
Gene Synonyms/Alias
Created Date
July 27, 2013
Organism
Cavia porcellus (Guinea pig)
NCBI Taxa ID
10141
Lysine Modification
Position
Peptide
Type
References
43
DVFVPDD
K
EEFVKAK
acetylation
[1]
67
TAETEHG
K
TVTVRED
acetylation
[1]
278
SRVIFQL
K
AERDYHI
acetylation
[1]
383
DGTEEAD
K
SAYLMGL
acetylation
[1]
405
GLCHPRV
K
VGNEYVT
acetylation
[1]
413
VGNEYVT
K
GQNVQQV
acetylation
[1]
434
LAKAVYE
K
MFNWMVG
acetylation
[1]
565
GKSSNFQ
K
PRNIKGK
acetylation
[1]
744
DSRKGAE
K
LLGSLDI
acetylation
[1]
757
DIDHNQY
K
FGHTKVF
acetylation
[1]
1016
DLQAEED
K
VNTLTKA
acetylation
[1]
1026
TLTKAKV
K
LEQQVDD
acetylation
[1]
1042
EGSLEQE
K
KVRMDLE
acetylation
[1]
1106
ALGSQLQ
K
KLKELQA
acetylation
[1]
1247
KAKANLE
K
MCRTLED
acetylation
[1]
1279
DLTSQRA
K
LQTENGE
acetylation
[1]
1294
LSRQLDE
K
EALISQL
acetylation
[1]
1316
TQQLEDL
K
RQLEEEV
acetylation
[1]
1354
YEEETEA
K
AELQRVL
acetylation
[1]
1444
AAAAALD
K
KQRNFDK
acetylation
[1]
1445
AAAALDK
K
QRNFDKI
acetylation
[1]
1459
ILAEWKQ
K
YEESQSE
acetylation
[1]
1499
LEHLETF
K
RENKNLQ
acetylation
[1]
1528
KSIHELE
K
IRKQLEA
acetylation
[1]
1531
HELEKIR
K
QLEAEKL
acetylation
[1]
1569
QLEFNQI
K
AEIERKL
acetylation
[1]
1579
IERKLAE
K
DEEMEQA
acetylation
[1]
1587
DEEMEQA
K
RNHLRVV
acetylation
[1]
1641
RLAAEAQ
K
QVKSLQS
acetylation
[1]
1727
NTSLINQ
K
KKMDADL
acetylation
[1]
1791
KNMEQTI
K
DLQHRLD
acetylation
[1]
1930
KSRDIGT
K
GLNEE**
acetylation
[1]
Reference
[1] The cardiac acetyl-lysine proteome.
Foster DB, Liu T, Rucker J, O'Meally RN, Devine LR, Cole RN, O'Rourke B.
PLoS One. 2013;8(7):e67513. [
PMID: 23844019
]
Functional Description
Sequence Annotation
Keyword
Actin-binding; ATP-binding; Coiled coil; Complete proteome; Motor protein; Myosin; Nucleotide-binding; Reference proteome.
Sequence Source
UniProt (SWISSPROT/TrEMBL); GenBank; EMBL
Protein Length
1935 AA
Protein Sequence
MGDSEMAAFG AAAPYLRKSE KERLEAQTRP FDLKKDVFVP DDKEEFVKAK IVSREGGKVT 60
AETEHGKTVT VREDQIMQQN PPKFDKIEDM AMLTFLHEPA VLYNLKERYA SWMIYTYSGL 120
FCVTVNPYKW LPVYNAEVVA AYRGKKRSEA PPHIFSISDN AYQYMLTDRE NQSILITGES 180
GAGKTVNTKR VIQYFAVIAA IGDRSKKEQT PGKGTLEDQI IQANPALEAF GNAKTVRNDN 240
SSRFGKFIRI HFGATGKLAS ADIETYLLEK SRVIFQLKAE RDYHIFYQIL SNKKPELLDM 300
LLITNNPYDY AFISQGETTV ASIDDSEELM ATDNAFDVLG FTPEEKNSMY KLTGAIMHFG 360
NMKFKQKQRE EQAEPDGTEE ADKSAYLMGL NSADLLKGLC HPRVKVGNEY VTKGQNVQQV 420
AYATGALAKA VYEKMFNWMV GRINATLETK QPRQYFIGVL DIAGFEIFDF NSFEQLCINF 480
TNEKLQQFFN HHMFVLEQEE YKKEGIEWEF IDFGMDLQAC IDLIEKPMGI MSILEEECMF 540
PKATDMTFKA KLFDNHLGKS SNFQKPRNIK GKPEAHFSLI HYAGTVDYNI IGWLQKNKDP 600
LNETVVALYQ KSSLKLLSNL FANYAGADAP VEKGKGKAKK GSSFQTVSAL HRENLNKLMT 660
NLRSTHPHFV RCIIPNETKS PGVIDNPLVM HQLRCNGVLE GIRICRKGFP NRILYGDFRQ 720
RYRILNPAAI PEGQFIDSRK GAEKLLGSLD IDHNQYKFGH TKVFFKAGLL GLLEEMRDER 780
LSRIITRIQA QSRGVLSRME YKKLLERRDS LLIIQWNIRS FMSVKNWPWM KLFFKIKPLL 840
KSAETEKEMA TMKEEFARIK EALEKSEARR KELEEKMVSL LQEKNDLQLQ VQTEQDNLAD 900
AEERCDQLIK NKIQLEAKVK EMNERLEDEE EMNAELTAKK RKLEDECSEL KRDIDDLELT 960
LAKVEKEKHA TENKVKNLTE EMAGLDEIIV KLTKEKKALQ EAHQQALDDL QAEEDKVNTL 1020
TKAKVKLEQQ VDDLEGSLEQ EKKVRMDLER AKRKLEGDLK LTQESIMDLE NDKQQLDERL 1080
KKKDFELNAL NARIEDEQAL GSQLQKKLKE LQARIEELEE ELEAERTARA KVEKLRSDLS 1140
RELEEISERL EEAGGATSVQ IEMNKKREAE FQKMRRDLEE ATLQHEATAA ALRKKHADSV 1200
AELSEQIDNL QRVKQKLEKE KSEFKLELDD VTSNMEQIIK AKANLEKMCR TLEDQMNEHR 1260
SKAEETQRSV NDLTSQRAKL QTENGELSRQ LDEKEALISQ LTRGKLTYTQ QLEDLKRQLE 1320
EEVKAKNALA HALQSARHDC DLLREQYEEE TEAKAELQRV LSKANSEVAQ WRTKYETDAI 1380
QRTEELEEAK KKLAQRLQDA EEAVEAVNAK CSSLEKTKHR LQNEIEDLMV DVERSNAAAA 1440
ALDKKQRNFD KILAEWKQKY EESQSELESS QKEARSLSTE LFKLKNAYEE SLEHLETFKR 1500
ENKNLQEEIS DLTEQLGSSG KSIHELEKIR KQLEAEKLEL QSALEEAEAS LEHEEGKILR 1560
AQLEFNQIKA EIERKLAEKD EEMEQAKRNH LRVVDSLQTS LDAETRSRNE ALRVKKKMEG 1620
DLNEMEIQLS HANRLAAEAQ KQVKSLQSLL KDTQIQLDDA VRANDDLKEN IAIVERRNNL 1680
LQAELEELRA VVEQTERSRK LAEQELIETS ERVQLLHSQN TSLINQKKKM DADLSQLQTE 1740
VEEAVQECRN AEEKAKKAIT DAAMMAEELK KEQDTSAHLE RMKKNMEQTI KDLQHRLDEA 1800
EQIALKGGKK QLQKLEARVR ELENELEAEQ KRNAESVKGM RKSERRIKEL TYQTEEDRKN 1860
LLRLQDLVDK LQLKVKAYKR QAEEAEEQAN TNLSKFRKVQ HELDEAEERA DIAESQVNKL 1920
RAKSRDIGTK GLNEE 1935
Gene Ontology
GO:0005925
; C:focal adhesion; IEA:Compara.
GO:0032982
; C:myosin filament; IEA:Compara.
GO:0005730
; C:nucleolus; IEA:Compara.
GO:0001725
; C:stress fiber; IEA:Compara.
GO:0030018
; C:Z disc; IEA:Compara.
GO:0030898
; F:actin-dependent ATPase activity; IEA:Compara.
GO:0005524
; F:ATP binding; IEA:UniProtKB-KW.
GO:0003774
; F:motor activity; IEA:InterPro.
GO:0008307
; F:structural constituent of muscle; IEA:Compara.
GO:0007512
; P:adult heart development; IEA:Compara.
GO:0030049
; P:muscle filament sliding; IEA:Compara.
GO:0002027
; P:regulation of heart rate; IEA:Compara.
GO:0055010
; P:ventricular cardiac muscle tissue morphogenesis; IEA:Compara.
Interpro
IPR000048
; IQ_motif_EF-hand-BS.
IPR027401
; Myosin-like_IQ_dom.
IPR001609
; Myosin_head_motor_dom.
IPR004009
; Myosin_N.
IPR002928
; Myosin_tail.
IPR027417
; P-loop_NTPase.
Pfam
PF00063
; Myosin_head
PF02736
; Myosin_N
PF01576
; Myosin_tail_1
SMART
SM00015
; IQ
SM00242
; MYSc
PROSITE
PS50096
; IQ
PRINTS
PR00193
; MYOSINHEAVY.