CPLM 1.0 - Compendium of Protein Lysine Modification
Tag
Content
CPLM ID
CPLM-038471
UniProt Accession
E9Q5A7_MOUSE
;
E9Q5A7
Genbank Protein ID
AC115031
;
AC118246
Genbank Nucleotide ID
Protein Name
Zinc finger homeobox protein 4
Protein Synonyms/Alias
Gene Name
Zfhx4
Gene Synonyms/Alias
Created Date
July 27, 2013
Organism
Mus musculus (Mouse)
NCBI Taxa ID
10090
Lysine Modification
Position
Peptide
Type
References
2960
KFKINIG
K
PFMINQS
acetylation
[1]
Reference
[1] Quantitative acetylome analysis reveals the roles of SIRT1 in regulating diverse substrates and cellular pathways.
Chen Y, Zhao W, Yang JS, Cheng Z, Luo H, Lu Z, Tan M, Gu W, Zhao Y.
Mol Cell Proteomics. 2012 Oct;11(10):1048-62. [
PMID: 22826441
]
Functional Description
Sequence Annotation
Keyword
Complete proteome; DNA-binding; Homeobox; Nucleus; Reference proteome.
Sequence Source
UniProt (SWISSPROT/TrEMBL); GenBank; EMBL
Protein Length
3581 AA
Protein Sequence
METCDSPPIS RQENGQSTSK LCGMTQLDNE VPEKVAGIEP DRENSSSHDN LKTDERKSEV 60
LLGFSIENAA ATQVTSAKEI PCNECATSFP SLQKYMEHHC PNARLPVLKD DESETSELED 120
SDVENLTGEI VYQPDGSAYI IEDSKESGQN AQTGANSKLF STAMFLDSLA SAGEKSDQSS 180
TAPVSFYPQI INTFHIASSL GKPFTADPAF PNTSALAGVG PVLHSFRVYD LRHKREKDYL 240
TSDGSAKNSC VSKDVPNNVD LSKFDGCVSD GKRKPVLMCF LCKLSFGYIR SFVTHAVHDH 300
RMTLNDEEQR LLSNKCVSAI IQGIGKDKEP LISFLEPKKS TSVYPNFSTT NLIGPDPTFR 360
GLWSAFHVEN GDSLQAGFAF LKGSASPSSS AEQPLGITHM PKAEVNLGGL SSLVVNTPIT 420
SVSLSHLSSE SSKMSESKDQ ENNCERPKES TILHPNVGCP VKSEPTEPGD EDEEDAYSNE 480
LDDEEVLGEL TDSIGNKDFP LLNQSISPLS SSVLKFIEKG TSSSSGTIAE DTEKKKQAAA 540
AGRSNGNVTN SYSIGGKDFA DGSISRDGTT AAPSETTHGD EDSSTTHQHG FTPSTPGTPG 600
PGGDGSPGNG IECPKCDTVL GSSRSLGGHM TMMHSRNSCK TLKCPKCNWH YKYQQTLEAH 660
MKEKHPEPGG SCVYCKTGQP HPRLARGESY TCGYKPFRCE VCNYSTTTKG NLSIHMQSDK 720
HLNNVQNLQN GNGEQVFGHS APTPNTSLSG CGTPSPSKPK QKPTWRCEVC DYETNVARNL 780
RIHMTSEKHM HNMMLLQQNM KQIQHNLHLG LAPAEAELYQ YYLAQNIGLT GMKLENPAET 840
QLLLNPFQFD SATAAALAPG LGELSPYISD PALKLFQCAV CNKFTSDSLE ALSVHVNSER 900
SLPEEEWRAV IGDIYQCKLC NYNTQLKANF QLHCKTDKHM QKYQLVAHIK EGGKSNEWRL 960
KCIAIGNPVH LKCNACDYYT NSVDKLRLHT TNHRHEAALK LYKHLQKQEG AVNSESCYYY 1020
CAVCDYSSKI KLNLVQHVRS VKHQQTEGLR KLQLHQQGLP SEEDNLSEIF FVKECPANEL 1080
ETASLGARNG EDELIEQQLK AASEEPSEDA GDPLKPPTVA EDDEKEAHKR DNSEGKISTK 1140
DPEVIVPEKE LKVVTGATQP LLLAKEDNTG TKRSKPTEDN KFCPEQFYQC PYCNYNSRDQ 1200
SRIQMHVLSQ HSVQPVICCP LCQDVLSNKM HLQLHLTHLH SVSPDCVEKL LMTVPVPDVM 1260
MPNSMLLPAA APEKSEQDPP TALTAEGSGK YSGDSPVDDK SMSGLEDSKV GVEIKNEEQK 1320
PAKEPVEASE WNKTSSKDVN ISDALQDQLN EQQKRQPLSV SDRHVYKYRC NHCSLAFKTM 1380
QKLQIHSQYH AIRAATMCTL CQRSFRTFQA LKKHLEAGHP ELSEAELQQL YASLPMNGEL 1440
WAESETMTQD DHGIDQEMER EYEVDHEGKA SPVESDSSSI PDDLGLEPKR TLPFRKGPNF 1500
TMEKFLDPSR PYKCTVCKES FTQKNILLVH YNSVSHLHKL KKVLQEASSP VPQEANSSTD 1560
NKPYKCSTCS VAYSQSSTLE IHMRSVLHQT KARAAKLEPS RHLPSGHSIT AAVNSPGQGM 1620
LESMSLASVN SKDTHLDAKE LNKKQTPELI SAQPTHHPPP RSPAQIQMQL QHELQQQAAF 1680
FQPQFLNPAF LPHFPMTPEA LLQFQQPQFL FPFYIPGAEF SLGPDLGLPT STTFGVPGMT 1740
GMAGSLLEDL KQQIQTQHHV GQTQLQFLQQ AQQYQAVQPQ LQPQNQQPPL PQQQQPQQQP 1800
SKLLKQEQGS LASTDCQLMK DMPSYKEAEE VTEKQEKPKQ EFINDTEGLK DSKDIKKQKS 1860
LEPCIPPPRI ASGARGNAAK ALLENFGFEL VIQYNENRQK VQKKGKSGEG ENSDKLECGI 1920
CGKLFSNVLI LKSHQEHVHG QFFPYGALEK FARQYREAYD KLYPISPSSP ETPPPPPPPP 1980
PLPPAPPQPS TLGPVKIPNT VSAPLQAPPP TPPPPPPPPP PPPPPPPPPP PPSAPQQVQL 2040
PVSLDLPLFP SIMMQPVQHP ALPPQLALQL PQMDTLSADL TQLCQQQLGI DPNFLRHSQF 2100
KRPRTRITDD QLKILRAYFD INNSPSEEQI QEMAEKSGLS QKVIKHWFRN TLFKERQRNK 2160
DSPYNFSNPP ITVLEDIRID PQPTSLEHYK SDAAFSKRSS RTRFTDYQLR VLQDFFDTNA 2220
YPKDDEIEQL STVLNLPTRV IVVWFQNARQ KARKSYENQA EAKDNEKREL TNERYIRTSN 2280
MQYQCKKCNV VFPRIFDLIT HQKKQCYKDE DDDAQDESQT EDSMDATDQV LYKHCMVSGQ 2340
TDTAKSTATL VASSGSGTST PLIPSPKPEP EKNSPKTEYP GEKTKQSDPS LPQGTKSAPS 2400
SVLTSSEPQQ ASIPQPPTQP PKQPQLIGRP PSASQTPIPS SPLQISMTSL QNSLPPQLLQ 2460
YPCDQCTIAF PTLELWKEHQ HMHFLAAQNQ FLHSPFLERP MDMPYMIFDP NNPLMTGQLL 2520
GSSLTQMPPQ TSTAHTTAPA SVAASLKRKL EDKEDNNCSE KEGGNSGEDQ HRDKRLRTTI 2580
TPEQLEILYE KYLLDSNPTR KMLDHIAREV GLKKRVVQVW FQNTRARERK GQFRAVGPAQ 2640
SHKRCPFCRA LFKAKSALES HIRSRHWNEG KQAGYSLPPS PLISTEDGGE SPQKYIYFDY 2700
PSLPLTKIDL STENELASTV STPVSKTAEL SPKNLLSPSS FKAECPEDVE NLNAPSADAG 2760
YDQSKTDFDE TSSINTAISD ATTGDEGAAD MENTGGSGEV KPALSPKETK TLDSLQKPAT 2820
TPTTEVCDDK FLFSLTSPSI HFNDKDGDHD QSFYITDDPD DNADRSETSS IADPSSPNPF 2880
GSSNPFKSKS NDRPGHKRFR TQMSNLQLKV LKACFSDYRT PTMQECEMLG NEIGLPKRVV 2940
QVWFQNARAK EKKFKINIGK PFMINQSGTD GTKPECTLCG VKYSARLSIR DHIFSKQHIS 3000
KVRETVGSQL DREKDYLAPT TVRQLMAQQE LDRIKKASDV LGLTVQQQGI TDNCSLHGIS 3060
LQAAYPGLPG LPPVILPGMN GPSSLPGFPQ NSNTLTSPGT GMLGFPSSAT SSPALSLSSG 3120
PTKSLLQTPP PPPPPPPPPS SLSGQQTEPQ NKESEKKQTK PNKVKKIKEE ESEAIKPEKH 3180
PKKEEKISSA LTVLGKVVGE THMDPTQLQA LQNAIAGDPA SFIGGQFLPY FIPGFASYFS 3240
PQLPGTVQGG YLPPICGMES LFPYGPAVPQ TLAGLSPGAL LQQYQQYQQS LQDSLQKQQK 3300
QQQEQQQKPV PAKTAKGEGD QPQSSNEASE TKEEKSTAPE STKEEVQLDS KSAEFSDTCI 3360
VPFVKYEFVC RKCQMMFTDE DATVNHQKSF CYFGQPLIDP QETVLRIPVS KYQCLACDLA 3420
LSGNEALSQH LQSSLHKEKT IKQAMRNAKE HVRLLPHSVC SPPPNTSSTS PSAASSNNTY 3480
PHLSCFSMKS WPNILFQASA RKAASSPSSP PSLSLPSTVT SSLCSTSGVQ TSLPTESCSD 3540
ESDSELSQKL QDLDNSLEVK AKPASGLDGN FNSVRMDMFS V 3581
Gene Ontology
GO:0005634
; C:nucleus; IEA:UniProtKB-SubCell.
GO:0043565
; F:sequence-specific DNA binding; IEA:InterPro.
GO:0003700
; F:sequence-specific DNA binding transcription factor activity; IEA:InterPro.
GO:0008270
; F:zinc ion binding; IEA:InterPro.
Interpro
IPR017970
; Homeobox_CS.
IPR001356
; Homeodomain.
IPR009057
; Homeodomain-like.
IPR007087
; Znf_C2H2.
IPR015880
; Znf_C2H2-like.
IPR013087
; Znf_C2H2/integrase_DNA-bd.
IPR003604
; Znf_U1.
Pfam
PF00046
; Homeobox
SMART
SM00389
; HOX
SM00355
; ZnF_C2H2
SM00451
; ZnF_U1
PROSITE
PS00027
; HOMEOBOX_1
PS50071
; HOMEOBOX_2
PS00028
; ZINC_FINGER_C2H2_1
PS50157
; ZINC_FINGER_C2H2_2
PRINTS