CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-038471
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
  
Protein Name
 Zinc finger homeobox protein 4 
Protein Synonyms/Alias
  
Gene Name
 Zfhx4 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Mus musculus (Mouse) 
NCBI Taxa ID
 10090 
Lysine Modification
Position
Peptide
Type
References
2960KFKINIGKPFMINQSacetylation[1]
Reference
 [1] Quantitative acetylome analysis reveals the roles of SIRT1 in regulating diverse substrates and cellular pathways.
 Chen Y, Zhao W, Yang JS, Cheng Z, Luo H, Lu Z, Tan M, Gu W, Zhao Y.
 Mol Cell Proteomics. 2012 Oct;11(10):1048-62. [PMID: 22826441
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; DNA-binding; Homeobox; Nucleus; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 3581 AA 
Protein Sequence
METCDSPPIS RQENGQSTSK LCGMTQLDNE VPEKVAGIEP DRENSSSHDN LKTDERKSEV 60
LLGFSIENAA ATQVTSAKEI PCNECATSFP SLQKYMEHHC PNARLPVLKD DESETSELED 120
SDVENLTGEI VYQPDGSAYI IEDSKESGQN AQTGANSKLF STAMFLDSLA SAGEKSDQSS 180
TAPVSFYPQI INTFHIASSL GKPFTADPAF PNTSALAGVG PVLHSFRVYD LRHKREKDYL 240
TSDGSAKNSC VSKDVPNNVD LSKFDGCVSD GKRKPVLMCF LCKLSFGYIR SFVTHAVHDH 300
RMTLNDEEQR LLSNKCVSAI IQGIGKDKEP LISFLEPKKS TSVYPNFSTT NLIGPDPTFR 360
GLWSAFHVEN GDSLQAGFAF LKGSASPSSS AEQPLGITHM PKAEVNLGGL SSLVVNTPIT 420
SVSLSHLSSE SSKMSESKDQ ENNCERPKES TILHPNVGCP VKSEPTEPGD EDEEDAYSNE 480
LDDEEVLGEL TDSIGNKDFP LLNQSISPLS SSVLKFIEKG TSSSSGTIAE DTEKKKQAAA 540
AGRSNGNVTN SYSIGGKDFA DGSISRDGTT AAPSETTHGD EDSSTTHQHG FTPSTPGTPG 600
PGGDGSPGNG IECPKCDTVL GSSRSLGGHM TMMHSRNSCK TLKCPKCNWH YKYQQTLEAH 660
MKEKHPEPGG SCVYCKTGQP HPRLARGESY TCGYKPFRCE VCNYSTTTKG NLSIHMQSDK 720
HLNNVQNLQN GNGEQVFGHS APTPNTSLSG CGTPSPSKPK QKPTWRCEVC DYETNVARNL 780
RIHMTSEKHM HNMMLLQQNM KQIQHNLHLG LAPAEAELYQ YYLAQNIGLT GMKLENPAET 840
QLLLNPFQFD SATAAALAPG LGELSPYISD PALKLFQCAV CNKFTSDSLE ALSVHVNSER 900
SLPEEEWRAV IGDIYQCKLC NYNTQLKANF QLHCKTDKHM QKYQLVAHIK EGGKSNEWRL 960
KCIAIGNPVH LKCNACDYYT NSVDKLRLHT TNHRHEAALK LYKHLQKQEG AVNSESCYYY 1020
CAVCDYSSKI KLNLVQHVRS VKHQQTEGLR KLQLHQQGLP SEEDNLSEIF FVKECPANEL 1080
ETASLGARNG EDELIEQQLK AASEEPSEDA GDPLKPPTVA EDDEKEAHKR DNSEGKISTK 1140
DPEVIVPEKE LKVVTGATQP LLLAKEDNTG TKRSKPTEDN KFCPEQFYQC PYCNYNSRDQ 1200
SRIQMHVLSQ HSVQPVICCP LCQDVLSNKM HLQLHLTHLH SVSPDCVEKL LMTVPVPDVM 1260
MPNSMLLPAA APEKSEQDPP TALTAEGSGK YSGDSPVDDK SMSGLEDSKV GVEIKNEEQK 1320
PAKEPVEASE WNKTSSKDVN ISDALQDQLN EQQKRQPLSV SDRHVYKYRC NHCSLAFKTM 1380
QKLQIHSQYH AIRAATMCTL CQRSFRTFQA LKKHLEAGHP ELSEAELQQL YASLPMNGEL 1440
WAESETMTQD DHGIDQEMER EYEVDHEGKA SPVESDSSSI PDDLGLEPKR TLPFRKGPNF 1500
TMEKFLDPSR PYKCTVCKES FTQKNILLVH YNSVSHLHKL KKVLQEASSP VPQEANSSTD 1560
NKPYKCSTCS VAYSQSSTLE IHMRSVLHQT KARAAKLEPS RHLPSGHSIT AAVNSPGQGM 1620
LESMSLASVN SKDTHLDAKE LNKKQTPELI SAQPTHHPPP RSPAQIQMQL QHELQQQAAF 1680
FQPQFLNPAF LPHFPMTPEA LLQFQQPQFL FPFYIPGAEF SLGPDLGLPT STTFGVPGMT 1740
GMAGSLLEDL KQQIQTQHHV GQTQLQFLQQ AQQYQAVQPQ LQPQNQQPPL PQQQQPQQQP 1800
SKLLKQEQGS LASTDCQLMK DMPSYKEAEE VTEKQEKPKQ EFINDTEGLK DSKDIKKQKS 1860
LEPCIPPPRI ASGARGNAAK ALLENFGFEL VIQYNENRQK VQKKGKSGEG ENSDKLECGI 1920
CGKLFSNVLI LKSHQEHVHG QFFPYGALEK FARQYREAYD KLYPISPSSP ETPPPPPPPP 1980
PLPPAPPQPS TLGPVKIPNT VSAPLQAPPP TPPPPPPPPP PPPPPPPPPP PPSAPQQVQL 2040
PVSLDLPLFP SIMMQPVQHP ALPPQLALQL PQMDTLSADL TQLCQQQLGI DPNFLRHSQF 2100
KRPRTRITDD QLKILRAYFD INNSPSEEQI QEMAEKSGLS QKVIKHWFRN TLFKERQRNK 2160
DSPYNFSNPP ITVLEDIRID PQPTSLEHYK SDAAFSKRSS RTRFTDYQLR VLQDFFDTNA 2220
YPKDDEIEQL STVLNLPTRV IVVWFQNARQ KARKSYENQA EAKDNEKREL TNERYIRTSN 2280
MQYQCKKCNV VFPRIFDLIT HQKKQCYKDE DDDAQDESQT EDSMDATDQV LYKHCMVSGQ 2340
TDTAKSTATL VASSGSGTST PLIPSPKPEP EKNSPKTEYP GEKTKQSDPS LPQGTKSAPS 2400
SVLTSSEPQQ ASIPQPPTQP PKQPQLIGRP PSASQTPIPS SPLQISMTSL QNSLPPQLLQ 2460
YPCDQCTIAF PTLELWKEHQ HMHFLAAQNQ FLHSPFLERP MDMPYMIFDP NNPLMTGQLL 2520
GSSLTQMPPQ TSTAHTTAPA SVAASLKRKL EDKEDNNCSE KEGGNSGEDQ HRDKRLRTTI 2580
TPEQLEILYE KYLLDSNPTR KMLDHIAREV GLKKRVVQVW FQNTRARERK GQFRAVGPAQ 2640
SHKRCPFCRA LFKAKSALES HIRSRHWNEG KQAGYSLPPS PLISTEDGGE SPQKYIYFDY 2700
PSLPLTKIDL STENELASTV STPVSKTAEL SPKNLLSPSS FKAECPEDVE NLNAPSADAG 2760
YDQSKTDFDE TSSINTAISD ATTGDEGAAD MENTGGSGEV KPALSPKETK TLDSLQKPAT 2820
TPTTEVCDDK FLFSLTSPSI HFNDKDGDHD QSFYITDDPD DNADRSETSS IADPSSPNPF 2880
GSSNPFKSKS NDRPGHKRFR TQMSNLQLKV LKACFSDYRT PTMQECEMLG NEIGLPKRVV 2940
QVWFQNARAK EKKFKINIGK PFMINQSGTD GTKPECTLCG VKYSARLSIR DHIFSKQHIS 3000
KVRETVGSQL DREKDYLAPT TVRQLMAQQE LDRIKKASDV LGLTVQQQGI TDNCSLHGIS 3060
LQAAYPGLPG LPPVILPGMN GPSSLPGFPQ NSNTLTSPGT GMLGFPSSAT SSPALSLSSG 3120
PTKSLLQTPP PPPPPPPPPS SLSGQQTEPQ NKESEKKQTK PNKVKKIKEE ESEAIKPEKH 3180
PKKEEKISSA LTVLGKVVGE THMDPTQLQA LQNAIAGDPA SFIGGQFLPY FIPGFASYFS 3240
PQLPGTVQGG YLPPICGMES LFPYGPAVPQ TLAGLSPGAL LQQYQQYQQS LQDSLQKQQK 3300
QQQEQQQKPV PAKTAKGEGD QPQSSNEASE TKEEKSTAPE STKEEVQLDS KSAEFSDTCI 3360
VPFVKYEFVC RKCQMMFTDE DATVNHQKSF CYFGQPLIDP QETVLRIPVS KYQCLACDLA 3420
LSGNEALSQH LQSSLHKEKT IKQAMRNAKE HVRLLPHSVC SPPPNTSSTS PSAASSNNTY 3480
PHLSCFSMKS WPNILFQASA RKAASSPSSP PSLSLPSTVT SSLCSTSGVQ TSLPTESCSD 3540
ESDSELSQKL QDLDNSLEVK AKPASGLDGN FNSVRMDMFS V 3581 
Gene Ontology
 GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
 GO:0043565; F:sequence-specific DNA binding; IEA:InterPro.
 GO:0003700; F:sequence-specific DNA binding transcription factor activity; IEA:InterPro.
 GO:0008270; F:zinc ion binding; IEA:InterPro. 
Interpro
 IPR017970; Homeobox_CS.
 IPR001356; Homeodomain.
 IPR009057; Homeodomain-like.
 IPR007087; Znf_C2H2.
 IPR015880; Znf_C2H2-like.
 IPR013087; Znf_C2H2/integrase_DNA-bd.
 IPR003604; Znf_U1. 
Pfam
 PF00046; Homeobox 
SMART
 SM00389; HOX
 SM00355; ZnF_C2H2
 SM00451; ZnF_U1 
PROSITE
 PS00027; HOMEOBOX_1
 PS50071; HOMEOBOX_2
 PS00028; ZINC_FINGER_C2H2_1
 PS50157; ZINC_FINGER_C2H2_2 
PRINTS