CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-037178
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
  
Protein Name
 Zinc finger homeobox protein 4 
Protein Synonyms/Alias
  
Gene Name
 ZFHX4 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
2967KFKINIGKPFMINQGacetylation[1]
Reference
 [1] Proteomic investigations reveal a role for RNA processing factor THRAP3 in the DNA damage response.
 Beli P, Lukashchuk N, Wagner SA, Weinert BT, Olsen JV, Baskcomb L, Mann M, Jackson SP, Choudhary C.
 Mol Cell. 2012 Apr 27;46(2):212-25. [PMID: 22424773
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; DNA-binding; Homeobox; Nucleus; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 3590 AA 
Protein Sequence
METCDSPPIS RQENGQSTSK LCGTTQLDNE VPEKVAGMEP DRENSSTDDN LKTDERKSEA 60
LLGFSVENAA ATQVTSAKEI PCNECATSFP SLQKYMEHHC PNARLPVLKD DNESEISELE 120
DSDVENLTGE IVYQPDGSAY IIEDSKESGQ NAQTGANSKL FSTAMFLDSL ASAGEKSDQS 180
ASAPMSFYPQ IINTFHIASS LGKPFTADQA FPNTSALAGV GPVLHSFRVY DLRHKREKDY 240
LTSDGSAKNS CVSKDVPNNV DLSKFDGCVS DGKRKPVLMC FLCKLSFGYI RSFVTHAVHD 300
HRMTLNDEEQ KLLSNKCVSA IIQGIGKDKE PLISFLEPKK STSVYPHFST TNLIGPDPTF 360
RGLWSAFHVE NGDSLPAGFA FLKGSASTSS SAEQPLGITQ MPKAEVNLGG LSSLVVNTPI 420
TSVSLSHSSS ESSKMSESKD QENNCERPKE SNVLHPNGEC PVKSEPTEPG DEDEEDAYSN 480
ELDDEEVLGE LTDSIGNKDF PLLNQSISPL SSSVLKFIEK GTSSSSATVS DDTEKKKQTA 540
AVRASGSVAS NYGISGKDFA DASASKDSAT AAHPSEIARG DEDSSATPHQ HGFTPSTPGT 600
PGPGGDGSPG SGIECPKCDT VLGSSRSLGG HMTMMHSRNS CKTLKCPKCN WHYKYQQTLE 660
AHMKEKHPEP GGSCVYCKTG QPHPRLARGE SYTCGYKPFR CEVCNYSTTT KGNLSIHMQS 720
DKHLNNVQNL QNGNGEQVFG HSAPAPNTSL SGCGTPSPSK PKQKPTWRCE VCDYETNVAR 780
NLRIHMTSEK HMHNMMLLQQ NMKQIQHNLH LGLAPAEAEL YQYYLAQNIG LTGMKLENPA 840
DPQLMINPFQ LDPATAAALA PGLGELSPYI SDPALKLFQC AVCNKFTSDS LEALSVHVSS 900
ERSLPEEEWR AVIGDIYQCK LCNYNTQLKA NFQLHCKTDK HMQKYQLVAH IKEGGKSNEW 960
RLKCIAIGNP VHLKCNACDY YTNSVDKLRL HTTNHRHEAA LKLYKHLQKQ EGAVNPESCY 1020
YYCAVCDYTT KVKLNLVQHV RSVKHQQTEG LRKLQLHQQG LAPEEDNLSE IFFVKDCPPN 1080
ELETASLGAR TCDDDLTEQQ LRSTSEEQSE EAEGAIKPTA VAEDDEKDTS ERDNSEGKNS 1140
NKDSGIITPE KELKVSVAGG TQPLLLAKEE DVATKRSKPT EDNKFCHEQF YQCPYCNYNS 1200
RDQSRIQMHV LSQHSVQPVI CCPLCQDVLS NKMHLQLHLT HLHSVSPDCV EKLLMTVPVP 1260
DVMMPNSMLL PAAASEKSER DTPAAVTAEG SGKYSGESPM DDKSMAGLED SKANVEVKNE 1320
EQKPTKEPLE VSEWNKNSSK DVKIPDTLQD QLNEQQKRQP LSVSDRHVYK YRCNHCSLAF 1380
KTMQKLQIHS QYHAIRAATM CNLCQRSFRT FQALKKHLEA GHPELSEAEL QQLYASLPVN 1440
GELWAESETM SQDDHGLEQE MEREYEVDHE GKASPVGSDS SSIPDDMGSE PKRTLPFRKG 1500
PNFTMEKFLD PSRPYKCTVC KESFTQKNIL LVHYNSVSHL HKLKKVLQEA SSPVPQETNS 1560
NTDNKPYKCS ICNVAYSQSS TLEIHMRSVL HQTKARAAKL EPSGHVAGGH SIAANVNSPG 1620
QGMLDSMSLA AVNSKDTHLD AKELNKKQTP DLISAQPAHH PPQSPAQIQM QLQHELQQQA 1680
AFFQPQFLNP AFLPHFPMTP EALLQFQQPQ FLFPFYIPGT EFSLGPDLGL PGSATFGMPG 1740
MTGMAGSLLE DLKQQIQTQH HVGQTQLQIL QQQAQQYQAT QPQLQPQKQQ QQPPPPQQQQ 1800
QQQASKLLKQ EQSNIVSADC QIMKDVPSYK EAEDISEKPE KPKQEFISEG EGLKEGKDTK 1860
KQKSLEPSIP PPRIASGARG NAAKALLENF GFELVIQYNE NRQKVQKKGK SGEGENTDKL 1920
ECGTCGKLFS NVLILKSHQE HVHGQFFPYA ALEKFARQYR EAYDKLYPIS PSSPETPPPP 1980
PPPPPLPPAP PQPSSMGPVK IPNTVSTPLQ APPPTPPPPP PPPPPPPPPP PPPPPSAPPQ 2040
VQLPVSLDLP LFPSIMMQPV QHPALPPQLA LQLPQMDALS ADLTQLCQQQ LGLDPNFLRH 2100
SQFKRPRTRI TDDQLKILRA YFDINNSPSE EQIQEMAEKS GLSQKVIKHW FRNTLFKERQ 2160
RNKDSPYNFS NPPITVLEDI RIDPQPTSLE HYKSDASFSK RSSRTRFTDY QLRVLQDFFD 2220
TNAYPKDDEI EQLSTVLNLP TRVIVVWFQN ARQKARKSYE NQAETKDNEK RELTNERYIR 2280
TSNMQYQCKK CNVVFPRIFD LITHQKKQCY KDEDDDAQDE SQTEDSMDAT DQVVYKHCTV 2340
SGQTDAAKNA AAPAASSGSG TSTPLIPSPK PEPEKTSPKP EYPAEKPKQS DPSPPSQGTK 2400
PALPLASTSS DPPQASTAQP QPQPQPPKQP QLIGRPPSAS QTPVPSSPLQ ISMTSLQNSL 2460
PPQLLQYQCD QCTVAFPTLE LWQEHQHMHF LAAQNQFLHS PFLERPMDMP YMIFDPNNPL 2520
MTGQLLGSSL TQMPPQASSS HTTAPTTVAA SLKRKLDDKE DNNCSEKEGG NSGEDQHRDK 2580
RLRTTITPEQ LEILYEKYLL DSNPTRKMLD HIAREVGLKK RVVQVWFQNT RARERKGQFR 2640
AVGPAQSHKR CPFCRALFKA KSALESHIRS RHWNEGKQAG YSLPPSPLIS TEDGGESPQK 2700
YIYFDYPSLP LTKIDLSSEN ELASTVSTPV SKTAELSPKN LLSPSSFKAE CSEDVENLNA 2760
PPAEAGYDQN KTDFDETSSI NTAISDATTG DEGNTEMEST TGSSGDVKPA LSPKEPKTLD 2820
TLPKPATTPT TEVCDDKFLF SLTSPSIHFN DKDGDHDQSF YITDDPDDNA DRSETSSIAD 2880
PSSPNPFGSS NPFKSKSNDR PGHKRFRTQM SNLQLKVLKA CFSDYRTPTM QECEMLGNEI 2940
GLPKRVVQVW FQNARAKEKK FKINIGKPFM INQGGTEGTK PECTLCGVKY SARLSIRDHI 3000
FSKQHISKVR ETVGSQLDRE KDYLAPTTVR QLMAQQELDR IKKASDVLGL TVQQPGMMDS 3060
SSLHGISLPT AYPGLPGLPP VLLPGMNGPS SLPGFPQNSN TLTPPGAGML GFPTSATSSP 3120
ALSLSSAPTK PLLQTPPPPP PPPPPPPSSS LSGQQTEQQN KESEKKQTKP NKVKKIKEEE 3180
LEATKPEKHP KKEEKISSAL SVLGKVVGET HVDPIQLQAL QNAIAGDPAS FIGGQFLPYF 3240
IPGFASYFTP QLPGTVQGGY FPPVCGMESL FPYGPTMPQT LAGLSPGALL QQYQQYQQNL 3300
QESLQKQQKQ QQEQQQKPVQ AKTSKVESDQ PQNSNDASET KEDKSTATES TKEEPQLESK 3360
SADFSDTYVV PFVKYEFICR KCQMMFTDED AAVNHQKSFC YFGQPLIDPQ ETVLRVPVSK 3420
YQCLACDVAI SGNEALSQHL QSSLHKEKTI KQAMRNAKEH VRLLPHSVCS PNPNTTSTSQ 3480
SAASSNNTYP HLSCFSMKSW PNILFQASAR RAASPPSSPP SLSLPSTVTS SLCSTSGVQT 3540
SLPTESCSDE SDSELSQKLE DLDNSLEVKA KPASGLDGNF NSIRMDMFSV 3590 
Gene Ontology
 GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
 GO:0043565; F:sequence-specific DNA binding; IEA:InterPro.
 GO:0003700; F:sequence-specific DNA binding transcription factor activity; IEA:InterPro.
 GO:0008270; F:zinc ion binding; IEA:InterPro. 
Interpro
 IPR017970; Homeobox_CS.
 IPR001356; Homeodomain.
 IPR009057; Homeodomain-like.
 IPR007087; Znf_C2H2.
 IPR015880; Znf_C2H2-like.
 IPR013087; Znf_C2H2/integrase_DNA-bd.
 IPR003604; Znf_U1. 
Pfam
 PF00046; Homeobox
 PF00096; zf-C2H2 
SMART
 SM00389; HOX
 SM00355; ZnF_C2H2
 SM00451; ZnF_U1 
PROSITE
 PS00027; HOMEOBOX_1
 PS50071; HOMEOBOX_2
 PS00028; ZINC_FINGER_C2H2_1
 PS50157; ZINC_FINGER_C2H2_2 
PRINTS