Tag | Content |
---|
CPLM ID | CPLM-012653 |
UniProt Accession | |
Genbank Protein ID | |
Genbank Nucleotide ID | |
Protein Name | Zinc finger homeobox protein 3 |
Protein Synonyms/Alias | AT motif-binding factor; AT-binding transcription factor 1; Alpha-fetoprotein enhancer-binding protein; Zinc finger homeodomain protein 3; ZFH-3 |
Gene Name | ZFHX3 |
Gene Synonyms/Alias | ATBF1 |
Created Date | July 27, 2013 |
Organism | Homo sapiens (Human) |
NCBI Taxa ID | 9606 |
Lysine Modification | Position | Peptide | Type | References |
---|
2013 | HQNYFPFKQLERFAK | ubiquitination | [1] |
|
Reference | [1] Refined preparation and use of anti-diglycine remnant (K-ε-GG) antibody enables routine quantification of 10,000s of ubiquitination sites in single proteomics experiments. Udeshi ND, Svinkina T, Mertins P, Kuhn E, Mani DR, Qiao JW, Carr SA. Mol Cell Proteomics. 2013 Mar;12(3):825-31. [ PMID: 23266961] |
Functional Description | Transcriptional repressor. It inhibits the enhancer element of the AFP gene by binding to its AT-rich core sequence. Regulator of myoblasts differentiation through the binding to the AT-rich sequence of MYF6 promoter and promoter repression. Down- regulates the MUC5AC promoter in gastric cancer. |
Sequence Annotation | ZN_FING 282 305 C2H2-type 1. ZN_FING 640 663 C2H2-type 2. ZN_FING 671 694 C2H2-type 3. ZN_FING 726 750 C2H2-type 4. ZN_FING 804 828 C2H2-type 5; atypical. ZN_FING 945 968 C2H2-type 6; degenerate. ZN_FING 984 1008 C2H2-type 7; atypical. ZN_FING 1040 1064 C2H2-type 8; atypical. ZN_FING 1088 1112 C2H2-type 9; atypical. ZN_FING 1223 1246 C2H2-type 10; atypical. ZN_FING 1252 1275 C2H2-type 11. ZN_FING 1360 1385 C2H2-type 12. ZN_FING 1401 1423 C2H2-type 13. ZN_FING 1429 1452 C2H2-type 14. ZN_FING 1545 1569 C2H2-type 15. ZN_FING 1596 1620 C2H2-type 16. ZN_FING 1983 2006 C2H2-type 17. DNA_BIND 2145 2204 Homeobox 1. DNA_BIND 2242 2301 Homeobox 2. ZN_FING 2328 2351 C2H2-type 18; atypical. ZN_FING 2530 2552 C2H2-type 19. DNA_BIND 2641 2700 Homeobox 3. ZN_FING 2711 2734 C2H2-type 20. DNA_BIND 2944 3003 Homeobox 4. ZN_FING 3024 3048 C2H2-type 21. ZN_FING 3529 3553 C2H2-type 22. MOD_RES 533 533 Phosphoserine. MOD_RES 571 571 Phosphoserine. MOD_RES 1197 1197 Phosphoserine. MOD_RES 1590 1590 Phosphoserine (By similarity). |
Keyword | 3D-structure; Activator; Alternative splicing; Complete proteome; DNA-binding; Homeobox; Metal-binding; Myogenesis; Nucleus; Phosphoprotein; Polymorphism; Reference proteome; Repeat; Transcription; Transcription regulation; Zinc; Zinc-finger. |
Sequence Source | UniProt (SWISSPROT/TrEMBL); GenBank; EMBL |
Protein Length | 3703 AA |
Protein Sequence | MEGCDSPVVS GKDNGCGIPQ HQQWTELNST HLPDKPSSME QSTGESHGPL DSLRAPFNER 60 LAESTASAGP PSEPASKEVT CNECSASFAS LQTYMEHHCP SARPPPPLRE ESASDTGEEG 120 DEESDVENLA GEIVYQPDGS AYIVESLSQL TQGGGACGSG SGSGPLPSLF LNSLPGAGGK 180 QGDPSCAAPV YPQIINTFHI ASSFGKWFEG PDQAFPNTSA LAGLSPVLHS FRVFDVRHKS 240 NKDYLNSDGS AKSSCVSKDV PNNVDLSKFD GFVLYGKRKP ILMCFLCKLS FGYVRSFVTH 300 AVHDHRMTLS EDERKILSNK NISAIIQGIG KDKEPLVSFL EPKNKNFQHP LVSTANLIGP 360 GHSFYGKFSG IRMEGEEALP AGSAAGPEQP QAGLLTPSTL LNLGGLTSSV LKTPITSVPL 420 GPLASSPTKS SEGKDSGAAE GEKQEVGDGD CFSEKVEPAE EEAEEEEEEE EAEEEEEEEE 480 EEEEEEEDEG CKGLFPSELD EELEDRPHEE PGAAAGSSSK KDLALSNQSI SNSPLMPNVL 540 QTLSRGTAST SSNSASSFVV FDGANRRNRL SFNSEGVRAN VAEGGRRLDF ADESANKDNA 600 TAPEPNESTE GDDGGFVPHH QHAGSLCELG VGECPSGSGV ECPKCDTVLG SSRSLGGHMT 660 MMHSRNSCKT LKCPKCNWHY KYQQTLEAHM KEKHPEPGGS CVYCKSGQPH PRLARGESYT 720 CGYKPFRCEV CNYSTTTKGN LSIHMQSDKH LNNMQNLQNG GGEQVFSHTA GAAAAAVAAA 780 AAAANISSSC GAPSPTKPKT KPTWRCEVCD YETNVARNLR IHMTSEKHMH NMMLLQQNMT 840 QIQHNRHLGL GSLPSPAEAE LYQYYLAQNM NLPNLKMDSA ASDAQFMMSG FQLDPAGPMA 900 AMTPALVGGE IPLDMRLGGG QLVSEELMNL GESFIQTNDP SLKLFQCAVC NKFTTDNLDM 960 LGLHMNVERS LSEDEWKAVM GDSYQCKLCR YNTQLKANFQ LHCKTDKHVQ KYQLVAHIKE 1020 GGKANEWRLK CVAIGNPVHL KCNACDYYTN SLEKLRLHTV NSRHEASLKL YKHLQQHESG 1080 VEGESCYYHC VLCNYSTKAK LNLIQHVRSM KHQRSESLRK LQRLQKGLPE EDEDLGQIFT 1140 IRRCPSTDPE EAIEDVEGPS ETAADPEELA KDQEGGASSS QAEKELTDSP ATSKRISFPG 1200 SSESPLSSKR PKTAEEIKPE QMYQCPYCKY SNADVNRLRV HAMTQHSVQP MLRCPLCQDM 1260 LNNKIHLQLH LTHLHSVAPD CVEKLIMTVT TPEMVMPSSM FLPAAVPDRD GNSNLEEAGK 1320 QPETSEDLGK NILPSASTEQ SGDLKPSPAD PGSVREDSGF ICWKKGCNQV FKTSAALQTH 1380 FNEVHAKRPQ LPVSDRHVYK YRCNQCSLAF KTIEKLQLHS QYHVIRAATM CCLCQRSFRT 1440 FQALKKHLET SHLELSEADI QQLYGGLLAN GDLLAMGDPT LAEDHTIIVE EDKEEESDLE 1500 DKQSPTGSDS GSVQEDSGSE PKRALPFRKG PNFTMEKFLD PSRPYKCTVC KESFTQKNIL 1560 LVHYNSVSHL HKLKRALQES ATGQPEPTSS PDNKPFKCNT CNVAYSQSST LEIHMRSVLH 1620 QTKARAAKLE AASGSSNGTG NSSSISLSSS TPSPVSTSGS NTFTTSNPSS AGIAPSSNLL 1680 SQVPTESVGM PPLGNPIGAN IASPSEPKEA NRKKLADMIA SRQQQQQQQQ QQQQQQQQQQ 1740 QAQTLAQAQA QVQAHLQQEL QQQAALIQSQ LFNPTLLPHF PMTTETLLQL QQQQHLLFPF 1800 YIPSAEFQLN PEVSLPVTSG ALTLTGTGPG LLEDLKAQVQ VPQQSHQQIL PQQQQNQLSI 1860 AQSHSALLQP SQHPEKKNKL VIKEKEKESQ RERDSAEGGE GNTGPKETLP DALKAKEKKE 1920 LAPGGGSEPS MLPPRIASDA RGNATKALLE NFGFELVIQY NENKQKVQKK NGKTDQGENL 1980 EKLECDSCGK LFSNILILKS HQEHVHQNYF PFKQLERFAK QYRDHYDKLY PLRPQTPEPP 2040 PPPPPPPPPP LPAAPPQPAS TPAIPASAPP ITSPTIAPAQ PSVPLTQLSM PMELPIFSPL 2100 MMQTMPLQTL PAQLPPQLGP VEPLPADLAQ LYQHQLNPTL LQQQNKRPRT RITDDQLRVL 2160 RQYFDINNSP SEEQIKEMAD KSGLPQKVIK HWFRNTLFKE RQRNKDSPYN FSNPPITSLE 2220 ELKIDSRPPS PEPPKQEYWG SKRSSRTRFT DYQLRVLQDF FDANAYPKDD EFEQLSNLLN 2280 LPTRVIVVWF QNARQKARKN YENQGEGKDG ERRELTNDRY IRTSNLNYQC KKCSLVFQRI 2340 FDLIKHQKKL CYKDEDEEGQ DDSQNEDSMD AMEILTPTSS SCSTPMPSQA YSAPAPSANN 2400 TASSAFLQLT AEAEELATFN SKTEAGDEKP KLAEAPSAQP NQTQEKQGQP KPELQQQEQP 2460 EQKTNTPQQK LPQLVSLPSL PQPPPQAPPP QCPLPQSSPS PSQLSHLPLK PLHTSTPQQL 2520 ANLPPQLIPY QCDQCKLAFP SFEHWQEHQQ LHFLSAQNQF IHPQFLDRSL DMPFMLFDPS 2580 NPLLASQLLS GAIPQIPASS ATSPSTPTST MNTLKRKLEE KASASPGEND SGTGGEEPQR 2640 DKRLRTTITP EQLEILYQKY LLDSNPTRKM LDHIAHEVGL KKRVVQVWFQ NTRARERKGQ 2700 FRAVGPAQAH RRCPFCRALF KAKTALEAHI RSRHWHEAKR AGYNLTLSAM LLDCDGGLQM 2760 KGDIFDGTSF SHLPPSSSDG QGVPLSPVSK TMELSPRTLL SPSSIKVEGI EDFESPSMSS 2820 VNLNFDQTKL DNDDCSSVNT AITDTTTGDE GNADNDSATG IATETKSSSA PNEGLTKAAM 2880 MAMSEYEDRL SSGLVSPAPS FYSKEYDNEG TVDYSETSSL ADPCSPSPGA SGSAGKSGDS 2940 GDRPGQKRFR TQMTNLQLKV LKSCFNDYRT PTMLECEVLG NDIGLPKRVV QVWFQNARAK 3000 EKKSKLSMAK HFGINQTSYE GPKTECTLCG IKYSARLSVR DHIFSQQHIS KVKDTIGSQL 3060 DKEKEYFDPA TVRQLMAQQE LDRIKKANEV LGLAAQQQGM FDNTPLQALN LPTAYPALQG 3120 IPPVLLPGLN SPSLPGFTPS NTALTSPKPN LMGLPSTTVP SPGLPTSGLP NKPSSASLSS 3180 PTPAQATMAM GPQQPPQQQQ QQQQPQVQQP PPPPAAQPPP TPQLPLQQQQ QRKDKDSEKV 3240 KEKEKAHKGK GEPLPVPKKE KGEAPTATAA TISAPLPTME YAVDPAQLQA LQAALTSDPT 3300 ALLTSQFLPY FVPGFSPYYA PQIPGALQSG YLQPMYGMEG LFPYSPALSQ ALMGLSPGSL 3360 LQQYQQYQQS LQEAIQQQQQ RQLQQQQQQK VQQQQPKASQ TPVPPGAPSP DKDPAKESPK 3420 PEEQKNTPRE VSPLLPKLPE EPEAESKSAD SLYDPFIVPK VQYKLVCRKC QAGFSDEEAA 3480 RSHLKSLCFF GQSVVNLQEM VLHVPTGGGG GGSGGGGGGG GGGGGGGSYH CLACESALCG 3540 EEALSQHLES ALHKHRTITR AARNAKEHPS LLPHSACFPD PSTASTSQSA AHSNDSPPPP 3600 SAAAPSSASP HASRKSWPQV VSRASAAKPP SFPPLSSSST VTSSSCSTSG VQPSMPTDDY 3660 SEESDTDLSQ KSDGPASPVE GPKDPSCPKD SGLTSVGTDT FRL 3703 |
Gene Ontology | GO:0005739; C:mitochondrion; IEA:Compara. GO:0005667; C:transcription factor complex; IDA:MGI. GO:0003705; F:RNA polymerase II distal enhancer sequence-specific DNA binding transcription factor activity; TAS:UniProtKB. GO:0043565; F:sequence-specific DNA binding; IEA:InterPro. GO:0044212; F:transcription regulatory region DNA binding; IDA:UniProtKB. GO:0008270; F:zinc ion binding; IEA:InterPro. GO:0007517; P:muscle organ development; IEA:UniProtKB-KW. GO:0045662; P:negative regulation of myoblast differentiation; IDA:UniProtKB. GO:0000122; P:negative regulation of transcription from RNA polymerase II promoter; IGI:MGI. GO:0045892; P:negative regulation of transcription, DNA-dependent; IDA:UniProtKB. GO:0045663; P:positive regulation of myoblast differentiation; IDA:UniProtKB. |
Interpro | |
Pfam | |
SMART | |
PROSITE | |
PRINTS | |