CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-012653
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Zinc finger homeobox protein 3 
Protein Synonyms/Alias
 AT motif-binding factor; AT-binding transcription factor 1; Alpha-fetoprotein enhancer-binding protein; Zinc finger homeodomain protein 3; ZFH-3 
Gene Name
 ZFHX3 
Gene Synonyms/Alias
 ATBF1 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
2013HQNYFPFKQLERFAKubiquitination[1]
Reference
 [1] Refined preparation and use of anti-diglycine remnant (K-ε-GG) antibody enables routine quantification of 10,000s of ubiquitination sites in single proteomics experiments.
 Udeshi ND, Svinkina T, Mertins P, Kuhn E, Mani DR, Qiao JW, Carr SA.
 Mol Cell Proteomics. 2013 Mar;12(3):825-31. [PMID: 23266961
Functional Description
 Transcriptional repressor. It inhibits the enhancer element of the AFP gene by binding to its AT-rich core sequence. Regulator of myoblasts differentiation through the binding to the AT-rich sequence of MYF6 promoter and promoter repression. Down- regulates the MUC5AC promoter in gastric cancer. 
Sequence Annotation
 ZN_FING 282 305 C2H2-type 1.
 ZN_FING 640 663 C2H2-type 2.
 ZN_FING 671 694 C2H2-type 3.
 ZN_FING 726 750 C2H2-type 4.
 ZN_FING 804 828 C2H2-type 5; atypical.
 ZN_FING 945 968 C2H2-type 6; degenerate.
 ZN_FING 984 1008 C2H2-type 7; atypical.
 ZN_FING 1040 1064 C2H2-type 8; atypical.
 ZN_FING 1088 1112 C2H2-type 9; atypical.
 ZN_FING 1223 1246 C2H2-type 10; atypical.
 ZN_FING 1252 1275 C2H2-type 11.
 ZN_FING 1360 1385 C2H2-type 12.
 ZN_FING 1401 1423 C2H2-type 13.
 ZN_FING 1429 1452 C2H2-type 14.
 ZN_FING 1545 1569 C2H2-type 15.
 ZN_FING 1596 1620 C2H2-type 16.
 ZN_FING 1983 2006 C2H2-type 17.
 DNA_BIND 2145 2204 Homeobox 1.
 DNA_BIND 2242 2301 Homeobox 2.
 ZN_FING 2328 2351 C2H2-type 18; atypical.
 ZN_FING 2530 2552 C2H2-type 19.
 DNA_BIND 2641 2700 Homeobox 3.
 ZN_FING 2711 2734 C2H2-type 20.
 DNA_BIND 2944 3003 Homeobox 4.
 ZN_FING 3024 3048 C2H2-type 21.
 ZN_FING 3529 3553 C2H2-type 22.
 MOD_RES 533 533 Phosphoserine.
 MOD_RES 571 571 Phosphoserine.
 MOD_RES 1197 1197 Phosphoserine.
 MOD_RES 1590 1590 Phosphoserine (By similarity).  
Keyword
 3D-structure; Activator; Alternative splicing; Complete proteome; DNA-binding; Homeobox; Metal-binding; Myogenesis; Nucleus; Phosphoprotein; Polymorphism; Reference proteome; Repeat; Transcription; Transcription regulation; Zinc; Zinc-finger. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 3703 AA 
Protein Sequence
MEGCDSPVVS GKDNGCGIPQ HQQWTELNST HLPDKPSSME QSTGESHGPL DSLRAPFNER 60
LAESTASAGP PSEPASKEVT CNECSASFAS LQTYMEHHCP SARPPPPLRE ESASDTGEEG 120
DEESDVENLA GEIVYQPDGS AYIVESLSQL TQGGGACGSG SGSGPLPSLF LNSLPGAGGK 180
QGDPSCAAPV YPQIINTFHI ASSFGKWFEG PDQAFPNTSA LAGLSPVLHS FRVFDVRHKS 240
NKDYLNSDGS AKSSCVSKDV PNNVDLSKFD GFVLYGKRKP ILMCFLCKLS FGYVRSFVTH 300
AVHDHRMTLS EDERKILSNK NISAIIQGIG KDKEPLVSFL EPKNKNFQHP LVSTANLIGP 360
GHSFYGKFSG IRMEGEEALP AGSAAGPEQP QAGLLTPSTL LNLGGLTSSV LKTPITSVPL 420
GPLASSPTKS SEGKDSGAAE GEKQEVGDGD CFSEKVEPAE EEAEEEEEEE EAEEEEEEEE 480
EEEEEEEDEG CKGLFPSELD EELEDRPHEE PGAAAGSSSK KDLALSNQSI SNSPLMPNVL 540
QTLSRGTAST SSNSASSFVV FDGANRRNRL SFNSEGVRAN VAEGGRRLDF ADESANKDNA 600
TAPEPNESTE GDDGGFVPHH QHAGSLCELG VGECPSGSGV ECPKCDTVLG SSRSLGGHMT 660
MMHSRNSCKT LKCPKCNWHY KYQQTLEAHM KEKHPEPGGS CVYCKSGQPH PRLARGESYT 720
CGYKPFRCEV CNYSTTTKGN LSIHMQSDKH LNNMQNLQNG GGEQVFSHTA GAAAAAVAAA 780
AAAANISSSC GAPSPTKPKT KPTWRCEVCD YETNVARNLR IHMTSEKHMH NMMLLQQNMT 840
QIQHNRHLGL GSLPSPAEAE LYQYYLAQNM NLPNLKMDSA ASDAQFMMSG FQLDPAGPMA 900
AMTPALVGGE IPLDMRLGGG QLVSEELMNL GESFIQTNDP SLKLFQCAVC NKFTTDNLDM 960
LGLHMNVERS LSEDEWKAVM GDSYQCKLCR YNTQLKANFQ LHCKTDKHVQ KYQLVAHIKE 1020
GGKANEWRLK CVAIGNPVHL KCNACDYYTN SLEKLRLHTV NSRHEASLKL YKHLQQHESG 1080
VEGESCYYHC VLCNYSTKAK LNLIQHVRSM KHQRSESLRK LQRLQKGLPE EDEDLGQIFT 1140
IRRCPSTDPE EAIEDVEGPS ETAADPEELA KDQEGGASSS QAEKELTDSP ATSKRISFPG 1200
SSESPLSSKR PKTAEEIKPE QMYQCPYCKY SNADVNRLRV HAMTQHSVQP MLRCPLCQDM 1260
LNNKIHLQLH LTHLHSVAPD CVEKLIMTVT TPEMVMPSSM FLPAAVPDRD GNSNLEEAGK 1320
QPETSEDLGK NILPSASTEQ SGDLKPSPAD PGSVREDSGF ICWKKGCNQV FKTSAALQTH 1380
FNEVHAKRPQ LPVSDRHVYK YRCNQCSLAF KTIEKLQLHS QYHVIRAATM CCLCQRSFRT 1440
FQALKKHLET SHLELSEADI QQLYGGLLAN GDLLAMGDPT LAEDHTIIVE EDKEEESDLE 1500
DKQSPTGSDS GSVQEDSGSE PKRALPFRKG PNFTMEKFLD PSRPYKCTVC KESFTQKNIL 1560
LVHYNSVSHL HKLKRALQES ATGQPEPTSS PDNKPFKCNT CNVAYSQSST LEIHMRSVLH 1620
QTKARAAKLE AASGSSNGTG NSSSISLSSS TPSPVSTSGS NTFTTSNPSS AGIAPSSNLL 1680
SQVPTESVGM PPLGNPIGAN IASPSEPKEA NRKKLADMIA SRQQQQQQQQ QQQQQQQQQQ 1740
QAQTLAQAQA QVQAHLQQEL QQQAALIQSQ LFNPTLLPHF PMTTETLLQL QQQQHLLFPF 1800
YIPSAEFQLN PEVSLPVTSG ALTLTGTGPG LLEDLKAQVQ VPQQSHQQIL PQQQQNQLSI 1860
AQSHSALLQP SQHPEKKNKL VIKEKEKESQ RERDSAEGGE GNTGPKETLP DALKAKEKKE 1920
LAPGGGSEPS MLPPRIASDA RGNATKALLE NFGFELVIQY NENKQKVQKK NGKTDQGENL 1980
EKLECDSCGK LFSNILILKS HQEHVHQNYF PFKQLERFAK QYRDHYDKLY PLRPQTPEPP 2040
PPPPPPPPPP LPAAPPQPAS TPAIPASAPP ITSPTIAPAQ PSVPLTQLSM PMELPIFSPL 2100
MMQTMPLQTL PAQLPPQLGP VEPLPADLAQ LYQHQLNPTL LQQQNKRPRT RITDDQLRVL 2160
RQYFDINNSP SEEQIKEMAD KSGLPQKVIK HWFRNTLFKE RQRNKDSPYN FSNPPITSLE 2220
ELKIDSRPPS PEPPKQEYWG SKRSSRTRFT DYQLRVLQDF FDANAYPKDD EFEQLSNLLN 2280
LPTRVIVVWF QNARQKARKN YENQGEGKDG ERRELTNDRY IRTSNLNYQC KKCSLVFQRI 2340
FDLIKHQKKL CYKDEDEEGQ DDSQNEDSMD AMEILTPTSS SCSTPMPSQA YSAPAPSANN 2400
TASSAFLQLT AEAEELATFN SKTEAGDEKP KLAEAPSAQP NQTQEKQGQP KPELQQQEQP 2460
EQKTNTPQQK LPQLVSLPSL PQPPPQAPPP QCPLPQSSPS PSQLSHLPLK PLHTSTPQQL 2520
ANLPPQLIPY QCDQCKLAFP SFEHWQEHQQ LHFLSAQNQF IHPQFLDRSL DMPFMLFDPS 2580
NPLLASQLLS GAIPQIPASS ATSPSTPTST MNTLKRKLEE KASASPGEND SGTGGEEPQR 2640
DKRLRTTITP EQLEILYQKY LLDSNPTRKM LDHIAHEVGL KKRVVQVWFQ NTRARERKGQ 2700
FRAVGPAQAH RRCPFCRALF KAKTALEAHI RSRHWHEAKR AGYNLTLSAM LLDCDGGLQM 2760
KGDIFDGTSF SHLPPSSSDG QGVPLSPVSK TMELSPRTLL SPSSIKVEGI EDFESPSMSS 2820
VNLNFDQTKL DNDDCSSVNT AITDTTTGDE GNADNDSATG IATETKSSSA PNEGLTKAAM 2880
MAMSEYEDRL SSGLVSPAPS FYSKEYDNEG TVDYSETSSL ADPCSPSPGA SGSAGKSGDS 2940
GDRPGQKRFR TQMTNLQLKV LKSCFNDYRT PTMLECEVLG NDIGLPKRVV QVWFQNARAK 3000
EKKSKLSMAK HFGINQTSYE GPKTECTLCG IKYSARLSVR DHIFSQQHIS KVKDTIGSQL 3060
DKEKEYFDPA TVRQLMAQQE LDRIKKANEV LGLAAQQQGM FDNTPLQALN LPTAYPALQG 3120
IPPVLLPGLN SPSLPGFTPS NTALTSPKPN LMGLPSTTVP SPGLPTSGLP NKPSSASLSS 3180
PTPAQATMAM GPQQPPQQQQ QQQQPQVQQP PPPPAAQPPP TPQLPLQQQQ QRKDKDSEKV 3240
KEKEKAHKGK GEPLPVPKKE KGEAPTATAA TISAPLPTME YAVDPAQLQA LQAALTSDPT 3300
ALLTSQFLPY FVPGFSPYYA PQIPGALQSG YLQPMYGMEG LFPYSPALSQ ALMGLSPGSL 3360
LQQYQQYQQS LQEAIQQQQQ RQLQQQQQQK VQQQQPKASQ TPVPPGAPSP DKDPAKESPK 3420
PEEQKNTPRE VSPLLPKLPE EPEAESKSAD SLYDPFIVPK VQYKLVCRKC QAGFSDEEAA 3480
RSHLKSLCFF GQSVVNLQEM VLHVPTGGGG GGSGGGGGGG GGGGGGGSYH CLACESALCG 3540
EEALSQHLES ALHKHRTITR AARNAKEHPS LLPHSACFPD PSTASTSQSA AHSNDSPPPP 3600
SAAAPSSASP HASRKSWPQV VSRASAAKPP SFPPLSSSST VTSSSCSTSG VQPSMPTDDY 3660
SEESDTDLSQ KSDGPASPVE GPKDPSCPKD SGLTSVGTDT FRL 3703 
Gene Ontology
 GO:0005739; C:mitochondrion; IEA:Compara.
 GO:0005667; C:transcription factor complex; IDA:MGI.
 GO:0003705; F:RNA polymerase II distal enhancer sequence-specific DNA binding transcription factor activity; TAS:UniProtKB.
 GO:0043565; F:sequence-specific DNA binding; IEA:InterPro.
 GO:0044212; F:transcription regulatory region DNA binding; IDA:UniProtKB.
 GO:0008270; F:zinc ion binding; IEA:InterPro.
 GO:0007517; P:muscle organ development; IEA:UniProtKB-KW.
 GO:0045662; P:negative regulation of myoblast differentiation; IDA:UniProtKB.
 GO:0000122; P:negative regulation of transcription from RNA polymerase II promoter; IGI:MGI.
 GO:0045892; P:negative regulation of transcription, DNA-dependent; IDA:UniProtKB.
 GO:0045663; P:positive regulation of myoblast differentiation; IDA:UniProtKB. 
Interpro
 IPR017970; Homeobox_CS.
 IPR001356; Homeodomain.
 IPR009057; Homeodomain-like.
 IPR007087; Znf_C2H2.
 IPR015880; Znf_C2H2-like.
 IPR013087; Znf_C2H2/integrase_DNA-bd.
 IPR003604; Znf_U1. 
Pfam
 PF00046; Homeobox
 PF00096; zf-C2H2 
SMART
 SM00389; HOX
 SM00355; ZnF_C2H2
 SM00451; ZnF_U1 
PROSITE
 PS00027; HOMEOBOX_1
 PS50071; HOMEOBOX_2
 PS00028; ZINC_FINGER_C2H2_1
 PS50157; ZINC_FINGER_C2H2_2 
PRINTS