CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-000916
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Zinc finger E-box-binding homeobox 2 
Protein Synonyms/Alias
 Smad-interacting protein 1; SMADIP1; Zinc finger homeobox protein 1b 
Gene Name
 ZEB2 
Gene Synonyms/Alias
 KIAA0569; SIP1; ZFHX1B; ZFX1B; HRIHFB2411 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
372AITQLRNKLENGKPLacetylation[1]
377RNKLENGKPLSMSEQacetylation[1]
391QTGLLKIKTEPLDFNsumoylation[2]
632AGVFVDNKALLLSSVubiquitination[3, 4]
866PLNLTFIKKEFSNSNsumoylation[2]
Reference
 [1] Lysine acetylation targets protein complexes and co-regulates major cellular functions.
 Choudhary C, Kumar C, Gnad F, Nielsen ML, Rehman M, Walther TC, Olsen JV, Mann M.
 Science. 2009 Aug 14;325(5942):834-40. [PMID: 19608861]
 [2] Pc2-mediated sumoylation of Smad-interacting protein 1 attenuates transcriptional repression of E-cadherin.
 Long J, Zuo D, Park M.
 J Biol Chem. 2005 Oct 21;280(42):35477-89. [PMID: 16061479]
 [3] A proteome-wide, quantitative survey of in vivo ubiquitylation sites reveals widespread regulatory roles.
 Wagner SA, Beli P, Weinert BT, Nielsen ML, Cox J, Mann M, Choudhary C.
 Mol Cell Proteomics. 2011 Oct;10(10):M111.013284. [PMID: 21890473]
 [4] hCKSAAP_UbSite: improved prediction of human ubiquitination sites by exploiting amino acid pattern and properties.
 Chen Z, Zhou Y, Song J, Zhang Z.
 Biochim Biophys Acta. 2013 Aug;1834(8):1461-7. [PMID: 23603789
Functional Description
 Transcriptional inhibitor that binds to DNA sequence 5'- CACCT-3' in different promoters. Represses transcription of E- cadherin. 
Sequence Annotation
 ZN_FING 211 234 C2H2-type 1.
 ZN_FING 241 263 C2H2-type 2.
 ZN_FING 282 304 C2H2-type 3.
 ZN_FING 310 334 C2H2-type 4; atypical.
 DNA_BIND 644 703 Homeobox; atypical.
 ZN_FING 999 1021 C2H2-type 5.
 ZN_FING 1027 1049 C2H2-type 6.
 ZN_FING 1055 1076 C2H2-type 7; atypical.
 REGION 437 487 SMAD-MH2 binding domain (By similarity).
 MOD_RES 36 36 Phosphoserine (By similarity).
 MOD_RES 38 38 Phosphothreonine (By similarity).
 MOD_RES 377 377 N6-acetyllysine.
 MOD_RES 778 778 Phosphoserine (By similarity).
 MOD_RES 780 780 Phosphoserine (By similarity).
 MOD_RES 782 782 Phosphothreonine (By similarity).
 MOD_RES 784 784 Phosphoserine (By similarity).
 MOD_RES 1167 1167 Phosphoserine (By similarity).
 CROSSLNK 391 391 Glycyl lysine isopeptide (Lys-Gly)
 CROSSLNK 866 866 Glycyl lysine isopeptide (Lys-Gly)  
Keyword
 3D-structure; Acetylation; Alternative splicing; Complete proteome; Disease mutation; DNA-binding; Epilepsy; Hirschsprung disease; Homeobox; Isopeptide bond; Mental retardation; Metal-binding; Nucleus; Phosphoprotein; Polymorphism; Reference proteome; Repeat; Repressor; Transcription; Transcription regulation; Ubl conjugation; Zinc; Zinc-finger. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1214 AA 
Protein Sequence
MKQPIMADGP RCKRRKQANP RRKNVVNYDN VVDTGSETDE EDKLHIAEDD GIANPLDQET 60
SPASVPNHES SPHVSQALLP REEEEDEIRE GGVEHPWHNN EILQASVDGP EEMKEDYDTM 120
GPEATIQTAI NNGTVKNANC TSDFEEYFAK RKLEERDGHA VSIEEYLQRS DTAIIYPEAP 180
EELSRLGTPE ANGQEENDLP PGTPDAFAQL LTCPYCDRGY KRLTSLKEHI KYRHEKNEEN 240
FSCPLCSYTF AYRTQLERHM VTHKPGTDQH QMLTQGAGNR KFKCTECGKA FKYKHHLKEH 300
LRIHSGEKPY ECPNCKKRFS HSGSYSSHIS SKKCIGLISV NGRMRNNIKT GSSPNSVSSS 360
PTNSAITQLR NKLENGKPLS MSEQTGLLKI KTEPLDFNDY KVLMATHGFS GTSPFMNGGL 420
GATSPLGVHP SAQSPMQHLG VGMEAPLLGF PTMNSNLSEV QKVLQIVDNT VSRQKMDCKA 480
EEISKLKGYH MKDPCSQPEE QGVTSPNIPP VGLPVVSHNG ATKSIIDYTL EKVNEAKACL 540
QSLTTDSRRQ ISNIKKEKLR TLIDLVTDDK MIENHNISTP FSCQFCKESF PGPIPLHQHE 600
RYLCKMNEEI KAVLQPHENI VPNKAGVFVD NKALLLSSVL SEKGMTSPIN PYKDHMSVLK 660
AYYAMNMEPN SDELLKISIA VGLPQEFVKE WFEQRKVYQY SNSRSPSLER SSKPLAPNSN 720
PPTKDSLLPR SPVKPMDSIT SPSIAELHNS VTNCDPPLRL TKPSHFTNIK PVEKLDHSRS 780
NTPSPLNLSS TSSKNSHSSS YTPNSFSSEE LQAEPLDLSL PKQMKEPKSI IATKNKTKAS 840
SISLDHNSVS SSSENSDEPL NLTFIKKEFS NSNNLDNKST NPVFSMNPFS AKPLYTALPP 900
QSAFPPATFM PPVQTSIPGL RPYPGLDQMS FLPHMAYTYP TGAATFADMQ QRRKYQRKQG 960
FQGELLDGAQ DYMSGLDDMT DSDSCLSRKK IKKTESGMYA CDLCDKTFQK SSSLLRHKYE 1020
HTGKRPHQCQ ICKKAFKHKH HLIEHSRLHS GEKPYQCDKC GKRFSHSGSY SQHMNHRYSY 1080
CKREAEEREA AEREAREKGH LEPTELLMNR AYLQSITPQG YSDSEERESM PRDGESEKEH 1140
EKEGEDGYGK LGRQDGDEEF EEEEEESENK SMDTDPETIR DEEETGDHSM DDSSEDGKME 1200
TKSDHEEDNM EDGM 1214 
Gene Ontology
 GO:0005737; C:cytoplasm; IDA:HPA.
 GO:0005730; C:nucleolus; IDA:HPA.
 GO:0019208; F:phosphatase regulator activity; NAS:UniProtKB.
 GO:0043565; F:sequence-specific DNA binding; IEA:InterPro.
 GO:0003700; F:sequence-specific DNA binding transcription factor activity; IEA:InterPro.
 GO:0046332; F:SMAD binding; NAS:UniProtKB.
 GO:0008270; F:zinc ion binding; IEA:InterPro.
 GO:0021846; P:cell proliferation in forebrain; IEA:Compara.
 GO:0021766; P:hippocampus development; IEA:Compara.
 GO:0007399; P:nervous system development; NAS:UniProtKB.
 GO:0001755; P:neural crest cell migration; IEA:Compara.
 GO:0001843; P:neural tube closure; IEA:Compara.
 GO:0043507; P:positive regulation of JUN kinase activity; IEA:Compara.
 GO:0030177; P:positive regulation of Wnt receptor signaling pathway; IEA:Compara.
 GO:0001756; P:somitogenesis; IEA:Compara.
 GO:0006351; P:transcription, DNA-dependent; IEA:UniProtKB-KW. 
Interpro
 IPR001356; Homeodomain.
 IPR009057; Homeodomain-like.
 IPR007087; Znf_C2H2.
 IPR015880; Znf_C2H2-like.
 IPR013087; Znf_C2H2/integrase_DNA-bd. 
Pfam
 PF00096; zf-C2H2 
SMART
 SM00389; HOX
 SM00355; ZnF_C2H2 
PROSITE
 PS00027; HOMEOBOX_1
 PS50071; HOMEOBOX_2
 PS00028; ZINC_FINGER_C2H2_1
 PS50157; ZINC_FINGER_C2H2_2 
PRINTS