CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-022950
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 General transcription factor II-I repeat domain-containing protein 1 
Protein Synonyms/Alias
 GTF2I repeat domain-containing protein 1; General transcription factor III; MusTRD1/BEN; Muscle TFII-I repeat domain-containing protein 1; Slow-muscle-fiber enhancer-binding protein; USE B1-binding protein; Williams-Beuren syndrome chromosomal region 11 protein; Williams-Beuren syndrome chromosomal region 12 protein 
Gene Name
 GTF2IRD1 
Gene Synonyms/Alias
 CREAM1; GTF3; MUSTRD1; RBAP2; WBSCR11; WBSCR12 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
579LFNTRYAKAIGISEPubiquitination[1]
Reference
 [1] A proteome-wide, quantitative survey of in vivo ubiquitylation sites reveals widespread regulatory roles.
 Wagner SA, Beli P, Weinert BT, Nielsen ML, Cox J, Mann M, Choudhary C.
 Mol Cell Proteomics. 2011 Oct;10(10):M111.013284. [PMID: 21890473
Functional Description
 May be a transcription regulator involved in cell-cycle progression and skeletal muscle differentiation. May repress GTF2I transcriptional functions, by preventing its nuclear residency, or by inhibiting its transcriptional activation. May contribute to slow-twitch fiber type specificity during myogenesis and in regenerating muscles. Binds troponin I slow-muscle fiber enhancer (USE B1). Binds specifically and with high affinity to the EFG sequences derived from the early enhancer of HOXC8 (By similarity). 
Sequence Annotation
 REPEAT 119 213 GTF2I-like 1.
 REPEAT 342 436 GTF2I-like 2.
 REPEAT 556 650 GTF2I-like 3.
 REPEAT 696 790 GTF2I-like 4.
 REPEAT 793 887 GTF2I-like 5.
 MOTIF 898 905 Nuclear localization signal (Potential).
 MOD_RES 448 448 Phosphoserine.  
Keyword
 3D-structure; Alternative splicing; Complete proteome; Developmental protein; DNA-binding; Nucleus; Phosphoprotein; Polymorphism; Reference proteome; Repeat; Transcription; Transcription regulation; Williams-Beuren syndrome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 959 AA 
Protein Sequence
MALLGKRCDV PTNGCGPDRW NSAFTRKDEI ITSLVSALDS MCSALSKLNA EVACVAVHDE 60
SAFVVGTEKG RMFLNARKEL QSDFLRFCRG PPWKDPEAEH PKKVQRGEGG GRSLPRSSLE 120
HGSDVYLLRK MVEEVFDVLY SEALGRASVV PLPYERLLRE PGLLAVQGLP EGLAFRRPAE 180
YDPKALMAIL EHSHRIRFKL KRPLEDGGRD SKALVELNGV SLIPKGSRDC GLHGQAPKVP 240
PQDLPPTATS SSMASFLYST ALPNHAIREL KQEAPSCPLA PSDLGLSRPM PEPKATGAQD 300
FSDCCGQKPT GPGGPLIQNV HASKRILFSI VHDKSEKWDA FIKETEDINT LRECVQILFN 360
SRYAEALGLD HMVPVPYRKI ACDPEAVEIV GIPDKIPFKR PCTYGVPKLK RILEERHSIH 420
FIIKRMFDER IFTGNKFTKD TTKLEPASPP EDTSAEVSRA TVLDLAGNAR SDKGSMSEDC 480
GPGTSGELGG LRPIKIEPED LDIIQVTVPD PSPTSEEMTD SMPGHLPSED SGYGMEMLTD 540
KGLSEDARPE ERPVEDSHGD VIRPLRKQVE LLFNTRYAKA IGISEPVKVP YSKFLMHPEE 600
LFVVGLPEGI SLRRPNCFGI AKLRKILEAS NSIQFVIKRP ELLTEGVKEP IMDSQGTASS 660
LGFSPPALPP ERDSGDPLVD ESLKRQGFQE NYDARLSRID IANTLREQVQ DLFNKKYGEA 720
LGIKYPVQVP YKRIKSNPGS VIIEGLPPGI PFRKPCTFGS QNLERILAVA DKIKFTVTRP 780
FQGLIPKPDE DDANRLGEKV ILREQVKELF NEKYGEALGL NRPVLVPYKL IRDSPDAVEV 840
TGLPDDIPFR NPNTYDIHRL EKILKAREHV RMVIINQLQP FAEICNDAKV PAKDSSIPKR 900
KRKRVSEGNS VSSSSSSSSS SSSNPDSVAS ANQISLVQWP MYMVDYAGLN VQLPGPLNY 959 
Gene Ontology
 GO:0005737; C:cytoplasm; IDA:HPA.
 GO:0005634; C:nucleus; NAS:UniProtKB.
 GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
 GO:0003705; F:RNA polymerase II distal enhancer sequence-specific DNA binding transcription factor activity; NAS:UniProtKB.
 GO:0009790; P:embryo development; IEA:Compara. 
Interpro
 IPR004212; GTF2I.
 IPR016659; TF_II-I. 
Pfam
 PF02946; GTF2I 
SMART
  
PROSITE
 PS51139; GTF2I 
PRINTS