CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-022951
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 General transcription factor II-I repeat domain-containing protein 1 
Protein Synonyms/Alias
 GTF2I repeat domain-containing protein 1; General transcription factor III; MusTRD1/BEN; Muscle TFII-I repeat domain-containing protein 1; Slow-muscle-fiber enhancer-binding protein; USE B1-binding protein; Williams-Beuren syndrome chromosomal region 11 protein; Williams-Beuren syndrome chromosomal region 12 protein 
Gene Name
 GTF2IRD1 
Gene Synonyms/Alias
 CREAM1; GTF3; MUSTRD1; RBAP2; WBSCR11; WBSCR12 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
611LFNTRYAKAIGISEPubiquitination[1]
846RPVLVPYKLIRDSPDubiquitination[2]
Reference
 [1] hCKSAAP_UbSite: improved prediction of human ubiquitination sites by exploiting amino acid pattern and properties.
 Chen Z, Zhou Y, Song J, Zhang Z.
 Biochim Biophys Acta. 2013 Aug;1834(8):1461-7. [PMID: 23603789]
 [2] Refined preparation and use of anti-diglycine remnant (K-ε-GG) antibody enables routine quantification of 10,000s of ubiquitination sites in single proteomics experiments.
 Udeshi ND, Svinkina T, Mertins P, Kuhn E, Mani DR, Qiao JW, Carr SA.
 Mol Cell Proteomics. 2013 Mar;12(3):825-31. [PMID: 23266961
Functional Description
 May be a transcription regulator involved in cell-cycle progression and skeletal muscle differentiation. May repress GTF2I transcriptional functions, by preventing its nuclear residency, or by inhibiting its transcriptional activation. May contribute to slow-twitch fiber type specificity during myogenesis and in regenerating muscles. Binds troponin I slow-muscle fiber enhancer (USE B1). Binds specifically and with high affinity to the EFG sequences derived from the early enhancer of HOXC8 (By similarity). 
Sequence Annotation
 REPEAT 119 213 GTF2I-like 1.
 REPEAT 342 436 GTF2I-like 2.
 REPEAT 556 650 GTF2I-like 3.
 REPEAT 696 790 GTF2I-like 4.
 REPEAT 793 887 GTF2I-like 5.
 MOTIF 898 905 Nuclear localization signal (Potential).
 MOD_RES 448 448 Phosphoserine.  
Keyword
 3D-structure; Alternative splicing; Complete proteome; Developmental protein; DNA-binding; Nucleus; Phosphoprotein; Polymorphism; Reference proteome; Repeat; Transcription; Transcription regulation; Williams-Beuren syndrome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 959 AA 
Protein Sequence
MALLGKRCDV PTNGCGPDRW NSAFTRKDEI ITSLVSALDS MCSALSKLNA EVACVAVHDE 60
SAFVVGTEKG RMFLNARKEL QSDFLRFCLS AAQHRAATSQ LEGRVVRRVL TVASRALCPT 120
GGPPWKDPEA EHPKKVQRGE GGGRSLPRSS LEHGSDVYLL RKMVEEVFDV LYSEALGRAS 180
VVPLPYERLL REPGLLAVQG LPEGLAFRRP AEYDPKALMA ILEHSHRIRF KLKRPLEDGG 240
RDSKALVELN GVSLIPKGSR DCGLHGQAPK VPPQDLPPTA TSSSMASFLY STALPNHAIR 300
ELKQEAPSCP LAPSDLGLSR PMPEPKATGA QDFSDCCGQK PTGPGGPLIQ NVHASKRILF 360
SIVHDKSEKW DAFIKETEDI NTLRECVQIL FNSRYAEALG LDHMVPVPYR KIACDPEAVE 420
IVGIPDKIPF KRPCTYGVPK LKRILEERHS IHFIIKRMFD ERIFTGNKFT KDTTKLEPAS 480
PPEDTSAEVS RATVLDLAGN ARSDKGSMSE DCGPGTSGEL GGLRPIKIEP EDLDIIQVTV 540
PDPSPTSEEM TDSMPGHLPS EDSGYGMEML TDKGLSEDAR PEERPVEDSH GDVIRPLRKQ 600
VELLFNTRYA KAIGISEPVK VPYSKFLMHP EELFVVGLPE GISLRRPNCF GIAKLRKILE 660
ASNSIQFVIK RPELLTEGVK EPIMDSQERD SGDPLVDESL KRQGFQENYD ARLSRIDIAN 720
TLREQVQDLF NKKYGEALGI KYPVQVPYKR IKSNPGSVII EGLPPGIPFR KPCTFGSQNL 780
ERILAVADKI KFTVTRPFQG LIPKPDEDDA NRLGEKVILR EQVKELFNEK YGEALGLNRP 840
VLVPYKLIRD SPDAVEVTGL PDDIPFRNPN TYDIHRLEKI LKAREHVRMV IINQLQPFAE 900
ICNDAKVPAK DSSIPKRKRK RVSEGNSVSS SSSSSSSSSS NPDSVASANQ ISLVQWPMYM 960
VDYAGLNVQL PGPLNY 976 
Gene Ontology
 GO:0005737; C:cytoplasm; IDA:HPA.
 GO:0005634; C:nucleus; NAS:UniProtKB.
 GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
 GO:0003705; F:RNA polymerase II distal enhancer sequence-specific DNA binding transcription factor activity; NAS:UniProtKB.
 GO:0009790; P:embryo development; IEA:Compara. 
Interpro
 IPR004212; GTF2I.
 IPR016659; TF_II-I. 
Pfam
 PF02946; GTF2I 
SMART
  
PROSITE
 PS51139; GTF2I 
PRINTS