CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-037427
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
  
Protein Name
 General transcription factor II-I repeat domain-containing protein 1 
Protein Synonyms/Alias
  
Gene Name
 GTF2IRD1 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
130SDVYLLRKMVEEVFDubiquitination[1]
579LFNTRYAKAIGISEPubiquitination[2]
814RPVLVPYKLIRDSPDubiquitination[3]
Reference
 [1] A data set of human endogenous protein ubiquitination sites.
 Shi Y, Chan DW, Jung SY, Malovannaya A, Wang Y, Qin J.
 Mol Cell Proteomics. 2011 May;10(5):M110.002089. [PMID: 20972266]
 [2] A proteome-wide, quantitative survey of in vivo ubiquitylation sites reveals widespread regulatory roles.
 Wagner SA, Beli P, Weinert BT, Nielsen ML, Cox J, Mann M, Choudhary C.
 Mol Cell Proteomics. 2011 Oct;10(10):M111.013284. [PMID: 21890473]
 [3] Refined preparation and use of anti-diglycine remnant (K-ε-GG) antibody enables routine quantification of 10,000s of ubiquitination sites in single proteomics experiments.
 Udeshi ND, Svinkina T, Mertins P, Kuhn E, Mani DR, Qiao JW, Carr SA.
 Mol Cell Proteomics. 2013 Mar;12(3):825-31. [PMID: 23266961
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 960 AA 
Protein Sequence
MALLGKRCDV PTNGCGPDRW NSAFTRKDEI ITSLVSALDS MCSALSKLNA EVACVAVHDE 60
SAFVVGTEKG RMFLNARKEL QSDFLRFCRG PPWKDPEAEH PKKVQRGEGG GRSLPRSSLE 120
HGSDVYLLRK MVEEVFDVLY SEALGRASVV PLPYERLLRE PGLLAVQGLP EGLAFRRPAE 180
YDPKALMAIL EHSHRIRFKL KRPLEDGGRD SKALVELNGV SLIPKGSRDC GLHGQAPKVP 240
PQDLPPTATS SSMASFLYST ALPNHAIREL KQEAPSCPLA PSDLGLSRPM PEPKATGAQD 300
FSDCCGQKPT GPGGPLIQNV HASKRILFSI VHDKSEKWDA FIKETEDINT LRECVQILFN 360
SRYAEALGLD HMVPVPYRKI ACDPEAVEIV GIPDKIPFKR PCTYGVPKLK RILEERHSIH 420
FIIKRMFDER IFTGNKFTKD TTKLEPASPP EDTSAEVSRA TVLDLAGNAR SDKGSMSEDC 480
GPGTSGELGG LRPIKIEPED LDIIQVTVPD PSPTSEEMTD SMPGHLPSED SGYGMEMLTD 540
KGLSEDARPE ERPVEDSHGD VIRPLRKQVE LLFNTRYAKA IGISEPVKVP YSKFLMHPEE 600
LFVVGLPEGI SLRRPNCFGI AKLRKILEAS NSIQFVIKRP ELLTEGVKEP IMDSQERDSG 660
DPLVDESLKR QGFQENYDAR LSRIDIANTL REQVQDLFNK KYGEALGIKY PVQVPYKRIK 720
SNPGSVIIEG LPPGIPFRKP CTFGSQNLER ILAVADKIKF TVTRPFQGLI PKPDEDDANR 780
LGEKVILREQ VKELFNEKYG EALGLNRPVL VPYKLIRDSP DAVEVTGLPD DIPFRNPNTY 840
DIHRLEKILK AREHVRMVII NQLQPFAEIC NDAKVPAKDS SIPKRKRKRV SEGNSVSSSS 900
SSSSSSSSNP DSVASANQIS LVVKLHRFGL RHSSLWPSPL CLAGPSTLGC GPRGRGGTGL 960 
Gene Ontology
 GO:0005737; C:cytoplasm; IDA:HPA.
 GO:0005634; C:nucleus; IDA:HPA. 
Interpro
 IPR004212; GTF2I.
 IPR016659; TF_II-I. 
Pfam
 PF02946; GTF2I 
SMART
  
PROSITE
 PS51139; GTF2I 
PRINTS