CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-025398
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Putative uncharacterized protein 
Protein Synonyms/Alias
  
Gene Name
 Gtf3c1 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Mus musculus (Mouse) 
NCBI Taxa ID
 10090 
Lysine Modification
Position
Peptide
Type
References
218YHRKILNKNGLITMQubiquitination[1]
724SSTANRVKVPPAPAPubiquitination[1]
1123SSFYAHLKRNWVWTSubiquitination[1]
1670VVNSCQVKFRLRNTPubiquitination[1]
Reference
 [1] Proteomic analyses reveal divergent ubiquitylation site patterns in murine tissues.
 Wagner SA, Beli P, Weinert BT, Schölz C, Kelstrup CD, Young C, Nielsen ML, Olsen JV, Brakebusch C, Choudhary C.
 Mol Cell Proteomics. 2012 Dec;11(12):1578-85. [PMID: 22790023
Functional Description
  
Sequence Annotation
  
Keyword
  
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2101 AA 
Protein Sequence
MDALESLLDE VALEGLDGLC LPALWSRLES RSPAFPLPLE PYTQEFLWRA LATHPGISFY 60
EEPRERPDLQ LQDRYEEIDL ETGILESRRD PVTLEDVYPI HMILENKDGI QGSCRYFKER 120
KDITSSIRSK CLQPRCTMVE AFSRWGKKLI IVASQDMRYR ALIGLEGDPD LKLPDFSYCI 180
LERLGRSRWQ GELQRDLHTT AFKVDAGKLH YHRKILNKNG LITMQSHVIR LPTGAQQHSI 240
LLLLNRFHVD RRSKYDILME KLSMMLSTRS NQIETLGKLR EELGLCERTF KRLYQYMLNA 300
GLAKVVSLPL QEIHPECGPC KTKKGTDVMV RCLKLLKEFK RKMEDDHDDD DDEEVISKGV 360
PPVDIVFERD MLTQTYELIE RRGTKGISQA EIRVAMNVGK LEARMLCRLL QRFKVVKGFM 420
EDEGRQRTTK YISCVFAEES DLSRQYAREK ARGELLTTVS LASVQDESLM PEGEEAFLSD 480
SESEEESSCS GGKRRGRGSR GHARASGDAG SGSRPHHSTP AKGGWKVLNL HPLKKPKAAA 540
EERSRRSSAC RDGLDTSSSS ELNAPFDPHS MDSHSGDIAV IEEVRLDNPK EGGGSQKGGR 600
HGSSQDKPHK TYRLLKRRNL IIEAATNLRL IESLFTIQKM IMDQEKQEGV STKCCKKSII 660
RLVRNLSEEG LLRLYRTTVI QDGIKKKVDL VVHPSMDQND PLVRSAIEQV RFRISNSSTA 720
NRVKVPPAPA PQEEAEEENQ EPEVPSRSAN SDPNTSSKPE STRVKKTDEK MGITPLKNYK 780
PVIVPGLGRS IGFLPKMPRL KAMHLFLWYL VYGHPAGHTG EQPALHSERK TGKQESSRPG 840
AQPSSGDDWD TSEAKNNTES SSWESEMELS TEIVYVDEIS WMRYVPPIPI HRDFGFGWAL 900
VSDILLCLPL SIFVQLVQVS YKVDNLEDFL NDPLKKHTLI RFLPRHIRQQ LLYKRRYIFS 960
VVENLQRLCY MGLLQFGPTE KFQDKDQVFV FLKKNAVIVD TTICDPHYNL AHSSRPFERR 1020
LYVLDSMQDV ECYWFDLQCI CLNTPLGVVR CPCAQKICPD PGSDPEGSLR KEQESAMDKH 1080
NLERKCAMLE YTTGSREVVD EGLVPGDGLG AAGLDSSFYA HLKRNWVWTS YIISKARKNN 1140
TSENGLTGRL QTFLSKRPMP LGSGGSGRLP LWSEGRADAE LCADKEEQFE LDREPTPGRN 1200
RKVRGGKSQK RKRLKKEPIR KTKRRRRGEH PEAKSKKLRY QDEADQNALR MMTRLRVSWS 1260
MQEDGLLMLC RIASNVLNTK VKGPFVTWQV VRDILHATFE ESLDKTSHSV GRRARYIVKN 1320
PQAFMNYKVC LAEVYQDKAL VGDFMSRKGN YEDPKVCAKE FKEFVEKLKE KFSSGLRNPN 1380
LEIPNTLQEL FAKYRVLAIG DEKDRVRKED ELNSVEDIHF LVLQNLIQST LSLSNSQSNS 1440
CQSFQIFRLY REFREPVLVR AFMECQKRSL VNRRRVSHSQ GPKKNRAVPF VPMSYQLSQS 1500
YYKLFTWRFP TTICTESFQF YDRLRTNGML DQPDHFSFKD LDSSDPSNDL VAFSLDSPGG 1560
HCVTALALFS LGLLSVDVRI PEQIVVVDSS MVESEVMKSL GKDGGLDDDE EEEDLDEGSG 1620
TKRQGVEVKA HQASHTKYLL MRGYYTVPGM VSTRNLNPND SIVVNSCQVK FRLRNTPAST 1680
QLGPTGFTAT PLEELQAGLS CLPASFTSLV DPQLRTHCPE EFAHQMAQSG YSPEDVAASL 1740
EILQAVAAAD CFGIDKEKLS RQFSALEKIA DRRTRTFLDY IQDLLEQEQV MEVGGNTVRL 1800
VTMASAQPWL LHSMRLRDME VDTKASGDDS QSRLPEGPSI EDHTSEGAAV PPVSSHSTKK 1860
RPHCPETDAE EATRLPAKKP TLQDVRVAAS PRPGAEEQAE AQAPAQLAAP EDADAGGPRQ 1920
ENQENVGVSG LEQLGCEFQL PEGSEDPRGL TESNMAQAAW ESGCERVCFV GRPWRGVDGH 1980
LNMPVCKGML EAVLYHIMSR PGVPESCLLQ YYQGVLQPVA VLELLRGLES LGCIQKRTLR 2040
KPASVSLFSR PVVEGLGQAS EAEALSCHES TVTFYEPTLD CTIRLGRVFP HDINWNKWIH 2100
L 2101 
Gene Ontology
 GO:0030529; C:ribonucleoprotein complex; IDA:MGI. 
Interpro
 IPR007309; TFIIIC_Bblock-bd. 
Pfam
 PF04182; B-block_TFIIIC 
SMART
  
PROSITE
  
PRINTS