CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-022403
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 UDP-glucose:glycoprotein glucosyltransferase 2 
Protein Synonyms/Alias
 UGT2; hUGT2; UDP--Glc:glycoprotein glucosyltransferase 2; UDP-glucose ceramide glucosyltransferase-like 1 
Gene Name
 UGGT2 
Gene Synonyms/Alias
 UGCGL2; UGT2 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
342ISQNFPIKARSLTRIubiquitination[1]
756FDKPSGRKLLFNALKacetylation[2]
782IIYNPTSKINEENTAubiquitination[1]
Reference
 [1] Systematic and quantitative assessment of the ubiquitin-modified proteome.
 Kim W, Bennett EJ, Huttlin EL, Guo A, Li J, Possemato A, Sowa ME, Rad R, Rush J, Comb MJ, Harper JW, Gygi SP.
 Mol Cell. 2011 Oct 21;44(2):325-40. [PMID: 21906983]
 [2] Regulation of cellular metabolism by protein lysine acetylation.
 Zhao S, Xu W, Jiang W, Yu W, Lin Y, Zhang T, Yao J, Zhou L, Zeng Y, Li H, Li Y, Shi J, An W, Hancock SM, He F, Qin L, Chin J, Yang P, Chen X, Lei Q, Xiong Y, Guan KL.
 Science. 2010 Feb 19;327(5968):1000-4. [PMID: 20167786
Functional Description
 Recognizes glycoproteins with minor folding defects. Reglucosylates single N-glycans near the misfolded part of the protein, thus providing quality control for protein folding in the endoplasmic reticulum. Reglucosylated proteins are recognized by calreticulin for recycling to the endoplasmic reticulum and refolding or degradation (By similarity). 
Sequence Annotation
 REGION 1220 1516 Glucosyltransferase.
 MOTIF 1513 1516 Prevents secretion from ER (Potential).
 MOD_RES 1289 1289 Phosphotyrosine.
 CARBOHYD 256 256 N-linked (GlcNAc...) (Potential).
 CARBOHYD 286 286 N-linked (GlcNAc...) (Potential).
 CARBOHYD 920 920 N-linked (GlcNAc...) (Potential).
 CARBOHYD 950 950 N-linked (GlcNAc...) (Potential).  
Keyword
 Complete proteome; Endoplasmic reticulum; Glycoprotein; Glycosyltransferase; Phosphoprotein; Polymorphism; Reference proteome; Signal; Transferase. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1516 AA 
Protein Sequence
MAPAKATNVV RLLLGSTALW LSQLGSGTVA ASKSVTAHLA AKWPETPLLL EASEFMAEES 60
NEKFWQFLET VQELAIYKQT ESDYSYYNLI LKKAGQFLDN LHINLLKFAF SIRAYSPAIQ 120
MFQQIAADEP PPDGCNAFVV IHKKHTCKIN EIKKLLKKAA SRTRPYLFKG DHKFPTNKEN 180
LPVVILYAEM GTRTFSAFHK VLSEKAQNEE ILYVLRHYIQ KPSSRKMYLS GYGVELAIKS 240
TEYKALDDTQ VKTVTNTTVE DETETNEVQG FLFGKLKEIY SDLRDNLTAF QKYLIESNKQ 300
MMPLKVWELQ DLSFQAASQI MSAPVYDSIK LMKDISQNFP IKARSLTRIA VNQHMREEIK 360
ENQKDLQVRF KIQPGDARLF INGLRVDMDV YDAFSILDML KLEGKMMNGL RNLGINGEDM 420
SKFLKLNSHI WEYTYVLDIR HSSIMWINDL ENDDLYITWP TSCQKLLKPV FPGSVPSIRR 480
NFHNLVLFID PAQEYTLDFI KLADVFYSHE VPLRIGFVFI LNTDDEVDGA NDAGVALWRA 540
FNYIAEEFDI SEAFISIVHM YQKVKKDQNI LTVDNVKSVL QNTFPHANIW DILGIHSKYD 600
EERKAGASFY KMTGLGPLPQ ALYNGEPFKH EEMNIKELKM AVLQRMMDAS VYLQREVFLG 660
TLNDRTNAID FLMDRNNVVP RINTLILRTN QQYLNLISTS VTADVEDFST FFFLDSQDKS 720
AVIAKNMYYL TQDDESIISA VTLWIIADFD KPSGRKLLFN ALKHMKTSVH SRLGIIYNPT 780
SKINEENTAI SRGILAAFLT QKNMFLRSFL GQLAKEEIAT AIYSGDKIKT FLIEGMDKNA 840
FEKKYNTVGV NIFRTHQLFC QDVLKLRPGE MGIVSNGRFL GPLDEDFYAE DFYLLEKITF 900
SNLGEKIKGI VENMGINANN MSDFIMKVDA LMSSVPKRAS RYDVTFLREN HSVIKTNPQE 960
NDMFFNVIAI VDPLTREAQK MAQLLVVLGK IINMKIKLFM NCRGRLSEAP LESFYRFVLE 1020
PELMSGANDV SSLGPVAKFL DIPESPLLIL NMITPEGWLV ETVHSNCDLD NIHLKDTEKT 1080
VTAEYELEYL LLEGQCFDKV TEQPPRGLQF TLGTKNKPAV VDTIVMAHHG YFQLKANPGA 1140
WILRLHQGKS EDIYQIVGHE GTDSQADLED IIVVLNSFKS KILKVKVKKE TDKIKEDILT 1200
DEDEKTKGLW DSIKSFTVSL HKENKKEKDV LNIFSVASGH LYERFLRIMM LSVLRNTKTP 1260
VKFWLLKNYL SPTFKEVIPH MAKEYGFRYE LVQYRWPRWL RQQTERQRII WGYKILFLDV 1320
LFPLAVDKII FVDADQIVRH DLKELRDFDL DGAPYGYTPF CDSRREMDGY RFWKTGYWAS 1380
HLLRRKYHIS ALYVVDLKKF RRIGAGDRLR SQYQALSQDP NSLSNLDQDL PNNMIYQVAI 1440
KSLPQDWLWC ETWCDDESKQ RAKTIDLCNN PKTKESKLKA AARIVPEWVE YDAEIRQLLD 1500
HLENKKQDTI LTHDEL 1516 
Gene Ontology
 GO:0005788; C:endoplasmic reticulum lumen; ISS:UniProtKB.
 GO:0005793; C:endoplasmic reticulum-Golgi intermediate compartment; IEA:UniProtKB-SubCell.
 GO:0003980; F:UDP-glucose:glycoprotein glucosyltransferase activity; ISS:UniProtKB.
 GO:0043687; P:post-translational protein modification; TAS:Reactome.
 GO:0006457; P:protein folding; TAS:Reactome.
 GO:0018279; P:protein N-linked glycosylation via asparagine; TAS:Reactome. 
Interpro
 IPR002495; Glyco_trans_8.
 IPR009448; UDP-g_GGtrans. 
Pfam
 PF01501; Glyco_transf_8
 PF06427; UDP-g_GGTase 
SMART
  
PROSITE
 PS00014; ER_TARGET 
PRINTS