CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-022404
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 UDP-glucose:glycoprotein glucosyltransferase 1 
Protein Synonyms/Alias
 UGT1; hUGT1; UDP--Glc:glycoprotein glucosyltransferase; UDP-glucose ceramide glucosyltransferase-like 1 
Gene Name
 UGGT1 
Gene Synonyms/Alias
 GT; UGCGL1; UGGT; UGT1; UGTR 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
356LSQNFPTKARAITKTubiquitination[1]
378TEVEENQKYFKGTLGubiquitination[1]
1055TSDNSFAKGPIAKFLubiquitination[1]
1287KNTKTPVKFWFLKNYubiquitination[2, 3]
Reference
 [1] Refined preparation and use of anti-diglycine remnant (K-ε-GG) antibody enables routine quantification of 10,000s of ubiquitination sites in single proteomics experiments.
 Udeshi ND, Svinkina T, Mertins P, Kuhn E, Mani DR, Qiao JW, Carr SA.
 Mol Cell Proteomics. 2013 Mar;12(3):825-31. [PMID: 23266961]
 [2] A proteome-wide, quantitative survey of in vivo ubiquitylation sites reveals widespread regulatory roles.
 Wagner SA, Beli P, Weinert BT, Nielsen ML, Cox J, Mann M, Choudhary C.
 Mol Cell Proteomics. 2011 Oct;10(10):M111.013284. [PMID: 21890473]
 [3] hCKSAAP_UbSite: improved prediction of human ubiquitination sites by exploiting amino acid pattern and properties.
 Chen Z, Zhou Y, Song J, Zhang Z.
 Biochim Biophys Acta. 2013 Aug;1834(8):1461-7. [PMID: 23603789
Functional Description
 Recognizes glycoproteins with minor folding defects. Reglucosylates single N-glycans near the misfolded part of the protein, thus providing quality control for protein folding in the endoplasmic reticulum. Reglucosylated proteins are recognized by calreticulin for recycling to the endoplasmic reticulum and refolding or degradation. 
Sequence Annotation
 REGION 1244 1555 Glucosyltransferase (By similarity).
 MOTIF 1552 1555 Prevents secretion from ER (Potential).
 MOD_RES 1277 1277 Phosphoserine.
 CARBOHYD 536 536 N-linked (GlcNAc...) (Potential).
 CARBOHYD 1228 1228 N-linked (GlcNAc...) (Potential).  
Keyword
 Alternative splicing; Complete proteome; Endoplasmic reticulum; Glycoprotein; Glycosyltransferase; Phosphoprotein; Reference proteome; Signal; Transferase. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1555 AA 
Protein Sequence
MGCKGDASGA CAAGALPVTG VCYKMGVLVV LTVLWLFSSV KADSKAITTS LTTKWFSTPL 60
LLEASEFLAE DSQEKFWNFV EASQNIGSSD HDGTDYSYYH AILEAAFQFL SPLQQNLFKF 120
CLSLRSYSAT IQAFQQIAAD EPPPEGCNSF FSVHGKKTCE SDTLEALLLT ASERPKPLLF 180
KGDHRYPSSN PESPVVIFYS EIGSEEFSNF HRQLISKSNA GKINYVFRHY IFNPRKEPVY 240
LSGYGVELAI KSTEYKAKDD TQVKGTEVNT TVIGENDPID EVQGFLFGKL RDLHPDLEGQ 300
LKELRKHLVE STNEMAPLKV WQLQDLSFQT AARILASPVE LALVVMKDLS QNFPTKARAI 360
TKTAVSSELR TEVEENQKYF KGTLGLQPGD SALFINGLHM DLDTQDIFSL FDVLRNEARV 420
MEGLHRLGIE GLSLHNVLKL NIQPSEADYA VDIRSPAISW VNNLEVDSRY NSWPSSLQEL 480
LRPTFPGVIR QIRKNLHNMV FIVDPAHETT AELMNTAEMF LSNHIPLRIG FIFVVNDSED 540
VDGMQDAGVA VLRAYNYVAQ EVDDYHAFQT LTHIYNKVRT GEKVKVEHVV SVLEKKYPYV 600
EVNSILGIDS AYDRNRKEAR GYYEQTGVGP LPVVLFNGMP FEREQLDPDE LETITMHKIL 660
ETTTFFQRAV YLGELPHDQD VVEYIMNQPN VVPRINSRIL TAERDYLDLT ASNNFFVDDY 720
ARFTILDSQG KTAAVANSMN YLTKKGMSSK EIYDDSFIRP VTFWIVGDFD SPSGRQLLYD 780
AIKHQKSSNN VRISMINNPA KEISYENTQI SRAIWAALQT QTSNAAKNFI TKMAKEGAAE 840
ALAAGADIAE FSVGGMDFSL FKEVFESSKM DFILSHAVYC RDVLKLKKGQ RAVISNGRII 900
GPLEDSELFN QDDFHLLENI ILKTSGQKIK SHIQQLRVEE DVASDLVMKV DALLSAQPKG 960
DPRIEYQFFE DRHSAIKLRP KEGETYFDVV AVVDPVTREA QRLAPLLLVL AQLINMNLRV 1020
FMNCQSKLSD MPLKSFYRYV LEPEISFTSD NSFAKGPIAK FLDMPQSPLF TLNLNTPESW 1080
MVESVRTPYD LDNIYLEEVD SVVAAEYELE YLLLEGHCYD ITTGQPPRGL QFTLGTSANP 1140
VIVDTIVMAN LGYFQLKANP GAWILRLRKG RSEDIYRIYS HDGTDSPPDA DEVVIVLNNF 1200
KSKIIKVKVQ KKADMVNEDL LSDGTSENES GFWDSFKWGF TGQKTEEVKQ DKDDIINIFS 1260
VASGHLYERF LRIMMLSVLK NTKTPVKFWF LKNYLSPTFK EFIPYMANEY NFQYELVQYK 1320
WPRWLHQQTE KQRIIWGYKI LFLDVLFPLV VDKFLFVDAD QIVRTDLKEL RDFNLDGAPY 1380
GYTPFCDSRR EMDGYRFWKS GYWASHLAGR KYHISALYVV DLKKFRKIAA GDRLRGQYQG 1440
LSQDPNSLSN LDQDLPNNMI HQVPIKSLPQ EWLWCETWCD DASKKRAKTI DLCNNPMTKE 1500
PKLEAAVRIV PEWQDYDQEI KQLQIRFQKE KETGALYKEK TKEPSREGPQ KREEL 1555 
Gene Ontology
 GO:0005788; C:endoplasmic reticulum lumen; TAS:Reactome.
 GO:0005793; C:endoplasmic reticulum-Golgi intermediate compartment; ISS:UniProtKB.
 GO:0003980; F:UDP-glucose:glycoprotein glucosyltransferase activity; IDA:UniProtKB.
 GO:0051082; F:unfolded protein binding; IDA:UniProtKB.
 GO:0051084; P:'de novo' posttranslational protein folding; TAS:UniProtKB.
 GO:0043687; P:post-translational protein modification; TAS:Reactome.
 GO:0018279; P:protein N-linked glycosylation via asparagine; TAS:Reactome. 
Interpro
 IPR009448; UDP-g_GGtrans. 
Pfam
 PF06427; UDP-g_GGTase 
SMART
  
PROSITE
 PS00014; ER_TARGET 
PRINTS