CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-015183
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 UDP-glucose:glycoprotein glucosyltransferase 1 
Protein Synonyms/Alias
 UGT1; UDP--Glc:glycoprotein glucosyltransferase; UDP-glucose ceramide glucosyltransferase-like 1 
Gene Name
 Uggt1 
Gene Synonyms/Alias
 Gt; Ugcgl1; Uggt; Ugt1; Ugtr 
Created Date
 July 27, 2013 
Organism
 Mus musculus (Mouse) 
NCBI Taxa ID
 10090 
Lysine Modification
Position
Peptide
Type
References
362TKARAITKTAVSAQLacetylation[1]
362TKARAITKTAVSAQLsuccinylation[1]
378AEVEENQKYFKGTIGacetylation[2]
972EYQFFEDKHSAIKLKubiquitination[3]
1034KLSDMPLKSFYRYVLacetylation[2]
1309FIPYMAKKYNFQYELacetylation[2]
1499LCNNPMTKEPKLEAAacetylation[2]
1547ETQEGSQKHEEL***acetylation[1]
Reference
 [1] SIRT5-Mediated Lysine Desuccinylation Impacts Diverse Metabolic Pathways.
 Park J, Chen Y, Tishkoff DX, Peng C, Tan M, Dai L, Xie Z, Zhang Y, Zwaans BM, Skinner ME, Lombard DB, Zhao Y.
 Mol Cell. 2013 Jun 27;50(6):919-30. [PMID: 23806337]
 [2] Quantitative acetylome analysis reveals the roles of SIRT1 in regulating diverse substrates and cellular pathways.
 Chen Y, Zhao W, Yang JS, Cheng Z, Luo H, Lu Z, Tan M, Gu W, Zhao Y.
 Mol Cell Proteomics. 2012 Oct;11(10):1048-62. [PMID: 22826441]
 [3] Proteomic analyses reveal divergent ubiquitylation site patterns in murine tissues.
 Wagner SA, Beli P, Weinert BT, Schölz C, Kelstrup CD, Young C, Nielsen ML, Olsen JV, Brakebusch C, Choudhary C.
 Mol Cell Proteomics. 2012 Dec;11(12):1578-85. [PMID: 22790023
Functional Description
 Recognizes glycoproteins with minor folding defects. Reglucosylates single N-glycans near the misfolded part of the protein, thus providing quality control for protein folding in the endoplasmic reticulum. Reglucosylated proteins are recognized by calreticulin for recycling to the endoplasmic reticulum and refolding or degradation (By similarity). 
Sequence Annotation
 REGION 1244 1551 Glucosyltransferase (By similarity).
 MOTIF 1548 1551 Prevents secretion from ER (Potential).
 MOD_RES 1277 1277 Phosphoserine (By similarity).
 CARBOHYD 269 269 N-linked (GlcNAc...) (Potential).
 CARBOHYD 536 536 N-linked (GlcNAc...) (Potential).
 CARBOHYD 1228 1228 N-linked (GlcNAc...) (Potential).  
Keyword
 Complete proteome; Endoplasmic reticulum; Glycoprotein; Glycosyltransferase; Phosphoprotein; Reference proteome; Signal; Transferase. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1551 AA 
Protein Sequence
MCSRGDANTA DAAAARRVTG LRYNMRLLIA LALPCLFSLA EANSKAITTS LTTKWFSAPL 60
LLEASEFLAE DSQEKFWSFV EATQNIGSSD HHDTDHSYYD AVLEAAFRFL SPLQQNLLKF 120
CLSLRSYSAS IQAFQQIAVD EPPPEGCKSF LSVHGKQTCD LDTLESLLLT AADRPKPLLF 180
KGDHRYPSSN PESPVVILYS EIGHEEFSNI HHQLISKSNE GKINYVFRHY ISNPSKEPVY 240
LSGYGVELAI KSTEYKAKDD TQVKGTEVNA TVIGESDPID EVQGFLFGKL RELYPALEGQ 300
LKEFRKHLVE STNEMAPLKV WQLQDLSFQT AARILAASGA LSLVVMKDIS QNFPTKARAI 360
TKTAVSAQLR AEVEENQKYF KGTIGLQPGD SALFINGLHI DLDTQDIFSL FDTLRNEARV 420
MEGLHRLGIE GLSLHNILKL NIQPSETDYA VDIRSPAISW VNNLEVDSRY NSWPSSLQEL 480
LRPTFPGVIR QIRKNLHNMV FIIDPVHETT AELISIAEMF LSNHIPLRIG FIFVVNDSED 540
VDGMQDAGVA VLRAYNYVAQ EVDGYHAFQT LTQIYNKVRT GETVKVEHVV SVLEKKYPYV 600
EVNSILGIDS AYDQNRKEAR GYYEQTGVGP LPVVLFNGMP FEKEQLDPDE LETITMHKIL 660
ETTTFFQRAV YLGELSHDQD VVEYIMNQPN VVPRINSRIL TAKREYLDLT ASNNFYVDDF 720
ARFSALDSRG KTAAIANSMN YLTKKGMSSK EIYDDSFIRP VTFWIVGDFD SPSGRQLLYD 780
AIKHQKTSNN VRISMINNPS QEISDSSTPI FRAIWAALQT QASSSAKNFI TKMAKEETAE 840
ALAAGVDIAE FSVGGMDVSL FKEVFESSRM DFILSHALYC RDVLKLKKGQ RVVISNGRII 900
GPLEDNELFN QDDFHLLENI ILKTSGQKIK SHIQQLRVEE DVASDLVMKV DALLSAQPKG 960
EARIEYQFFE DKHSAIKLKP KEGETYYDVV AVVDPVTREA QRLAPLLLVL TQLINMNLRV 1020
FMNCQSKLSD MPLKSFYRYV LEPEISFTAD SSFAKGPIAK FLDMPQSPLF TLNLNTPESW 1080
MVESVRTPYD LDNIYLEEVD SIVAAEYELE YLLLEGHCYD ITTGQPPRGL QFTLGTSANP 1140
TIVDTIVMAN LGYFQLKANP GAWILRLRKG RSDDIYRIYS HDGTDSPPDA NDVVVILNNF 1200
KSKIIKVKVQ KKADMANEDL LSDGTNENES GFWDSFKWGF SGQKAEEVKQ DKDDIINIFS 1260
VASGHLYERF LRIMMLSVLK NTKTPVKFWF LKNYLSPTFK EFIPYMAKKY NFQYELVQYK 1320
WPRWLHQQTE KQRIIWGYKI LFLDVLFPLV VDKFLFVDAD QIVRTDLKEL RDFNLDGAPY 1380
GYTPFCDSRR EMDGYRFWKS GYWASHLAGR KYHISALYVV DLKKFRKIAA GDRLRGQYQG 1440
LSQDPNSLSN LDQDLPNNMI HQVPIKSLPQ EWLWCETWCD DASKKRAKTI DLCNNPMTKE 1500
PKLEAAVRIV PEWQDYDQEI KQLQTLFQEE KELGTLHTEE TQEGSQKHEE L 1551 
Gene Ontology
 GO:0005788; C:endoplasmic reticulum lumen; ISS:UniProtKB.
 GO:0005793; C:endoplasmic reticulum-Golgi intermediate compartment; ISS:UniProtKB.
 GO:0003980; F:UDP-glucose:glycoprotein glucosyltransferase activity; ISS:UniProtKB.
 GO:0051082; F:unfolded protein binding; ISS:UniProtKB.
 GO:0006486; P:protein glycosylation; IEA:UniProtKB-UniPathway. 
Interpro
 IPR009448; UDP-g_GGtrans. 
Pfam
 PF06427; UDP-g_GGTase 
SMART
  
PROSITE
 PS00014; ER_TARGET 
PRINTS