CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-042783
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
  
Protein Name
 C-terminal 80 kDa form 
Protein Synonyms/Alias
  
Gene Name
 SOGA1 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
373LGSQGLSKEILLAKDubiquitination[1]
379SKEILLAKDLGSDFQubiquitination[1]
412RTGDLDSKPDPSRSFubiquitination[1, 2, 3]
625ELQQQFAKAKATWETubiquitination[1]
661ERAGPDWKAALQRERubiquitination[3, 4]
700ERNWSQEKLQLVERLubiquitination[3, 5]
719QQVEQQVKELQNRLSubiquitination[4]
730NRLSQLQKAADPWVLubiquitination[1, 2, 3, 4, 5]
738AADPWVLKHSELEKQubiquitination[1, 3, 4, 5]
774ELGGNGLKRTKSVSSubiquitination[6]
861RSYTAPDKTGIRVYYubiquitination[1, 2, 3, 5]
900GFLFTTAKPKESAEAubiquitination[1, 2, 3, 5]
902LFTTAKPKESAEADGubiquitination[3, 6]
980GSLERKVKSTSSQTVubiquitination[1]
Reference
 [1] Landscape of the PARKIN-dependent ubiquitylome in response to mitochondrial depolarization.
 Sarraf SA, Raman M, Guarani-Pereira V, Sowa ME, Huttlin EL, Gygi SP, Harper JW.
 Nature. 2013 Apr 18;496(7445):372-6. [PMID: 23503661]
 [2] Systematic and quantitative assessment of the ubiquitin-modified proteome.
 Kim W, Bennett EJ, Huttlin EL, Guo A, Li J, Possemato A, Sowa ME, Rad R, Rush J, Comb MJ, Harper JW, Gygi SP.
 Mol Cell. 2011 Oct 21;44(2):325-40. [PMID: 21906983]
 [3] Integrated proteomic analysis of post-translational modifications by serial enrichment.
 Mertins P, Qiao JW, Patel J, Udeshi ND, Clauser KR, Mani DR, Burgess MW, Gillette MA, Jaffe JD, Carr SA.
 Nat Methods. 2013 Jul;10(7):634-7. [PMID: 23749302]
 [4] A proteome-wide, quantitative survey of in vivo ubiquitylation sites reveals widespread regulatory roles.
 Wagner SA, Beli P, Weinert BT, Nielsen ML, Cox J, Mann M, Choudhary C.
 Mol Cell Proteomics. 2011 Oct;10(10):M111.013284. [PMID: 21890473]
 [5] Refined preparation and use of anti-diglycine remnant (K-ε-GG) antibody enables routine quantification of 10,000s of ubiquitination sites in single proteomics experiments.
 Udeshi ND, Svinkina T, Mertins P, Kuhn E, Mani DR, Qiao JW, Carr SA.
 Mol Cell Proteomics. 2013 Mar;12(3):825-31. [PMID: 23266961]
 [6] Global identification of modular cullin-RING ligase substrates.
 Emanuele MJ, Elia AE, Xu Q, Thoma CR, Izhar L, Leng Y, Guo A, Chen YN, Rush J, Hsu PW, Yen HC, Elledge SJ.
 Cell. 2011 Oct 14;147(2):459-74. [PMID: 21963094
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1275 AA 
Protein Sequence
XLHFAKEESA LMCKKLTKLA KENDSMKEEL LKYRSLYGDL DSALSAEELA DAPHSRETEL 60
KVHLKLVEEE ANLLSRRIVE LEVENRGLRA EMDDMKDHGG GCGGPEARLA FSALGGGECG 120
ESLAELRRHL QFVEEEAELL RRSSAELEDQ NKLLLNELAK FRSEHELDVA LSEDSCSVLS 180
EPSQEELAAA KLQIGELSGK VKKLQYENRV LLSNLQRCDL ASCQSTRPML ETDAEAGDSA 240
QCVPAPLGET HESHAVRLCR AREAEVLPGL REQAALVSKA IDVLVADANG FTAGLRLCLD 300
NECADFRLHE APDNSEGPRD TKLIHAILVR LSVLQQELNA FTRKADAVLG CSVKEQQESF 360
SSLPPLGSQG LSKEILLAKD LGSDFQPPDF RDLPEWEPRI REAFRTGDLD SKPDPSRSFR 420
PYRAEDNDSY ASEIKELQLV LAEAHDSLRG LQEQLSQERQ LRKEEADNFN QKMVQLKEDQ 480
QRALLRREFE LQSLSLQRRL EQKFWSQEKN MLVQESQQFK HNFLLLFMKL RWFLKRWRQG 540
KVLPSEGDDF LEVNSMKELY LLMEEEEINA QHSDNKACTG DSWTQNTPNE YIKTLADMKV 600
TLKELCWLLR DERRGLTELQ QQFAKAKATW ETERAELKGH TSQMELKTGK GAGERAGPDW 660
KAALQREREE QQHLLAESYS AVMELTRQLQ ISERNWSQEK LQLVERLQGE KQQVEQQVKE 720
LQNRLSQLQK AADPWVLKHS ELEKQDNSWK ETRSEKIHDK EAVSEVELGG NGLKRTKSVS 780
SMSEFESLLD CSPYLAGGDA RGKKLPNNPA FGFVSSEPGD PEKDTKEKPG LSSRDCNHLG 840
ALACQDPPGR QMQRSYTAPD KTGIRVYYSP PVARRLGVPV VHDKEGKIII EPGFLFTTAK 900
PKESAEADGL AESSYGRWLC NFSRQRLDGG SAGSPSAAGP GFPAALHDFE MSGNMSDDMK 960
EITNCVRQAM RSGSLERKVK STSSQTVGLA SVGTQTIRTV SVGLQTDPPR SSLHGKAWSP 1020
RSSSLVSVRS KQISSSLDKV HSRIERPCCS PKYGSPKLQR RSVSKLDSSK DRSLWNLHQG 1080
KQNGSAWARS TTTRDSPVLR NINDGLSSLF SVVEHSGSTE SVWKLGMSET RAKPEPPKYG 1140
IVQEFFRNVC GRAPSPTSSA GEEGTKKPEP LSPASYHQPE GVARILNKKA AKLGSSEEVR 1200
LTMLPQVGKD GVLRDGDGAV VLPNEDAVCD CSTQSLTSCF ARSSRSAIRH SPSKCRLHPS 1260
ESSWGGEERA LPPSE 1275 
Gene Ontology
  
Interpro
 IPR021507; DUF3166. 
Pfam
 PF11365; DUF3166 
SMART
  
PROSITE
  
PRINTS