CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-028585
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
  
Protein Name
 Uncharacterized protein 
Protein Synonyms/Alias
  
Gene Name
 Soga1 
Gene Synonyms/Alias
 9830001H06Rik 
Created Date
 July 27, 2013 
Organism
 Mus musculus (Mouse) 
NCBI Taxa ID
 10090 
Lysine Modification
Position
Peptide
Type
References
1282GFLFTTAKPKESAEAubiquitination[1]
Reference
 [1] Proteomic analyses reveal divergent ubiquitylation site patterns in murine tissues.
 Wagner SA, Beli P, Weinert BT, Schölz C, Kelstrup CD, Young C, Nielsen ML, Olsen JV, Brakebusch C, Choudhary C.
 Mol Cell Proteomics. 2012 Dec;11(12):1578-85. [PMID: 22790023
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1654 AA 
Protein Sequence
METPAGESSA RGYGPPPAPA PAAERKKSHR APSPARPKDV AGWSLAKGRR GTGPGSATAC 60
GTASSARPDK KGRAVAPGTR GTGPRVAGVR TGVRAKGRPR PGTGPRPPPP PPSLTDSSSE 120
VSDCASEEAR QLGLELALSS DAESAAGGPA GTRTGQPPQP AQSGQQPPRP PASPDEPSVA 180
ASSVGSSRLP LSASLAFSDL TEEMLDCGPG GLVRELEELR SENDYLKDEI EELRAEMLEM 240
RDVYMEEDVY QLQELRQQLD QASKTCRILQ YRLRKAERRS LRAAQTGQVD GELIRGLEQD 300
VKVSKDISMR LHKELEVVEK KRMRLEEENE GLRQRLIETE LAKQVLQTEL DRPREHSLKK 360
RGTRSLGKTD KKPTAQEDSA DLKCQLHFAK EESALMCKKL TKLAKENDSM KEELLKYRSL 420
YGDLDAALSA EELADAPHSR ETELKVHLKL VEEEANLLSR RIVELEVENR GLRAEMDDMK 480
DHGGGGGPEA RLAFSSLGGE CGESLAELRR HLQFVEEEAE LLRRSSAELE DQNKLLLNEL 540
AKYRSEHELD VTLSEDSCSV LSEPSQEELA AAKLQIGELS GKVKKLQYEN RVLLSNLQRC 600
DLASCQSTRP MLETDAEAGD SAQCVPAPLG ETLEPHAARL CRAREAEALP GLREQAALVS 660
KAIDVLVADA NGFSVGLRLC LDNECADLRL HEAPDNSEGP RDAKLIHAIL VRLSVLQQEL 720
NAFTRKADVA LGSSGKEQPE PFPALPALGS QGPAKEIMLS KDLGSDFQPP DFRDLLEWEP 780
RIREAFRTGD LESKPDPSRN FRPYRAEDND SYASEIKDLQ LVLAEAHDSL RGLQEQLSQE 840
RQLRKEEADS FNQKMVQLKE DQQRALLRRE FELQSLSLQR RLEQKFWSQE KNILVQESQQ 900
FKHNFLLLFM KLRWFLKRWR QGKVLPSEED DFLEVNSMKE LYLLMEEEEM NAQHSDNKAC 960
TGESWTQNTP NECIKTLADM KVTLKELCWL LQDERRGLTE LQQQFAKAKA TWETERAELK 1020
GHASQMELKA GKGASERPGP DWKAALQRER EEQQHLLAES YSAVMELTRQ LQLSERHWSQ 1080
EKLQLVERLQ GEKQQVEQQV KELQNRLSQL QKAAEPWVLK HSDMEKQDNS WKEARSEKTH 1140
DKEGVSEAEL GGTGLKRTKS VSSMSEFESL LDCSPYLAGG DARNKKLPNG PAFAFVSTEP 1200
VEPEKDAKEK AGLSTRDCSH IGSLACQEPA GRQMQRSYTA PDKTGIRVYY SPPVARRLGV 1260
PVVHDKEGKI LIEPGFLFTT AKPKESAEAD GLAESSYSRW LCNFSRQRLD GGSGASTSGS 1320
GPAFPALHDF EMSGNMSDDM KEITNCVRQA MRSGSLERKV KNTSSQTVGV ATVGTQTIRT 1380
VSVGLQTDPP RSSLHSKSWS PRSSSLVSVR SKQISSSLDK VHSRIERPCC SPKYGSPKLQ 1440
RRSVSKLDST KDRSLWNLHQ GKQNGSAWAR STTTRDSPVL RNINDGLSSL FSVVEHSGST 1500
ESVWKLGMSE ARTKPEPPKY GIVQEFFRNV CGRAPSPTTA AGEESCKKPE PLSPASYHQP 1560
EGVSRILNKK AAKAGGSEEV RPTMLSQVGK DGILRDGDGS LILPSEDAVC DCSAQSLASC 1620
FIRPSRNTIR HSPSKCRLHP SESGWGGEER AAPQ 1654 
Gene Ontology
 GO:0005615; C:extracellular space; IEA:Compara. 
Interpro
 IPR021507; DUF3166. 
Pfam
 PF11365; DUF3166 
SMART
  
PROSITE
  
PRINTS