CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-017577
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Sulfatase-modifying factor 2 
Protein Synonyms/Alias
 C-alpha-formylglycine-generating enzyme 2 
Gene Name
 SUMF2 
Gene Synonyms/Alias
 PSEC0171; UNQ1968/PRO4500 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
199RTNLWQGKFPKGDKAubiquitination[1]
Reference
 [1] A proteome-wide, quantitative survey of in vivo ubiquitylation sites reveals widespread regulatory roles.
 Wagner SA, Beli P, Weinert BT, Nielsen ML, Cox J, Mann M, Choudhary C.
 Mol Cell Proteomics. 2011 Oct;10(10):M111.013284. [PMID: 21890473
Functional Description
 Lacks formyl-glycine generating activity and is unable to convert newly synthesized inactive sulfatases to their active form. Inhibits the activation of sulfatases by SUMF1. 
Sequence Annotation
 METAL 194 194 Calcium 1; via carbonyl oxygen.
 METAL 195 195 Calcium 1; via carbonyl oxygen.
 METAL 208 208 Calcium 1.
 METAL 210 210 Calcium 1; via carbonyl oxygen.
 METAL 229 229 Calcium 2; via carbonyl oxygen.
 METAL 232 232 Calcium 2; via carbonyl oxygen.
 METAL 234 234 Calcium 2; via carbonyl oxygen.
 METAL 236 236 Calcium 2.
 CARBOHYD 191 191 N-linked (GlcNAc...).
 DISULFID 156 290  
Keyword
 3D-structure; Alternative splicing; Calcium; Complete proteome; Direct protein sequencing; Disulfide bond; Endoplasmic reticulum; Glycoprotein; Metal-binding; Polymorphism; Reference proteome; Signal. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 301 AA 
Protein Sequence
MARHGLPLLP LLSLLVGAWL KLGNGQATSM VQLQGGRFLM GTNSPDSRDG DGPVREATVK 60
PFAIDIFPVT NKDFRDFVRE KKYRTEAEMF GWSFVFEDFV SDELRNKATQ PMKSVLWWLP 120
VEKAFWRQPA GPGSGIRERL EHPVLHVSWN DARAYCAWRG KRLPTEEEWE FAARGGLKGQ 180
VYPWGNWFQP NRTNLWQGKF PKGDKAEDGF HGVSPVNAFP AQNNYGLYDL LGNVWEWTAS 240
PYQAAEQDMR VLRGASWIDT ADGSANHRAR VTTRMGNTPD SASDNLGFRC AADAGRPPGE 300
L 301 
Gene Ontology
 GO:0005788; C:endoplasmic reticulum lumen; TAS:Reactome.
 GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
 GO:0043687; P:post-translational protein modification; TAS:Reactome. 
Interpro
 IPR016187; C-type_lectin_fold.
 IPR005532; FGE_dom. 
Pfam
 PF03781; FGE-sulfatase 
SMART
  
PROSITE
  
PRINTS