CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-017031
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Extracellular sulfatase Sulf-2 
Protein Synonyms/Alias
 hSulf-2 
Gene Name
 SULF2 
Gene Synonyms/Alias
 KIAA1247; UNQ559/PRO1120 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
694KGLQEKDKVWLLREQubiquitination[1, 2]
834RNMDLGLKDGGSYEQubiquitination[1, 2]
859EMKRPSSKSLGQLWEubiquitination[1, 2]
Reference
 [1] A proteome-wide, quantitative survey of in vivo ubiquitylation sites reveals widespread regulatory roles.
 Wagner SA, Beli P, Weinert BT, Nielsen ML, Cox J, Mann M, Choudhary C.
 Mol Cell Proteomics. 2011 Oct;10(10):M111.013284. [PMID: 21890473]
 [2] hCKSAAP_UbSite: improved prediction of human ubiquitination sites by exploiting amino acid pattern and properties.
 Chen Z, Zhou Y, Song J, Zhang Z.
 Biochim Biophys Acta. 2013 Aug;1834(8):1461-7. [PMID: 23603789
Functional Description
 Exhibits arylsulfatase activity and highly specific endoglucosamine-6-sulfatase activity. It can remove sulfate from the C-6 position of glucosamine within specific subregions of intact heparin. 
Sequence Annotation
 METAL 52 52 Calcium (By similarity).
 METAL 53 53 Calcium (By similarity).
 METAL 88 88 Calcium; via 3-oxoalanine (By
 METAL 317 317 Calcium (By similarity).
 METAL 318 318 Calcium (By similarity).
 MOD_RES 88 88 3-oxoalanine (Cys) (By similarity).
 CARBOHYD 65 65 N-linked (GlcNAc...) (Potential).
 CARBOHYD 112 112 N-linked (GlcNAc...) (Potential).
 CARBOHYD 132 132 N-linked (GlcNAc...) (Potential).
 CARBOHYD 149 149 N-linked (GlcNAc...) (Potential).
 CARBOHYD 171 171 N-linked (GlcNAc...) (Potential).
 CARBOHYD 198 198 N-linked (GlcNAc...).
 CARBOHYD 241 241 N-linked (GlcNAc...) (Potential).
 CARBOHYD 561 561 N-linked (GlcNAc...) (Potential).
 CARBOHYD 608 608 N-linked (GlcNAc...) (Potential).
 CARBOHYD 717 717 N-linked (GlcNAc...) (Potential).
 CARBOHYD 754 754 N-linked (GlcNAc...) (Potential).
 CARBOHYD 764 764 N-linked (GlcNAc...) (Potential).  
Keyword
 Alternative splicing; Calcium; Complete proteome; Endoplasmic reticulum; Glycoprotein; Golgi apparatus; Hydrolase; Metal-binding; Polymorphism; Reference proteome; Signal. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 870 AA 
Protein Sequence
MGPPSLVLCL LSATVFSLLG GSSAFLSHHR LKGRFQRDRR NIRPNIILVL TDDQDVELGS 60
MQVMNKTRRI MEQGGAHFIN AFVTTPMCCP SRSSILTGKY VHNHNTYTNN ENCSSPSWQA 120
QHESRTFAVY LNSTGYRTAF FGKYLNEYNG SYVPPGWKEW VGLLKNSRFY NYTLCRNGVK 180
EKHGSDYSKD YLTDLITNDS VSFFRTSKKM YPHRPVLMVI SHAAPHGPED SAPQYSRLFP 240
NASQHITPSY NYAPNPDKHW IMRYTGPMKP IHMEFTNMLQ RKRLQTLMSV DDSMETIYNM 300
LVETGELDNT YIVYTADHGY HIGQFGLVKG KSMPYEFDIR VPFYVRGPNV EAGCLNPHIV 360
LNIDLAPTIL DIAGLDIPAD MDGKSILKLL DTERPVNRFH LKKKMRVWRD SFLVERGKLL 420
HKRDNDKVDA QEENFLPKYQ RVKDLCQRAE YQTACEQLGQ KWQCVEDATG KLKLHKCKGP 480
MRLGGSRALS NLVPKYYGQG SEACTCDSGD YKLSLAGRRK KLFKKKYKAS YVRSRSIRSV 540
AIEVDGRVYH VGLGDAAQPR NLTKRHWPGA PEDQDDKDGG DFSGTGGLPD YSAANPIKVT 600
HRCYILENDT VQCDLDLYKS LQAWKDHKLH IDHEIETLQN KIKNLREVRG HLKKKRPEEC 660
DCHKISYHTQ HKGRLKHRGS SLHPFRKGLQ EKDKVWLLRE QKRKKKLRKL LKRLQNNDTC 720
SMPGLTCFTH DNQHWQTAPF WTLGPFCACT SANNNTYWCM RTINETHNFL FCEFATGFLE 780
YFDLNTDPYQ LMNAVNTLDR DVLNQLHVQL MELRSCKGYK QCNPRTRNMD LGLKDGGSYE 840
QYRQFQRRKW PEMKRPSSKS LGQLWEGWEG 870 
Gene Ontology
 GO:0009986; C:cell surface; IDA:UniProtKB.
 GO:0005783; C:endoplasmic reticulum; IDA:UniProtKB.
 GO:0005615; C:extracellular space; NAS:UniProtKB.
 GO:0005795; C:Golgi stack; IEA:UniProtKB-SubCell.
 GO:0005886; C:plasma membrane; ISS:UniProtKB.
 GO:0004065; F:arylsulfatase activity; IDA:UniProtKB.
 GO:0005509; F:calcium ion binding; IEA:InterPro.
 GO:0008449; F:N-acetylglucosamine-6-sulfatase activity; IDA:UniProtKB.
 GO:0060348; P:bone development; ISS:BHF-UCL.
 GO:0002063; P:chondrocyte development; ISS:UniProtKB.
 GO:0048706; P:embryonic skeletal system development; ISS:UniProtKB.
 GO:0014846; P:esophagus smooth muscle contraction; ISS:UniProtKB.
 GO:0035860; P:glial cell-derived neurotrophic factor receptor signaling pathway; ISS:UniProtKB.
 GO:0032836; P:glomerular basement membrane development; ISS:UniProtKB.
 GO:0003094; P:glomerular filtration; ISS:UniProtKB.
 GO:0030201; P:heparan sulfate proteoglycan metabolic process; IDA:UniProtKB.
 GO:0060384; P:innervation; ISS:UniProtKB.
 GO:0040037; P:negative regulation of fibroblast growth factor receptor signaling pathway; ISS:UniProtKB.
 GO:0030177; P:positive regulation of Wnt receptor signaling pathway; IDA:UniProtKB.
 GO:0010575; P:positive regulation vascular endothelial growth factor production; ISS:UniProtKB. 
Interpro
 IPR017849; Alkaline_Pase-like_a/b/a.
 IPR017850; Alkaline_phosphatase_core.
 IPR014615; Extracellular_sulfatase.
 IPR024609; Extracellular_sulfatase_C.
 IPR000917; Sulfatase.
 IPR024607; Sulfatase_CS. 
Pfam
 PF12548; DUF3740
 PF00884; Sulfatase 
SMART
  
PROSITE
 PS00523; SULFATASE_1
 PS00149; SULFATASE_2 
PRINTS