CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-004344
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Amphiregulin 
Protein Synonyms/Alias
 AR; Colorectum cell-derived growth factor; CRDGF 
Gene Name
 AREG; AREGB 
Gene Synonyms/Alias
 SDGF 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
109VRVEQVVKPPQNKTEubiquitination[1]
187RCGEKSMKTHSMIDSubiquitination[2]
230LRRQYVRKYEGEAEEubiquitination[1, 2]
240GEAEERKKLRQENGNubiquitination[3]
Reference
 [1] Systematic and quantitative assessment of the ubiquitin-modified proteome.
 Kim W, Bennett EJ, Huttlin EL, Guo A, Li J, Possemato A, Sowa ME, Rad R, Rush J, Comb MJ, Harper JW, Gygi SP.
 Mol Cell. 2011 Oct 21;44(2):325-40. [PMID: 21906983]
 [2] Landscape of the PARKIN-dependent ubiquitylome in response to mitochondrial depolarization.
 Sarraf SA, Raman M, Guarani-Pereira V, Sowa ME, Huttlin EL, Gygi SP, Harper JW.
 Nature. 2013 Apr 18;496(7445):372-6. [PMID: 23503661]
 [3] hCKSAAP_UbSite: improved prediction of human ubiquitination sites by exploiting amino acid pattern and properties.
 Chen Z, Zhou Y, Song J, Zhang Z.
 Biochim Biophys Acta. 2013 Aug;1834(8):1461-7. [PMID: 23603789
Functional Description
 Ligand of the EGF receptor/EGFR. Autocrine growth factor as well as a mitogen for a broad range of target cells including astrocytes, Schwann cells and fibroblasts. 
Sequence Annotation
 DOMAIN 142 182 EGF-like.
 CARBOHYD 30 30 N-linked (GlcNAc...) (Potential).
 CARBOHYD 113 113 N-linked (GlcNAc...) (Potential).
 CARBOHYD 119 119 N-linked (GlcNAc...).
 DISULFID 146 159 By similarity.
 DISULFID 154 170 By similarity.
 DISULFID 172 181 By similarity.  
Keyword
 3D-structure; Complete proteome; Cytokine; Direct protein sequencing; Disulfide bond; EGF-like domain; Glycoprotein; Growth factor; Membrane; Polymorphism; Reference proteome; Signal; Transmembrane; Transmembrane helix. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 252 AA 
Protein Sequence
MRAPLLPPAP VVLSLLILGS GHYAAGLDLN DTYSGKREPF SGDHSADGFE VTSRSEMSSG 60
SEISPVSEMP SSSEPSSGAD YDYSEEYDNE PQIPGYIVDD SVRVEQVVKP PQNKTESENT 120
SDKPKRKKKG GKNGKNRRNR KKKNPCNAEF QNFCIHGECK YIEHLEAVTC KCQQEYFGER 180
CGEKSMKTHS MIDSSLSKIA LAAIAAFMSA VILTAVAVIT VQLRRQYVRK YEGEAEERKK 240
LRQENGNVHA IA 252 
Gene Ontology
 GO:0009986; C:cell surface; IDA:BHF-UCL.
 GO:0005615; C:extracellular space; IDA:BHF-UCL.
 GO:0016021; C:integral to membrane; IEA:UniProtKB-KW.
 GO:0008083; F:growth factor activity; TAS:ProtInc.
 GO:0008283; P:cell proliferation; TAS:ProtInc.
 GO:0007267; P:cell-cell signaling; TAS:ProtInc.
 GO:0007173; P:epidermal growth factor receptor signaling pathway; IDA:BHF-UCL.
 GO:0007186; P:G-protein coupled receptor signaling pathway; IDA:BHF-UCL.
 GO:0045740; P:positive regulation of DNA replication; IMP:BHF-UCL. 
Interpro
 IPR000742; EG-like_dom.
 IPR013032; EGF-like_CS.
 IPR015497; EGF_rcpt_ligand. 
Pfam
  
SMART
 SM00181; EGF 
PROSITE
 PS00022; EGF_1
 PS01186; EGF_2
 PS50026; EGF_3 
PRINTS