CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-016426
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Multiple epidermal growth factor-like domains protein 9 
Protein Synonyms/Alias
 Multiple EGF-like domains protein 9; Epidermal growth factor-like protein 5; EGF-like protein 5 
Gene Name
 Megf9 
Gene Synonyms/Alias
 Egfl5; Kiaa0818 
Created Date
 July 27, 2013 
Organism
 Mus musculus (Mouse) 
NCBI Taxa ID
 10090 
Lysine Modification
Position
Peptide
Type
References
543YREYQNRKLNAPFWTubiquitination[1]
Reference
 [1] Proteomic analyses reveal divergent ubiquitylation site patterns in murine tissues.
 Wagner SA, Beli P, Weinert BT, Schölz C, Kelstrup CD, Young C, Nielsen ML, Olsen JV, Brakebusch C, Choudhary C.
 Mol Cell Proteomics. 2012 Dec;11(12):1578-85. [PMID: 22790023
Functional Description
  
Sequence Annotation
 DOMAIN 202 251 Laminin EGF-like 1.
 DOMAIN 252 298 Laminin EGF-like 2.
 DOMAIN 299 346 Laminin EGF-like 3.
 DOMAIN 347 397 Laminin EGF-like 4.
 DOMAIN 398 449 Laminin EGF-like 5.
 CARBOHYD 36 36 N-linked (GlcNAc...) (Potential).
 CARBOHYD 179 179 N-linked (GlcNAc...) (Potential).
 CARBOHYD 203 203 N-linked (GlcNAc...) (Potential).
 CARBOHYD 216 216 N-linked (GlcNAc...) (Potential).
 CARBOHYD 243 243 N-linked (GlcNAc...) (Potential).
 CARBOHYD 265 265 N-linked (GlcNAc...) (Potential).
 CARBOHYD 303 303 N-linked (GlcNAc...) (Potential).
 CARBOHYD 357 357 N-linked (GlcNAc...) (Potential).
 CARBOHYD 426 426 N-linked (GlcNAc...) (Potential).
 CARBOHYD 466 466 N-linked (GlcNAc...) (Potential).
 CARBOHYD 479 479 N-linked (GlcNAc...) (Potential).
 CARBOHYD 498 498 N-linked (GlcNAc...) (Potential).
 DISULFID 202 215 By similarity.
 DISULFID 204 222 By similarity.
 DISULFID 224 233 By similarity.
 DISULFID 236 249 By similarity.
 DISULFID 252 264 By similarity.
 DISULFID 254 270 By similarity.
 DISULFID 272 281 By similarity.
 DISULFID 284 296 By similarity.
 DISULFID 299 308 By similarity.
 DISULFID 301 315 By similarity.
 DISULFID 318 327 By similarity.
 DISULFID 330 344 By similarity.
 DISULFID 347 358 By similarity.
 DISULFID 349 369 By similarity.
 DISULFID 372 381 By similarity.
 DISULFID 384 395 By similarity.
 DISULFID 398 413 By similarity.
 DISULFID 400 420 By similarity.
 DISULFID 423 432 By similarity.
 DISULFID 435 447 By similarity.  
Keyword
 Complete proteome; Disulfide bond; Glycoprotein; Laminin EGF-like domain; Membrane; Reference proteome; Repeat; Signal; Transmembrane; Transmembrane helix. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 600 AA 
Protein Sequence
MNGGAERAMR SLPSLGGLAL LCCAAAAAAS TASAGNVTGG GGAEGQVVPS PSPGLRDQAS 60
SPFPKTAAPT AQAPRTGPPR TTVRKTGATT PSAGSPEIIP PLRTSAQPAA TPFPALDLSP 120
ATPSEDGHTP TTESPPSRPA PTTLASTVGQ PPTTSVVTTA QASSTPGTPT AESPDRSSNS 180
SGVPPTAPVT EAPTSPPPEH MCNCSEVGSL DVKRCNQTTG QCDCHVGYQG LHCDTCKEGF 240
YLNHTVGLCL PCHCSPHGAV SILCNSSGNC QCKVGVTGSM CDKCQDGHYG FGKTGCLPCQ 300
CNNRSDSCDV HTGACLNCQE NSKGEHCEEC KEGFYPSPDA AKQCHRCPCS AVTSTGNCTI 360
ESGELEPTCD QCKDGYTGQN CNKCENGYYN SDSICTQCEC HGHVDPIKTP KICKPESGEC 420
INCLHNTTGF WCEKCLEGYV RDLQRNCIKQ EVIVPTPEGS TILVSNASLT TSVPTPVINS 480
TFAPTTLQTI FAVSSSENST SALADVSWTQ FNIIILTVII IVVVLLMGFV GAVYMYREYQ 540
NRKLNAPFWT IELKEDNISF SSYHDSIPNA DVSGLLEDDA NEVAPNGQLT LTTPIHNYKA 600 
Gene Ontology
 GO:0016021; C:integral to membrane; IEA:UniProtKB-KW. 
Interpro
 IPR002049; EGF_laminin.
 IPR001368; TNFR/NGFR_Cys_rich_reg. 
Pfam
 PF00053; Laminin_EGF 
SMART
 SM00180; EGF_Lam
 SM00208; TNFR 
PROSITE
 PS00022; EGF_1
 PS01186; EGF_2
 PS01248; EGF_LAM_1
 PS50027; EGF_LAM_2 
PRINTS