CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-007097
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Collagen alpha-1(XV) chain 
Protein Synonyms/Alias
 Restin; Endostatin-XV; Related to endostatin; Restin-I; Restin-2; Restin-II; Restin-3; Restin-III; Restin-4; Restin-IV 
Gene Name
 COL15A1 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
1239RADFQCFKQARAAGLubiquitination[1]
1360ASPLSTGKILDQKAYubiquitination[1]
Reference
 [1] Global identification of modular cullin-RING ligase substrates.
 Emanuele MJ, Elia AE, Xu Q, Thoma CR, Izhar L, Leng Y, Guo A, Chen YN, Rush J, Hsu PW, Yen HC, Elledge SJ.
 Cell. 2011 Oct 14;147(2):459-74. [PMID: 21963094
Functional Description
 Structural protein that stabilizes microvessels and muscle cells, both in heart and in skeletal muscle. 
Sequence Annotation
 DOMAIN 66 249 Laminin G-like.
 REPEAT 358 408 1.
 REPEAT 409 459 2.
 REPEAT 460 509 3.
 REPEAT 510 555 4.
 DOMAIN 619 680 Collagen-like 1.
 DOMAIN 681 731 Collagen-like 2.
 DOMAIN 823 865 Collagen-like 3.
 DOMAIN 879 927 Collagen-like 4.
 REGION 229 555 Nonhelical region 1 (NC1).
 REGION 358 555 4 X tandem repeats.
 REGION 556 573 Triple-helical region 1 (COL1).
 REGION 574 618 Nonhelical region 2 (NC2).
 REGION 619 732 Triple-helical region 2 (COL2).
 REGION 733 763 Nonhelical region 3 (NC3).
 REGION 764 798 Triple-helical region 3 (COL3).
 REGION 799 822 Nonhelical region 4 (NC4).
 REGION 823 867 Triple-helical region 4 (COL4).
 REGION 868 878 Nonhelical region 5 (NC5).
 REGION 879 949 Triple-helical region 5 (COL5).
 REGION 950 983 Nonhelical region 6 (NC6).
 REGION 984 1013 Triple-helical region 6 (COL6).
 REGION 1014 1027 Nonhelical region 7 (NC7).
 REGION 1028 1045 Triple-helical region 7 (COL7).
 REGION 1046 1052 Nonhelical region 8 (NC8).
 REGION 1053 1107 Triple-helical region 8 (COL8).
 REGION 1108 1117 Nonhelical region 9 (NC9).
 REGION 1118 1132 Triple-helical region 9 (COL9).
 REGION 1133 1388 Nonhelical region 10 (NC10).
 CARBOHYD 265 265 O-linked (GalNAc...).
 CARBOHYD 306 306 N-linked (GlcNAc...) (Potential).
 CARBOHYD 324 324 N-linked (GlcNAc...) (Potential).
 CARBOHYD 687 687 N-linked (GlcNAc...) (Potential).
 CARBOHYD 807 807 N-linked (GlcNAc...) (Potential).
 CARBOHYD 814 814 N-linked (GlcNAc...) (Potential).
 CARBOHYD 1046 1046 N-linked (GlcNAc...) (Potential).
 DISULFID 1237 1377
 DISULFID 1339 1369  
Keyword
 3D-structure; Angiogenesis; Cell adhesion; Collagen; Complete proteome; Developmental protein; Differentiation; Direct protein sequencing; Disulfide bond; Extracellular matrix; Glycoprotein; Hydroxylation; Polymorphism; Reference proteome; Repeat; Secreted; Signal. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1388 AA 
Protein Sequence
MAPRRNNGQC WCLLMLLSVS TPLPAVTQTR GATETASQGH LDLTQLIGVP LPSSVSFVTG 60
YGGFPAYSFG PGANVGRPAR TLIPSTFFRD FAISVVVKPS STRGGVLFAI TDAFQKVIYL 120
GLRLSGVEDG HQRIILYYTE PGSHVSQEAA AFSVPVMTHR WNRFAMIVQG EEVTLLVNCE 180
EHSRIPFQRS SQALAFESSA GIFMGNAGAT GLERFTGSLQ QLTVHPDPRT PEELCDPEES 240
SASGETSGLQ EADGVAEILE AVTYTQASPK EAKVEPINTP PTPSSPFEDM ELSGEPVPEG 300
TLETTNMSII QHSSPKQGSG EILNDTLEGV HSVDGDPITD SGSGAGAFLD IAEEKNLAAT 360
AAGLAEVPIS TAGEAEASSV PTGGPTLSMS TENPEEGVTP GPDNEERLAA TAAGEAEALA 420
SMPGEVEASG VAPGELDLSM SAQSLGEEAT VGPSSEDSLT TAAAATEVSL STFEDEEASG 480
VPTDGLAPLT ATMAPERAVT SGPGDEEDLA AATTEEPLIT AGGEESGSPP PDGPPLPLPT 540
VAPERWITPA QREHVGMKGQ AGPKGEKGDA GEELPGPPEP SGPVGPTAGA EAEGSGLGWG 600
SDVGSGSGDL VGSEQLLRGP PGPPGPPGLP GIPGKPGTDV FMGPPGSPGE DGPAGEPGPP 660
GPEGQPGVDG ATGLPGMKGE KGARGPNGSV GEKGDPGNRG LPGPPGKKGQ AGPPGVMGPP 720
GPPGPPGPPG PGCTMGLGFE DTEGSGSTQL LNEPKLSRPT AAIGLKGEKG DRGPKGERGM 780
DGASIVGPPG PRGPPGHIKV LSNSLINITH GFMNFSDIPE LVGPPGPDGL PGLPGFPGPR 840
GPKGDTGLPG FPGLKGEQGE KGEPGAILTE DIPLERLMGK KGEPGMHGAP GPMGPKGPPG 900
HKGEFGLPGR PGRPGLNGLK GTKGDPGVIM QGPPGLPGPP GPPGPPGAVI NIKGAIFPIP 960
VRPHCKMPVD TAHPGSPELI TFHGVKGEKG SWGLPGSKGE KGDQGAQGPP GPPLDLAYLR 1020
HFLNNLKGEN GDKGFKGEKG EKGDINGSFL MSGPPGLPGN PGPAGQKGET VVGPQGPPGA 1080
PGLPGPPGFG RPGDPGPPGP PGPPGPPAIL GAAVALPGPP GPPGQPGLPG SRNLVTAFSN 1140
MDDMLQKAHL VIEGTFIYLR DSTEFFIRVR DGWKKLQLGE LIPIPADSPP PPALSSNPHQ 1200
LLPPPNPISS ANYEKPALHL AALNMPFSGD IRADFQCFKQ ARAAGLLSTY RAFLSSHLQD 1260
LSTIVRKAER YSLPIVNLKG QVLFNNWDSI FSGHGGQFNM HIPIYSFDGR DIMTDPSWPQ 1320
KVIWHGSSPH GVRLVDNYCE AWRTADTAVT GLASPLSTGK ILDQKAYSCA NRLIVLCIEN 1380
SFMTDARK 1388 
Gene Ontology
 GO:0005604; C:basement membrane; IEA:Compara.
 GO:0005582; C:collagen type XV; TAS:UniProtKB.
 GO:0005788; C:endoplasmic reticulum lumen; TAS:Reactome.
 GO:0005615; C:extracellular space; IDA:BHF-UCL.
 GO:0016021; C:integral to membrane; NAS:UniProtKB.
 GO:0005201; F:extracellular matrix structural constituent; IC:UniProtKB.
 GO:0001525; P:angiogenesis; IEA:UniProtKB-KW.
 GO:0007155; P:cell adhesion; IC:UniProtKB.
 GO:0030154; P:cell differentiation; IEA:UniProtKB-KW.
 GO:0030574; P:collagen catabolic process; TAS:Reactome.
 GO:0022617; P:extracellular matrix disassembly; TAS:Reactome.
 GO:0007165; P:signal transduction; NAS:UniProtKB. 
Interpro
 IPR016186; C-type_lectin-like.
 IPR016187; C-type_lectin_fold.
 IPR008160; Collagen.
 IPR010515; Collagenase_NC10/endostatin.
 IPR008985; ConA-like_lec_gl_sf.
 IPR001791; Laminin_G. 
Pfam
 PF01391; Collagen
 PF06482; Endostatin 
SMART
 SM00282; LamG
 SM00210; TSPN 
PROSITE
 PS50025; LAM_G_DOMAIN 
PRINTS