CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-007098
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Collagen alpha-1(XVIII) chain 
Protein Synonyms/Alias
 Endostatin 
Gene Name
 COL18A1 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
1677RIFSFDGKDVLRHPTubiquitination[1]
Reference
 [1] Refined preparation and use of anti-diglycine remnant (K-ε-GG) antibody enables routine quantification of 10,000s of ubiquitination sites in single proteomics experiments.
 Udeshi ND, Svinkina T, Mertins P, Kuhn E, Mani DR, Qiao JW, Carr SA.
 Mol Cell Proteomics. 2013 Mar;12(3):825-31. [PMID: 23266961
Functional Description
 COLA18A probably plays a major role in determining the retinal structure as well as in the closure of the neural tube. 
Sequence Annotation
 DOMAIN 329 446 FZ.
 DOMAIN 456 644 Laminin G-like.
 REGION 645 751 Nonhelical region 1 (NC1).
 REGION 752 785 Triple-helical region 1 (COL1).
 REGION 786 795 Nonhelical region 2 (NC2).
 REGION 796 875 Triple-helical region 2 (COL2).
 REGION 876 899 Nonhelical region 3 (NC3).
 REGION 900 1021 Triple-helical region 3 (COL3).
 REGION 1022 1044 Nonhelical region 4 (NC4).
 REGION 1045 1127 Triple-helical region 4 (COL4).
 REGION 1128 1141 Nonhelical region 5 (NC5).
 REGION 1142 1183 Triple-helical region 5 (COL5).
 REGION 1184 1196 Nonhelical region 6 (NC6).
 REGION 1197 1269 Triple-helical region 6 (COL6).
 REGION 1270 1279 Nonhelical region 7 (NC7).
 REGION 1280 1312 Triple-helical region 7 (COL7).
 REGION 1313 1324 Nonhelical region 8 (NC8).
 REGION 1325 1346 Triple-helical region 8 (COL8).
 REGION 1347 1353 Nonhelical region 9 (NC9).
 REGION 1354 1411 Triple-helical region 9 (COL9).
 REGION 1412 1424 Nonhelical region 10 (NC10).
 REGION 1425 1442 Triple-helical region 10 (COL10).
 REGION 1443 1754 Nonhelical region 11 (NC11).
 MOTIF 1330 1332 Cell attachment site (Potential).
 METAL 1572 1572 Zinc.
 METAL 1574 1574 Zinc.
 METAL 1582 1582 Zinc.
 METAL 1647 1647 Zinc.
 CARBOHYD 68 68 N-linked (GlcNAc...) (Potential).
 CARBOHYD 129 129 N-linked (GlcNAc...) (Potential).
 CARBOHYD 164 164 N-linked (GlcNAc...) (Potential).
 CARBOHYD 926 926 N-linked (GlcNAc...) (Potential).
 CARBOHYD 1567 1567 O-linked (GalNAc...).
 DISULFID 334 397 By similarity.
 DISULFID 344 390 By similarity.
 DISULFID 381 419 By similarity.
 DISULFID 408 443 By similarity.
 DISULFID 412 432 By similarity.
 DISULFID 1604 1744 By similarity.
 DISULFID 1706 1736 By similarity.  
Keyword
 3D-structure; Alternative promoter usage; Alternative splicing; Cell adhesion; Collagen; Complete proteome; Disulfide bond; Extracellular matrix; Glycoprotein; Hydroxylation; Metal-binding; Polymorphism; Reference proteome; Repeat; Secreted; Signal; Zinc. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1754 AA 
Protein Sequence
MAPYPCGCHI LLLLFCCLAA ARANLLNLNW LWFNNEDTSH AATTIPEPQG PLPVQPTADT 60
TTHVTPRNGS TEPATAPGSP EPPSELLEDG QDTPTSAESP DAPEENIAGV GAEILNVAKG 120
IRSFVQLWND TVPTESLARA ETLVLETPVG PLALAGPSST PQENGTTLWP SRGIPSSPGA 180
HTTEAGTLPA PTPSPPSLGR PWAPLTGPSV PPPSSGRASL SSLLGGAPPW GSLQDPDSQG 240
LSPAAAAPSQ QLQRPDVRLR TPLLHPLVMG SLGKHAAPSA FSSGLPGALS QVAVTTLTRD 300
SGAWVSHVAN SVGPGLANNS ALLGADPEAP AGRCLPLPPS LPVCGHLGIS RFWLPNHLHH 360
ESGEQVRAGA RAWGGLLQTH CHPFLAWFFC LLLVPPCGSV PPPAPPPCCQ FCEALQDACW 420
SRLGGGRLPV ACASLPTQED GYCVLIGPAA ERISEEVGLL QLLGDPPPQQ VTQTDDPDVG 480
LAYVFGPDAN SGQVARYHFP SLFFRDFSLL FHIRPATEGP GVLFAITDSA QAMVLLGVKL 540
SGVQDGHQDI SLLYTEPGAG QTHTAASFRL PAFVGQWTHL ALSVAGGFVA LYVDCEEFQR 600
MPLARSSRGL ELEPGAGLFV AQAGGADPDK FQGVIAELKV RRDPQVSPMH CLDEEGDDSD 660
GASGDSGSGL GDARELLREE TGAALKPRLP APPPVTTPPL AGGSSTEDSR SEEVEEQTTV 720
ASLGAQTLPG SDSVSTWDGS VRTPGGRVKE GGLKGQKGEP GVPGPPGRAG PPGSPCLPGP 780
PGLPCPVSPL GPAGPALQTV PGPQGPPGPP GRDGTPGRDG EPGDPGEDGK PGDTGPQGFP 840
GTPGDVGPKG DKGDPGVGER GPPGPQGPPG PPGPSFRHDK LTFIDMEGSG FGGDLEALRG 900
PRGFPGPPGP PGVPGLPGEP GRFGVNSSDV PGPAGLPGVP GREGPPGFPG LPGPPGPPGR 960
EGPPGRTGQK GSLGEAGAPG HKGSKGAPGP AGARGESGLA GAPGPAGPPG PPGPPGPPGP 1020
GLPAGFDDME GSGGPFWSTA RSADGPQGPP GLPGLKGDPG VPGLPGAKGE VGADGVPGFP 1080
GLPGREGIAG PQGPKGDRGS RGEKGDPGKD GVGQPGLPGP PGPPGPVVYV SEQDGSVLSV 1140
PGPEGRPGFA GFPGPAGPKG NLGSKGERGS PGPKGEKGEP GSIFSPDGGA LGPAQKGAKG 1200
EPGFRGPPGP YGRPGYKGEI GFPGRPGRPG MNGLKGEKGE PGDASLGFGM RGMPGPPGPP 1260
GPPGPPGTPV YDSNVFAESS RPGPPGLPGN QGPPGPKGAK GEVGPPGPPG QFPFDFLQLE 1320
AEMKGEKGDR GDAGQKGERG EPGGGGFFGS SLPGPPGPPG PPGPRGYPGI PGPKGESIRG 1380
QPGPPGPQGP PGIGYEGRQG PPGPPGPPGP PSFPGPHRQT ISVPGPPGPP GPPGPPGTMG 1440
ASSGVRLWAT RQAMLGQVHE VPEGWLIFVA EQEELYVRVQ NGFRKVQLEA RTPLPRGTDN 1500
EVAALQPPVV QLHDSNPYPR REHPHPTARP WRADDILASP PRLPEPQPYP GAPHHSSYVH 1560
LRPARPTSPP AHSHRDFQPV LHLVALNSPL SGGMRGIRGA DFQCFQQARA VGLAGTFRAF 1620
LSSRLQDLYS IVRRADRAAV PIVNLKDELL FPSWEALFSG SEGPLKPGAR IFSFDGKDVL 1680
RHPTWPQKSV WHGSDPNGRR LTESYCETWR TEAPSATGQA SSLLGGRLLG QSAASCHHAY 1740
IVLCIENSFM TASK 1754 
Gene Ontology
 GO:0005604; C:basement membrane; IEA:Compara.
 GO:0005581; C:collagen; TAS:ProtInc.
 GO:0005788; C:endoplasmic reticulum lumen; TAS:Reactome.
 GO:0005615; C:extracellular space; IDA:BHF-UCL.
 GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
 GO:0005198; F:structural molecule activity; IEA:InterPro.
 GO:0001525; P:angiogenesis; IEA:Compara.
 GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
 GO:0030574; P:collagen catabolic process; TAS:Reactome.
 GO:0001886; P:endothelial cell morphogenesis; IEA:Compara.
 GO:0022617; P:extracellular matrix disassembly; TAS:Reactome.
 GO:0008285; P:negative regulation of cell proliferation; TAS:ProtInc.
 GO:0009887; P:organ morphogenesis; TAS:ProtInc.
 GO:0043065; P:positive regulation of apoptotic process; IEA:Compara.
 GO:0030335; P:positive regulation of cell migration; IEA:Compara.
 GO:0008284; P:positive regulation of cell proliferation; IEA:Compara.
 GO:0007601; P:visual perception; TAS:ProtInc. 
Interpro
 IPR016186; C-type_lectin-like.
 IPR016187; C-type_lectin_fold.
 IPR026917; COL18A1.
 IPR008160; Collagen.
 IPR010515; Collagenase_NC10/endostatin.
 IPR008985; ConA-like_lec_gl_sf.
 IPR010363; DUF959_COL18_N.
 IPR020067; Frizzled_dom.
 IPR001791; Laminin_G. 
Pfam
 PF01391; Collagen
 PF06121; DUF959
 PF06482; Endostatin
 PF01392; Fz 
SMART
 SM00063; FRI
 SM00282; LamG
 SM00210; TSPN 
PROSITE
 PS50038; FZ
 PS50025; LAM_G_DOMAIN 
PRINTS