CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-038998
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Procollagen, type XVIII, alpha 1, isoform CRA_a 
Protein Synonyms/Alias
 Protein Col18a1 
Gene Name
 Col18a1 
Gene Synonyms/Alias
 rCG_60896 
Created Date
 July 27, 2013 
Organism
 Rattus norvegicus (Rat) 
NCBI Taxa ID
 10116 
Lysine Modification
Position
Peptide
Type
References
207AGAADPDKFQGMISEacetylation[1]
1005TYQTMLDKIREVPEGacetylation[1]
1202SVPIVNLKDEVLSPSacetylation[1]
Reference
 [1] Proteomic analysis of lysine acetylation sites in rat tissues reveals organ specificity and subcellular patterns.
 Lundby A, Lage K, Weinert BT, Bekker-Jensen DB, Secher A, Skovgaard T, Kelstrup CD, Dmytriyev A, Choudhary C, Lundby C, Olsen JV.
 Cell Rep. 2012 Aug 30;2(2):419-31. [PMID: 22902405
Functional Description
  
Sequence Annotation
  
Keyword
 Collagen; Complete proteome; Disulfide bond; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1311 AA 
Protein Sequence
MAPRWHLLDM LTSLVLLLVA RVSWAEPENV AEDVGLLQLL GDPLPQKISQ VEDPHVGLAY 60
VFGPYSKSSQ MAQYHFPKLF FRDFSLLFEV RPTTEAAGVL FAITDAAQVV VSLGVKLSEV 120
RDGQQNISLL YTEPGASQTQ TGASFRLPAF VGQWTHFALS VDGSSVALYV DCEEFQRVPF 180
ARSPHGLELE RGAGLFVGQA GAADPDKFQG MISELRVRKT PRVSPVHCLD EEDDDDDRAS 240
GDFGSGLEES SNLHRQETYL RPGLPQPPPV TSPPLAGGSA TEDSRTEEKE EEATVDSKGA 300
DTLPVTDSSG VWDGDVQNPG GGLIKGGLKG QKGEPGAQGP PGPAGPQGPA GPAVQSPSSQ 360
PVPGAQGPPG PQGPPGKDGI PGRDGEPGDP GEDGRPGDTG PQGFPGTPGD VGPKGEKGDP 420
GIGPRGPPGP PGPPGPSFRQ DKLTFIDMEG SGGFSGDLES LRGPRGFPGP PGPPGVPGLP 480
GEPGRFGVNS SYAPGPAGLP GVPGKEGPPG FPGPPGPPGK EGPPGVAGQK GSVGDAGSPG 540
PKGSKGDLGP IGMPGKSGLP GLPGPVGPPG PPGPPGPPGP GFAAGFDDME GSGTPLWSTA 600
RSSDGLQGDP GVTGPPGAKG EVGADGVQGI PGLPGREGVA GPPGPKGEKG TQGEKGNPGK 660
DGVGRPGLPG PPGPPGPVIY VSNEDRAVVS TPGPEGKPGY AGFPGPAGPK GDLGSKGEQG 720
LPGPKGEKGE PGSIFSPDGT ALGQAQKGAK GEPGFRGPPG PYGRPGYKGE IGFPGRPGRP 780
GTNGLKGEKG EPGEASLGFS MRGLPGPPGP PGPPGPPGVP VYDSNAFVES GRPGLPGQQG 840
VQGPPGPKGD KGEVGPPGPP GQFPIDLFHL EAEMKGDKGD RGDAGRKGER GEPGAPGGGF 900
FSSSVPGPPG PPGYPGIPGP KGESIRGPPG PPGPQGPPGI GYEGRQGPPG PPGPPGPPSF 960
PGPHRQTVSV PGPPGPPGPP GPPGAMGASA GQVRIWATYQ TMLDKIREVP EGWLIFVAER 1020
EELYVRVRNG FRKVLLEART ALPHGTDNEV AALQPPLVQL HEGSSYTRRE HSYPTARPWR 1080
ADDILANPPR LPDRQPYPGV PHHHHHHHHH HSSHEHRPPA HPSPSPAHTH QDFHPVLHLV 1140
ALNTPLSGGM RGIRGADFQC FQQARAVGLS GTFRAFLSSR LQDLYSIVRR ADRSSVPIVN 1200
LKDEVLSPSW DTLFSGSQGQ LHSGARIFSF DGRDVLRHPA WPQKSVWHGS DPSGRRLMES 1260
YCETWRTEAT GVTGQASSLL SGRLLEQKAE SCHNSYIVLC IENSFMTSFS K 1311 
Gene Ontology
 GO:0030938; C:collagen type XVIII; TAS:RGD.
 GO:0005615; C:extracellular space; IDA:RGD.
 GO:0005198; F:structural molecule activity; IEA:InterPro.
 GO:0007155; P:cell adhesion; IEA:InterPro.
 GO:0042493; P:response to drug; IEP:RGD.
 GO:0051599; P:response to hydrostatic pressure; IEP:RGD. 
Interpro
 IPR016186; C-type_lectin-like.
 IPR016187; C-type_lectin_fold.
 IPR026917; COL18A1.
 IPR008160; Collagen.
 IPR010515; Collagenase_NC10/endostatin.
 IPR008985; ConA-like_lec_gl_sf.
 IPR013320; ConA-like_subgrp.
 IPR001791; Laminin_G. 
Pfam
 PF01391; Collagen
 PF06482; Endostatin 
SMART
 SM00282; LamG
 SM00210; TSPN 
PROSITE
  
PRINTS