Tag | Content |
---|
CPLM ID | CPLM-038998 |
UniProt Accession | |
Genbank Protein ID | |
Genbank Nucleotide ID | |
Protein Name | Procollagen, type XVIII, alpha 1, isoform CRA_a |
Protein Synonyms/Alias | Protein Col18a1 |
Gene Name | Col18a1 |
Gene Synonyms/Alias | rCG_60896 |
Created Date | July 27, 2013 |
Organism | Rattus norvegicus (Rat) |
NCBI Taxa ID | 10116 |
Lysine Modification | Position | Peptide | Type | References |
---|
207 | AGAADPDKFQGMISE | acetylation | [1] | 1005 | TYQTMLDKIREVPEG | acetylation | [1] | 1202 | SVPIVNLKDEVLSPS | acetylation | [1] |
|
Reference | [1] Proteomic analysis of lysine acetylation sites in rat tissues reveals organ specificity and subcellular patterns. Lundby A, Lage K, Weinert BT, Bekker-Jensen DB, Secher A, Skovgaard T, Kelstrup CD, Dmytriyev A, Choudhary C, Lundby C, Olsen JV. Cell Rep. 2012 Aug 30;2(2):419-31. [ PMID: 22902405] |
Functional Description | |
Sequence Annotation | |
Keyword | Collagen; Complete proteome; Disulfide bond; Reference proteome. |
Sequence Source | UniProt (SWISSPROT/TrEMBL); GenBank; EMBL |
Protein Length | 1311 AA |
Protein Sequence | MAPRWHLLDM LTSLVLLLVA RVSWAEPENV AEDVGLLQLL GDPLPQKISQ VEDPHVGLAY 60 VFGPYSKSSQ MAQYHFPKLF FRDFSLLFEV RPTTEAAGVL FAITDAAQVV VSLGVKLSEV 120 RDGQQNISLL YTEPGASQTQ TGASFRLPAF VGQWTHFALS VDGSSVALYV DCEEFQRVPF 180 ARSPHGLELE RGAGLFVGQA GAADPDKFQG MISELRVRKT PRVSPVHCLD EEDDDDDRAS 240 GDFGSGLEES SNLHRQETYL RPGLPQPPPV TSPPLAGGSA TEDSRTEEKE EEATVDSKGA 300 DTLPVTDSSG VWDGDVQNPG GGLIKGGLKG QKGEPGAQGP PGPAGPQGPA GPAVQSPSSQ 360 PVPGAQGPPG PQGPPGKDGI PGRDGEPGDP GEDGRPGDTG PQGFPGTPGD VGPKGEKGDP 420 GIGPRGPPGP PGPPGPSFRQ DKLTFIDMEG SGGFSGDLES LRGPRGFPGP PGPPGVPGLP 480 GEPGRFGVNS SYAPGPAGLP GVPGKEGPPG FPGPPGPPGK EGPPGVAGQK GSVGDAGSPG 540 PKGSKGDLGP IGMPGKSGLP GLPGPVGPPG PPGPPGPPGP GFAAGFDDME GSGTPLWSTA 600 RSSDGLQGDP GVTGPPGAKG EVGADGVQGI PGLPGREGVA GPPGPKGEKG TQGEKGNPGK 660 DGVGRPGLPG PPGPPGPVIY VSNEDRAVVS TPGPEGKPGY AGFPGPAGPK GDLGSKGEQG 720 LPGPKGEKGE PGSIFSPDGT ALGQAQKGAK GEPGFRGPPG PYGRPGYKGE IGFPGRPGRP 780 GTNGLKGEKG EPGEASLGFS MRGLPGPPGP PGPPGPPGVP VYDSNAFVES GRPGLPGQQG 840 VQGPPGPKGD KGEVGPPGPP GQFPIDLFHL EAEMKGDKGD RGDAGRKGER GEPGAPGGGF 900 FSSSVPGPPG PPGYPGIPGP KGESIRGPPG PPGPQGPPGI GYEGRQGPPG PPGPPGPPSF 960 PGPHRQTVSV PGPPGPPGPP GPPGAMGASA GQVRIWATYQ TMLDKIREVP EGWLIFVAER 1020 EELYVRVRNG FRKVLLEART ALPHGTDNEV AALQPPLVQL HEGSSYTRRE HSYPTARPWR 1080 ADDILANPPR LPDRQPYPGV PHHHHHHHHH HSSHEHRPPA HPSPSPAHTH QDFHPVLHLV 1140 ALNTPLSGGM RGIRGADFQC FQQARAVGLS GTFRAFLSSR LQDLYSIVRR ADRSSVPIVN 1200 LKDEVLSPSW DTLFSGSQGQ LHSGARIFSF DGRDVLRHPA WPQKSVWHGS DPSGRRLMES 1260 YCETWRTEAT GVTGQASSLL SGRLLEQKAE SCHNSYIVLC IENSFMTSFS K 1311 |
Gene Ontology | |
Interpro | |
Pfam | |
SMART | |
PROSITE | |
PRINTS | |