Tag | Content |
---|
CPLM ID | CPLM-001965 |
UniProt Accession | |
Genbank Protein ID | |
Genbank Nucleotide ID | |
Protein Name | Collagen alpha-2(I) chain |
Protein Synonyms/Alias | Alpha-2 type I collagen |
Gene Name | Col1a2 |
Gene Synonyms/Alias | |
Created Date | July 27, 2013 |
Organism | Rattus norvegicus (Rat) |
NCBI Taxa ID | 10116 |
Lysine Modification | Position | Peptide | Type | References |
---|
549 | PQGVQGGKGEQGPAG | glycation | [1] | 575 | GTAGEVGKPGERGLP | glycation | [1] | 1020 | DKGEPGDKGARGLPG | glycation | [1] | 1070 | GPSGPIGKDGRSGHP | glycation | [1] |
|
Reference | [1] Nonenzymatic glycation of type I collagen. The effects of aging on preferential glycation sites. Reiser KM, Amigable MA, Last JA. J Biol Chem. 1992 Dec 5;267(34):24207-16. [ PMID: 1447170] |
Functional Description | Type I collagen is a member of group I collagen (fibrillar forming collagen). |
Sequence Annotation | DOMAIN 1139 1372 Fibrillar collagen NC1. MOTIF 783 785 Cell attachment site (Potential). MOTIF 828 830 Cell attachment site (Potential). MOTIF 1011 1013 Cell attachment site (Potential). METAL 1187 1187 Calcium (By similarity). METAL 1189 1189 Calcium (By similarity). METAL 1190 1190 Calcium; via carbonyl oxygen (By METAL 1192 1192 Calcium; via carbonyl oxygen (By METAL 1195 1195 Calcium (By similarity). MOD_RES 86 86 Pyrrolidone carboxylic acid (Probable). MOD_RES 90 90 Allysine. CARBOHYD 1273 1273 N-linked (GlcNAc...) (Potential). DISULFID 1169 1201 By similarity. DISULFID 1209 1370 By similarity. DISULFID 1278 1323 By similarity. |
Keyword | 3D-structure; Calcium; Collagen; Complete proteome; Direct protein sequencing; Disulfide bond; Extracellular matrix; Glycoprotein; Hydroxylation; Metal-binding; Pyrrolidone carboxylic acid; Reference proteome; Repeat; Secreted; Signal. |
Sequence Source | UniProt (SWISSPROT/TrEMBL); GenBank; EMBL |
Protein Length | 1372 AA |
Protein Sequence | MLSFVDTRTL LLLAVTSCLA TCQSLQMGSV RKGPTGDRGP RGQRGPAGPR GRDGVDGPVG 60 PPGPPGAPGP PGPPGPPGLT GNFAAQYSDK GVSAGPGPMG LMGPRGPPGA VGAPGPQGFQ 120 GPAGEPGEPG QTGPAGSRGP AGPPGKAGED GHPGKPGRPG ERGVVGPQGA RGFPGTPGLP 180 GFKGIRGHNG LDGLKGQPGA QGVKGEPGAP GENGTPGQAG ARGLPGERGR VGAPGPAGAR 240 GSDGSVGPVG PAGPIGSAGP PGFPGAPGPK GELGPVGNPG PAGPAGPRGE AGLPGLSGPV 300 GPPGNPGANG LTGAKGATGL PGVAGAPGLP GPRGIPGPVG AAGATGPRGL VGEPGPAGSK 360 GETGNKGEPG SAGAQGPPGP SGEEGKRGSP GEPGSAGPAG PPGLRGSPGS RGLPGADGRA 420 GVMGPPGNRG STGPAGVRGP NGDAGRPGEP GLMGPRGLPG SPGNVGPAGK EGPVGLPGID 480 GRPGPIGPAG PRGEAGNIGF PGPKGPSGDP GKPGEKGHPG LAGARGAPGP DGNNGAQGPP 540 GPQGVQGGKG EQGPAGPPGF QGLPGPSGTA GEVGKPGERG LPGEFGLPGP AGPRGERGPP 600 GESGAAGPSG PIGIRGPSGA PGPDGNKGEA GAVGAPGSAG ASGPGGLPGE RGAAGIPGGK 660 GEKGETGLRG EIGNPGRDGA RGAPGAIGAP GPAGASGDRG EAGAAGPSGP AGPRGSPGER 720 GEVGPAGPNG FAGPAGSAGQ PGAKGEKGTK GPKGENGIVG PTGPVGAAGP SGPNGPPGPA 780 GSRGDGGPPG MTGFPGAAGR TGPPGPSGIT GPPGPPGAAG KEGIRGPRGD QGPVGRTGEI 840 GASGPPGFAG EKGPSGEPGT TGPPGTAGPQ GLLGAPGILG LPGSRGERGQ PGIAGALGEP 900 GPLGIAGPPG ARGPPGAVGS PGVNGAPGEA GRDGNPGSDG PPGRDGQPGH KGERGYPGNI 960 GPTGAAGAPG PHGSVGPAGK HGNRGEPGPA GSVGPVGAVG PRGPSGPQGI RGDKGEPGDK 1020 GARGLPGLKG HNGLQGLPGL AGLHGDQGAP GPVGPAGPRG PAGPSGPIGK DGRSGHPGPV 1080 GPAGVRGSQG SQGPAGPPGP PGPPGPPGVS GGGYDFGFEG GFYRADQPRS QPSLRPKDYE 1140 VDATLKSLNN QIETLLTPEG SRKNPARTCR DLRLSHPEWK SDYYWIDPNQ GCTMDAIKVY 1200 CDFSTGETCI QAQPVNTPAK NAYSRAQANK HVWLGETING GSQFEYNAEG VSSKEMATQL 1260 AFMRLLANRA SQNITYHCKN SIAYLDEETG RLNKAVILQG SNDVELVAEG NSRFTYTVLV 1320 DGCSKKTNEW DKTVIEYKTN KPSRLPFLDI APLDIGGTNQ EFRVEVGPVC FK 1372 |
Gene Ontology | GO:0005581; C:collagen; IEA:UniProtKB-KW. GO:0005576; C:extracellular region; TAS:Reactome. GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro. GO:0046872; F:metal ion binding; IEA:UniProtKB-KW. |
Interpro | |
Pfam | |
SMART | |
PROSITE | |
PRINTS | |