CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-013806
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Collagen alpha-1(XXVII) chain 
Protein Synonyms/Alias
  
Gene Name
 Col27a1 
Gene Synonyms/Alias
 Kiaa1870 
Created Date
 July 27, 2013 
Organism
 Mus musculus (Mouse) 
NCBI Taxa ID
 10090 
Lysine Modification
Position
Peptide
Type
References
82FIFTQRAKLQAPTANubiquitination[1]
Reference
 [1] Proteomic analyses reveal divergent ubiquitylation site patterns in murine tissues.
 Wagner SA, Beli P, Weinert BT, Schölz C, Kelstrup CD, Young C, Nielsen ML, Olsen JV, Brakebusch C, Choudhary C.
 Mol Cell Proteomics. 2012 Dec;11(12):1578-85. [PMID: 22790023
Functional Description
 Plays a role during the calcification of cartilage and the transition of cartilage to bone (By similarity). 
Sequence Annotation
 DOMAIN 72 237 Laminin G-like.
 DOMAIN 610 664 Collagen-like 1.
 DOMAIN 673 732 Collagen-like 2.
 DOMAIN 742 801 Collagen-like 3.
 DOMAIN 817 876 Collagen-like 4.
 DOMAIN 877 936 Collagen-like 5.
 DOMAIN 937 996 Collagen-like 6.
 DOMAIN 997 1038 Collagen-like 7.
 DOMAIN 1039 1096 Collagen-like 8.
 DOMAIN 1117 1176 Collagen-like 9.
 DOMAIN 1177 1236 Collagen-like 10.
 DOMAIN 1240 1299 Collagen-like 11.
 DOMAIN 1325 1384 Collagen-like 12.
 DOMAIN 1424 1483 Collagen-like 13.
 DOMAIN 1484 1543 Collagen-like 14.
 DOMAIN 1544 1603 Collagen-like 15.
 DOMAIN 1645 1845 Fibrillar collagen NC1.
 REGION 610 1603 Triple-helical.
 METAL 1693 1693 Calcium (By similarity).
 METAL 1695 1695 Calcium (By similarity).
 METAL 1698 1698 Calcium; via carbonyl oxygen (By
 METAL 1701 1701 Calcium (By similarity).
 CARBOHYD 272 272 N-linked (GlcNAc...) (Potential).
 CARBOHYD 340 340 N-linked (GlcNAc...) (Potential).
 CARBOHYD 1754 1754 N-linked (GlcNAc...) (Potential).
 DISULFID 1675 1707 By similarity.
 DISULFID 1681 1681 Interchain (with C-1285) (By similarity).
 DISULFID 1698 1698 Interchain (with C-1268) (By similarity).
 DISULFID 1716 1843 By similarity.
 DISULFID 1752 1796 By similarity.  
Keyword
 Calcium; Collagen; Complete proteome; Disulfide bond; Extracellular matrix; Glycoprotein; Metal-binding; Reference proteome; Repeat; Secreted; Signal. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1845 AA 
Protein Sequence
MGTGFARGAR GTAASGPGGG FLFAWILVSF TCHLASTQGA PEDVDVLQRL GLSWTKAGGG 60
RSPTPPGVIP FPSGFIFTQR AKLQAPTANV LPTTLGRELA LVLSLCSHRV NHAFLFAIRS 120
RKHKLQLGLQ FLPGRTIIHL GPRQSVAFDL DVHDGRWHHL ALELRGRTVT MVTACGQHRV 180
PVPLPSRRDS MLDPQGSFLL GKVNPRAVQF EGALCQFSIH PVAQVAHNYC AHLRERCRQV 240
DTYSPQVGTL FPWDSGPAFA LHPEPALLGL GNLTRTPATL GARPVSRALA VTLAPAMPTK 300
PLRTVHPDVS EHSSSQTPLS PAKQSARKTP SPSSSASLAN STRVYRPAAA QPRQITTTSP 360
TKRSPTKPSV SPLSVTPMKS PHATQKTGVP SFTKPVPPTQ KPAPFTSYLA PSKASSPTVR 420
PVQKTFMTPR PPVPSPQPLR PTTGLSKKFT NPTVAKSKSK TTSWASKPVL ARSSVPKTLQ 480
QTVLSQSPVS YLGSQTLAPA LPPLGVGNPR TMPPTRDSAL TPAGSKKFTG RETSKKTRQK 540
SSPRKPEPLS PGKSARDASP RDLTTKPSRP STPALVLAPA YLLSSSPQPT SSSFPFFHLL 600
GPTPFPMLMG PPGSKGDCGL PGPPGLPGLP GSPGARGPRG PPGPYGNPGP PGPPGAKGQK 660
GDPGLSPGQA HDGAKGNMGL PGLSGNPGPL GRKGHKGHPG AAGHPGEQGQ PGPEGSPGAK 720
GYPGRQGFPG PVGDPGPKGS RGYIGLPGLF GLPGSDGERG LPGVPGKRGE MGRPGFPGDF 780
GERGPPGLDG NPGEIGLPGP PGVLGLIGDT GALGPVGYPG PKGMKGLMGG VGEPGLKGDK 840
GEQGVPGVSG DPGFQGDKGS HGLPGLPGGR GKPGPLGKAG DKGSLGFPGP PGPEGFPGDI 900
GPPGDNGPEG MKGKPGARGL PGPPGQLGPE GDEGPMGPPG VPGLEGQPGR KGFPGRPGLD 960
GSKGEPGDPG RPGPVGEQGL MGFIGLVGEP GIVGEKGDRG VMGPPGAPGP KGSMGHPGTP 1020
GGIGNPGEPG PWGPPGSRGL PGMRGAKGHR GPRGPDGPAG EQGSKGLKGR VGPRGRPGQP 1080
GQQGAAGERG HSGAKGFLGI PGPSGPPGAK GLPGEPGSQG PQGPVGPPGE MGPKGPPGAV 1140
GEPGLPGDSG MKGDLGPLGP PGEQGLIGQR GEPGLEGDHG PVGPDGLKGD RGDPGPDGEH 1200
GEKGQEGLKG EDGSPGPPGI TGVPGREGKP GKQGEKGQRG AKGAKGHQGY LGEMGIPGEP 1260
GPPGTPGPKG SRGTLGPTGA PGRMGAQGEP GLAGYNGHKG ITGPLGPPGP KGEKGDQGED 1320
GKTEGPPGPP GDRGPVGDRG DRGEPGDPGY PGQEGVQGLR GEPGQQGQPG HPGPRGRPGP 1380
KGSKGEEGPK GKPGKAGPSG RRGTQGLQGL PGPRGVVGRQ GPEGTAGSDG IPGRDGRPGY 1440
QGDQGNDGDP GPVGPAGRRG NPGVAGLPGA QGPPGFKGES GLPGQLGPPG KRGTEGGTGL 1500
PGNQGEPGSK GQPGDSGEMG FPGVAGLFGP KGPPGDIGFK GIQGPRGPPG LMGKEGIIGP 1560
PGMLGPSGLP GPKGDRGSRG DLGLQGPRGP PGPRGRPGPP GPPWHPIQFQ QDDLGAAFQT 1620
WMDAQGAVRS EGYSYPDQLA LDQGGEIFKT LHYLSNLIQS IKTPLGTKEN PARVCRDLMD 1680
CEQRMADGTY WVDPNLGCSS DTIEVSCNFT QGGQTCLKPI TASKAEFAVS RVQMNFLHLL 1740
SSEGTQHITI HCLNMTVWQE GPGRSSARQA VRFRAWNGQV FEAGGQFRPE VSMDGCKVHD 1800
GRWHQTLFTF RTQDPQQLPI VSVDNLPPVS SGKQYRLEVG PACFL 1845 
Gene Ontology
 GO:0005583; C:fibrillar collagen; IDA:MGI.
 GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
 GO:0046872; F:metal ion binding; IEA:UniProtKB-KW. 
Interpro
 IPR008160; Collagen.
 IPR008985; ConA-like_lec_gl_sf.
 IPR000885; Fib_collagen_C.
 IPR001791; Laminin_G. 
Pfam
 PF01410; COLFI
 PF01391; Collagen 
SMART
 SM00038; COLFI
 SM00210; TSPN 
PROSITE
 PS50025; LAM_G_DOMAIN
 PS51461; NC1_FIB 
PRINTS