CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-035384
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
  
Protein Name
 Protein Col14a1 
Protein Synonyms/Alias
  
Gene Name
 Col14a1 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Rattus norvegicus (Rat) 
NCBI Taxa ID
 10116 
Lysine Modification
Position
Peptide
Type
References
56SWKAPRGKFGGYKLLacetylation[1]
Reference
 [1] Proteomic analysis of lysine acetylation sites in rat tissues reveals organ specificity and subcellular patterns.
 Lundby A, Lage K, Weinert BT, Bekker-Jensen DB, Secher A, Skovgaard T, Kelstrup CD, Dmytriyev A, Choudhary C, Lundby C, Olsen JV.
 Cell Rep. 2012 Aug 30;2(2):419-31. [PMID: 22902405
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1793 AA 
Protein Sequence
MTILQCKMRA WLILAFLAVA YFCTIVRGQV APPTRLRYNV ISHDSIQISW KAPRGKFGGY 60
KLLVAPASGG KTNQMNLQNG ATKAIIQGLL PEQNYTVQLI AYYKDKESKP AQGQFRIKDL 120
EKRKDPTKPR VKAVDKGNGS KSTSPEEIKF FCETPAIADI VILVDGSWSI GRFNFRLVRN 180
FLENLVTAFN VGSEKTRIGL AQYSGDPRIE WHLNAFNTKD EVIDAVRSLP YKGGNTLTGL 240
ALNFIFENSF KPEAGSRSGV SKIGILITDG KSQDDIIPPS RNLREAGVEL FAIGVKNADL 300
SELQEIASEP DSTHVYNVAE FDLMHTVVES LTRTVCSRVE EQDKEIKASA LATIGPPTEL 360
ITSEVTARSF MVNWTHSPGK VEKYRVVYYP TRGGKPEEVV ADGRVSSIVL KNLMSSTEYQ 420
IAVFAVSAHT ASEGLRGTET TLALPMASDL ELYDVTENSM RVKWDAVPGA TGYLILYAPL 480
TEGLAGDEKE MKIGETHTDI ELSGLFPNTE YTVTVYAMFG EEASDPATGQ ETTLPLTPPR 540
NLRISNVGSN SARLTWDPTS GKITGYRIVY TSADGTEINE VEVDPITTFP LKGLTPLTDY 600
SIAIFSIYEE GQSLPLVGEF TTEEVPAQQY LEIDEVKTDS FRVTWHPLSA EEGQHKLMWI 660
PVYGGKTQEV ALKEEQDSYV VEGLDPGTEY EVSLLAVLDD GSESEVVTAV GTTLDDFWTE 720
APTTVEPTSP VTSVLQTGIR NLVVDDEAAT SLRVTWDISD SNVEHFRVTY LTAQGDPKEE 780
VVMVPGVQNS LLLKNLLPDT EYKVTVTPIY TVGEGVSVSA PGKTLPTSGP QNLRVSEEWY 840
NRLRITWDPP SAPVKGYRIV YKPVSVPGQT LETFVGADVN TIVMTNLLSG MDYNVKIFAS 900
QAAGYSDALT GLVQTLFLGV TDLQANQVEM TSLCARWQIH RHATAYRIVL ESLQDTQAQE 960
STVGGGVNRH CFYGLQPDSE YKISVYTKLQ EIEGPSVSIM QKTQSLPTEP PTFPPTIPPA 1020
KEVCKAAKAD LVFMVDGSWS IGDDNFNKII NFLYSTVGAL DKIGADGTQV AMVQFTDDPR 1080
TEFKLDAYKT KETLLDAIRH ISYKGGNTKT GKAIKHVRDT LFTADSGTRR GIPKVIVVIT 1140
DGRSQDDVNK ISREMQADGY NIFAIGVADA DYSELVRIGS KPSSRHVFFV DDFDAFKKIE 1200
DELITFVCET ASATCPMVHK DGVDIAGFKM MEMFGLVEKD FSAVEGVSME PGTFNLFPCY 1260
QIHKDALVSQ PTKYLHPEGL PADYTMTFLF RILPDTPQEP FALWEILNKK SEPLVGIILD 1320
NGGKTLTYFS YDYKSDSFKV LYTGKDIETE FSSFSVLHVV VSKTLAKVVV DCKEVGQKAI 1380
NASANITSDG VEVLGRMVRS RGPNGNSAPF QLQMFDIVCS TSWASKDRCC ELPGLRDEES 1440
CPDLPRSCSC SETNEVALGP AGPPGGPGLR GPKGQQGEPG PKGPEGPRGE TGPAGPQGPP 1500
GPQGPSGLSI QGMPGMPGDK GDKGDAGLPG PQGVPGGVGS PGRDGSPGQR TQPIKSGTDK 1560
PGGSPGPIGI PGAPGVPGIA GSMGPQGALG PPGVPGAKGE RGERGDLQSQ AMVRAVARQV 1620
CEQLIQSHMA RYTAILNQIP SQSSSIRTIQ GPPGEPGRPG SPGTPGEQGP PGAPGFPGNA 1680
GVPGTPGERG LTGVKGDKGN PGIGTQGPRG PPGPAGPSGE SRPGSPGPPG SPGPRGPPGH 1740
LGVPGPQGPS GQPGYCDPSS CSAYGVGVSH PDQPEFTPVQ DEQEALDLWS AGI 1793 
Gene Ontology
  
Interpro
 IPR008160; Collagen.
 IPR008985; ConA-like_lec_gl_sf.
 IPR003961; Fibronectin_type3.
 IPR013783; Ig-like_fold.
 IPR001791; Laminin_G.
 IPR002035; VWF_A. 
Pfam
 PF01391; Collagen
 PF00041; fn3
 PF00092; VWA 
SMART
 SM00060; FN3
 SM00210; TSPN
 SM00327; VWA 
PROSITE
 PS50853; FN3
 PS50234; VWFA 
PRINTS