CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-034769
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Procollagen, type VII, alpha 1 (Predicted) 
Protein Synonyms/Alias
 Protein Col7a1 
Gene Name
 Col7a1 
Gene Synonyms/Alias
 Col7a1_predicted; rCG_25407 
Created Date
 July 27, 2013 
Organism
 Rattus norvegicus (Rat) 
NCBI Taxa ID
 10116 
Lysine Modification
Position
Peptide
Type
References
2136GDAGEPGKRGPDGNPacetylation[1]
Reference
 [1] Proteomic analysis of lysine acetylation sites in rat tissues reveals organ specificity and subcellular patterns.
 Lundby A, Lage K, Weinert BT, Bekker-Jensen DB, Secher A, Skovgaard T, Kelstrup CD, Dmytriyev A, Choudhary C, Lundby C, Olsen JV.
 Cell Rep. 2012 Aug 30;2(2):419-31. [PMID: 22902405
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2588 AA 
Protein Sequence
MRLRLLVAAL CTAEILVGAP GVWAQPRDRV TCTRLYAADI VFLIDGSSSI GRSNFREVRG 60
FLEGLVLPFS GAANAQGVRF ATVQYSDDPQ TEFGLDTLGS GGDTIRAIRE LSYKGGNTRT 120
GAALLHVSDR VFLPHLTRPG IPKVCILITD GKSQDLVDTA AQKLKRQGVK LFAVGIKNAD 180
PEELKRIASQ PTSDFFFFVN DFSILRTLLP LISRRVCTTA GGVPVTLPSD DTPSGPRDLV 240
LSEPSSQSLR VQWTAASGPV TGYKIQYTPL TGLGQPLPSE RKEVNVPAGE TSMRLQGLRP 300
LTDYQVTVVA LYANSIGEAV SGTARTTAKE GLELSVQNIT SHSLLVAWRR VPGANGYRVT 360
WRDLSGGGAQ QQDLSPGQGS VFLDHLKPGT DYEVTVSALF GRSVGPAASL TTRTASSVEQ 420
TLRPIILSPT SILLSWNLVP EARGYRLEWH RESGLEPPQK VELLPDVTRY QLDGLQPGTE 480
YRLTLYTLLE GREVATPATV VPTGLEQLVG PVMNLQATEL PGQRLRVSWN PVLGATEYRF 540
TVRTPQGVER TLLLPGSQTT FDLDDVRAGI SYTVRVSARV GTHEGDASTL TIHRDQEAPL 600
IIPGLQVVAS DATRIRVAWG LVPGASGFRI SWRTGSGPES SQTLAPDSTV TDILGLQPGT 660
SYQVAVSALR GREEGPPVVI VARTDPLGPV RRVHLTQAGS SSVSITWTGV PGATGYRVSW 720
HSGHGPEKSQ LVSGDATVAE IDGLEPDTEY IVRVRTHVAG VDGAPASVVV RTAPEPVGSV 780
SKLQILNASS DVLRVTWVGV PGATSYRLAW GRSEGGPMRH QIVPGNKDSA EITGLEGGVS 840
YSVRVTALVG DREGAPVSIV VTTPPEVPAP LETLQVVQSG EHSLRLHWNR VPGAQGFRLH 900
WQPEGGQEQS LTLGPESNSY NLVGLEPATK YHVWLSVLGQ PGEAAPRKVT AYTETPHIPS 960
TGLRVVDTSV NSVTLTWTPV SGASSYILSW RPLRGTGQEV PGGPQTLPGN SSSHRVTGLE 1020
PGVPYVFSLT PIQSGVRGSE ITVTQKPACP HGQVDVVILL HATRDNAHNA EAVKRFLERL 1080
VSALGPLGPQ AAQVGLLSYS HRPSPLFPLN SSHDLGVILQ KIRDIPYVDP SGNNLGTAVT 1140
TAHRYLLAPN APGRRRQVPG VMVLLVDEPL REDILSPIRE AQASGLKVMM LGLVGADPEQ 1200
LRLLAPGMDP IQTFFAVDNG LDLDRAGSDL AVALCQAAVA IQPQLEPCAV PCPKGQKGEP 1260
GVTGPQGQAG PPGPPGLPGR TGAPGPQGAP GSTQAKGERG FPGPEGPPGS PGLPGVPGSP 1320
GVKGSPGWSG PRGDRGERGP QGPKGEPGEP GQVIGGGRPG LPGKKGDPGP SGPPGPHGPL 1380
GDPGPRGPPG LPGTSVKGDK GDRGERGPPG PGTGASEQGS PGLPGLPGSP GPQGPPGRTG 1440
EKGEKGDCED GGPGLPGQPG VPGEPGLRGA PGVTGPKGDR GLTGTPGEPG EKGERGPPGP 1500
VGPQGLPGAA GRPGVEGPEG PPGPPGRRGE KGEPGRPGDP ALGPGGAGAK GEKGDAGLPG 1560
PRGASGIKGE QGAPGLALPG DPGPKGDPGD RGPIGLTGRA GPTGDSGPPG EKGEPGRPGS 1620
PGPVGPRGRD GEAGEKGDEG APGEPGLPGK AGERGLRGAP GPRGPVGEKG NEGDPGEDGR 1680
NGTPGPSGPK GDRGEPGPPG LPGRLVDAAL ESRDKGEPGQ EGPRGPKGDP GPPGASGERG 1740
IDGLRGPPGP QGDPGVRGPA GDKGDRGSPG LDGRNGLDGK PGAPGPPGLH GASGKAGDPG 1800
RDGLPGLRGE HGPPGPPGPP GVPGKPGDDG KPGLNGKNGE PGDPGEDGRK GEKGDSGVPG 1860
REGPDGPKGE RGAPGNPGLQ GPPGLPGQVG PPGQGFPGVP GVTGPKGDRG ETGSKGEQGL 1920
PGERGLRGEP GSLPNAERFL ETAGIKVSAL REIVDTWGES SGSFLLVPER RQGPKGDPGD 1980
PGPPGKEGSI GLPGERGLKG ERGDPGPQGP PGLALGERGP PGPSGLAGEP GKPGIPGLPG 2040
RAGAAGEAGR PGERGERGEK GERGEQGRDG HPGLPGPPGP PGPKVAIEEL GPGPAREQGP 2100
PGLKGAKGEP GSDGVPGPKG DRGVPGIKGD AGEPGKRGPD GNPGLPGERG VSGPEGKPGL 2160
QGPRGTPGPV GSHGDPGPPG APGLAGPAGP QGPSGLKGEP GETGPPGRGL PGPTGAVGLP 2220
GPPGPSGLVG PQGSPGLPGQ VGETGKPGPP GRDGSSGKDG ERGGPGVPGL PGLPGPVGPK 2280
GEPGPVGAPG QVVVGPPGAK GEKGAPGDLA GALLGEPGAK GDRGLPGPRG EKGEAGHAGE 2340
PGDPGEDGQK GAPGLKGLKG EPGIGVQGPP GPSGPPGMKG DLGPPGAPGA PGIVGFPGQP 2400
GPRGETGQPG PVGERGLAGP PGREGAPGPL GPPGPPGSVG APGASGFKGD KGDSGAGLPG 2460
PRGERGEPGL RGEDGHPGQE GPRGLMGPPG SRGERGEKGD PGAAGLKGDK GDSAVIEGPA 2520
GPRGAKGDMG ERGPRGIDGD QGPRGESGDP GDKGSKGEPG DKGSAGSTGV RGLTGPKARR 2580
ARCCRDPR 2588 
Gene Ontology
  
Interpro
 IPR008160; Collagen.
 IPR003961; Fibronectin_type3.
 IPR013783; Ig-like_fold.
 IPR002035; VWF_A. 
Pfam
 PF01391; Collagen
 PF00041; fn3
 PF00092; VWA 
SMART
 SM00060; FN3
 SM00327; VWA 
PROSITE
 PS50853; FN3
 PS50234; VWFA 
PRINTS