CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-039331
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
  
Protein Name
 Protein Col24a1 
Protein Synonyms/Alias
  
Gene Name
 Col24a1 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Rattus norvegicus (Rat) 
NCBI Taxa ID
 10116 
Lysine Modification
Position
Peptide
Type
References
755GERGRTGKKGDKGQTacetylation[1]
Reference
 [1] Proteomic analysis of lysine acetylation sites in rat tissues reveals organ specificity and subcellular patterns.
 Lundby A, Lage K, Weinert BT, Bekker-Jensen DB, Secher A, Skovgaard T, Kelstrup CD, Dmytriyev A, Choudhary C, Lundby C, Olsen JV.
 Cell Rep. 2012 Aug 30;2(2):419-31. [PMID: 22902405
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1349 AA 
Protein Sequence
MNEEERISPL FPEFNITQHE EAAGLPSPKK ASSSIAQTNQ DTVTNLKKAI TANLHTNELM 60
EMERILNSTL YKVIHGPSVD NHLELRKEGE FYPDATNPME NSYEPQPYEY YYYEDYNAML 120
DLEYLRGPKG DTGPPGPPGP MGIPGPSGKR GPRGIPGPHG NPGLPGLPGP KGPKGDPGLS 180
PGQAASGEKG DPGLLGLVGP PGLQGEKGLK GHTGLPGLRG EQGIPGLAGN VGSPGYPGRQ 240
GLAGPEGNPG SKGVRGFIGS PGEAGQLGPE GERGTPGVRG KKGPKGRQGF PGDFGDRGPA 300
GLDGSPGLVG GIGPPGFPGI RGNVGPAGPV GPPGVPGPMG LSGSRGPPGI KGDKGEQGVA 360
GEPGEPGYPG DKGAIGSPGP PGIRGKSGPS GQPGDPGPQG PTGPPGPEGF PGDIGIPGQN 420
GPEGPKGHLG SRGPPGPPGL KGTQGEEGPI GPFGELGSRG KPGRKGYMGE PGPEGLKGET 480
GDQGDIGKIG ETGPVGLPGE VGITGSIGEK GERGSPGPLG PQGEKGVMGY PGPPGAPGPM 540
GPIGLPGLVG ARGAPGSPGP KGQRGPRGPD GLAGDQGGHG AKGEKGNQGK RGLPGRAGKT 600
GSPGERGVQG KPGLQGLPGS SGDMGPAGEP GPRGLPGDAG LPGEMGVEGP PGTEGDSGLQ 660
GEPGAKGDVG PTGSEGATGE PGPRGEPGAP GEEGLQGKDG LKGAPGGSGL PGEDGEKGEM 720
GLPGTAGPVG RPGQMGPPGS EGIVGTPGER GRTGKKGDKG QTGPVGEAGS RGSPGRVGDS 780
GPKGARGTRG AVGPLGLMGP EGEPGIPGYR GLEGQPGPSG LPGPKGEKGY PGEDSTVLGP 840
PGPRGEPGPM GERGERGEHG EEGYKGHMGV PGLRGAAGQQ GPPGEPGDQG EQGLKGERGS 900
EGPQGKKGVP GPSGKPGIPG LPGLPGPKGL QGYHGVDGLS GYPGKPGLLG KQGLPVSKGQ 960
RGVTLSFIRN GRKDEDDIFN IPNALCYSGE QGLPGQPGIP GQRGQRGTQG DQGRRGEPGL 1020
KGQPGEHGNQ GLTGFQGFPG PRGPEGDAGI IGIVGPKGPV GQRGNTGPLG REGIIGPTGG 1080
TGPRGEKGFR GETGPQGPRG QPGPPGPPGA PGPRRQMDIN AAIGALIESN SAQQMESYQN 1140
TEVTFLSQGT EISKTLAYLS SLLSSIKNPL GTRENPARIC KDLLSCQYEV SDGKYWIDPN 1200
LGCSSDAFEV FCNFSAGGQT CVSPVSVTKL EFGVGKVQMN FLHLLSSEAT HIITLHCLNT 1260
PRRTGTPADG PELPISFKGW NGQIFEENTL LEPQVLSDDC KIQDGSWHKA KFLFHTQNPN 1320
QLPVTEVQNL PHLRTEQKHY IESSSVCFL 1349 
Gene Ontology
 GO:0005581; C:collagen; IEA:InterPro.
 GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro. 
Interpro
 IPR008160; Collagen.
 IPR000885; Fib_collagen_C. 
Pfam
 PF01410; COLFI
 PF01391; Collagen 
SMART
 SM00038; COLFI 
PROSITE
 PS51461; NC1_FIB 
PRINTS