CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-038131
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
  
Protein Name
 Protein Muc5ac 
Protein Synonyms/Alias
  
Gene Name
 Muc5ac 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Mus musculus (Mouse) 
NCBI Taxa ID
 10090 
Lysine Modification
Position
Peptide
Type
References
63VTIIPPLKTIPVVRAacetylation[1]
Reference
 [1] Proteomic analysis of lysine acetylation sites in rat tissues reveals organ specificity and subcellular patterns.
 Lundby A, Lage K, Weinert BT, Bekker-Jensen DB, Secher A, Skovgaard T, Kelstrup CD, Dmytriyev A, Choudhary C, Lundby C, Olsen JV.
 Cell Rep. 2012 Aug 30;2(2):419-31. [PMID: 22902405
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Disulfide bond; Reference proteome; Secreted. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2751 AA 
Protein Sequence
MGVGRRKLVP FWVLALALAC SQCTGQAQQD SLKSYHEHRS DVPHPQGHVG TPLNRVTIIP 60
PLKTIPVVRA FNPGHTRRVC STWGNFHYKT FDGQVFYFPG LCNYVFSAHC GDAYEDFNIQ 120
LRRVQESNTT TLSRVTMKLD GLVVELTKSS VLVNNHPVQL PFSQSGVLIE LSNGYLKVVA 180
RLGLLFVWNE DDSLLLELDT KYTNKTCGLC GDFNGSPKSN EFLSNNVRLT PLEFGNLQKM 240
DGPTEQCQDP LPVPQKNCSA RSGICEMILK GELFSGCAAL VDISSYVEAC RQDVCLCESL 300
DPSDCICHTL AEYSRQCAHA GGQPQDWRGP NLCSQTCPLN MQHQECGSPC VDTCSNPQHS 360
QVCEDHCIAG CFCPEGMVLD DINQMGCVPV SQCACLYNGT LYAPGTNYST DCTNTCSGGQ 420
WSCQDIPCAG TCSVMGGSHM STFDGRQYTV HGDCTYVLSK PCDSNAFTVL VELRKCGLTE 480
SETCLKTVTL NLGGGQTEIM VKATGEVFVN QIYTQLPVST ANATFFRPST FFIVGETNLG 540
LQLEIQLSPI MQTSVRLKPG LRGLTCGLCG NFNSMQADDF QTISGVVEGT AAAFFNTFKT 600
QAACPNVKNI FQDPCSLSVE NEKYAQHWCS LLTNASGPFS QCHATVNPST FFSNCMYDTC 660
NCEKSEDCMC AALSSYVRAC AAKGVLLSDW RDGICTKPTI TCPKSMTYQY HISTCQPTCR 720
ALNEKDVTCH VSFIPVDGCT CPKGTFLDDL GKCVQATSCP CYYKGSTVPN GESVQDSGAI 780
CTCTQGALTC IGGPAPTPVC DAPMIYFDCH NATPGDTGAG CQKSCHTLDM TCYSSECVPG 840
CVCPNGLVAD GNGGCVVTED CPCVHNEATY RPGETIQVGC NNCTCENRMW QCTDKPCLAT 900
CAVYGDGHYI TFDGQRYSFN GDCEYTLLQD NCGGNGSSQD AFRVITENIP CGTTGTTCSK 960
SIKIFLGNYE LKLSDSKMEV VQKDVGQEPP YFVHQMGNYL VVETDIGLVL LWDKKTSIFL 1020
RLSPEFKGRV CGLCGNFDDN AINDFTTRSQ SVVSDMLEFG NSWKLSPSCP DVLVPKDPCT 1080
ANPYRKSWAQ KQCSIINSET FSACHAHVEP AKYYEACVND ACACDSGGDC ECFCTTVAAY 1140
AQACHEVGVC VSWRTPDICP LFCDYYNPEG QCEWHYQPCG APCMRTCQNP TGQCLQDLRG 1200
LEGCYPKCPP TAPIFDEGTM QCVSNCTVTF PCRVNGKLYR PGASVPSDKN CDSCICTESG 1260
VRCTHNAGAC VCTYNGQQFH PGEIIYHTTD GIGGCISAHC RANGTIERSV DTCNSTTPTP 1320
PTTFSFSTPP VMTSMQPSST HSSPTPSVGS SGASSKAAST TSSILSVKSP VTAPMTMSTS 1380
ASAVTTSGCR EECLWSPWMD VSRPGRGIDS GDFDTLENLR AHGYPICQVP KAVECRAEAS 1440
PGVPLPELQQ HLECSTTVGL ICYNSDQLSG LCDNYQIKVQ CCTPVSCPTS QTTHVISSSR 1500
TTNLDNTTSS VPVTSTEHPY SSTVTSGSST HTPGLSPSSS VPSSPTPASS TPAPVSSTTV 1560
KTTLPITSPT PEPTPAISSV SISTSGSTMP SSETTHECKQ ELCNWTNWLD GSYPGSGRNS 1620
GDFDTFVNLR SKGYKFCEKP RNVECRAQFF PNTPLEELGQ NVTCSREEGL ICLNKNQLPP 1680
MCYNYEIRIE CCTVVNNCST ASVTTHPTSH GVSTKTETNW TTHVYSSPTK DTSSHSATID 1740
TKTWTSGISH TTTQPVTTHC QLQCNWTKWF DTDFPVPGPH GGDLETYSNI ERSGERLCHR 1800
EEITQLQCRA KNYPEREMED LGQVVKCDPS VGLVCNNRDQ GGDSGMCLNY EVRLLCCHIP 1860
EDCPRTDQTS PVTLSHKPSS AVVSPSSVSP SLSTSHRVHS TTPCFCSVSG QLYPLGSIIY 1920
NQTDLDGHCY YAMCSQDCQV VKRVSQDCPS TMPPPATTLS TSTTPPVTGR DRCNVFPPRL 1980
RGETWPMPNC SQATCEGNNV ISLSPRQCPE LNEPSCANGY PPLKVDDQDG CCQHYQCQCV 2040
CSGWGDPHYI TFDGTYYTFL DNCTYVLVQQ IVPVFGYFRV LIDNYYCDVG DSVSCPQSII 2100
VEYHQDRVVL TRRPVSGVMT NQIIFNNKVV SPGFQQNGIV TSRVGIKMYV TIQEIGVRVM 2160
FSGLIFSVEV PFNLFANNTE GQCGTCTNDK KDECRLPGGS IASSCSEMSL HWKVPNQPSC 2220
QGPPPTPTSV VPRPSPTPCP PSPLCELILS NTFKLCHDVI PPLQFYQGCL FDYCHMLDLE 2280
VVCSGLELYA SLCAAQGVCI PWRSQTNNTC SFTCPDNQVY QPCGPSNPHY CYRDDSISPS 2340
LTLQEAGPKT EGCFCPDSTT LFSTNDSICV PSCQWCLGPR GEPVEPGHTI SIDCQDCICK 2400
EATLTCQKKA CPQPTCPEPG FVPVPVALEA GQCCPQFSCA CNSSHCPPPL HCPKNSSLIV 2460
TYEEGACCPT QNCSSQKGCE VNGTLYQPGD VVSSSLCERC LCEVSSNPLS DVFMVSCETE 2520
LCNTQCPKGS EYQAMPGQCC GKCIPKTCPF KNNSGSTYFY QPGELWAEPG NPCVTHKCEK 2580
FQDVLMVVTM KTECPKINCP QGQAQLREDG CCYDCPLPNQ QKCTVHQRQQ IIRQQNCSSE 2640
GPVSISYCQG NCGDSISMYS LEANKVEHTC ECCQELQTSQ RNVTLRCDDG SSQTFSYTQV 2700
EKCGCLGQQC HALGDTSHAE SSEQEFKSKE SEEHGQQLAF RVSEDMLGPF Q 2751 
Gene Ontology
 GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
 GO:0043206; P:extracellular fibril organization; ISO:MGI. 
Interpro
 IPR006207; Cys_knot_C.
 IPR002919; TIL_dom.
 IPR014853; Unchr_dom_Cys-rich.
 IPR006552; VWC_out.
 IPR001007; VWF_C.
 IPR001846; VWF_type-D.
 IPR025155; WxxW_domain. 
Pfam
 PF08742; C8
 PF13330; Mucin2_WxxW
 PF01826; TIL
 PF00094; VWD 
SMART
 SM00832; C8
 SM00041; CT
 SM00214; VWC
 SM00215; VWC_out
 SM00216; VWD 
PROSITE
 PS01185; CTCK_1
 PS01225; CTCK_2
 PS01208; VWFC_1
 PS50184; VWFC_2
 PS51233; VWFD 
PRINTS