CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-030028
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
  
Protein Name
 Basement membrane-specific heparan sulfate proteoglycan core protein 
Protein Synonyms/Alias
  
Gene Name
 Hspg2 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Mus musculus (Mouse) 
NCBI Taxa ID
 10090 
Lysine Modification
Position
Peptide
Type
References
106QLEDASAKEFREVSEubiquitination[1]
3164MLKIASVKPSDAGTYacetylation[2]
Reference
 [1] Proteomic analyses reveal divergent ubiquitylation site patterns in murine tissues.
 Wagner SA, Beli P, Weinert BT, Schölz C, Kelstrup CD, Young C, Nielsen ML, Olsen JV, Brakebusch C, Choudhary C.
 Mol Cell Proteomics. 2012 Dec;11(12):1578-85. [PMID: 22790023]
 [2] Quantitative acetylome analysis reveals the roles of SIRT1 in regulating diverse substrates and cellular pathways.
 Chen Y, Zhao W, Yang JS, Cheng Z, Luo H, Lu Z, Tan M, Gu W, Zhao Y.
 Mol Cell Proteomics. 2012 Oct;11(10):1048-62. [PMID: 22826441
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Disulfide bond; Reference proteome; Repeat. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 4375 AA 
Protein Sequence
MGQRAVGSLL LGLLLHARLL AVTHGLRAYD GLSLPEDTET VTASRYGWTY SYLSDDEDLL 60
ADDASGDGLG SGDVGSGDFQ MVYFRALVNF TRSIEYSPQL EDASAKEFRE VSEAVVEKLE 120
PEYRKIPGDQ IVSVVFIKEL DGWVFVELDV GSEGNADGSQ IQEVLHTVVS SGSIGPYVTS 180
PWGFKFRRLG TVPQFPRVCT ETEFACHSYN ECVALEYRCD RRPDCRDMSD ELNCEEPVPE 240
LSSSTPAVGK VSPLPLWPEA ATTPPPPVTH GPQFLLPSVP GPSACGPQEA SCHSGHCIPR 300
DYLCDGQEDC RDGSDELGCA SPPPCEPNEF ACENGHCALK LWRCDGDFDC EDRTDEANCS 360
VKQPGEVCGP THFQCVSTNR CIPASFHCDE ESDCPDRSDE FGCMPPQVVT PPQQSIQASR 420
GQTVTFTCVA TGVPTPIINW RLNWGHIPAH PRVTMTSEGG RGTLIIRDVK EADQGAYTCE 480
AMNSRGMVFG IPDGVLELVP QRGPCPDGHF YLEDSASCLP CFCFGVTNVC QSSLRFRDQI 540
RLSFDQPNDF KGVNVTMPSQ PGVPPLSSTQ LQIDPALQEF QLVDLSRRFL VHDAFWALPK 600
QFLGNKVDSY GGFLRYKVRY ELARGMLEPV QKPDVILVGA GYRLHSRGHT PTHPGTLNQR 660
QVQLSEEHWV HESGRPVQRA EMLQALASLE AVLLQTVYNT KMASVGLSDI VMDTTVTHTT 720
IHGRAHSVEE CRCPIGYSGL SCESCDAHFT RVPGGPYLGT CSGCNCNGHA SSCDPVYGHC 780
LNCQHNTEGP QCDKCKPGFF GDATKATATA CRPCPCPYID ASRRFSDTCF LDTDGQATCD 840
ACAPGYTGRR CESCAPGYEG NPIQPGGKCR PTTQEIVRCD ERGSLGTSGE TCRCKNNVVG 900
RLCNECSDGS FHLSKQNPDG CLKCFCMGVS RQCSSSSWSR AQVLGASEQP SQFSLSNAAG 960
THTTSEGVSS PAPGELSFSS FHNLLSEPYF WSLPASFRGD KVTSYGGELR FTVTQRPRPS 1020
SAPLHRQPLV VLQGNNIVLE HHASRDPSPG QPSNFIVPFQ EQAWQRPDGQ PATREHLLMA 1080
LAGIDALLIQ ASYTQQPAES RVSGISMDVA VPENTGQDSA REVEQCTCPP GYRGPSCQDC 1140
DTGYTRVPSG LYLGTCERCN CHGHSETCEP ETGACQSCQH HTEGASCEQC QPGYYGDAQR 1200
GTPQDCQPCP CYGAPAAGQA AHTCFLDTDG HPTCDSCSPG HSGRHCERCA PGYYGNPSQG 1260
QPCHRDGQVP EVLGCGCDPH GSISSQCDAA GQCQCKAQVE GRTCSHCRPH HFHLSASNPE 1320
GCLPCFCMGV TQQCASSSYS RQLISTHFAP GDFQGFALVN PQRNSQLTGG FTVEPVHDGA 1380
RLSFSNFAHL GQESFYWQLP EIYQGDKVAA YGGKLRYTLS YTAGPQGSPL LDPDIQITGN 1440
NIMLVASQPA LQGPERRSYE IIFREEFWRR PDGQPATREH LLMALADLDE LLVRATFSSV 1500
PRAASISAVS LEVAQPGPSS GPRALEVEEC RCPPGYVGLS CQDCAPGYTR TGSGLYLGQC 1560
ELCECNGHSD LCHPETGACS RCQHNTAGEF CELCATGYYG DATAGTPEDC QPCACPLTNP 1620
ENMFSRTCES LGAGGYRCTA CEPGYTGQYC EQCAPGYEGD PNVQGGRCQP LTKESLEVQI 1680
HPSRSVVPQG GPHSLRCQVS GSPPHYFYWS REDGRPLPSS AQQRHQGSEL HFPSVQPSDA 1740
GVYICTCRNL IHTSNSRAEL LVAEAPSKPI TVTVEEQRSQ SVRPGADVTF ICTAKSKSPA 1800
YTLVWTRLHN GKLPSRAMDF NGILTIRNVQ PSDAGTYVCT GSNMFAMDQG TATLHVQVSG 1860
TSTAPVASIH PPQLTVQPGQ QAEFRCSATG NPTPMLEWIG GPSGQLPAKA QIHNGILRLP 1920
AIEPSDQGQY LCRALSSAGQ HVARAMLQVH GGSGPRVQVS PERTQVHEGR TVRLYCRAAG 1980
VPSASITWRK EGGSLPPQAR SENTDIPTLL IPAITAADAG FYLCVATSPT GTAQARIQVV 2040
VLSVPVRIES SSPSVTEGQT LDLNCAVMGL TYTQVTWYKR GGSLPPHAQV HGSRLRLPQV 2100
SPADSGDYVC RVESDVGPKE ASIVVSVLHS PHSGPSYTPA TSITPPIRIE SSSSHVAEGQ 2160
TLDLNCVVPG QAQVTWRKRG GSLPARHQTH GSLLRLHQVS PADSGEYVCH VVLGSEHTET 2220
SVLVTIEPAE SIPAPGPAPP VRIEASSSTV TEGHMLDLNC VVAGQAHAQV TWYKRGGSLP 2280
ARHQVRGSRL YILQASPADA GEYVCRAGNG QEATITVTVT RNHGANLAYP PGSTSPIRIE 2340
SSSSHVAEGQ TLDLNCVVQG QAHAQVTWHK RGGSLPARHQ THGSLLRLHQ VSPVDSGEYV 2400
CRVEGGAVPL ESSVLVTIEP AGTAPGVIPP VRIESSSSHV SEGQSLDLNC LVSGQTHPQI 2460
SWHKRGGSLP ARHQVHGSRL RLLQVTPTDS GEYVCRVVSG SGTQEASILV TIQQTLSPSH 2520
SQSVVHPVRI ESSSPSLANG HTLDLNCLVA SLTPHTITWY KRGGSLPSRH QIVGSRLRIP 2580
QVTPADSGEY VCHVSNGAGS QETSLIVTIE SRGPSHVPSV SPPMRIETSS PTVTEGQTLD 2640
LNCVVVGRPQ ATITWYKRGG SLPFRHQAHG SRLRLHHMSV ADSGEYVCRA NNNIDAQETS 2700
IMISVSPSTN SPPAPASPAP IRIESSSSRV AEGQTLDLNC VVPGHAHAQV TWHKRGGSLP 2760
THHQTHGSRL RLYQVSSADS GEYVCSVLSS SGPLEASVLV SITPAAANVH IPGVVPPIRI 2820
ETSSSRVAEG QTLDLSCVVP GQAHAQVTWH KRGGSLPAGH QVHGHMLRLN RVSPADSGEY 2880
SCQVTGSSGT LEASVLVTIE ASEPSPIPAP GLAQPVYIES SSSHLTEGQT VDLKCVVPGQ 2940
AHAQVTWHKR GSSLPARHQT HGSLLRLYQL SPADSGEYVC QVAGSSHPEH EASFKLTVPS 3000
SQNSSFRLRS PVISIEPPSS TVQQGQDASF KCLIHEGATP IKVEWKIRDQ ELEDNVHISP 3060
NGSIITIVGT RPSNHGAYRC VASNVYGMAQ SVVNLSVHGP PTVSVLPEGP VHVKMGKDIT 3120
LECISSGEPR SSPRWTRLGI PVKLEPRMFG LMNSHAMLKI ASVKPSDAGT YVCQAQNALG 3180
TAQKQVELIV DTGTVAPGAP QVQVEESELT LEAGHTATLH CSATGNPPPT IHWSKLRAPL 3240
PWQHRIEGNT LVIPRVAQQD SGQYICNATN SAGHTEATVV LHVESPPYAT IIPEHTSAQP 3300
GNLVQLQCLA HGTPPLTYQW SLVGGVLPEK AVARNQVLRL EPTVPEDSGR YRCQVSNRVG 3360
SAEAFAQVLV QGSSSNLPDT SIPGGSTPTV QVTPQLETRN IGASVEFHCA VPNERGTHLR 3420
WLKEGGQLPP GHSVQDGVLR IQNLDQSCQG TYVCQAHGPW GQAQATAQLI VQALPSVLIN 3480
VRTSVHSVVV GHSVEFECLA LGDPKPQVTW SKVGGHLRPG IVQSGSIIRI AHVELADAGQ 3540
YRCAATNAAG TTQSHVLLLV QALPQISTPP EIRVPAGSAA VFPCMASGYP TPAITWSKVD 3600
GDLPPDSRLE NNMLMLPSVR PEDAGTYVCT ATNRQGKVKA FAYLQVPERV IPYFTQTPYS 3660
FLPLPTIKDA YRKFEIKITF RPDSADGMLL YNGQKRSPTN LANRQPDFIS FGLVGGRPEF 3720
RFDAGSGMAT IRHPTPLALG QFHTVTLLRS LTQGSLIVGN LAPVNGTSQG KFQGLDLNEE 3780
LYLGGYPDYG AIPKAGLSSG FVGCVRELRI QGEEVVFHDV NLTTHGISHC PTCQDRPCQN 3840
GGQCQDSESS SYTCVCPAGF TGSRCEHSQA LHCHPEACGP DATCVNRPDG RGYTCRCHLG 3900
RSGVRCEEGV TVTTPSMSGA GSYLALPALT NMHHELRLDV EFKPLEPNGI LLFSGGKSGP 3960
VEDFVSLAMV GGHLEFRYEL GSGLAVLRSH EPLTLGRWHR VSAERLNKDG SLRVDGGRPV 4020
LRSSPGKSQG LNLHTLLYLG GVEPSVQLSP ATNMSAHFHG CVGEVSVNGK RLDLTYSFLG 4080
SQGVGQCYDS SPCERQPCQN GATCMPAGEY EFQCLCQDGF KGDLCEHEEN PCQLHEPCLN 4140
GGTCRGARCL CLPGFSGPRC QQGAGYGVVE SDWHPEGSGG NDAPGQYGAY FYDNGFLGLP 4200
GNSFSRSLPE VPETIEFEVR TSTADGLLLW QGVVREASRS KDFISLGLQD GHLVFSYQLG 4260
SGEARLVSED PINDGEWHRI TALREGQRGS IQVDGEDLVT GRSPGPNVAV NTKDIIYIGG 4320
APDVATLTRG KFSSGITGCI KNLVLHTARP GAPPPQPLDL QHRAQAGANT RPCPS 4375 
Gene Ontology
 GO:0005605; C:basal lamina; IDA:MGI.
 GO:0007420; P:brain development; IMP:MGI.
 GO:0048738; P:cardiac muscle tissue development; IMP:MGI.
 GO:0060351; P:cartilage development involved in endochondral bone morphogenesis; IMP:MGI.
 GO:0002062; P:chondrocyte differentiation; IMP:MGI.
 GO:0048704; P:embryonic skeletal system morphogenesis; IMP:MGI.
 GO:0001958; P:endochondral ossification; IMP:MGI.
 GO:0030198; P:extracellular matrix organization; IMP:MGI.
 GO:0008104; P:protein localization; IMP:MGI. 
Interpro
 IPR008985; ConA-like_lec_gl_sf.
 IPR013320; ConA-like_subgrp.
 IPR000742; EG-like_dom.
 IPR013032; EGF-like_CS.
 IPR002049; EGF_laminin.
 IPR007110; Ig-like_dom.
 IPR013783; Ig-like_fold.
 IPR013098; Ig_I-set.
 IPR003599; Ig_sub.
 IPR003598; Ig_sub2.
 IPR018031; Laminin_B_subgr.
 IPR000034; Laminin_B_type_IV.
 IPR001791; Laminin_G.
 IPR023415; LDLR_class-A_CS.
 IPR002172; LDrepeatLR_classA_rpt.
 IPR000082; SEA_dom. 
Pfam
 PF00008; EGF
 PF12661; hEGF
 PF07679; I-set
 PF00052; Laminin_B
 PF00053; Laminin_EGF
 PF00054; Laminin_G_1
 PF02210; Laminin_G_2
 PF00057; Ldl_recept_a 
SMART
 SM00181; EGF
 SM00180; EGF_Lam
 SM00409; IG
 SM00408; IGc2
 SM00281; LamB
 SM00282; LamG
 SM00192; LDLa
 SM00200; SEA 
PROSITE
 PS00022; EGF_1
 PS01186; EGF_2
 PS50026; EGF_3
 PS01248; EGF_LAM_1
 PS50027; EGF_LAM_2
 PS50835; IG_LIKE
 PS50025; LAM_G_DOMAIN
 PS51115; LAMININ_IVA
 PS01209; LDLRA_1
 PS50068; LDLRA_2
 PS50024; SEA 
PRINTS