CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-038251
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
  
Protein Name
 Basement membrane-specific heparan sulfate proteoglycan core protein 
Protein Synonyms/Alias
  
Gene Name
 Hspg2 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Mus musculus (Mouse) 
NCBI Taxa ID
 10090 
Lysine Modification
Position
Peptide
Type
References
106QLEDASAKEFREVSEubiquitination[1]
Reference
 [1] Proteomic analyses reveal divergent ubiquitylation site patterns in murine tissues.
 Wagner SA, Beli P, Weinert BT, Schölz C, Kelstrup CD, Young C, Nielsen ML, Olsen JV, Brakebusch C, Choudhary C.
 Mol Cell Proteomics. 2012 Dec;11(12):1578-85. [PMID: 22790023
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Disulfide bond; Reference proteome; Repeat. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 4383 AA 
Protein Sequence
MGQRAVGSLL LGLLLHARLL AVTHGLRAYD GLSLPEDTET VTASRYGWTY SYLSDDEDLL 60
ADDASGDGLG SGDVGSGDFQ MVYFRALVNF TRSIEYSPQL EDASAKEFRE VSEAVVEKLE 120
PEYRKIPGDQ IVSVVFIKEL DGWVFVELDV GSEGNADGSQ IQEVLHTVVS SGSIGPYVTS 180
PWGFKFRRLG TVPQFPRVCT ETEFACHSYN ECVALEYRCD RRPDCRDMSD ELNCEEPVPE 240
LSSSTPAVGK VSPLPLWPEA ATTPPPPVTH GPQFLLPSVP GPSACGPQEA SCHSGHCIPR 300
DYLCDGQEDC RDGSDELGCA SPPPCEPNEF ACENGHCALK LWRCDGDFDC EDRTDEANCS 360
VKQPGEVCGP THFQCVSTNR CIPASFHCDE ESDCPDRSDE FGCMPPQVVT PPQQSIQASR 420
GQTVTFTCVA TGVPTPIINW RLNWGHIPAH PRVTMTSEGG RGTLIIRDVK EADQGAYTCE 480
AMNSRGMVFG IPDGVLELVP QRGPCPDGHF YLEDSASCLP CFCFGVTNVC QSSLRFRDQI 540
RLSFDQPNDF KGVNVTMPSQ PGVPPLSSTQ LQIDPALQEF QLVDLSRRFL VHDAFWALPK 600
QFLGNKVDSY GGFLRYKVRY ELARGMLEPV QKPDVILVGA GYRLHSRGHT PTHPGTLNQR 660
QVQLSEEHWV HESGRPVQRA EMLQALASLE AVLLQTVYNT KMASVGLSDI VMDTTVTHTT 720
IHGRAHSVEE CRCPIGYSGL SCESCDAHFT RVPGGPYLGT CSGCNCNGHA SSCDPVYGHC 780
LNCQHNTEGP QCDKCKPGFF GDATKATATA CRPCPCPYID ASRRFSDTCF LDTDGQATCD 840
ACAPGYTGRR CESCAPGYEG NPIQPGGKCR PTTQEIVRCD ERGSLGTSGE TCRCKNNVVG 900
RLCNECSDGS FHLSKQNPDG CLKCFCMGVS RQCSSSSWSR AQVLGASEQP SQFSLSNAAG 960
THTTSEGVSS PAPGELSFSS FHNLLSEPYF WSLPASFRGD KVTSYGGELR FTVTQRPRPS 1020
SAPLHRQPLV VLQGNNIVLE HHASRDPSPG QPSNFIVPFQ EQAWQRPDGQ PATREHLLMA 1080
LAGIDALLIQ ASYTQQPAES RVSGISMDVA VPENTGQDSA REVEQCTCPP GYRGPSCQDC 1140
DTGYTRVPSG LYLGTCERCN CHGHSETCEP ETGACQSCQH HTEGASCEQC QPGYYGDAQR 1200
GTPQDCQPCP CYGAPAAGQA AHTCFLDTDG HPTCDSCSPG HSGRHCERCA PGYYGNPSQG 1260
QPCHRDGQVP EVLGCGCDPH GSISSQCDAA GQCQCKAQVE GRTCSHCRPH HFHLSASNPE 1320
GCLPCFCMGV TQQCASSSYS RQLISTHFAP GDFQGFALVN PQRNSQLTGG FTVEPVHDGA 1380
RLSFSNFAHL GQESFYWQLP EIYQGDKVAA YGGKLRYTLS YTAGPQGSPL LDPDIQITGN 1440
NIMLVASQPA LQGPERRSYE IIFREEFWRR PDGQPATREH LLMALADLDE LLVRATFSSV 1500
PRAASISAVS LEVAQPGPSS GPRALEVEEC RCPPGYVGLS CQDCAPGYTR TGSGLYLGQC 1560
ELCECNGHSD LCHPETGACS RCQHNTAGEF CELCATGYYG DATAGTPEDC QPCACPLTNP 1620
ENMFSRTCES LGAGGYRCTA CEPGYTGQYC EQCAPGYEGD PNVQGGRCQP LTKESLEVQI 1680
HPSRSVVPQG GPHSLRCQVS GSPPHYFYWS REDGRPLPSS AQQRHQGSEL HFPSVQPSDA 1740
GVYICTCRNL IHTSNSRAEL LVAEAPSKPI TVTVEEQRSQ SVRPGADVTF ICTAKSKSPA 1800
YTLVWTRLHN GKLPSRAMDF NGILTIRNVQ PSDAGTYVCT GSNMFAMDQG TATLHVQVSG 1860
TSTAPVASIH PPQLTVQPGQ QAEFRCSATG NPTPMLEWIG GPSGQLPAKA QIHNGILRLP 1920
AIEPSDQGQY LCRALSSAGQ HVARAMLQVH GGSGPRVQVS PERTQVHEGR TVRLYCRAAG 1980
VPSASITWRK EGGSLPPQAR SENTDIPTLL IPAITAADAG FYLCVATSPT GTAQARIQVV 2040
VLSASGANSV PVRIESSSPS VTEGQTLDLN CAVMGLTYTQ VTWYKRGGSL PPHAQVHGSR 2100
LRLPQVSPAD SGDYVCRVES DVGPKEASIV VSVLHSPHSG PSYTPATSIT PPIRIESSSS 2160
HVAEGQTLDL NCVVPGQAQV TWRKRGGSLP ARHQTHGSLL RLHQVSPADS GEYVCHVVLG 2220
SEHTETSVLV TIEPAESIPA PGPAPPVRIE ASSSTVTEGH MLDLNCVVAG QAHAQVTWYK 2280
RGGSLPARHQ VRGSRLYILQ ASPADAGEYV CRAGNGQEAT ITVTVTRNHG ANLAYPPGST 2340
SPIRIESSSS HVAEGQTLDL NCVVQGQAHA QVTWHKRGGS LPARHQTHGS LLRLHQVSPV 2400
DSGEYVCRVE GGAVPLESSV LVTIEPAGTA PGVIPPVRIE SSSSHVSEGQ SLDLNCLVSG 2460
QTHPQISWHK RGGSLPARHQ VHGSRLRLLQ VTPTDSGEYV CRVVSGSGTQ EASILVTIQQ 2520
TLSPSHSQSV VHPVRIESSS PSLANGHTLD LNCLVASLTP HTITWYKRGG SLPSRHQIVG 2580
SRLRIPQVTP ADSGEYVCHV SNGAGSQETS LIVTIESRGP SHVPSVSPPM RIETSSPTVT 2640
EGQTLDLNCV VVGRPQATIT WYKRGGSLPF RHQAHGSRLR LHHMSVADSG EYVCRANNNI 2700
DAQETSIMIS VSPSTNSPPA PASPAPIRIE SSSSRVAEGQ TLDLNCVVPG HAHAQVTWHK 2760
RGGSLPTHHQ THGSRLRLYQ VSSADSGEYV CSVLSSSGPL EASVLVSITP AAANVHIPGE 2820
VPFPPIRIET SSSRVAEGQT LDLSCVVPGQ AHAQVTWHKR GGSLPAGHQV HGHMLRLNRV 2880
SPADSGEYSC QVTGSSGTLE ASVLVTIEAS EPSPIPAPGL AQPVYIESSS SHLTEGQTVD 2940
LKCVVPGQAH AQVTWHKRGS SLPARHQTHG SLLRLYQLSP ADSGEYVCQV AGSSHPEHEA 3000
SFKLTVPSSQ NSSFRLRSPV ISIEPPSSTV QQGQDASFKC LIHEGATPIK VEWKIRDQEL 3060
EDNVHISPNG SIITIVGTRP SNHGAYRCVA SNVYGMAQSV VNLSVHGPPT VSVLPEGPVH 3120
VKMGKDITLE CISSGEPRSS PRWTRLGIPV KLEPRMFGLM NSHAMLKIAS VKPSDAGTYV 3180
CQAQNALGTA QKQVELIVDT GTVAPGAPQV QVEESELTLE AGHTATLHCS ATGNPPPTIH 3240
WSKLRAPLPW QHRIEGNTLV IPRVAQQDSG QYICNATNSA GHTEATVVLH VESPPYATII 3300
PEHTSAQPGN LVQLQCLAHG TPPLTYQWSL VGGVLPEKAV ARNQVLRLEP TVPEDSGRYR 3360
CQVSNRVGSA EAFAQVLVQG SSSNLPDTSI PGGSTPTVQV TPQLETRNIG ASVEFHCAVP 3420
NERGTHLRWL KEGGQLPPGH SVQDGVLRIQ NLDQSCQGTY VCQAHGPWGQ AQATAQLIVQ 3480
ALPSVLINVR TSVHSVVVGH SVEFECLALG DPKPQVTWSK VGGHLRPGIV QSGSIIRIAH 3540
VELADAGQYR CAATNAAGTT QSHVLLLVQA LPQISTPPEI RVPAGSAAVF PCMASGYPTP 3600
AITWSKVDGD LPPDSRLENN MLMLPSVRPE DAGTYVCTAT NRQGKVKAFA YLQVPERVIP 3660
YFTQTPYSFL PLPTIKDAYR KFEIKITFRP DSADGMLLYN GQKRSPTNLA NRQPDFISFG 3720
LVGGRPEFRF DAGSGMATIR HPTPLALGQF HTVTLLRSLT QGSLIVGNLA PVNGTSQGKF 3780
QGLDLNEELY LGGYPDYGAI PKAGLSSGFV GCVRELRIQG EEVVFHDVNL TTHGISHCPT 3840
CQDRPCQNGG QCQDSESSSY TCVCPAGFTG SRCEHSQALH CHPEACGPDA TCVNRPDGRG 3900
YTCRCHLGRS GVRCEEGVTV TTPSMSGAGS YLALPALTNM HHELRLDVEF KPLEPNGILL 3960
FSGGKSGPVE DFVSLAMVGG HLEFRYELGS GLAVLRSHEP LTLGRWHRVS AERLNKDGSL 4020
RVDGGRPVLR SSPGKSQGLN LHTLLYLGGV EPSVQLSPAT NMSAHFHGCV GEVSVNGKRL 4080
DLTYSFLGSQ GVGQCYDSSP CERQPCQNGA TCMPAGEYEF QCLCQDGFKG DLCEHEENPC 4140
QLHEPCLNGG TCRGARCLCL PGFSGPRCQQ GAGYGVVESD WHPEGSGGND APGQYGAYFY 4200
DNGFLGLPGN SFSRSLPEVP ETIEFEVRTS TADGLLLWQG VVREASRSKD FISLGLQDGH 4260
LVFSYQLGSG EARLVSEDPI NDGEWHRITA LREGQRGSIQ VDGEDLVTGR SPGPNVAVNT 4320
KDIIYIGGAP DVATLTRGKF SSGITGCIKN LVLHTARPGA PPPQPLDLQH RAQAGANTRP 4380
CPS 4383 
Gene Ontology
 GO:0005605; C:basal lamina; IDA:MGI.
 GO:0005615; C:extracellular space; IEA:Compara.
 GO:0070062; C:extracellular vesicular exosome; IEA:Compara.
 GO:0007420; P:brain development; IMP:MGI.
 GO:0048738; P:cardiac muscle tissue development; IMP:MGI.
 GO:0060351; P:cartilage development involved in endochondral bone morphogenesis; IMP:MGI.
 GO:0002062; P:chondrocyte differentiation; IMP:MGI.
 GO:0048704; P:embryonic skeletal system morphogenesis; IMP:MGI.
 GO:0001958; P:endochondral ossification; IMP:MGI.
 GO:0030198; P:extracellular matrix organization; IMP:MGI.
 GO:0008104; P:protein localization; IMP:MGI. 
Interpro
 IPR008985; ConA-like_lec_gl_sf.
 IPR013320; ConA-like_subgrp.
 IPR000742; EG-like_dom.
 IPR013032; EGF-like_CS.
 IPR002049; EGF_laminin.
 IPR007110; Ig-like_dom.
 IPR013783; Ig-like_fold.
 IPR013098; Ig_I-set.
 IPR003599; Ig_sub.
 IPR003598; Ig_sub2.
 IPR018031; Laminin_B_subgr.
 IPR000034; Laminin_B_type_IV.
 IPR001791; Laminin_G.
 IPR023415; LDLR_class-A_CS.
 IPR002172; LDrepeatLR_classA_rpt.
 IPR000082; SEA_dom. 
Pfam
 PF00008; EGF
 PF12661; hEGF
 PF07679; I-set
 PF00052; Laminin_B
 PF00053; Laminin_EGF
 PF00054; Laminin_G_1
 PF02210; Laminin_G_2
 PF00057; Ldl_recept_a 
SMART
 SM00181; EGF
 SM00180; EGF_Lam
 SM00409; IG
 SM00408; IGc2
 SM00281; LamB
 SM00282; LamG
 SM00192; LDLa
 SM00200; SEA 
PROSITE
 PS00022; EGF_1
 PS01186; EGF_2
 PS50026; EGF_3
 PS01248; EGF_LAM_1
 PS50027; EGF_LAM_2
 PS50835; IG_LIKE
 PS50025; LAM_G_DOMAIN
 PS51115; LAMININ_IVA
 PS01209; LDLRA_1
 PS50068; LDLRA_2
 PS50024; SEA 
PRINTS