CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-039103
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
  
Protein Name
 Protein Hspg2 
Protein Synonyms/Alias
  
Gene Name
 Hspg2 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Rattus norvegicus (Rat) 
NCBI Taxa ID
 10116 
Lysine Modification
Position
Peptide
Type
References
106QLEDANAKEFREVSEacetylation[1]
118VSEAVVEKLETEYVKacetylation[1]
631GMLEPVQKPDVILVGacetylation[1]
Reference
 [1] Proteomic analysis of lysine acetylation sites in rat tissues reveals organ specificity and subcellular patterns.
 Lundby A, Lage K, Weinert BT, Bekker-Jensen DB, Secher A, Skovgaard T, Kelstrup CD, Dmytriyev A, Choudhary C, Lundby C, Olsen JV.
 Cell Rep. 2012 Aug 30;2(2):419-31. [PMID: 22902405
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Disulfide bond; Reference proteome; Repeat. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2001 AA 
Protein Sequence
MGQRAVGSLL LGLLLHARLL AVTQGLRAYD GLSLPEDTET VTASRYGWTY SYLSDDEDLL 60
ADDASGDGLG SGDLGSGDFQ MVYFRALVNF TRSIEYSPQL EDANAKEFRE VSEAVVEKLE 120
TEYVKIPGDQ IVSVVFIKEL DGWVFVELDV GSEGNADGSQ IQEVLHTVVS SGSIGPYVTS 180
PWGFKFRRLG TVPQFPRVCT ETEFACHSYN ECVALEYRCD RRPDCRDMSD ELNCEEAVSE 240
ISSLPSAVVK VSPLPLWPEA ATLPPPVMHG PQFLLPNVPG PTACGPQEAS CHSGHCIPRD 300
YLCDGQEDCR DGSDELGCGP PPPCEPNEFS CENGHCALKL WRCDGDFDCE DHTDEANCPV 360
KHPGEVCGPT QFQCVSTNRC IPASFHCDEE TDCPDRSDEF GCMPPQVVTP PQQSIQASRG 420
QTVTFTCVAT GVPTPIINWR LNWGHIPVNP RVTMTSEGGR GTLIIRDVKE ADQGAYTCEA 480
MNSRGMVFGI PDGVLELVPQ RGPCPDGHFY LEDSASCLPC FCFGVTNLCQ SSRRFRDQIR 540
LNFDQPSDFK GVNVTMPSQP GVPPLSSTQL QIDPTLQEFQ LVDLSRRFLV HDAFWALPKQ 600
FLGNKVDSYG GFLRYKVRYE LARGMLEPVQ KPDVILVGAG YRLHSRGHTP THPGALNQRQ 660
VQLSEEHWVH ESGRPVQRAE MLQALASLEA VLLQTVYNTK MTSVGLSDIV MDTTVTYTTI 720
HGRAHSVEEC RCPIGYSGLS CESCDAHFTR VPGGPYLGTC SGCNCNGHAS SCDPVYGHCL 780
NCQHNTEGPQ CDKCKPGFFG DATKATATAC RPCPCPYIDA SQRFSDTCFL DTDGQATCDA 840
CAPGYTGRRC ESCAPGYEGN PIQPGGKCRP SNQELVRCDE RGSLSATGEA CRCKNNVVGR 900
SCNECSEGSF HLSKQNPDGC LKCFCMGVSR QCSSSSWSRA QVLGASVQPS QFTLTNAAGT 960
HTTSEGVSSP APGELLFSSF HNLLSEPYFW SLPASFRGDK VTSYGGELRF TVTQRPRPGS 1020
APLHRQPLVV LQGNNIVLEH HASREPSPGQ PSNFIVPFQE QVWQRPDGQP ATREHLLMAL 1080
AGIDALLIQA SHTQQPAESR LSGISMDVAV PENTGQDPAR EVEHCTCPPG YRGPSCQDCD 1140
TGYTRVASGL YLGTCERCNC HGHSETCEPE TGACQSCQHH TEGASCEQCQ PGYYGDPQRG 1200
TPQDCQPCPC YGTPAAGQAS HTCFLDTDGH PTCDSCSPGH SGRHCERCAP GYHGNPSQGQ 1260
PCHRDGQVPE VPGCDCDPHG SISSQCDATG QCQCKAQVEG RTCSHCRPHH FHLSASNPEG 1320
CLPCFCMGVT QQCTSSSYSR QLISTHFAPG DFQGFALVNP QRNSQLTGGF TVEPVPDGAR 1380
LSFSNFAHLG QESFYWQLPE TYQGDKVAAY GGKLRYTLSY TSGPQGSPLS DPDIQITGNN 1440
IMLVASQPAL QGPERRSYEI IFREEFWRRP DGQPATREHL LMALADLDEL LVRATFSSVP 1500
RAASISAVSL EVAQPGPSGG PQALEVEECR CPPGYVGLSC QDCAPGYTRT GSGLYLGQCE 1560
LCECNGHSEV CHPETGACTG CQHNTVGEFC ETCATGYYGD ATAGTPEDCQ PCACPLTNPE 1620
NMFSRTCESL GAGGYRCTAC EPGYTGQYCE QCAPGYEGNP NVQGGRCQPL TKESLEVQIH 1680
PSRSVVPQGG PHSLRCQVNG SPPHYFYWSR EDGRPLPSGA QQRHQGSELH FPSVQPSDAG 1740
VYICTCRNLI HTSNSRAELL VAEAPSKPIT VTVEEQRSQS VRPGADVTFI CTAKSKSPAY 1800
TLVWTRLHNG KLPSRAMDFN GILTIRNVQP SDAGTYVCTG SNMFAMDQGT ATLHVQVSGT 1860
STAPVASIHP PQLTVQPGQL AEFRCSATGN PTPTLEWIGG PSGQLPTKAQ IHNGILRLPA 1920
IEPSDQGQYL CRALSSAGQH VVRAMLQVHG GSGPRVQVSP ERTQVHEGRT VRLYCRAAGV 1980
PSASITWRKE GGSLPPQARS E 2001 
Gene Ontology
  
Interpro
 IPR013032; EGF-like_CS.
 IPR002049; EGF_laminin.
 IPR007110; Ig-like_dom.
 IPR013783; Ig-like_fold.
 IPR013098; Ig_I-set.
 IPR003598; Ig_sub2.
 IPR018031; Laminin_B_subgr.
 IPR000034; Laminin_B_type_IV.
 IPR023415; LDLR_class-A_CS.
 IPR002172; LDrepeatLR_classA_rpt.
 IPR000082; SEA_dom. 
Pfam
 PF07679; I-set
 PF00052; Laminin_B
 PF00053; Laminin_EGF
 PF00057; Ldl_recept_a 
SMART
 SM00180; EGF_Lam
 SM00408; IGc2
 SM00281; LamB
 SM00192; LDLa
 SM00200; SEA 
PROSITE
 PS00022; EGF_1
 PS01248; EGF_LAM_1
 PS50027; EGF_LAM_2
 PS50835; IG_LIKE
 PS51115; LAMININ_IVA
 PS01209; LDLRA_1
 PS50068; LDLRA_2
 PS50024; SEA 
PRINTS