CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-024771
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Protein HEG homolog 1 
Protein Synonyms/Alias
  
Gene Name
 Heg1 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Mus musculus (Mouse) 
NCBI Taxa ID
 10090 
Lysine Modification
Position
Peptide
Type
References
1256SPYTDVPKNPRSQEWubiquitination[1]
Reference
 [1] Proteomic analyses reveal divergent ubiquitylation site patterns in murine tissues.
 Wagner SA, Beli P, Weinert BT, Schölz C, Kelstrup CD, Young C, Nielsen ML, Olsen JV, Brakebusch C, Choudhary C.
 Mol Cell Proteomics. 2012 Dec;11(12):1578-85. [PMID: 22790023
Functional Description
 Receptor component of the CCM signaling pathway which is a crucial regulator of heart and vessel formation and integrity. May be acting by stabilizing endothelial cell junctions. 
Sequence Annotation
 DOMAIN 941 979 EGF-like 1.
 DOMAIN 981 1019 EGF-like 2; calcium-binding (Potential).
 CARBOHYD 1093 1093 N-linked (GlcNAc...) (Potential).
 DISULFID 945 956 By similarity.
 DISULFID 950 967 By similarity.
 DISULFID 969 978 By similarity.
 DISULFID 985 996 By similarity.
 DISULFID 990 1005 By similarity.
 DISULFID 1007 1018 By similarity.  
Keyword
 Alternative splicing; Calcium; Cell junction; Cell membrane; Complete proteome; Developmental protein; Disulfide bond; EGF-like domain; Glycoprotein; Membrane; Reference proteome; Repeat; Signal; Transmembrane; Transmembrane helix. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1337 AA 
Protein Sequence
MATPRAPRWP PPSLLLLLLL PLLLLPPAAP GARGSLPSPA HRTLLPVAGP LSPPGAGHTA 60
PGPGVATRRG RSGRVPRGVS AAAARNRWLE SNNPEPHIGC SPSYQSQEDH SGSRKGVTAQ 120
NARMSHSSSE GPENPPLLPE TSAEWSNMAS SHRADIAGLR RGPSPEITTA PTAHSSLLSL 180
ESLPESPSSS RSQRRITPSQ TESGTSLGFL ERTRELPEEG TVHTQVAGTW VSRQASHPAL 240
EPGEPTVLSQ KRNSSGQEHS GPPFSWSQSH PPPSDHPSSS GSIKNGNNFT ALQNPSVTQT 300
KSMLITDTYT NGVPRTLRSL PVGVDPADET EGFPEHSRLG ITSMSVRSSP SVKDSRTNSG 360
LTEHLGDGEG TELSTENGYG LPSIHWQSDA PSFGGRQLAS SSEAGDGRAM PLTEAVFRSD 420
PSIGGGESTG RWILTKKKTS TDAAESSALH PEAGGAGGLT QSSHAAQQPR GGGEDSGMGG 480
RSYAESSSSS SSTSSSESLD SSAPLREHSL TGLSYTREHG SDAGQRTSSD HTDHGYVPST 540
FTKGERTLLS ITDNTSYSEA SESSTSSVKI SDSPSQAQPK QSSMSSDDDE PAQSSTESPV 600
LHTSNLPTYT STVNMPNTLV LDTGTKPVED PSDSRVPSTQ PSPSQPQPFS SALPSTRSPG 660
STSETTTSSP SPSPISLLVS TLAPYSVSQT TFPHPSSTLV PHRPREPRVT SVQMSTAISA 720
IALIPSNQTA NPKNQSTPQQ EKPITEAKSP SLVSPPTDST KAVTVSLPPG APWSPALTGF 780
STGPALPATS TSLAQMSPAL TSAMPQTTHS PVTSPSTLSH VEALTSGAVV VHTTPKKPHL 840
PTNPEILVPH ISTEGAITTE GNREHTDPTT QPIPLTTSTT SAGERTTELG RAEESSPSHF 900
LTPSSPQTTD VSTAEMLTSR YITFAAQSTS QSPTALPPLT PVNSCTVNPC LHDGKCIVDL 960
TGRGYRCVCP PAWQGENCSV DVNECLSSPC PPLATCNNTQ GSFTCRCPVG YQLEKGICNL 1020
VRTFVTEFKL KKTFLNTTAE NHSNTQELEN EIAQTLNVCF STLPGYIRTT AHVSREPSTV 1080
FISLKTTFAL ASNVTLFDLA DRIQKYVNSC RSSAEVCQLL GSQRRVFRAG SLCKRKSPEC 1140
DKETSICTDL DGVALCQCKS GYFQFNKMDH SCRACEDGYR LENETCMSCP FGLGGLNCGN 1200
PYQLITVVIA AAGGGLLLIL GVALIVTCCR KSKNDISKLI FKSGDFQMSP YTDVPKNPRS 1260
QEWGREAIEM HENGSTKNLL QMTDVYYSPT NVRNPELERN GLYPAYTGLP GSRHSCIFPG 1320
QYNPSFISDE SRRRDYF 1337 
Gene Ontology
 GO:0005911; C:cell-cell junction; IDA:MGI.
 GO:0009897; C:external side of plasma membrane; IDA:MGI.
 GO:0016021; C:integral to membrane; IEA:UniProtKB-KW.
 GO:0005509; F:calcium ion binding; IEA:InterPro.
 GO:0003209; P:cardiac atrium morphogenesis; IMP:MGI.
 GO:0045216; P:cell-cell junction organization; IMP:MGI.
 GO:0001886; P:endothelial cell morphogenesis; IMP:MGI.
 GO:0001701; P:in utero embryonic development; IMP:MGI.
 GO:0030324; P:lung development; IMP:MGI.
 GO:0003017; P:lymph circulation; IMP:MGI.
 GO:0001945; P:lymph vessel development; IMP:MGI.
 GO:0035264; P:multicellular organism growth; IGI:MGI.
 GO:0060039; P:pericardium development; IMP:MGI.
 GO:0009791; P:post-embryonic development; IMP:MGI.
 GO:0050878; P:regulation of body fluid levels; IMP:MGI.
 GO:0001570; P:vasculogenesis; IGI:MGI.
 GO:0048845; P:venous blood vessel morphogenesis; IGI:MGI.
 GO:0003281; P:ventricular septum development; IMP:MGI. 
Interpro
 IPR000742; EG-like_dom.
 IPR001881; EGF-like_Ca-bd.
 IPR013032; EGF-like_CS.
 IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
 IPR018097; EGF_Ca-bd_CS. 
Pfam
 PF00008; EGF
 PF07645; EGF_CA 
SMART
 SM00181; EGF
 SM00179; EGF_CA 
PROSITE
 PS00010; ASX_HYDROXYL
 PS00022; EGF_1
 PS01186; EGF_2
 PS50026; EGF_3
 PS01187; EGF_CA 
PRINTS