CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-023168
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Protein HEG homolog 1 
Protein Synonyms/Alias
  
Gene Name
 HEG1 
Gene Synonyms/Alias
 KIAA1237 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
1300SPYAEYPKNPRSQEWubiquitination[1]
Reference
 [1] Systems-wide analysis of ubiquitylation dynamics reveals a key role for PAF15 ubiquitylation in DNA-damage bypass.
 Povlsen LK, Beli P, Wagner SA, Poulsen SL, Sylvestersen KB, Poulsen JW, Nielsen ML, Bekker-Jensen S, Mailand N, Choudhary C.
 Nat Cell Biol. 2012 Oct;14(10):1089-98. [PMID: 23000965
Functional Description
 Receptor component of the CCM signaling pathway which is a crucial regulator of heart and vessel formation and integrity May act through the stabilization of endothelial cell junctions (By similarity). 
Sequence Annotation
 DOMAIN 985 1023 EGF-like 1.
 DOMAIN 1025 1063 EGF-like 2; calcium-binding (Potential).
 CARBOHYD 67 67 O-linked (GalNAc...).
 CARBOHYD 123 123 N-linked (GlcNAc...) (Potential).
 CARBOHYD 159 159 N-linked (GlcNAc...).
 CARBOHYD 180 180 N-linked (GlcNAc...) (Potential).
 CARBOHYD 314 314 N-linked (GlcNAc...) (Potential).
 CARBOHYD 462 462 N-linked (GlcNAc...) (Potential).
 CARBOHYD 520 520 N-linked (GlcNAc...).
 CARBOHYD 610 610 N-linked (GlcNAc...) (Potential).
 CARBOHYD 1137 1137 N-linked (GlcNAc...) (Potential).
 DISULFID 989 1000 By similarity.
 DISULFID 994 1011 By similarity.
 DISULFID 1013 1022 By similarity.
 DISULFID 1029 1040 By similarity.
 DISULFID 1034 1049 By similarity.
 DISULFID 1051 1062 By similarity.  
Keyword
 3D-structure; Alternative splicing; Calcium; Cell junction; Cell membrane; Complete proteome; Developmental protein; Disulfide bond; EGF-like domain; Glycoprotein; Membrane; Polymorphism; Reference proteome; Repeat; Secreted; Signal; Transmembrane; Transmembrane helix. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1381 AA 
Protein Sequence
MASPRASRWP PPLLLLLLPL LLLPPAAPGT RDPPPSPARR ALSLAPLAGA GLELQLERRP 60
EREPPPTPPR ERRGPATPGP SYRAPEPGAA TQRGPSGRAP RGGSADAAWK HWPESNTEAH 120
VENITFYQNQ EDFSTVSSKE GVMVQTSGKS HAASDAPENL TLLAETADAR GRSGSSSRTN 180
FTILPVGYSL EIATALTSQS GNLASESLHL PSSSSEFDER IAAFQTKSGT ASEMGTERAM 240
GLSEEWTVHS QEATTSAWSP SFLPALEMGE LTTPSRKRNS SGPDLSWLHF YRTAASSPLL 300
DLSSSSESTE KLNNSTGLQS SSVSQTKTMH VATVFTDGGP RTLRSLTVSL GPVSKTEGFP 360
KDSRIATTSS SVLLSPSAVE SRRNSRVTGN PGDEEFIEPS TENEFGLTSL RWQNDSPTFG 420
EHQLASSSEV QNGSPMSQTE TVSRSVAPMR GGEITAHWLL TNSTTSADVT GSSASYPEGV 480
NASVLTQFSD STVQSGGSHT ALGDRSYSES SSTSSSESLN SSAPRGERSI AGISYGQVRG 540
TAIEQRTSSD HTDHTYLSST FTKGERALLS ITDNSSSSDI VESSTSYIKI SNSSHSEYSS 600
FFHAQTERSN ISSYDGEYAQ PSTESPVLHT SNLPSYTPTI NMPNTSVVLD TDAEFVSDSS 660
SSSSSSSSSS SSGPPLPLPS VSQSHHLFSS ILPSTRASVH LLKSTSDAST PWSSSPSPLP 720
VSLTTSTSAP LSVSQTTLPQ SSSTPVLPRA RETPVTSFQT STMTSFMTML HSSQTADLKS 780
QSTPHQEKVI TESKSPSLVS LPTESTKAVT TNSPLPPSLT ESSTEQTLPA TSTNLAQMSP 840
TFTTTILKTS QPLMTTPGTL SSTASLVTGP IAVQTTAGKQ LSLTHPEILV PQISTEGGIS 900
TERNRVIVDA TTGLIPLTSV PTSAKEMTTK LGVTAEYSPA SRSLGTSPSP QTTVVSTAED 960
LAPKSATFAV QSSTQSPTTV SSSASVNSCA VNPCLHNGEC VADNTSRGYH CRCPPSWQGD 1020
DCSVDVNECL SNPCPSTAMC NNTQGSFICK CPVGYQLEKG ICNLVRTFVT EFKLKRTFLN 1080
TTVEKHSDLQ EVENEITKTL NMCFSALPSY IRSTVHASRE SNAVVISLQT TFSLASNVTL 1140
FDLADRMQKC VNSCKSSAEV CQLLGSQRRI FRAGSLCKRK SPECDKDTSI CTDLDGVALC 1200
QCKSGYFQFN KMDHSCRACE DGYRLENETC MSCPFGLGGL NCGNPYQLIT VVIAAAGGGL 1260
LLILGIALIV TCCRKNKNDI SKLIFKSGDF QMSPYAEYPK NPRSQEWGRE AIEMHENGST 1320
KNLLQMTDVY YSPTSVRNPE LERNGLYPAY TGLPGSRHSC IFPGQYNPSF ISDESRRRDY 1380
F 1381 
Gene Ontology
 GO:0005911; C:cell-cell junction; IEA:Compara.
 GO:0009897; C:external side of plasma membrane; IEA:Compara.
 GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell.
 GO:0016021; C:integral to membrane; IEA:UniProtKB-KW.
 GO:0005509; F:calcium ion binding; IEA:InterPro.
 GO:0003209; P:cardiac atrium morphogenesis; IEA:Compara.
 GO:0045216; P:cell-cell junction organization; IEA:Compara.
 GO:0001886; P:endothelial cell morphogenesis; IEA:Compara.
 GO:0001701; P:in utero embryonic development; IEA:Compara.
 GO:0030324; P:lung development; IEA:Compara.
 GO:0003017; P:lymph circulation; IEA:Compara.
 GO:0001945; P:lymph vessel development; IEA:Compara.
 GO:0035264; P:multicellular organism growth; IEA:Compara.
 GO:0060039; P:pericardium development; IEA:Compara.
 GO:0009791; P:post-embryonic development; IEA:Compara.
 GO:0050878; P:regulation of body fluid levels; IEA:Compara.
 GO:0001570; P:vasculogenesis; IEA:Compara.
 GO:0048845; P:venous blood vessel morphogenesis; IEA:Compara.
 GO:0003281; P:ventricular septum development; IEA:Compara. 
Interpro
 IPR000742; EG-like_dom.
 IPR001881; EGF-like_Ca-bd.
 IPR013032; EGF-like_CS.
 IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
 IPR018097; EGF_Ca-bd_CS. 
Pfam
 PF00008; EGF
 PF07645; EGF_CA 
SMART
 SM00181; EGF
 SM00179; EGF_CA 
PROSITE
 PS00010; ASX_HYDROXYL
 PS00022; EGF_1
 PS01186; EGF_2
 PS50026; EGF_3
 PS01187; EGF_CA 
PRINTS