CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-038428
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
  
Protein Name
 Protein HEG homolog 1 
Protein Synonyms/Alias
  
Gene Name
 Heg1 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Mus musculus (Mouse) 
NCBI Taxa ID
 10090 
Lysine Modification
Position
Peptide
Type
References
1001SPYTDVPKNPRSQEWubiquitination[1]
Reference
 [1] Proteomic analyses reveal divergent ubiquitylation site patterns in murine tissues.
 Wagner SA, Beli P, Weinert BT, Schölz C, Kelstrup CD, Young C, Nielsen ML, Olsen JV, Brakebusch C, Choudhary C.
 Mol Cell Proteomics. 2012 Dec;11(12):1578-85. [PMID: 22790023
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Disulfide bond; EGF-like domain; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1082 AA 
Protein Sequence
MATPRAPRWP PPSLLLLLLL PLLLLPPAAP GARGSLPSPA HRTLLPVAGP LSPPGAGHTA 60
PGPGVATRRG RSGRVPRGVS AALESLPESP SSSRSQRRIT PSQTESGTSL GFLERTRELP 120
EEGTVHTQVA GTWVSRQASH PALEPGEPTV LSQKRNSSGQ EHSGPPFSWS QSHPPPSDHP 180
SSPEDGAMLS DSSDLADSTS GARTPHTSAM STRSGERTLR SLDLSSAATR PARPTPRGNV 240
TEHAGLLSGA PTLGVTGLSY TREHGSDAGQ RTSSDHTDHG YVPSTFTKGE RTLLSITDNT 300
SYSEASESST SSVKISDSPS QAQPKQSSMS SDDDEPAQSS TESPVLHTSN LPTYTSTVNM 360
PNTLVLDTGT KPVEDPSDSR VPSTQPSPSQ PQPFSSALPS TRSPGSTSET TTSSPSPSPI 420
SLLVSTLAPY SVSQTTFPHP SSTLVPHRPR EPRVTSVQMS TAISAIALIP SNQTANPKNQ 480
STPQQEKPIT EAKSPSLVSP PTDSTKAVTV SLPPGAPWSP ALTGFSTGPA LPATSTSLAQ 540
MSPALTSAMP QTTHSPVTSP STLSHVEALT SGAVVVHTTP KKPHLPTNPE ILVPHISTEG 600
AITTEGNREH TDPTTQPIPL TTSTTSAGER TTELGRAEES SPSHFLTPSS PQTTDVSTAE 660
MLTSRYITFA AQSTSQSPTA LPPLTPVNSC TVNPCLHDGK CIVDLTGRGY RCVCPPAWQG 720
ENCSVDVNEC LSSPCPPLAT CNNTQGSFTC RCPVGYQLEK GICNLVRTFV TEFKLKKTFL 780
NTTAENHSNT QELENEIAQT LNVCFSTLPG YIRTTAHVSR EPSTVFISLK TTFALASNVT 840
LFDLADRIQK YVNSCRSSAE VCQLLGSQRR VFRAGSLCKR KSPECDKETS ICTDLDGVAL 900
CQCKSGYFQF NKMDHSCRAC EDGYRLENET CMSCPFGLGG LNCGNPYQLI TVVIAAAGGG 960
LLLILGVALI VTCCRKSKND ISKLIFKSGD FQMSPYTDVP KNPRSQEWGR EAIEMHENGS 1020
TKNLLQMTDV YYSPTNVRNP ELERNGLYPA YTGLPGSRHS CIFPGQYNPS FISDESRRRD 1080
YF 1082 
Gene Ontology
 GO:0005911; C:cell-cell junction; IDA:MGI.
 GO:0009897; C:external side of plasma membrane; IDA:MGI.
 GO:0005509; F:calcium ion binding; IEA:InterPro.
 GO:0003209; P:cardiac atrium morphogenesis; IMP:MGI.
 GO:0045216; P:cell-cell junction organization; IMP:MGI.
 GO:0001886; P:endothelial cell morphogenesis; IMP:MGI.
 GO:0001701; P:in utero embryonic development; IMP:MGI.
 GO:0030324; P:lung development; IMP:MGI.
 GO:0003017; P:lymph circulation; IMP:MGI.
 GO:0001945; P:lymph vessel development; IMP:MGI.
 GO:0035264; P:multicellular organism growth; IGI:MGI.
 GO:0060039; P:pericardium development; IMP:MGI.
 GO:0009791; P:post-embryonic development; IMP:MGI.
 GO:0050878; P:regulation of body fluid levels; IMP:MGI.
 GO:0001570; P:vasculogenesis; IGI:MGI.
 GO:0048845; P:venous blood vessel morphogenesis; IGI:MGI.
 GO:0003281; P:ventricular septum development; IMP:MGI. 
Interpro
 IPR000742; EG-like_dom.
 IPR001881; EGF-like_Ca-bd.
 IPR013032; EGF-like_CS.
 IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
 IPR018097; EGF_Ca-bd_CS. 
Pfam
 PF00008; EGF
 PF07645; EGF_CA 
SMART
 SM00181; EGF
 SM00179; EGF_CA 
PROSITE
 PS00010; ASX_HYDROXYL
 PS00022; EGF_1
 PS01186; EGF_2
 PS50026; EGF_3
 PS01187; EGF_CA 
PRINTS