CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-024766
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Apolipoprotein B-100 
Protein Synonyms/Alias
 Apo B-100; Apolipoprotein B-48; Apo B-48 
Gene Name
 Apob 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Mus musculus (Mouse) 
NCBI Taxa ID
 10090 
Lysine Modification
Position
Peptide
Type
References
157PKYILNIKRGIISALubiquitination[1]
320TKSTSSPKQADAVLKubiquitination[1]
434QEIFNTAKEQQSRATubiquitination[1]
525KPSLLIQKAALQALRubiquitination[1]
782AYLRILGKELSFVRLubiquitination[1]
1305IPLPLGGKSSKDLKMubiquitination[1]
1828LNVGGNFKGTYQNNEubiquitination[1]
1991FKTKLNDKVYSQDFEubiquitination[1]
Reference
 [1] Proteomic analyses reveal divergent ubiquitylation site patterns in murine tissues.
 Wagner SA, Beli P, Weinert BT, Schölz C, Kelstrup CD, Young C, Nielsen ML, Olsen JV, Brakebusch C, Choudhary C.
 Mol Cell Proteomics. 2012 Dec;11(12):1578-85. [PMID: 22790023
Functional Description
 Apolipoprotein B is a major protein constituent of chylomicrons (apo B-48), LDL (apo B-100) and VLDL (apo B-100). Apo B-100 functions as a recognition signal for the cellular binding and internalization of LDL particles by the apoB/E receptor. 
Sequence Annotation
 DOMAIN 46 672 Vitellogenin.
 REGION 32 126 Heparin-binding (By similarity).
 REGION 232 306 Heparin-binding (By similarity).
 REGION 902 959 Heparin-binding (By similarity).
 REGION 2042 2177 Heparin-binding (By similarity).
 REGION 3155 3230 Heparin-binding (By similarity).
 REGION 3168 3178 Basic (possible receptor binding region)
 REGION 3368 3388 LDL receptor binding (By similarity).
 REGION 3378 3511 Heparin-binding (By similarity).
 REGION 3381 3389 Basic (possible receptor binding region)
 MOD_RES 2005 2005 N6-acetyllysine (By similarity).
 CARBOHYD 34 34 N-linked (GlcNAc...) (Potential).
 CARBOHYD 185 185 N-linked (GlcNAc...) (Potential).
 CARBOHYD 983 983 N-linked (GlcNAc...) (Potential).
 CARBOHYD 1368 1368 N-linked (GlcNAc...) (Potential).
 CARBOHYD 1377 1377 N-linked (GlcNAc...) (Potential).
 CARBOHYD 1523 1523 N-linked (GlcNAc...) (Potential).
 CARBOHYD 2554 2554 N-linked (GlcNAc...) (Potential).
 CARBOHYD 2773 2773 N-linked (GlcNAc...).
 CARBOHYD 2976 2976 N-linked (GlcNAc...).
 CARBOHYD 3095 3095 N-linked (GlcNAc...) (Potential).
 CARBOHYD 3331 3331 N-linked (GlcNAc...) (Potential).
 CARBOHYD 3353 3353 N-linked (GlcNAc...) (Potential).
 CARBOHYD 3460 3460 N-linked (GlcNAc...) (Potential).
 CARBOHYD 3860 3860 N-linked (GlcNAc...) (Potential).
 DISULFID 39 88 By similarity.
 DISULFID 78 97 By similarity.
 DISULFID 186 212 By similarity.
 DISULFID 245 261 By similarity.
 DISULFID 385 390 By similarity.
 DISULFID 478 513 By similarity.
 DISULFID 966 976 By similarity.  
Keyword
 Acetylation; Cholesterol metabolism; Chylomicron; Coiled coil; Complete proteome; Cytoplasm; Disulfide bond; Glycoprotein; Heparin-binding; LDL; Lipid metabolism; Lipid transport; Lipoprotein; Palmitate; Reference proteome; Repeat; RNA editing; Secreted; Signal; Steroid metabolism; Sterol metabolism; Transport; VLDL. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 4505 AA 
Protein Sequence
MGPRKPALRT PLLLLFLLLF LDTSVWAQDE VLENLSFSCP KDATRFKHLR KYVYNYEAES 60
SSGVQGTADS RSATKINCKV ELEVPQICGF IMRTNQCTLK EVYGFNPEGK ALMKKTKNSE 120
EFAAAMSRYE LKLAIPEGKQ IVLYPDKDEP KYILNIKRGI ISALLVPPET EEDQQELFLD 180
TVYGNCSTQV TVNSRKGTVP TEMSTERNLQ QCDGFQPIST SVSPLALIKG LVHPLSTLIS 240
SSQTCQYTLD PKRKHVSEAV CDEQHLFLPF SYKNKYGIMT RVTQKLSLED TPKINSRFFS 300
EGTNRMGLAF ESTKSTSSPK QADAVLKTLQ ELKKLSISEQ NAQRANLFNK LVTELRGLTG 360
EAITSLLPQL IEVSSPITLQ ALVQCGQPQC YTHILQWLKT EKAHPLLVDI VTYLMALIPN 420
PSTQRLQEIF NTAKEQQSRA TLYALSHAVN SYFDVDHSRS PVLQDIAGYL LKQIDNECTG 480
NEDHTFLILR VIGNMGRTME QVMPALKSSV LSCVRSTKPS LLIQKAALQA LRKMELEDEV 540
RTILFDTFVN GVAPVEKRLA AYLLLMKNPS SSDINKIAQL LQWEQSEQVK NFVASHIANI 600
LNSEELYVQD LKVLIKNALE NSQFPTIMDF RKFSRNYQIS KSASLPMFDP VSVKIEGNLI 660
FDPSSYLPRE SLLKTTLTVF GLASLDLFEI GLEGKGFEPT LEALFGKQGF FPDSVNKALY 720
WVNGRVPDGV SKVLVDHFGY TTDGKHEQDM VNGIMPIVDK LIKDLKSKEI PEARAYLRIL 780
GKELSFVRLQ DLQVLGKLLL SGAQTLQGIP QMVVQAIREG SKNDLFLHYI FMDNAFELPT 840
GAGLQLQVSS SGVFTPGIKA GVRLELANIQ AELVAKPSVS LEFVTNMGII IPDFAKSSVQ 900
MNTNFFHESG LEARVALKAG QLKVIIPSPK RPVKLFSGSN TLHLVSTTKT EVIPPLVENR 960
QSWSTCKPLF TGMNYCTTGA YSNASSTESA SYYPLTGDTR YELELRPTGE VEQYSATATY 1020
ELLKEDKSLV DTLKFLVQAE GVQQSEATVL FKYNRRSRTL SSEVLIPGFD VNFGTILRVN 1080
DESAKDKNTY KLILDIQNKK ITEVSLVGHL SYDKKGDGKI KGVVSIPRLQ AEARSEVHTH 1140
WSSTKLLFQM DSSATAYGST ISKRVTWRYD NEIIEFDWNT GTNVDTKKVA SNFPVDLSHY 1200
PRMLHEYANG LLDHRVPQTD VTFRDMGSKL IVATNTWLQM ATRGLPYPQT LQDHLNSLSE 1260
LNLLKMGLSD FHIPDNLFLK TDGRVKYTMN RNKINIDIPL PLGGKSSKDL KMPESVRTPA 1320
LNFKSVGFHL PSREVQVPTF TIPKTHQLQV PLLGVLDLST NVYSNLYNWS ASYTGGNTSR 1380
DHFSLQAQYR MKTDSVVDLF SYSVQGSGET TYDSKNTFTL SCDGSLHHKF LDSKFKVSHV 1440
EKFGNSPVSK GLLTFETSSA LGPQMSATVH LDSKKKQHLY VKDIKVDGQF RASSFYAQGK 1500
YGLSCERDVT TGQLSGESNM RFNSTYFQGT NQIVGMYQDG ALSITSTSDL QDGIFKNTAS 1560
LKYENYELTL KSDSSGQYEN FAASNKLDVT FSTQSALLRS EHQANYKSLR LVTLLSGSLT 1620
SQGVELNADI LGTDKINTGA HKATLKIARD GLSTSATTNL KYSPLLLENE LNAELGLSGA 1680
SMKLSTNGRF KEHHAKFSLD GRAALTEVSL GSIYQAMILG ADSKNIFNFK LSREGLRLSN 1740
DLMGSYAEMK LDHTHSLNIA GLSLDFFSKM DNIYSGDKFY KQNFNLQLQP YSFITTLSND 1800
LRYGALDLTN NGRFRLEPLK LNVGGNFKGT YQNNELKHIY TISYTDLVVA SYRADTVAKV 1860
QGVEFSHRLN ADIEGLTSSV DVTTSYNSDP LHFNNVFHFS LAPFTLGIDT HTSGDGKLSF 1920
WGEHTGQLYS KFLLKAEPLA LIVSHDYKGS TSHSLPYESS ISTALEHTVS ALLTPAEQTS 1980
TWKFKTKLND KVYSQDFEAY NTKDKIGVEL SGRADLSGLY SPIKLPFFYS EPVNVLNGLE 2040
VNDAVDKPQE FTIIAVVKYD KNQDVHTINL PFFKSLPDYL ERNRRGMISL LEAMRGELQR 2100
LSVDQFVRKY RAALSRLPQQ IHHYLNASDW ERQVAGAKEK ITSFMENYRI TDNDVLIAID 2160
SAKINFNEKL SQLETYAIQF DQYIKDNYDP HDLKRTIAEI IDRIIEKLKI LDEQYHIRVN 2220
LAKSIHNLYL FVENVDLNQV SSSNTSWIQN VDSNYQVRIQ IQEKLQQLRT QIQNIDIQQL 2280
AAEVKRQMDA IDVTMHLDQL RTAILFQRIS DIIDRVKYFV MNLIEDFKVT EKINTFRVIV 2340
RELIEKYEVD QHIQVLMDKS VELAHRYSLS EPLQKLSNVL QRIEIKDYYE KLVGFIDDTV 2400
EWLKALSFKN TIEELNRLTD MLVKKLKAFD YHQFVDKTNS KIREMTQRIN AEIQALKLPQ 2460
KMEALKLLVE DFKTTVSNSL ERLKDTKVTV VIDWLQDILT QMKDHFQDTL EDVRDRIYQM 2520
DIQRELEHFL SLVNQVYSTL VTYMSDWWTL TAKNITDFAE QYSIQNWAES IKVLVEQGFI 2580
VPEMQTFLWT MPAFEVSLRA LQEGNFQTPV FIVPLTDLRI PSIRINFKML KNIKIPLRFS 2640
TPEFTLLNTF HVHSFTIDLL EIKAKIIRTI DQILSSELQW PLPEMYLRDL DVVNIPLARL 2700
TLPDFHVPEI TIPEFTIPNV NLKDLHVPDL HIPEFQLPHL SHTIEIPAFG KLHSILKIQS 2760
PLFILDANAN IQNVTTSGNK AEIVASVTAK GESQFEALNF DFQAQAQFLE LNPHPPVLKE 2820
SMNFSSKHVR MEHEGEIVFD GKAIEGKSDT VASLHTEKNE VEFNNGMTVK VNNQLTLDSH 2880
TKYFHKLSVP RLDFSSKASL NNEIKTLLEA GHVALTSSGT GSWNWACPNF SDEGIHSSQI 2940
SFTVDGPIAF VGLSNNINGK HLRVIQKLTY ESGFLNYSKF EVESKVESQH VGSSILTANG 3000
RALLKDAKAE MTGEHNANLN GKVIGTLKNS LFFSAQPFEI TASTNNEGNL KVGFPLKLTG 3060
KIDFLNNYAL FLSPRAQQAS WQASTRFNQY KYNQNFSAIN NEHNIEASIG MNGDANLDFL 3120
NIPLTIPEIN LPYTEFKTPL LKDFSIWEET GLKEFLKTTK QSFDLSVKAQ YKKNSDKHSI 3180
VVPLGMFYEF ILNNVNSWDR KFEKVRNNAL HFLTTSYNEA KIKVDKYKTE NSLNQPSGTF 3240
QNHGYTIPVV NIEVSPFAVE TLASSHVIPT AISTPSVTIP GPNIMVPSYK LVLPPLELPV 3300
FHGPGNLFKF FLPDFKGFNT IDNIYIPAMG NFTYDFSFKS SVITLNTNAG LYNQSDIVAH 3360
FLSSSSFVTD ALQYKLEGTS RLMRKRGLKL ATAVSLTNKF VKGSHDSTIS LTKKNMEASV 3420
RTTANLHAPI FSMNFKQELN GNTKSKPTVS SSIELNYDFN SSKLHSTATG GIDHKFSLES 3480
LTSYFSIESF TKGNIKSSFL SQEYSGSVAN EANVYLNSKG TRSSVRLQGA SKVDGIWNVE 3540
VGENFAGEAT LQRIYTTWEH NMKNHLQVYS YFFTKGKQTC RATLELSPWT MSTLLQVHVS 3600
QLSSLLDLHH FDQEVILKAN TKNQKISWKG GVQVESRVLQ HNAQFSNDQE EIRLDLAGSL 3660
DGQLWDLEAI FLPVYGKSLQ ELLQMDGKRQ YLQASTSLLY TKNPNGYLLS LPVQELADRF 3720
IIPGIKLNDF SGVKIYKKLS TSPFALNLTM LPKVKFPGID LLTQYSTPEG SSVPIFEATI 3780
PEIHLTVSQF TLPKSLPVGN TVFDLNKLAN MIADVDLPSV TLPEQTIVIP PLEFSVPAGI 3840
FIPFFGELTA RAGMASPLYN VTWSAGWKTK ADHVETFLDS MCTSTLQFLE YALKVVETHK 3900
IEEDLLTYNI KGTLQHCDFN VEYNEDGLFK GLWDWQGEAH LDITSPALTD FHLYYKEDKT 3960
SLSASAASST IGTVGLDSST DDQSVELNVY FHPQSPPEKK LSIFKTEWRY KESDGERYIK 4020
INWEEEAASR LLGSLKSNVP KASKAIYDYA NKYHLEYVSS ELRKSLQVNA EHARRMVDEM 4080
NMSFQRVARD TYQNLYEEML AQKSLSIPEN LKKRVLDSIV HVTQKYHMAV MWLMDSFIHF 4140
LKFNRVQFPG YAGTYTVDEL YTIVMKETKK SLSQLFNGLG NLLSYVQNQV EKSRLINDIT 4200
FKCPFFSKPC KLKDLILIFR EELNILSNIG QQDIKFTTIL SSLQGFLERV LDIIEEQIKC 4260
LKDNESTCVA DHINMVFKIQ VPYAFKSLRE DIYFVLGEFN DFLQSILQEG SYKLQQVHQY 4320
MKALREEYFD PSMVGWTVKY YEIEENMVEL IKTLLVSFRD VYSEYSVTAA DFASKMSTQV 4380
EQFVSRDIRE YLSMLTDING KWMEKIAELS IVAKETMKSW VTAVAKIMSD YPQQFHSNLQ 4440
DFSDQLSSYY EKFVGESTRL IDLSIQNYHV FLRYITELLR KLQVATANNV SPYIKLAQGE 4500
LMITF 4505 
Gene Ontology
 GO:0005783; C:endoplasmic reticulum; IDA:MGI.
 GO:0034363; C:intermediate-density lipoprotein particle; IEA:Compara.
 GO:0034362; C:low-density lipoprotein particle; IEA:UniProtKB-KW.
 GO:0034359; C:mature chylomicron; IEA:Compara.
 GO:0034361; C:very-low-density lipoprotein particle; IEA:UniProtKB-KW.
 GO:0031983; C:vesicle lumen; IEA:Compara.
 GO:0012506; C:vesicle membrane; IEA:Compara.
 GO:0017127; F:cholesterol transporter activity; IEA:Compara.
 GO:0008201; F:heparin binding; IEA:UniProtKB-KW.
 GO:0005543; F:phospholipid binding; IEA:Compara.
 GO:0048844; P:artery morphogenesis; IGI:MGI.
 GO:0071379; P:cellular response to prostaglandin stimulus; IEA:Compara.
 GO:0071356; P:cellular response to tumor necrosis factor; IEA:Compara.
 GO:0033344; P:cholesterol efflux; IGI:MGI.
 GO:0042632; P:cholesterol homeostasis; IMP:MGI.
 GO:0008203; P:cholesterol metabolic process; IEA:UniProtKB-KW.
 GO:0009566; P:fertilization; IMP:MGI.
 GO:0001701; P:in utero embryonic development; IMP:MGI.
 GO:0042158; P:lipoprotein biosynthetic process; IGI:MGI.
 GO:0042159; P:lipoprotein catabolic process; IGI:MGI.
 GO:0042953; P:lipoprotein transport; IMP:MGI.
 GO:0034383; P:low-density lipoprotein particle clearance; IEA:Compara.
 GO:0034374; P:low-density lipoprotein particle remodeling; IEA:Compara.
 GO:0007399; P:nervous system development; IMP:MGI.
 GO:0010886; P:positive regulation of cholesterol storage; IEA:Compara.
 GO:0010744; P:positive regulation of macrophage derived foam cell differentiation; IEA:Compara.
 GO:0009791; P:post-embryonic development; IMP:MGI.
 GO:0045540; P:regulation of cholesterol biosynthetic process; IMP:MGI.
 GO:0009743; P:response to carbohydrate stimulus; IEA:Compara.
 GO:0032496; P:response to lipopolysaccharide; IEA:Compara.
 GO:0010269; P:response to selenium ion; IEA:Compara.
 GO:0009615; P:response to virus; IEA:Compara.
 GO:0030317; P:sperm motility; IMP:MGI.
 GO:0007283; P:spermatogenesis; IMP:MGI.
 GO:0019433; P:triglyceride catabolic process; IMP:MGI.
 GO:0006642; P:triglyceride mobilization; IMP:MGI. 
Interpro
 IPR022176; ApoB100_C.
 IPR015819; Lipid_transp_b-sht_shell.
 IPR001747; Lipid_transpt_N.
 IPR009454; Lipid_transpt_open_b-sht.
 IPR015816; Vitellinogen_b-sht_N.
 IPR015255; Vitellinogen_open_b-sht.
 IPR015817; Vitellinogen_open_b-sht_sub1.
 IPR015818; Vitellinogen_open_b-sht_sub2.
 IPR011030; Vitellinogen_superhlx. 
Pfam
 PF12491; ApoB100_C
 PF06448; DUF1081
 PF09172; DUF1943
 PF01347; Vitellogenin_N 
SMART
 SM00638; LPD_N 
PROSITE
 PS51211; VITELLOGENIN 
PRINTS