CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-038353
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
  
Protein Name
 Apolipoprotein B-100 
Protein Synonyms/Alias
  
Gene Name
 Apob 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Mus musculus (Mouse) 
NCBI Taxa ID
 10090 
Lysine Modification
Position
Peptide
Type
References
144PKYILNIKRGIISALubiquitination[1]
307TKSTSSPKQADAVLKubiquitination[1]
421QEIFNTAKEQQSRATubiquitination[1]
512KPSLLIQKAALQALRubiquitination[1]
769AYLRILGKELSFVRLubiquitination[1]
1150AYGSTISKRVTWRYDacetylation[2]
1272IPLPLGGKSSKDLKMubiquitination[1]
1795LNVGGNFKGTYQNNEubiquitination[1]
1958FKTKLNDKVYSQDFEubiquitination[1]
Reference
 [1] Proteomic analyses reveal divergent ubiquitylation site patterns in murine tissues.
 Wagner SA, Beli P, Weinert BT, Schölz C, Kelstrup CD, Young C, Nielsen ML, Olsen JV, Brakebusch C, Choudhary C.
 Mol Cell Proteomics. 2012 Dec;11(12):1578-85. [PMID: 22790023]
 [2] Quantitative assessment of the impact of the gut microbiota on lysine epsilon-acetylation of host proteins using gnotobiotic mice.
 Simon GM, Cheng J, Gordon JI.
 Proc Natl Acad Sci U S A. 2012 Jul 10;109(28):11133-8. [PMID: 22733758
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 4456 AA 
Protein Sequence
MGPRKPALRT PLLLLFLLLF LDTSVWAQDA TRFKHLRKYV YNYEAESSSG VQGTADSRSA 60
TKINCKVELE VPQICGFIMR TNQCTLKEVY GFNPEGKALM KKTKNSEEFA AAMSRYELKL 120
AIPEGKQIVL YPDKDEPKYI LNIKRGIISA LLVPPETEED QQELFLDTVY GNCSTQVTVN 180
SRKGTVPTEM STERNLQQCD GFQPISTSVS PLALIKGLVH PLSTLISSSQ TCQYTLDPKR 240
KHVSEAVCDE QHLFLPFSYK NKYGIMTRVT QKLSLEDTPK INSRFFSEGT NRMGLAFEST 300
KSTSSPKQAD AVLKTLQELK KLSISEQNAQ RANLFNKLVT ELRGLTGEAI TSLLPQLIEV 360
SSPITLQALV QCGQPQCYTH ILQWLKTEKA HPLLVDIVTY LMALIPNPST QRLQEIFNTA 420
KEQQSRATLY ALSHAVNSYF DVDHSRSPVL QDIAGYLLKQ IDNECTGNED HTFLILRVIG 480
NMGRTMEQVM PALKSSVLSC VRSTKPSLLI QKAALQALRK MELEDEVRTI LFDTFVNGVA 540
PVEKRLAAYL LLMKNPSSSD INKIAQLLQW EQSEQVKNFV ASHIANILNS EELYVQDLKV 600
LIKNALENSQ FPTIMDFRKF SRNYQISKSA SLPMFDPVSV KIEGNLIFDP SSYLPRESLL 660
KTTLTVFGLA SLDLFEIGLE GKGFEPTLEA LFGKQGFFPD SVNKALYWVN GRVPDGVSKV 720
LVDHFGYTTD GKHEQDMVNG IMPIVDKLIK DLKSKEIPEA RAYLRILGKE LSFVRLQDLQ 780
VLGKLLLSGA QTLQGIPQMV VQAIREGSKN DLFLHYIFMD NAFELPTGAG LQLQVSSSGV 840
FTPGIKAGVR LELANIQAEL VAKPSVSLEF VTNMGIIIPD FAKSSVQMNT NFFHESGLEA 900
RVALKAGQLK VIIPSPKRPV KLFSGSNTLH LVSTTKTEVI PPLVENRQSW STCKPLFTGM 960
NYCTTGAYSN ASSTESASYY PLTGDTRYEL ELRPTGEVEQ YSATATYELL KEDKSLVDTL 1020
KFLVQAEGVQ QSEATVLFKY NRRSRTLSSE VLIPGFDVNF GTILRVNDES AKDKNTYKLI 1080
LDIQNKKITE VSLVGHLSYD KKGDGKIKGV VSIPRLQAEA RSEVHTHWSS TKLLFQMDSS 1140
ATAYGSTISK RVTWRYDNEI IEFDWNTGTN VDTKKVASNF PVDLSHYPRM LHEYANGLLD 1200
HRVPQTDVTF RDMGSKLIVD HLNSLSELNL LKMGLSDFHI PDNLFLKTDG RVKYTMNRNK 1260
INIDIPLPLG GKSSKDLKMP ESVRTPALNF KSVGFHLPSR EVQVPTFTIP KTHQLQVPLL 1320
GVLDLSTNVY SNLYNWSASY TGGNTSRDHF SLQAQYRMKT DSVVDLFSYS VQGSGETTYD 1380
SKNTFTLSCD GSLHHKFLDS KFKVSHVEKF GNSPVSKGLL TFETSSALGP QMSATVHLDS 1440
KKKQHLYVKD IKVDGQFRAS SFYAQGKYGL SCERDVTTGQ LSGESNMRFN STYFQGTNQI 1500
VGMYQDGALS ITSTSDLQDG IFKNTASLKY ENYELTLKSD SSGQYENFAA SNKLDVTFST 1560
QSALLRSEHQ ANYKSLRLVT LLSGSLTSQG VELNADILGT DKINTGAHKA TLKIARDGLS 1620
TSATTNLKYS PLLLENELNA ELGLSGASMK LSTNGRFKEH HAKFSLDGRA ALTEVSLGSI 1680
YQAMILGADS KNIFNFKLSR EGLRLSNDLM GSYAEMKLDH THSLNIAGLS LDFFSKMDNI 1740
YSGDKFYKQN FNLQLQPYSF ITTLSNDLRY GALDLTNNGR FRLEPLKLNV GGNFKGTYQN 1800
NELKHIYTIS YTDLVVASYR ADTVAKVQGV EFSHRLNADI EGLTSSVDVT TSYNSDPLHF 1860
NNVFHFSLAP FTLGIDTHTS GDGKLSFWGE HTGQLYSKFL LKAEPLALIV SHDYKGSTSH 1920
SLPYESSIST ALEHTVSALL TPAEQTSTWK FKTKLNDKVY SQDFEAYNTK DKIGVELSGR 1980
ADLSGLYSPI KLPFFYSEPV NVLNGLEVND AVDKPQEFTI IAVVKYDKNQ DVHTINLPFF 2040
KSLPDYLERN RRGMISLLEA MRGELQRLSV DQFVRKYRAA LSRLPQQIHH YLNASDWERQ 2100
VAGAKEKITS FMENYRITDN DVLIAIDSAK INFNEKLSQL ETYAIQFDQY IKDNYDPHDL 2160
KRTIAEIIDR IIEKLKILDE QYHIRVNLAK SIHNLYLFVE NVDLNQVSSS NTSWIQNVDS 2220
NYQVRIQIQE KLQQLRTQIQ NIDIQQLAAE VKRQMDAIDV TMHLDQLRTA ILFQRISDII 2280
DRVKYFVMNL IEDFKVTEKI NTFRVIVREL IEKYEVDQHI QVLMDKSVEL AHRYSLSEPL 2340
QKLSNVLQRI EIKDYYEKLV GFIDDTVEWL KALSFKNTIE ELNRLTDMLV KKLKAFDYHQ 2400
FVDKTNSKIR EMTQRINAEI QALKLPQKME ALKLLVEDFK TTVSNSLERL KDTKVTVVID 2460
WLQDILTQMK DHFQDTLEDV RDRIYQMDIQ RELEHFLSLV NQVYSTLVTY MSDWWTLTAK 2520
NITDFAEQYS IQNWAESIKV LVEQGFIVPE MQTFLWTMPA FEVSLRALQE GNFQTPVFIV 2580
PLTDLRIPSI RINFKMLKNI KIPLRFSTPE FTLLNTFHVH SFTIDLLEIK AKIIRTIDQI 2640
LSSELQWPLP EMYLRDLDVV NIPLARLTLP DFHVPEITIP EFTIPNVNLK DLHVPDLHIP 2700
EFQLPHLSHT IEIPAFGKLH SILKIQSPLF ILDANANIQN VTTSGNKAEI VASVTAKGES 2760
QFEALNFDFQ AQAQFLELNP HPPVLKESMN FSSKHVRMEH EGEIVFDGKA IEGKSDTVAS 2820
LHTEKNEVEF NNGMTVKVNN QLTLDSHTKY FHKLSVPRLD FSSKASLNNE IKTLLEAGHV 2880
ALTSSGTGSW NWACPNFSDE GIHSSQISFT VDGPIAFVGL SNNINGKHLR VIQKLTYESG 2940
FLNYSKFEVE SKVESQHVGS SILTANGRAL LKDAKAEMTG EHNANLNGKV IGTLKNSLFF 3000
SAQPFEITAS TNNEGNLKVG FPLKLTGKID FLNNYALFLS PRAQQASWQA STRFNQYKYN 3060
QNFSAINNEH NIEASIGMNG DANLDFLNIP LTIPEINLPY TEFKTPLLKD FSIWEETGLK 3120
EFLKTTKQSF DLSVKAQYKK NSDKHSIVVP LGMFYEFILN NVNSWDRKFE KVRNNALHFL 3180
TTSYNEAKIK VDKYKTENSL NQPSGTFQNH GYTIPVVNIE VSPFAVETLA SSHVIPTAIS 3240
TPSVTIPGPN IMVPSYKLVL PPLELPVFHG PGNLFKFFLP DFKGFNTIDN IYIPAMGNFT 3300
YDFSFKSSVI TLNTNAGLYN QSDIVAHFLS SSSFVTDALQ YKLEGTSRLM RKRGLKLATA 3360
VSLTNKFVKG SHDSTISLTK KNMEASVRTT ANLHAPIFSM NFKQELNGNT KSKPTVSSSI 3420
ELNYDFNSSK LHSTATGGID HKFSLESLTS YFSIESFTKG NIKSSFLSQE YSGSVANEAN 3480
VYLNSKGTRS SVRLQGASKV DGIWNVEVGE NFAGEATLQR IYTTWEHNMK NHLQVYSYFF 3540
TKGKQTCRAT LELSPWTMST LLQVHVSQLS SLLDLHHFDQ EVILKANTKN QKISWKGGVQ 3600
VESRVLQHNA QFSNDQEEIR LDLAGSLDGQ LWDLEAIFLP VYGKSLQELL QMDGKRQYLQ 3660
ASTSLLYTKN PNGYLLSLPV QELADRFIIP GIKLNDFSGV KIYKKLSTSP FALNLTMLPK 3720
VKFPGIDLLT QYSTPEGSSV PIFEATIPEI HLTVSQFTLP KSLPVGNTVF DLNKLANMIA 3780
DVDLPSVTLP EQTIVIPPLE FSVPAGIFIP FFGELTARAG MASPLYNVTW SAGWKTKADH 3840
VETFLDSMCT STLQFLEYAL KVVETHKIEE DLLTYNIKGT LQHCDFNVEY NEDGLFKGLW 3900
DWQGEAHLDI TSPALTDFHL YYKEDKTSLS ASAASSTIGT VGLDSSTDDQ SVELNVYFHP 3960
QSPPEKKLSI FKTEWRYKES DGERYIKINW EEEAASRLLG SLKSNVPKAS KAIYDYANKY 4020
HLEYVSSELR KSLQVNAEHA RRMVDEMNMS FQRVARDTYQ NLYEEMLAQK SLSIPENLKK 4080
RVLDSIVHVT QKYHMAVMWL MDSFIHFLKF NRVQFPGYAG TYTVDELYTI VMKETKKSLS 4140
QLFNGLGNLL SYVQNQVEKS RLINDITFKC PFFSKPCKLK DLILIFREEL NILSNIGQQD 4200
IKFTTILSSL QGFLERVLDI IEEQIKCLKD NESTCVADHI NMVFKIQVPY AFKSLREDIY 4260
FVLGEFNDFL QSILQEGSYK LQQVHQYMKA LREEYFDPSM VGWTVKYYEI EENMVELIKT 4320
LLVSFRDVYS EYSVTAADFA SKMSTQVEQF VSRDIREYLS MLTDINGKWM EKIAELSIVA 4380
KETMKSWVTA VAKIMSDYPQ QFHSNLQDFS DQLSSYYEKF VGESTRLIDL SIQNYHVFLR 4440
YITELLRKLQ VATANN 4456 
Gene Ontology
 GO:0005783; C:endoplasmic reticulum; IDA:MGI.
 GO:0005319; F:lipid transporter activity; IEA:InterPro.
 GO:0048844; P:artery morphogenesis; IGI:MGI.
 GO:0033344; P:cholesterol efflux; IGI:MGI.
 GO:0042632; P:cholesterol homeostasis; IMP:MGI.
 GO:0009566; P:fertilization; IMP:MGI.
 GO:0001701; P:in utero embryonic development; IMP:MGI.
 GO:0042158; P:lipoprotein biosynthetic process; IGI:MGI.
 GO:0042159; P:lipoprotein catabolic process; IGI:MGI.
 GO:0042953; P:lipoprotein transport; IMP:MGI.
 GO:0007399; P:nervous system development; IMP:MGI.
 GO:0009791; P:post-embryonic development; IMP:MGI.
 GO:0045540; P:regulation of cholesterol biosynthetic process; IMP:MGI.
 GO:0030317; P:sperm motility; IMP:MGI.
 GO:0007283; P:spermatogenesis; IMP:MGI.
 GO:0019433; P:triglyceride catabolic process; IMP:MGI.
 GO:0006642; P:triglyceride mobilization; IMP:MGI. 
Interpro
 IPR022176; ApoB100_C.
 IPR015819; Lipid_transp_b-sht_shell.
 IPR001747; Lipid_transpt_N.
 IPR009454; Lipid_transpt_open_b-sht.
 IPR015816; Vitellinogen_b-sht_N.
 IPR015255; Vitellinogen_open_b-sht.
 IPR015817; Vitellinogen_open_b-sht_sub1.
 IPR015818; Vitellinogen_open_b-sht_sub2.
 IPR011030; Vitellinogen_superhlx. 
Pfam
 PF12491; ApoB100_C
 PF06448; DUF1081
 PF09172; DUF1943
 PF01347; Vitellogenin_N 
SMART
 SM00638; LPD_N 
PROSITE
 PS51211; VITELLOGENIN 
PRINTS