CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-001863
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Complement C4-B 
Protein Synonyms/Alias
 Complement C4 beta chain; Complement C4 alpha chain; C4a anaphylatoxin; Complement C4 gamma chain 
Gene Name
 C4b 
Gene Synonyms/Alias
 C4 
Created Date
 July 27, 2013 
Organism
 Mus musculus (Mouse) 
NCBI Taxa ID
 10090 
Lysine Modification
Position
Peptide
Type
References
272QARYIYGKPVQGVAYubiquitination[1]
302RGLETQAKLVEGRTHubiquitination[1]
314RTHISISKDQFQAALubiquitination[1]
323QFQAALDKINIGVRDubiquitination[1]
683KRNVNFQKAVSEKLGubiquitination[1]
688FQKAVSEKLGQYSSPubiquitination[1]
910AAANVPLKVVARGVFubiquitination[1]
925DLGDAVSKILQIEKEubiquitination[1]
Reference
 [1] Proteomic analyses reveal divergent ubiquitylation site patterns in murine tissues.
 Wagner SA, Beli P, Weinert BT, Schölz C, Kelstrup CD, Young C, Nielsen ML, Olsen JV, Brakebusch C, Choudhary C.
 Mol Cell Proteomics. 2012 Dec;11(12):1578-85. [PMID: 22790023
Functional Description
 Non-enzymatic component of C3 and C5 convertases and thus essential for the propagation of the classical complement pathway. Covalently binds to immunoglobulins and immune complexes and enhances the solubilization of immune aggregates and the clearance of IC through CR1 on erythrocytes. Catalyzes the transacylation of the thioester carbonyl group to form ester bonds with carbohydrate antigens (By similarity). 
Sequence Annotation
 DOMAIN 700 734 Anaphylatoxin-like.
 DOMAIN 1589 1736 NTR.
 MOD_RES 1413 1413 Sulfotyrosine.
 MOD_RES 1416 1416 Sulfotyrosine.
 MOD_RES 1417 1417 Sulfotyrosine.
 CARBOHYD 224 224 N-linked (GlcNAc...) (Potential).
 CARBOHYD 743 743 N-linked (GlcNAc...).
 CARBOHYD 1324 1324 N-linked (GlcNAc...).
 CARBOHYD 1387 1387 N-linked (GlcNAc...) (Potential).
 DISULFID 700 726 By similarity.
 DISULFID 701 733 By similarity.
 DISULFID 714 734 By similarity.
 DISULFID 1589 1667 By similarity.
 DISULFID 1612 1736 By similarity.
 CROSSLNK 1006 1009 Isoglutamyl cysteine thioester (Cys-Gln)  
Keyword
 Cleavage on pair of basic residues; Complement pathway; Complete proteome; Disulfide bond; Glycoprotein; Immunity; Inflammatory response; Innate immunity; Reference proteome; Secreted; Signal; Sulfation; Thioester bond. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1738 AA 
Protein Sequence
MRLLWGLAWV FSFCASSLQK PRLLLFSPSV VNLGTPLSVG VQLLDAPPGQ EVKGSVFLRN 60
PKGGSCSPKK DFKLSSGDDF VLLSLEVPLE DVRSCGLFDL RRAPHIQLVA QSPWLRNTAF 120
KATETQGVNL LFSSRRGHIF VQTDQPIYNP GQRVRYRVFA LDQKMRPSTD FLTITVENSH 180
GLRVLKKEIF TSTSIFQDAF TIPDISEPGT WKISARFSDG LESNRSTHFE VKKYVLPNFE 240
VKITPWKPYI LMVPSNSDEI QLDIQARYIY GKPVQGVAYT RFALMDEQGK RTFLRGLETQ 300
AKLVEGRTHI SISKDQFQAA LDKINIGVRD LEGLRLYAAT AVIESPGGEM EEAELTSWRF 360
VSSAFSLDLS RTKRHLVPGA HFLLQALVQE MSGSEASNVP VKVSATLVSG SDSQVLDIQQ 420
STNGIGQVSI SFPIPPTVTE LRLLVSAGSL YPAIARLTVQ APPSRGTGFL SIEPLDPRSP 480
SVGDTFILNL QPVGIPAPTF SHYYYMIISR GQIMAMGREP RKTVTSVSVL VDHQLAPSFY 540
FVAYFYHQGH PVANSLLINI QSRDCEGKLQ LKVDGAKEYR NADMMKLRIQ TDSKALVALG 600
AVDMALYAVG GRSHKPLDMS KVFEVINSYN VGCGPGGGDD ALQVFQDAGL AFSDGDRLTQ 660
TREDLSCPKE KKSRQKRNVN FQKAVSEKLG QYSSPDAKRC CQDGMTKLPM KRTCEQRAAR 720
VPQQACREPF LSCCKFAEDL RRNQTRSQAH LARNNHNMLQ EEDLIDEDDI LVRTSFPENW 780
LWRVEPVDSS KLLTVWLPDS MTTWEIHGVS LSKSKGLCVA KPTRVRVFRK FHLHLRLPIS 840
IRRFEQFELR PVLYNYLNDD VAVSVHVTPV EGLCLAGGGM MAQQVTVPAG SARPVAFSVV 900
PTAAANVPLK VVARGVFDLG DAVSKILQIE KEGAIHREEL VYNLDPLNNL GRTLEIPGSS 960
DPNIVPDGDF SSLVRVTASE PLETMGSEGA LSPGGVASLL RLPQGCAEQT MIYLAPTLTA 1020
SNYLDRTEQW SKLSPETKDH AVDLIQKGYM RIQQFRKNDG SFGAWLHRDS STWLTAFVLK 1080
ILSLAQEQVG NSPEKLQETA SWLLAQQLGD GSFHDPCPVI HRAMQGGLVG SDETVALTAF 1140
VVIALHHGLD VFQDDDAKQL KNRVEASITK ANSFLGQKAS AGLLGAHAAA ITAYALTLTK 1200
ASEDLRNVAH NSLMAMAEET GEHLYWGLVL GSQDKVVLRP TAPRSPTEPV PQAPALWIET 1260
TAYALLHLLL REGKGKMADK AASWLTHQGS FHGAFRSTQD TVVTLDALSA YWIASHTTEE 1320
KALNVTLSSM GRNGLKTHGL HLNNHQVKGL EEELKFSLGS TISVKVEGNS KGTLKILRTY 1380
NVLDMKNTTC QDLQIEVKVT GAVEYAWDAN EDYEDYYDMP AADDPSVPLQ PVTPLQLFEG 1440
RRSRRRREAP KVVEEQESRV QYTVCIWRNG KLGLSGMAIA DITLLSGFHA LRADLEKLTS 1500
LSDRYVSHFE TDGPHVLLYF DSVPTTRECV GFGASQEVVV GLVQPSSAVL YDYYSPDHKC 1560
SVFYAAPTKS QLLATLCSGD VCQCAEGKCP RLLRSLERRV EDKDGYRMRF ACYYPRVEYG 1620
FTVKVLREDG RAAFRLFESK ITQVLHFRKD TMASIGQTRN FLSRASCRLR LEPNKEYLIM 1680
GMDGETSDNK GDPQYLLDSN TWIEEMPSEQ MCKSTRHRAA CFQLKDFLME FSSRGCQV 1738 
Gene Ontology
 GO:0005615; C:extracellular space; IEA:InterPro.
 GO:0004866; F:endopeptidase inhibitor activity; IEA:InterPro.
 GO:0006958; P:complement activation, classical pathway; IEA:UniProtKB-KW.
 GO:0006954; P:inflammatory response; IEA:UniProtKB-KW.
 GO:0045087; P:innate immune response; IEA:UniProtKB-KW.
 GO:0010951; P:negative regulation of endopeptidase activity; IEA:GOC. 
Interpro
 IPR009048; A-macroglobulin_rcpt-bd.
 IPR011626; A2M_comp.
 IPR002890; A2M_N.
 IPR011625; A2M_N_2.
 IPR000020; Anaphylatoxin/fibulin.
 IPR018081; Anaphylatoxin_.
 IPR001840; Anaphylatoxn.
 IPR001599; Macroglobln_a2.
 IPR019742; MacrogloblnA2_CS.
 IPR019565; MacrogloblnA2_thiol-ester-bond.
 IPR001134; Netrin_domain.
 IPR018933; Netrin_module_non-TIMP.
 IPR008930; Terpenoid_cyclase/PrenylTrfase.
 IPR008993; TIMP-like_OB-fold. 
Pfam
 PF00207; A2M
 PF07678; A2M_comp
 PF01835; A2M_N
 PF07703; A2M_N_2
 PF07677; A2M_recep
 PF01821; ANATO
 PF01759; NTR
 PF10569; Thiol-ester_cl 
SMART
 SM00104; ANATO
 SM00643; C345C 
PROSITE
 PS00477; ALPHA_2_MACROGLOBULIN
 PS01177; ANAPHYLATOXIN_1
 PS01178; ANAPHYLATOXIN_2
 PS50189; NTR 
PRINTS
 PR00004; ANAPHYLATOXN.