CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-004894
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Pregnancy zone protein 
Protein Synonyms/Alias
 C3 and PZP-like alpha-2-macroglobulin domain-containing protein 6 
Gene Name
 PZP 
Gene Synonyms/Alias
 CPAMD6 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
1399VSGFIPLKPTVKMLEglycation[1]
1403IPLKPTVKMLERSSSglycation[1]
Reference
 [1] Proteomic profiling of nonenzymatically glycated proteins in human plasma and erythrocyte membranes.
 Zhang Q, Tang N, Schepmoes AA, Phillips LS, Smith RD, Metz TO.
 J Proteome Res. 2008 May;7(5):2025-32. [PMID: 18396901
Functional Description
 Is able to inhibit all four classes of proteinases by a unique 'trapping' mechanism. This protein has a peptide stretch, called the 'bait region' which contains specific cleavage sites for different proteinases. When a proteinase cleaves the bait region, a conformational change is induced in the protein which traps the proteinase. The entrapped enzyme remains active against low molecular weight substrates (activity against high molecular weight substrates is greatly reduced). Following cleavage in the bait region a thioester bond is hydrolyzed and mediates the covalent binding of the protein to the proteinase. 
Sequence Annotation
 REGION 685 735 Bait region.
 CARBOHYD 24 24 N-linked (GlcNAc...) (Potential).
 CARBOHYD 54 54 N-linked (GlcNAc...) (Potential).
 CARBOHYD 69 69 N-linked (GlcNAc...) (Potential).
 CARBOHYD 246 246 N-linked (GlcNAc...) (Potential).
 CARBOHYD 392 392 N-linked (GlcNAc...) (Potential).
 CARBOHYD 406 406 N-linked (GlcNAc...).
 CARBOHYD 753 753 N-linked (GlcNAc...) (Potential).
 CARBOHYD 875 875 N-linked (GlcNAc...) (Potential).
 CARBOHYD 932 932 N-linked (GlcNAc...).
 CARBOHYD 997 997 N-linked (GlcNAc...).
 CARBOHYD 1430 1430 N-linked (GlcNAc...).
 CROSSLNK 978 981 Isoglutamyl cysteine thioester (Cys-Gln).  
Keyword
 Alternative splicing; Bait region; Complete proteome; Direct protein sequencing; Disulfide bond; Glycoprotein; Polymorphism; Protease inhibitor; Reference proteome; Secreted; Serine protease inhibitor; Signal; Thioester bond. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1482 AA 
Protein Sequence
MRKDRLLHLC LVLLLILLSA SDSNSTEPQY MVLVPSLLHT EAPKKGCVLL SHLNETVTVS 60
ASLESGRENR SLFTDLVAEK DLFHCVSFTL PRISASSEVA FLSIQIKGPT QDFRKRNTVL 120
VLNTQSLVFV QTDKPMYKPG QTVRFRVVSV DENFRPRNEL IPLIYLENPR RNRIAQWQSL 180
KLEAGINQLS FPLSSEPIQG SYRVVVQTES GGRIQHPFTV EEFVLPKFEV KVQVPKIISI 240
MDEKVNITVC GEYTYGKPVP GLATVSLCRK LSRVLNCDKQ EVCEEFSQQL NSNGCITQQV 300
HTKMLQITNT GFEMKLRVEA RIREEGTDLE VTANRISEIT NIVSKLKFVK VDSHFRQGIP 360
FFAQVLLVDG KGVPIPNKLF FISVNDANYY SNATTNEQGL AQFSINTTSI SVNKLFVRVF 420
TVHPNLCFHY SWVAEDHQGA QHTANRVFSL SGSYIHLEPV AGTLPCGHTE TITAHYTLNR 480
QAMGELSELS FHYLIMAKGV IVRSGTHTLP VESGDMKGSF ALSFPVESDV APIARMFIFA 540
ILPDGEVVGD SEKFEIENCL ANKVDLSFSP AQSPPASHAH LQVAAAPQSL CALRAVDQSV 600
LLMKPEAELS VSSVYNLLTV KDLTNFPDNV DQQEEEQGHC PRPFFIHNGA IYVPLSSNEA 660
DIYSFLKGMG LKVFTNSKIR KPKSCSVIPS VSAGAVGQGY YGAGLGVVER PYVPQLGTYN 720
VIPLNNEQSS GPVPETVRSY FPETWIWELV AVNSSGVAEV GVTVPDTITE WKAGAFCLSE 780
DAGLGISSTA SLRAFQPFFV ELTMPYSVIR GEVFTLKATV LNYLPKCIRV SVQLKASPAF 840
LASQNTKGEE SYCICGNERQ TLSWTVTPKT LGNVNFSVSA EAMQSLELCG NEVVEVPEIK 900
RKDTVIKTLL VEAEGIEQEK TFSSMTCASG ANVSEQLSLK LPSNVVKESA RASFSVLGDI 960
LGSAMQNIQN LLQMPYGCGE QNMVLFAPNI YVLNYLNETQ QLTQEIKAKA VGYLITGYQR 1020
QLNYKHQDGS YSTFGERYGR NQGNTWLTAF VLKTFAQARS YIFIDEAHIT QSLTWLSQMQ 1080
KDNGCFRSSG SLLNNAIKGG VEDEATLSAY VTIALLEIPL PVTNPIVRNA LFCLESAWNV 1140
AKEGTHGSHV YTKALLAYAF SLLGKQNQNR EILNSLDKEA VKEDNLVHWE RPQRPKAPVG 1200
HLYQTQAPSA EVEMTSYVLL AYLTAQPAPT SGDLTSATNI VKWIMKQQNA QGGFSSTQDT 1260
VVALHALSRY GAATFTRTEK TAQVTVQDSQ TFSTNFQVDN NNLLLLQQIS LPELPGEYVI 1320
TVTGERCVYL QTSMKYNILP EKEDSPFALK VQTVPQTCDG HKAHTSFQIS LTISYTGNRP 1380
ASNMVIVDVK MVSGFIPLKP TVKMLERSSS VSRTEVSNNH VLIYVEQVTN QTLSFSFMVL 1440
QDIPVGDLKP AIVKVYDYYE TDESVVAEYI APCSTDTEHG NV 1482 
Gene Ontology
 GO:0005615; C:extracellular space; IEA:InterPro.
 GO:0070062; C:extracellular vesicular exosome; IDA:UniProtKB.
 GO:0004866; F:endopeptidase inhibitor activity; TAS:UniProtKB.
 GO:0004867; F:serine-type endopeptidase inhibitor activity; IEA:UniProtKB-KW.
 GO:0007565; P:female pregnancy; TAS:UniProtKB. 
Interpro
 IPR009048; A-macroglobulin_rcpt-bd.
 IPR011626; A2M_comp.
 IPR002890; A2M_N.
 IPR011625; A2M_N_2.
 IPR001599; Macroglobln_a2.
 IPR019742; MacrogloblnA2_CS.
 IPR019565; MacrogloblnA2_thiol-ester-bond.
 IPR008930; Terpenoid_cyclase/PrenylTrfase.
 IPR010916; TonB_box_CS. 
Pfam
 PF00207; A2M
 PF07678; A2M_comp
 PF01835; A2M_N
 PF07703; A2M_N_2
 PF07677; A2M_recep
 PF10569; Thiol-ester_cl 
SMART
  
PROSITE
 PS00477; ALPHA_2_MACROGLOBULIN 
PRINTS