CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-041899
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Gag-pro-pol polyprotein 
Protein Synonyms/Alias
  
Gene Name
 gag-pro-pol 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Mus musculus (Mouse) 
NCBI Taxa ID
 10090 
Lysine Modification
Position
Peptide
Type
References
353PTNLAKVKGITQGPNubiquitination[1]
Reference
 [1] Proteomic analyses reveal divergent ubiquitylation site patterns in murine tissues.
 Wagner SA, Beli P, Weinert BT, Schölz C, Kelstrup CD, Young C, Nielsen ML, Olsen JV, Brakebusch C, Choudhary C.
 Mol Cell Proteomics. 2012 Dec;11(12):1578-85. [PMID: 22790023
Functional Description
  
Sequence Annotation
  
Keyword
  
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1737 AA 
Protein Sequence
MGQTVTTPLS LTLQHWGDVQ RIASNQSVDV KKRRWVTFCS AEWPTFNVGW PQDGTFNLGI 60
ISQVKSRVFC PGPHGHPDQV PYIVTWEALA YDPPPWVKPF VSPKPPPLPT APVLPPGPSA 120
QPPSRSALYP ALTPSIKSKP PKPQVLPDSG GPLIDLLTED PPPYGAQPSS SARENNEEEA 180
ATTSEVSPPS PMVSRLRGRR DPPAADSTTS QAFPLRMGGD GQLQYWPFSS SDLYNWKNNN 240
PSFSEDPGKL TALIESVLIT HQPTWDDCQQ LLGTLLTGEE KQRVLLEARK AVRGNDGRPT 300
QLPNEVNAAF PLERPDWDYT TTEGRNHLVL YRQLLLAGLQ NAGRSPTNLA KVKGITQGPN 360
ESPSAFLERL KEAYRRYTPY DPEDPGQETN VSMSFIWQSA PDIGRKLERL EDLKSKTLGD 420
LVREAEKIFN KRETPEEREE RIRREIEEKE ERRRAEDEQR ERERDRRRHR EMSKLLATVV 480
IGQRQDRQGG ERRRPQLDKD QCAYCKEKGH WAKDCPKKPR GPRGPRPQTS LLTLGDQGGQ 540
GQEPPPEPRI TLKVGGQPVT FLVDTGAQHS VLTQNPGPLS DKSAWVQGAT GGKRYRWTTD 600
RKVHLATGKV THSFLHVPDC PYPLLGRDLL TKLKAQIHFE GSGAQVVGPM GQPLQVLTLN 660
IEDEYRLHET SKEPDVPLGS TWLSDFPQAW AETGGMGLAV RQAPLIIPLK ATSTPVSIKQ 720
YPMSQEARLG IKPHIQRLLD QGILVPCQSP WNTPLLPVKK PGTNDYRPVQ DLREVNKRVE 780
DIHPTVPNPY NLLSGLPPSH QWYTVLDLKD AFFCLRLHPT SQPLFAFEWR DPEMGISGQL 840
TWTRLPQGFK NSPTLFDEAL HRDLADFRIQ HPDLILLQYV DDLLLAATSE QDCQRGTRAL 900
LQTLGNLGYR ASAKKAQICQ KQVKYLGYLL KEGQRWLTEA RKETVMGQPT PKTPRQLREF 960
LGTAGFCRLW IPGFAEMAAP LYPLTKTGTL FNWGPDQQKA YQEIKQALLT APALGLPDLT 1020
KPFELFVDEK QGYAKGVLTQ KLGPWRRPVA YLSKKLDPVA AGWPPCLRMV AAIAVLTKDA 1080
GKLTMGQPLV ILAPHAVEAL VKQPPDRWLS NARMTHYQAM LLDTDRVQFG PVVALNPATL 1140
LPLPGEETPH DCLEILAETH GTRPDLTDQP LPNADHTWYT DGSSYLHEGQ RRAGAAVTTE 1200
TEVIWARALP AGTSAQRAEL IALTQALKMA EGKKLNVYTD SRYAFATAHV HGEIYRRRGL 1260
LTSEGKEIKN KSEILALLKA LFLPKRLSII HCPGHQRGNS AEARGNRMAD QAAREAALRT 1320
DIETSTLLIE TSAPYTSSFF HYTETDKRDL DTLGAAYDET KRYWVFQGKP VMPSQDTFEL 1380
LDFLHQLTHL SYQKMRALLD RKESPYYMLN KDKILHEVAE SCQACVQVNA SKAKVGPGVR 1440
VRGHRPGTHW EIDFTEVKPG LYGYKYLLVF VDTFSGWVEA FPTKHETAKV VTKKLLEEIF 1500
PRFGMPQVLG TDNGPAFISQ VSQSVAKLLG IDWKLHCAYR PQSSGQVERM NRTIKETLTK 1560
LTLATGARDW VLLLPLALYR ARNTPGPHGL TPYEILYGAP PPLVNFHDPE MSKFTNSPSL 1620
QAHLQALQAV QREVWKPLAA AYQDQQDQPV IPHPFRVGDT VWVRRHQTKN LEPRWKGPYT 1680
VLLTTPTALK VDGIAAWIHA AHVKAATTPP AGTASGPTWK VQRSQNPLKI RLTRGAP 1737 
Gene Ontology
 GO:0019028; C:viral capsid; IEA:InterPro.
 GO:0004190; F:aspartic-type endopeptidase activity; IEA:InterPro.
 GO:0004523; F:ribonuclease H activity; IEA:InterPro.
 GO:0003723; F:RNA binding; IEA:InterPro.
 GO:0003964; F:RNA-directed DNA polymerase activity; IEA:InterPro.
 GO:0005198; F:structural molecule activity; IEA:InterPro.
 GO:0008270; F:zinc ion binding; IEA:InterPro.
 GO:0015074; P:DNA integration; IEA:InterPro.
 GO:0006508; P:proteolysis; IEA:InterPro.
 GO:0006278; P:RNA-dependent DNA replication; IEA:InterPro.
 GO:0019068; P:virion assembly; IEA:InterPro. 
Interpro
 IPR000840; G_retro_matrix_N.
 IPR002079; Gag_p12.
 IPR003036; Gag_p30.
 IPR001584; Integrase_cat-core.
 IPR018061; Pept_A2A_retrovirus_sg.
 IPR001995; Peptidase_A2_cat.
 IPR021109; Peptidase_aspartic.
 IPR001969; Peptidase_aspartic_AS.
 IPR008919; Retrov_capsid_N.
 IPR010999; Retrovr_matrix_N.
 IPR012337; RNaseH-like_dom.
 IPR002156; RNaseH_domain.
 IPR000477; RVT.
 IPR001878; Znf_CCHC. 
Pfam
 PF01140; Gag_MA
 PF01141; Gag_p12
 PF02093; Gag_p30
 PF00075; RNase_H
 PF00665; rve
 PF00077; RVP
 PF00078; RVT_1 
SMART
 SM00343; ZnF_C2HC 
PROSITE
 PS50175; ASP_PROT_RETROV
 PS00141; ASP_PROTEASE
 PS50994; INTEGRASE
 PS50879; RNASE_H
 PS50878; RT_POL
 PS50158; ZF_CCHC 
PRINTS