CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-008065
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Protein PRRC2A 
Protein Synonyms/Alias
 HLA-B-associated transcript 2; Large proline-rich protein BAT2; Proline-rich and coiled-coil-containing protein 2A; Protein G2 
Gene Name
 PRRC2A 
Gene Synonyms/Alias
 BAT2; G2 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
27LNLFDTYKGKSLEIQacetylation[1, 2]
35GKSLEIQKPAVAPRHacetylation[2, 3, 4]
502EKFGAPDKRLKAEPAacetylation[4]
1196AKSLAPKKPPTGPLPacetylation[1]
1500PGGHPRHKPGLPQAPubiquitination[5]
2056ELHPVELKPFQDYQKubiquitination[4, 5, 6]
2063KPFQDYQKLSSNLGGubiquitination[4, 5, 6]
2091SGLNSRLKATPSTYSubiquitination[6]
Reference
 [1] Lysine acetylation targets protein complexes and co-regulates major cellular functions.
 Choudhary C, Kumar C, Gnad F, Nielsen ML, Rehman M, Walther TC, Olsen JV, Mann M.
 Science. 2009 Aug 14;325(5942):834-40. [PMID: 19608861]
 [2] Proteomic investigations reveal a role for RNA processing factor THRAP3 in the DNA damage response.
 Beli P, Lukashchuk N, Wagner SA, Weinert BT, Olsen JV, Baskcomb L, Mann M, Jackson SP, Choudhary C.
 Mol Cell. 2012 Apr 27;46(2):212-25. [PMID: 22424773]
 [3] Proteomic investigations of lysine acetylation identify diverse substrates of mitochondrial deacetylase sirt3.
 Sol EM, Wagner SA, Weinert BT, Kumar A, Kim HS, Deng CX, Choudhary C.
 PLoS One. 2012;7(12):e50545. [PMID: 23236377]
 [4] Integrated proteomic analysis of post-translational modifications by serial enrichment.
 Mertins P, Qiao JW, Patel J, Udeshi ND, Clauser KR, Mani DR, Burgess MW, Gillette MA, Jaffe JD, Carr SA.
 Nat Methods. 2013 Jul;10(7):634-7. [PMID: 23749302]
 [5] Systematic and quantitative assessment of the ubiquitin-modified proteome.
 Kim W, Bennett EJ, Huttlin EL, Guo A, Li J, Possemato A, Sowa ME, Rad R, Rush J, Comb MJ, Harper JW, Gygi SP.
 Mol Cell. 2011 Oct 21;44(2):325-40. [PMID: 21906983]
 [6] Refined preparation and use of anti-diglycine remnant (K-ε-GG) antibody enables routine quantification of 10,000s of ubiquitination sites in single proteomics experiments.
 Udeshi ND, Svinkina T, Mertins P, Kuhn E, Mani DR, Qiao JW, Carr SA.
 Mol Cell Proteomics. 2013 Mar;12(3):825-31. [PMID: 23266961
Functional Description
 May play a role in the regulation of pre-mRNA splicing. 
Sequence Annotation
 REPEAT 41 95 1-1.
 REPEAT 98 154 1-2.
 REPEAT 281 337 1-3.
 REPEAT 337 430 2-1.
 REPEAT 488 561 2-2.
 REPEAT 1757 1812 1-4.
 REPEAT 1916 1965 3-1.
 REPEAT 1982 2031 3-2.
 REPEAT 2057 2106 3-3.
 REGION 41 1812 4 X 57 AA type A repeats.
 REGION 337 561 2 X type B repeats.
 REGION 1916 2106 3 X 50 AA type C repeats.
 MOD_RES 27 27 N6-acetyllysine.
 MOD_RES 166 166 Phosphoserine (By similarity).
 MOD_RES 342 342 Phosphoserine.
 MOD_RES 350 350 Phosphoserine.
 MOD_RES 380 380 Phosphoserine.
 MOD_RES 383 383 Phosphoserine.
 MOD_RES 456 456 Phosphoserine.
 MOD_RES 610 610 Phosphothreonine.
 MOD_RES 759 759 Phosphoserine.
 MOD_RES 761 761 Phosphoserine.
 MOD_RES 764 764 Phosphoserine.
 MOD_RES 782 782 Phosphothreonine (By similarity).
 MOD_RES 808 808 Phosphoserine.
 MOD_RES 1004 1004 Phosphoserine (By similarity).
 MOD_RES 1085 1085 Phosphoserine.
 MOD_RES 1089 1089 Phosphoserine.
 MOD_RES 1092 1092 Phosphoserine.
 MOD_RES 1094 1094 Phosphotyrosine.
 MOD_RES 1110 1110 Phosphoserine.
 MOD_RES 1147 1147 Phosphoserine.
 MOD_RES 1196 1196 N6-acetyllysine.
 MOD_RES 1219 1219 Phosphoserine.
 MOD_RES 1306 1306 Phosphoserine.
 MOD_RES 1353 1353 Phosphothreonine.
 MOD_RES 2113 2113 Phosphoserine.  
Keyword
 Acetylation; Alternative splicing; Complete proteome; Cytoplasm; Nucleus; Phosphoprotein; Polymorphism; Reference proteome; Repeat. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2157 AA 
Protein Sequence
MSDRSGPTAK GKDGKKYSSL NLFDTYKGKS LEIQKPAVAP RHGLQSLGKV AIARRMPPPA 60
NLPSLKAENK GNDPNVSLVP KDGTGWASKQ EQSDPKSSDA STAQPPESQP LPASQTPASN 120
QPKRPPAAPE NTPLVPSGVK SWAQASVTHG AHGDGGRASS LLSRFSREEF PTLQAAGDQD 180
KAAKERESAE QSSGPGPSLR PQNSTTWRDG GGRGPDELEG PDSKLHHGHD PRGGLQPSGP 240
PQFPPYRGMM PPFMYPPYLP FPPPYGPQGP YRYPTPDGPS RFPRVAGPRG SGPPMRLVEP 300
VGRPSILKED NLKEFDQLDQ ENDDGWAGAH EEVDYTEKLK FSDEEDGRDS DEEGAEGHRD 360
SQSASGEERP PEADGKKGNS PNSEPPTPKT AWAETSRPPE TEPGPPAPKP PLPPPHRGPA 420
GNWGPPGDYP DRGGPPCKPP APEDEDEAWR QRRKQSSSEI SLAVERARRR REEEERRMQE 480
ERRAACAEKL KRLDEKFGAP DKRLKAEPAA PPAAPSTPAP PPAVPKELPA PPAPPPASAP 540
TPETEPEEPA QAPPAQSTPT PGVAAAPTLV SGGGSTSSTS SGSFEASPVE PQLPSKEGPE 600
PPEEVPPPTT PPVPKVEPKG DGIGPTRQPP SQGLGYPKYQ KSLPPRFQRQ QQEQLLKQQQ 660
QHQWQQHQQG SAPPTPVPPS PPQPVTLGAV PAPQAPPPPP KALYPGALGR PPPMPPMNFD 720
PRWMMIPPYV DPRLLQGRPP LDFYPPGVHP SGLVPRERSD SGGSSSEPFD RHAPAMLRER 780
GTPPVDPKLA WVGDVFTATP AEPRPLTSPL RQAADEDDKG MRSETPPVPP PPPYLASYPG 840
FPENGAPGPP ISRFPLEEPG PRPLPWPPGS DEVAKIQTPP PKKEPPKEET AQLTGPEAGR 900
KPARGVGSGG QGPPPPRRES RTETRWGPRP GSSRRGIPPE EPGAPPRRAG PIKKPPPPTK 960
VEELPPKPLE QGDETPKPPK PDPLKITKGK LGGPKETPPN GNLSPAPRLR RDYSYERVGP 1020
TSCRGRGRGE YFARGRGFRG TYGGRGRGAR SREFRSYREF RGDDGRGGGT GGPNHPPAPR 1080
GRTASETRSE GSEYEEIPKR RRQRGSETGS ETHESDLAPS DKEAPTPKEG TLTQVPLAPP 1140
PPGAPPSPAP ARFTARGGRV FTPRGVPSRR GRGGGRPPPQ VCPGWSPPAK SLAPKKPPTG 1200
PLPPSKEPLK EKLIPGPLSP VARGGSNGGS NVGMEDGERP RRRRHGRAQQ QDKPPRFRRL 1260
KQERENAARG SEGKPSLTLP ASAPGPEEAL TTVTVAPAPR RAAAKSPDLS NQNSDQANEE 1320
WETASESSDF TSERRGDKEA PPPVLLTPKA VGTPGGGGGG AVPGISAMSR GDLSQRAKDL 1380
SKRSFSSQRP GMERQNRRPG PGGKAGSSGS SSGGGGGGPG GRTGPGRGDK RSWPSPKNRS 1440
RPPEERPPGL PLPPPPPSSS AVFRLDQVIH SNPAGIQQAL AQLSSRQGSV TAPGGHPRHK 1500
PGLPQAPQGP SPRPPTRYEP QRVNSGLSSD PHFEEPGPMV RGVGGTPRDS AGVSPFPPKR 1560
RERPPRKPEL LQEESLPPPH SSGFLGSKPE GPGPQAESRD TGTEALTPHI WNRLHTATSR 1620
KSYRPSSMEP WMEPLSPFED VAGTEMSQSD SGVDLSGDSQ VSSGPCSQRS SPDGGLKGAA 1680
EGPPKRPGGS SPLNAVPCEG PPGSEPPRRP PPAPHDGDRK ELPREQPLPP GPIGTERSQR 1740
TDRGTEPGPI RPSHRPGPPV QFGTSDKDSD LRLVVGDSLK AEKELTASVT EAIPVSRDWE 1800
LLPSAAASAE PQSKNLDSGH CVPEPSSSGQ RLYPEVFYGS AGPSSSQISG GAMDSQLHPN 1860
SGGFRPGTPS LHPYRSQPLY LPPGPAPPSA LLSGLALKGQ FLDFSTMQAT ELGKLPAGGV 1920
LYPPPSFLYS PAFCPSPLPD TSLLQVRQDL PSPSDFYSTP LQPGGQSGFL PSGAPAQQML 1980
LPMVDSQLPV VNFGSLPPAP PPAPPPLSLL PVGPALQPPS LAVRPPPAPA TRVLPSPARP 2040
FPASLGRAEL HPVELKPFQD YQKLSSNLGG PGSSRTPPTG RSFSGLNSRL KATPSTYSGV 2100
FRTQRVDLYQ QASPPDALRW IPKPWERTGP PPREGPSRRA EEPGSRGDKE PGLPPPR 2157 
Gene Ontology
 GO:0005737; C:cytoplasm; IEA:UniProtKB-SubCell.
 GO:0005634; C:nucleus; IEA:UniProtKB-SubCell. 
Interpro
 IPR009738; BAT2_N. 
Pfam
 PF07001; BAT2_N 
SMART
  
PROSITE
  
PRINTS