CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-015862
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Protein PRRC2A 
Protein Synonyms/Alias
 HLA-B-associated transcript 2; Proline-rich and coiled-coil-containing protein 2A 
Gene Name
 Prrc2a 
Gene Synonyms/Alias
 Bat2 
Created Date
 July 27, 2013 
Organism
 Mus musculus (Mouse) 
NCBI Taxa ID
 10090 
Lysine Modification
Position
Peptide
Type
References
27LNLFDTYKGKSLEIQacetylation[1]
29LFDTYKGKSLEIQKPacetylation[2]
35GKSLEIQKPAVAPRHacetylation[2, 3]
Reference
 [1] Quantitative acetylome analysis reveals the roles of SIRT1 in regulating diverse substrates and cellular pathways.
 Chen Y, Zhao W, Yang JS, Cheng Z, Luo H, Lu Z, Tan M, Gu W, Zhao Y.
 Mol Cell Proteomics. 2012 Oct;11(10):1048-62. [PMID: 22826441]
 [2] SIRT5-Mediated Lysine Desuccinylation Impacts Diverse Metabolic Pathways.
 Park J, Chen Y, Tishkoff DX, Peng C, Tan M, Dai L, Xie Z, Zhang Y, Zwaans BM, Skinner ME, Lombard DB, Zhao Y.
 Mol Cell. 2013 Jun 27;50(6):919-30. [PMID: 23806337]
 [3] Proteomic investigations of lysine acetylation identify diverse substrates of mitochondrial deacetylase sirt3.
 Sol EM, Wagner SA, Weinert BT, Kumar A, Kim HS, Deng CX, Choudhary C.
 PLoS One. 2012;7(12):e50545. [PMID: 23236377
Functional Description
 May play a role in the regulation of pre-mRNA splicing (By similarity). 
Sequence Annotation
 REPEAT 41 95 1-1.
 REPEAT 98 154 1-2.
 REPEAT 281 337 1-3.
 REPEAT 337 428 2-1.
 REPEAT 486 559 2-2.
 REPEAT 1756 1811 1-4.
 REPEAT 1917 1966 3-1.
 REPEAT 1983 2032 3-2.
 REPEAT 2058 2107 3-3.
 REGION 41 1811 4 X 57 AA type A repeats.
 REGION 337 559 2 X type B repeats.
 REGION 1917 2107 3 X 50 AA type C repeats.
 MOD_RES 27 27 N6-acetyllysine (By similarity).
 MOD_RES 166 166 Phosphoserine.
 MOD_RES 342 342 Phosphoserine.
 MOD_RES 350 350 Phosphoserine.
 MOD_RES 378 378 Phosphoserine (By similarity).
 MOD_RES 381 381 Phosphoserine (By similarity).
 MOD_RES 454 454 Phosphoserine (By similarity).
 MOD_RES 609 609 Phosphothreonine.
 MOD_RES 759 759 Phosphoserine.
 MOD_RES 761 761 Phosphoserine.
 MOD_RES 764 764 Phosphoserine (By similarity).
 MOD_RES 782 782 Phosphothreonine.
 MOD_RES 808 808 Phosphoserine.
 MOD_RES 1002 1002 Phosphoserine.
 MOD_RES 1083 1083 Phosphoserine (By similarity).
 MOD_RES 1087 1087 Phosphoserine.
 MOD_RES 1090 1090 Phosphoserine.
 MOD_RES 1092 1092 Phosphotyrosine (By similarity).
 MOD_RES 1108 1108 Phosphoserine (By similarity).
 MOD_RES 1145 1145 Phosphoserine (By similarity).
 MOD_RES 1194 1194 N6-acetyllysine (By similarity).
 MOD_RES 1217 1217 Phosphoserine.
 MOD_RES 1302 1302 Phosphoserine (By similarity).
 MOD_RES 1349 1349 Phosphothreonine (By similarity).
 MOD_RES 1401 1401 Phosphothreonine (By similarity).
 MOD_RES 2114 2114 Phosphoserine (By similarity).  
Keyword
 Acetylation; Complete proteome; Cytoplasm; Direct protein sequencing; Nucleus; Phosphoprotein; Reference proteome; Repeat. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2158 AA 
Protein Sequence
MSDRSGPTAK GKDGKKYSSL NLFDTYKGKS LEIQKPAVAP RHGLQSLGKV AIARRMPPPA 60
NLPSLKAENK GNDPNVSLVP KDGTGWASKQ EQSDPKSSDA STAQPPESQP LPASQTPASN 120
QPKRPPTAPE NTPSVPSGVK SWAQASVTHG AHGDGGRASN LLSRFSREEF PTLQAAGDQD 180
KAAKERESAE QSSGPGPSLR PQNSTTWRDG GGRGPDDLEG PDSKLHHGHD PRGGLQPSGP 240
PQFPPYRGMM PPFMYPPYLP FPPPYGPQGP YRYPTPDGPS RFPRVAGPRG SGPPMRLVEP 300
VGRPSILKED NLKEFDQLDQ ENDDGWAGAH EEVDYTEKLK FSDEEDGRDS DEEGAEGHKD 360
SQSAAAEEPE TDGKKGTSPG SELPPPKTAW TENARPSETE PAPPTPKPPP PPPHRGPVGN 420
WGPPGDYPDR GGPPCKPPAP EDEDEAWRQR RKQSSSEISL AVERARRRRE EEERRMQEER 480
RAACAEKLKR LDEKFGAPDK RLKAEPAAPP VTPAAPALPP VVPKEIPAAP ALPPTPTPTP 540
EKEPEEPAQA PPVQAAPSPG VAPVPTLVSG GGCTANSNSS GSFEASPVEP QLPSKEGPEP 600
PEEVPPPTTP PAPKMEPKGD GVGSTRQPPS QGLGYPKYQK SLPPRFQRQQ QEQLLKQQQQ 660
QQQWQQQQQG TAPPAPVPPS PPQPVTLGAV PAPQAPPPPP KALYPGALGR PPPMPPMNFD 720
PRWMMIPPYV DPRLLQGRPP LDFYPPGVHP SGLVPRERSD SGGSSSEPFE RHAPPLLRER 780
GTPPVDPKLA WVGDVFTTTP TDPRPLTSPL RQAADEEEKS MRSETPPVPP PPPYLANYPG 840
FPENGTPGPP ISRFPLEESA PPGPRPLPWP PGNDEAAKMQ APPPKKEPSK EEPPQLSGPE 900
AGRKPARGGQ GPPPPRRENR TETRWGPRPG SCRRGIPPEE PGVPPRRAGP IKKPPPPVKV 960
EELPPKSLEQ GDETPKVPKP DALKTAKGKV GPKETPPGGN LSPAPRLRRD YSYERVGPTS 1020
CRGRGRGEYF ARGRGFRGTY GGRGRGARSR EFRSYREFRG DDGRGGGSGG TNHPSAPRGR 1080
TASETRSEGS EYEEIPKRRR QRGSETGSET HESDLAPSDK EAPPPKEGVL GQVPLAPPQP 1140
GAPPSPAPAR FSTARGGRVF TPRGVPSRRG RGGGRPPPVC SGWSPPAKSL VPKKPPTGPL 1200
PPSKEPLKEK LISGPLSPMS RAGNMGVGME DGERPRRRRH GRAQQQDKPP RFRRLKQERE 1260
NAARGADGKP PSLTLAASTP GPEETLTAAT VPPPPRRTAA KSPDLSNQNS DQANEEWETA 1320
SESSDFASER RGDKETPPAA LMTSKAVGTP GANAGGAGPG ISAMSRGDLS QRAKDLSKRS 1380
FSSQRPGMDR QNRRPGTGGK TGSGGGSSGG GGAGPGGRTG PGRGDKRSWP SPKNRSRPPE 1440
ERPPGLPLPP PPPSSSAVFR LDQVIHSNPA GIQQALAQLS SRQGNVTAPG GHPRPKPGPP 1500
QAPQGSSPRP PTRYDPPRAS SAISSDPHFE EPGPMVRGVG GTPRDSAGVN PFPPKRRERP 1560
PRKPELLQEE TVPASHSSGF LGSKPEVPGP QEESRDSGTE ALTPHIWNRL HTATSRKSYQ 1620
PGSIEPWMEP LSPFEDVAGT EMSQSDSGVD LSGDSQVSSG PCSQRSSPDG GLKGSAEGPP 1680
KRPGGPSPLN AVPGESASGS EPSEPPRRRP PASHEGERKE LPREQPLPPG PIGTERSQRT 1740
DRGPEPGPLR PAHRPGSQVE FGTTNKDSDL CLVVGDTLKG EKELVASATE AVPISRDWEL 1800
LPSASTSAEP QPKSLGSGQC VPEPSPSGQR PYPEVFYGSP GPPNSQQVSG GAPIDSQLHP 1860
NSGGFRPGTP SLHQYRSQPL YLPPGPAPPS ALLSGVALKG QFLDFSALQA TELGKLPAGG 1920
VLYPPPSFLY SAAFCPSPLP DPPLLQVRQD LPSPSDFYST PLQPGGQSGF LPSGAPAQQM 1980
LLPVVDSQLP VVNFGSLPPA PPPAPPPLSL LPVGPALQPP NLAVRPPPAP AARVLPSPAR 2040
PFAPSLGRAE LHPVELKPFQ DYRKLSSNLG GPGSSRTPPS GRPFSGLNSR LKAPPSTYSG 2100
VFRTQRIDLY QQASPPDALR WMPKPWERAG PPSREGPPRR AEEPGSRGEK EPGLPPPR 2158 
Gene Ontology
 GO:0005737; C:cytoplasm; IEA:UniProtKB-SubCell.
 GO:0005634; C:nucleus; IEA:UniProtKB-SubCell. 
Interpro
 IPR009738; BAT2_N. 
Pfam
 PF07001; BAT2_N 
SMART
  
PROSITE
  
PRINTS