CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-007020
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Transcription-associated protein 1 
Protein Synonyms/Alias
 p400 kDa component of SAGA 
Gene Name
 TRA1 
Gene Synonyms/Alias
 YHR099W 
Created Date
 July 27, 2013 
Organism
 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) (Baker's yeast) 
NCBI Taxa ID
 559292 
Lysine Modification
Position
Peptide
Type
References
552VEMTESDKVVKNDVEacetylation[1]
3432LFNKSLSKNVETRRRubiquitination[2]
Reference
 [1] mChIP-KAT-MS, a method to map protein interactions and acetylation sites for lysine acetyltransferases.
 Mitchell L, Huard S, Cotrut M, Pourhanifeh-Lemeri R, Steunou AL, Hamza A, Lambert JP, Zhou H, Ning Z, Basu A, Côté J, Figeys DA, Baetz K.
 Proc Natl Acad Sci U S A. 2013 Apr 23;110(17):E1641-50. [PMID: 23572591]
 [2] Global analysis of phosphorylation and ubiquitylation cross-talk in protein degradation.
 Swaney DL, Beltrao P, Starita L, Guo A, Rush J, Fields S, Krogan NJ, Villén J.
 Nat Methods. 2013 Jul;10(7):676-82. [PMID: 23749301
Functional Description
 Essential component of histone acetyltransferase (HAT) complexes, which serves as a target for activators during recruitment of HAT complexes. Essential for vegetative growth. Functions as a component of the transcription regulatory histone acetylation (HAT) complexes SAGA, SALSA and SLIK. SAGA is involved in RNA polymerase II-dependent transcriptional regulation of approximately 10% of yeast genes. At the promoters, SAGA is required for recruitment of the basal transcription machinery. It influences RNA polymerase II transcriptional activity through different activities such as TBP interaction (SPT3, SPT8 and SPT20) and promoter selectivity, interaction with transcription activators (GCN5, ADA2, ADA3 and TRA1), and chromatin modification through histone acetylation (GCN5) and deubiquitination (UBP8). SAGA acetylates nucleosomal histone H3 to some extent (to form H3K9ac, H3K14ac, H3K18ac and H3K23ac). SAGA interacts with DNA via upstream activating sequences (UASs). SALSA, an altered form of SAGA, may be involved in positive transcriptional regulation. SLIK is proposed to have partly overlapping functions with SAGA. It preferentially acetylates methylated histone H3, at least after activation at the GAL1-10 locus. 
Sequence Annotation
 DOMAIN 2622 3177 FAT.
 DOMAIN 3414 3711 PI3K/PI4K.
 DOMAIN 3712 3744 FATC.
 MOD_RES 172 172 Phosphoserine.
 MOD_RES 542 542 Phosphoserine.  
Keyword
 Activator; Chromatin regulator; Complete proteome; Direct protein sequencing; Nucleus; Phosphoprotein; Reference proteome; Repeat; Transcription; Transcription regulation. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 3744 AA 
Protein Sequence
MSLTEQIEQF ASRFRDDDAT LQSRYSTLSE LYDIMELLNS PEDYHFFLQA VIPLLLNQLK 60
EVPISYDAHS PEQKLRNSML DIFNRCLMNQ TFQPYAMEVL EFLLSVLPKE NEENGILCMK 120
VLTTLFKSFK SILQDKLDSF IRIIIQIYKN TPNLINQTFY EAGKAEQGDL DSPKEPQADE 180
LLDEFSKNDE EKDFPSKQSS TEPRFENSTS SNGLRSSMFS FKILSECPIT MVTLYSSYKQ 240
LTSTSLPEFT PLIMNLLNIQ IKQQQEAREQ AESRGEHFTS ISTEIINRPA YCDFILAQIK 300
ATSFLAYVFI RGYAPEFLQD YVNFVPDLII RLLQDCPSEL SSARKELLHA TRHILSTNYK 360
KLFLPKLDYL FDERILIGNG FTMHETLRPL AYSTVADFIH NIRSELQLSE IEKTIKIYTG 420
YLLDESLALT VQIMSAKLLL NLVERILKLG KENPQEAPRA KKLLMIIIDS YMNRFKTLNR 480
QYDTIMKYYG RYETHKKEKA EKLKNSIQDN DKESEEFMRK VLEPSDDDHL MPQPKKEDIN 540
DSPDVEMTES DKVVKNDVEM FDIKNYAPIL LLPTPTNDPI KDAFYLYRTL MSFLKTIIHD 600
LKVFNPPPNE YTVANPKLWA SVSRVFSYEE VIVFKDLFHE CIIGLKFFKD HNEKLSPETT 660
KKHFDISMPS LPVSATKDAR ELMDYLAFMF MQMDNATFNE IIEQELPFVY ERMLEDSGLL 720
HVAQSFLTSE ITSPNFAGIL LRFLKGKLKD LGNVDFNTSN VLIRLFKLSF MSVNLFPNIN 780
EVVLLPHLND LILNSLKYST TAEEPLVYFY LIRTLFRSIG GGRFENLYRS IKPILQVLLQ 840
SLNQMILTAR LPHERELYVE LCITVPVRLS VLAPYLPFLM KPLVFALQQY PDLVSQGLRT 900
LELCIDNLTA EYFDPIIEPV IDDVSKALFN LLQPQPFNHA ISHNVVRILG KLGGRNRQFL 960
KPPTDLTEKT ELDIDAIADF KINGMPEDVP LSVTPGIQSA LNILQSYKSD IHYRKSAYKY 1020
LTCVLLLMTK SSAEFPTNYT ELLKTAVNSI KLERIGIEKN FDLEPTVNKR DYSNQENLFL 1080
RLLESVFYAT SIKELKDDAM DLLNNLLDHF CLLQVNTTLL NKRNYNGTFN IDLKNPNFML 1140
DSSLILDAIP FALSYYIPEV REVGVLAYKR IYEKSCLIYG EELALSHSFI PELAKQFIHL 1200
CYDETYYNKR GGVLGIKVLI DNVKSSSVFL KKYQYNLANG LLFVLKDTQS EAPSAITDSA 1260
EKLLIDLLSI TFADVKEEDL GNKVLENTLT DIVCELSNAN PKVRNACQKS LHTISNLTGI 1320
PIVKLMDHSK QFLLSPIFAK PLRALPFTMQ IGNVDAITFC LSLPNTFLTF NEELFRLLQE 1380
SIVLADAEDE SLSTNIQKTT EYSTSEQLVQ LRIACIKLLA IALKNEEFAT AQQGNIRIRI 1440
LAVFFKTMLK TSPEIINTTY EALKGSLAEN SKLPKELLQN GLKPLLMNLS DHQKLTVPGL 1500
DALSKLLELL IAYFKVEIGR KLLDHLTAWC RVEVLDTLFG QDLAEQMPTK IIVSIINIFH 1560
LLPPQADMFL NDLLLKVMLL ERKLRLQLDS PFRTPLARYL NRFHNPVTEY FKKNMTLRQL 1620
VLFMCNIVQR PEAKELAEDF EKELDNFYDF YISNIPKNQV RVVSFFTNMV DLFNTMVITN 1680
GDEWLKKKGN MILKLKDMLN LTLKTIKENS FYIDHLQLNQ SIAKFQALYL RFTELSERDQ 1740
NPLLLDFIDF SFSNGIKASY SLKKFIFHNI IASSNKEKQN NFINDATLFV LSDKCLDARI 1800
FVLKNVINST LIYEVATSGS LKSYLVEDKK PKWLELLHNK IWKNSNAILA YDVLDHHDLF 1860
RFELLQLSAI FIKADPEIIA EIKKDIIKFC WNFIKLEDTL IKQSAYLVTS YFISKFDFPI 1920
KVVTQVFVAL LRSSHVEARY LVKQSLDVLT PVLHERMNAA GTPDTWINWV KRVMVENSSS 1980
QNNILYQFLI SHPDLFFNSR DLFISNIIHH MNKITFMSNS NSDSHTLAID LASLILYWEN 2040
KTLEITNVNN TKTDSDGDVV MSDSKSDINP VEADTTAIIV DANNNSPISL HLREACTAFL 2100
IRYVCASNHR AIETELGLRA INILSELISD KHWTNVNVKL VYFEKFLIFQ DLDSENILYY 2160
CMNALDVLYV FFKNKTKEWI MENLPTIQNL LEKCIKSDHH DVQEALQKVL QVIMKAIKAQ 2220
GVSVIIEEES PGKTFIQMLT SVITQDLQET SSVTAGVTLA WVLFMNFPDN IVPLLTPLMK 2280
TFSKLCKDHL SISQPKDAMA LEEARITTKL LEKVLYILSL KVSLLGDSRR PFLSTVALLI 2340
DHSMDQNFLR KIVNMSRSWI FNTEIFPTVK EKAAILTKML AFEIRGEPSL SKLFYEIVLK 2400
LFDQEHFNNT EITVRMEQPF LVGTRVEDIG IRKRFMTILD NSLERDIKER LYYVIRDQNW 2460
EFIADYPWLN QALQLLYGSF NREKELSLKN IYCLSPPSIL QEYLPENAEM VTEVNDLELS 2520
NFVKGHIASM QGLCRIISSD FIDSLIEIFY QDPKAIHRAW VTLFPQVYKS IPKNEKYGFV 2580
RSIITLLSKP YHTRQISSRT NVINMLLDSI SKIESLELPP HLVKYLAISY NAWYQSINIL 2640
ESIQSNTSID NTKIIEANED ALLELYVNLQ EEDMFYGLWR RRAKYTETNI GLSYEQIGLW 2700
DKAQQLYEVA QVKARSGALP YSQSEYALWE DNWIQCAEKL QHWDVLTELA KHEGFTDLLL 2760
ECGWRVADWN SDRDALEQSV KSVMDVPTPR RQMFKTFLAL QNFAESRKGD QEVRKLCDEG 2820
IQLSLIKWVS LPIRYTPAHK WLLHGFQQYM EFLEATQIYA NLHTTTVQNL DSKAQEIKRI 2880
LQAWRDRLPN TWDDVNMWND LVTWRQHAFQ VINNAYLPLI PALQQSNSNS NINTHAYRGY 2940
HEIAWVINRF AHVARKHNMP DVCISQLARI YTLPNIEIQE AFLKLREQAK CHYQNMNELT 3000
TGLDVISNTN LVYFGTVQKA EFFTLKGMFL SKLRAYEEAN QAFATAVQID LNLAKAWAQW 3060
GFFNDRRLSE EPNNISFASN AISCYLQAAG LYKNSKIREL LCRILWLISI DDASGMLTNA 3120
FDSFRGEIPV WYWITFIPQL LTSLSHKEAN MVRHILIRIA KSYPQALHFQ LRTTKEDFAV 3180
IQRQTMAVMG DKPDTNDRNG RRQPWEYLQE LNNILKTAYP LLALSLESLV AQINDRFKST 3240
TDEDLFRLIN VLLIDGTLNY NRLPFPRKNP KLPENTEKNL VKFSTTLLAP YIRPKFNADF 3300
IDNKPDYETY IKRLRYWRRR LENKLDRASK KENLEVLCPH LSNFHHQKFE DIEIPGQYLL 3360
NKDNNVHFIK IARFLPTVDF VRGTHSSYRR LMIRGHDGSV HSFAVQYPAV RHSRREERMF 3420
QLYRLFNKSL SKNVETRRRS IQFNLPIAIP LSPQVRIMND SVSFTTLHEI HNEFCKKKGF 3480
DPDDIQDFMA DKLNAAHDDA LPAPDMTILK VEIFNSIQTM FVPSNVLKDH FTSLFTQFED 3540
FWLFRKQFAS QYSSFVFMSY MMMINNRTPH KIHVDKTSGN VFTLEMLPSR FPYERVKPLL 3600
KNHDLSLPPD SPIFHNNEPV PFRLTPNIQS LIGDSALEGI FAVNLFTISR ALIEPDNELN 3660
TYLALFIRDE IISWFSNLHR PIIENPQLRE MVQTNVDLII RKVAQLGHLN STPTVTTQFI 3720
LDCIGSAVSP RNLARTDVNF MPWF 3744 
Gene Ontology
 GO:0070209; C:ASTRA complex; IDA:SGD.
 GO:0035267; C:NuA4 histone acetyltransferase complex; IPI:SGD.
 GO:0000124; C:SAGA complex; IDA:SGD.
 GO:0046695; C:SLIK (SAGA-like) complex; IDA:SGD.
 GO:0016773; F:phosphotransferase activity, alcohol group as acceptor; IEA:InterPro.
 GO:0006281; P:DNA repair; IDA:SGD.
 GO:0016573; P:histone acetylation; IPI:SGD.
 GO:0045944; P:positive regulation of transcription from RNA polymerase II promoter; IMP:SGD.
 GO:0006351; P:transcription, DNA-dependent; IEA:UniProtKB-KW. 
Interpro
 IPR011989; ARM-like.
 IPR016024; ARM-type_fold.
 IPR003152; FATC.
 IPR011009; Kinase-like_dom.
 IPR000403; PI3/4_kinase_cat_dom.
 IPR003151; PIK-rel_kinase_FAT.
 IPR014009; PIK_FAT.
 IPR011990; TPR-like_helical. 
Pfam
 PF02259; FAT
 PF02260; FATC
 PF00454; PI3_PI4_kinase 
SMART
 SM00146; PI3Kc 
PROSITE
 PS51189; FAT
 PS51190; FATC
 PS00915; PI3_4_KINASE_1
 PS00916; PI3_4_KINASE_2
 PS50290; PI3_4_KINASE_3
 PS50005; TPR
 PS50293; TPR_REGION 
PRINTS