CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-016930
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Transcription-associated protein 1 
Protein Synonyms/Alias
 dTRA1 
Gene Name
 Nipped-A 
Gene Synonyms/Alias
 Tra1; CG2905 
Created Date
 July 27, 2013 
Organism
 Drosophila melanogaster (Fruit fly) 
NCBI Taxa ID
 7227 
Lysine Modification
Position
Peptide
Type
References
2800LMKIALAKTEQCYLKacetylation[1]
2807KTEQCYLKHYGFKINacetylation[1]
3370TNDFDFSKPGAMKLHacetylation[1]
Reference
 [1] Proteome-wide mapping of the Drosophila acetylome demonstrates a high degree of conservation of lysine acetylation.
 Weinert BT, Wagner SA, Horn H, Henriksen P, Liu WR, Olsen JV, Jensen LJ, Choudhary C.
 Sci Signal. 2011 Jul 26;4(183):ra48. [PMID: 21791702
Functional Description
 Part of the Tip60 chromatin-remodeling complex which is involved in DNA repair. Upon induction of DNA double-strand breaks, this complex acetylates phosphorylated H2AV in nucleosomes and exchanges it with unmodified H2AV. 
Sequence Annotation
 REPEAT 158 196 HEAT 1.
 REPEAT 397 443 HEAT 2.
 REPEAT 802 840 HEAT 3.
 REPEAT 1247 1285 HEAT 4.
 REPEAT 1394 1432 HEAT 5.
 REPEAT 1888 1926 HEAT 6.
 DOMAIN 2640 3186 FAT.
 DOMAIN 3470 3734 PI3K/PI4K.
 DOMAIN 3771 3803 FATC.  
Keyword
 Activator; Chromatin regulator; Complete proteome; Nucleus; Reference proteome; Repeat; Transcription; Transcription regulation. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 3803 AA 
Protein Sequence
MSVIENVPVN TFRNYLNILN DSSSKDELKL KATQELSEHF EMIMQSPAYP SFLDNSLKIF 60
MRILQDGEPQ FIQENTMQHN LSKDVMLVAT ATVLAMNRLG TFVSCRALLD SGSQLHLIHP 120
YLHISFNLRR RNRSTGVGDF SFDTHEMIHR LPITESLRQH VKTIITMMLK ILKTDNEENV 180
LVCLRIIIEL HKHFRPSFNS EGMVEIPKER LDESHALEVI VTDLCGPFFY KPEACNKAPV 240
KCYIQLFLGF VKEIYTNLPN HLTSIFETSN DVWVTDLKDL NLEVLLSESY SVRTIHVEKA 300
LDSNSQQQII RVDDEELIDD IIRDSRCFNS SAKLLHAFAK VSPIKSLFSS SGRAKVSIEK 360
GPSSVPTKRL NLMKNCPKEA AHLRKELLIA ARHIFATDLR QKFIPSIEQL FDEDLLIGKG 420
VTLDSIRPLA YSTLADLAHH VRQSLNIDVL IKAVNLFSKN VHDESLAVGI QTMSCKLLLN 480
LVDCLRHHSE TEPQRSKALL SKLLKVFVKK FETIAKIQLP LIIQKCKGHA FSGALVNSSG 540
NASLSHINAP DLKDDISNIQ VSASGSQWIY SVNVAEFRSL VKTLVGGVKT ITWGFFNSKF 600
QLTDTKLANH EKIFGPEIVC SYIDLVYYAM EALDIYTINV NPNQQRTSGL ISRSKEEKEV 660
LEHFSGIFLM MHSQNFQEIF STTINFLVER IYKNQSLQVI ANSFLANPTT SPLFATVLVE 720
YLLNKMEEMG SNLERSNLYL RLFKLVFGSV SLFPVENEQM LRPHLHKIVN RSMELALISE 780
EPYNYFLLLR ALFRSIGGGS HDLLYQEFLP LLPNLLEGLN RLQSGFHKQH MRDLFVELCL 840
TVPVRLSSLL PYLPMLMDPL VSALNGSPTL ISQGLRTLEL CVDNLQPDFL YDHIQPVRAA 900
LMQALWKTLR NQDNAALVAF RVLGKFGGGN RKMMVEPQAL SYIINDKPTI SIVTYFQEYE 960
TPIDFPVDEA IKSAFRALGS NSTDQFYRRQ SWEVIRCFLA AFISLDDEKH MLLKLFTHVD 1020
FVENKIMNWS TFQHKAGNET VRETHQTALI GMLVASATKD LRDSVCPVMA AVVRHYTMVA 1080
IAQQAGPFPQ KGYQATHGID PMILIDALAS CMGHEEKELC KPGIACMGII LDTATNIMGN 1140
KDRACKLPII QYLAEKMVSL CYDRPWYSKV GGCQAIQFLC KHMSLRALFQ NLFNFLKAFM 1200
FVLMDLEGDV SNGAIEITKS YMKSMLEICL TPINECYKNI DLKDLQAKAT YEVIHELVRH 1260
ITSPNTIVRE ESMVLLKHIG TIQSKTVSEV MDPHKDVLAD IIPPKKHLLR HQPANAQIGL 1320
MDGNTFCTTL EPRLFTIDLT NTYHKLFFHE LLTLSEAEDA TLAKLDCYKN VPNLIPLRTS 1380
ALRALAACHY ISDIGYKEKI INIIFKVMES DKSELQTTAF HCMKHFITGV TLEKEKVQSA 1440
MRPLLLKLGD HRNLSIPAIK RLSYFTQIFP QMFNEKLSEQ ILQHCSKIME IFVSEYKSTS 1500
PNVNFFASSK GGEYEQKIVI LIEMFFYISA SVKYIEKLCQ LVLKTEKNLM IEASSPYREA 1560
LIKFLQRFPT ETVDLFLTES LMIDPQWNRL FIYLLKHETG VSFRAVIKSS RYNNLIHYLN 1620
THTEFPEALK YEIQHQAVLI IFTLMESDDQ WIPTRQDIVD ALKNCWQNYL STLSSEDVLC 1680
DLWHLIGKIL LHYFSNNTND IELLFQLLRA LCFRFIPDVY FLRDFLQHTV AQSFTVNWKR 1740
NAFFYFVENF NNSFLSEELK AKIITAVIIP CFAVSFDKGE GNKLIGAPPT PYQEDEKNIV 1800
SVFINKVFDP DKQYDDAVRI ALLQLACLLV ERASQHIHDG DANNKRQGNK LRRLMTFAWP 1860
CLLSKSSVDP TARYHGHLLL SHIIARLAIH KKIVLQVFHS LLKGHALEAR SIVKQALDVL 1920
TPAMPLRMED GNTMLTHWTK KIIVEEGHAM QQLFHILQLI IRHYKVYFPV RHQLVQHLIN 1980
YMQRLGFPPT ASIEHKKLAV DLAEVIIKWE LHRIKDDRET KTDGTEEELI QESSVKRSGI 2040
DLVETRKKSF DIIRETTVQG KCVMLLKMAM RPEIWPQPFD IKLNWLDKVL ATVETPHHNL 2100
NNICTGIDFL TFLTTILSPD QLVSIIRPVQ RGLSLCIIHQ NTRIVRLMHM FLTRIMAIFP 2160
PDTQHKHEDL DLLYTAVSKM IAENLTSYEK SPQPNASSLF GTLMILKACT TNNASYIDRI 2220
LVQFIRVLNH LTRDHINTIG GNTVISQSPD SNALPLELLV LSLELIKNRI FVMSVEIRKL 2280
FIGTILVSLI EKSTEVKIIK CIIKMLDEWI KTKEPNVMTQ VPSIREKSAL LVKLMQNVEK 2340
KFTDEIELNI QFLEIINFIY RDEILKQTEL TNKLEGAFLN GLRFQNPNVR SKFFEILDSS 2400
MRRRLHDRLL YIICSQAWDT IGSHYWIKQC IELLILTANT MMQIQCSNEQ FKIPSITSVI 2460
PVNSSETQEN SFVSFLSSHS ESFDIIQTVD DKDDVYDIDL NADRKEDCQQ ILPNRRVTLV 2520
ELVYKQAEFL EANRNIRTDQ MLVATSQLCH IDTQLAQSVW LSMFPRIWSI FTEDQRCNIT 2580
KELIPFLSSG TNVNQKDCHP STLNTFVESL TKCAPPIYIP PNLLAYLGKS HNLWHRAILV 2640
LEDMAVNQSM QSKDIDGGEN QFSDLDVQQS NNIFDSLSKM YSSMHEEDLW AGLWLKFAHY 2700
PETNIAVSYE QMGFFEEAQG AYDLAMTKFK QDLSNGVVNT YVNSELLLWE NHWMRCAKEL 2760
NQWDILLDYA QTNKDKNMFL ILESSWRVPD WNLMKIALAK TEQCYLKHYG FKINLYKGYL 2820
SILHQEERQT GNIERYVEIA SSLCIREWRR LPNIVSHIHL PYLQASQQIM ELHEASQIHQ 2880
GLAQSRNNSL HDMKAIVKTW RNRLPIISDD LSHWSDIFTW RQHHYQIITQ HLEQQSDQGS 2940
TMLGVHASAQ AIISFGKIAR KHNLTGVCQE TLSRIYTIPS VPIVDCFQKI RQQVKCYLQM 3000
PSTSGKNEIN EALEVIESTN LKYFTGEMNA EFYALKGLLL AQIGRSEEAG KSFSVAAQLH 3060
DGLTKAWAMW GDYMEQIFLK ERKITLAVDA LICYLQASRN QIESKTRKYI AKVLWFLSYD 3120
NNTKILISTL EKHVAGIPPS YWLPWIPQLL CCLEQFEGDV ILNLLSQIGR LYPQAVYFPI 3180
RTLYLTLKIE QREKHKTAEQ AVKSSCSNID GTTLSFGRGA SHGNIPSINP IKATPPMWRC 3240
SKVMQLQREV HPTILSSLEG IVDQMVWFRE SWTEEVLRQL RQGLIKCYAI AFEKRDTVQH 3300
STITPHTLHF VKKLGSTFGI GIENVPGSVT SSISNSAASE SLARRAQVTF QDPVFQKMKE 3360
QFTNDFDFSK PGAMKLHNLI SKLKTWIKVL ETKVKKLPTS FLIEDKCRFL SNFSQKTAEV 3420
ELPGELLIPL SSHYYVRIAR FMPRVEIVQK NNTAARRLYI RGTNGKIYPY LVVLDSGLGD 3480
ARREERVLQL KRMLNYYLEK QKETSRRFLN ITVPRVVPIS PQMRLAEDNP NSISLLKIFK 3540
KCCQSMQVDY DMPIVKYYDR LSEVQARGTP TTHTLLREIF SEIQWTMVPK TLLKHWALKT 3600
FLAATDFWHF RKMLTLQLAL AFLCEHALNL TRLNADMMYL HQDSGLMNIS YFKFDVNDDK 3660
CQLNQHRPVP FRLTPNVGEF ITHFGITGPL SAAIVATARC FIQPNYKLSS ILQTILRDEI 3720
IALQKKGFRE CKLIEGSEDR YSDGNCMEHS VNIVNSAVDI IMTRFNKISY FDSIENKKIS 3780
VLVQSATNID NLCRMDPAWH PWL 3803 
Gene Ontology
 GO:0005737; C:cytoplasm; IDA:FlyBase.
 GO:0005875; C:microtubule associated complex; IDA:FlyBase.
 GO:0035267; C:NuA4 histone acetyltransferase complex; IDA:UniProtKB.
 GO:0005703; C:polytene chromosome puff; IDA:FlyBase.
 GO:0000124; C:SAGA complex; IDA:FlyBase.
 GO:0016773; F:phosphotransferase activity, alcohol group as acceptor; IEA:InterPro.
 GO:0000910; P:cytokinesis; IMP:FlyBase.
 GO:0016573; P:histone acetylation; IDA:UniProtKB.
 GO:0043486; P:histone exchange; IDA:UniProtKB.
 GO:0006911; P:phagocytosis, engulfment; IMP:FlyBase.
 GO:0045747; P:positive regulation of Notch signaling pathway; IGI:FlyBase.
 GO:0006355; P:regulation of transcription, DNA-dependent; IDA:UniProtKB.
 GO:0006351; P:transcription, DNA-dependent; IDA:UniProtKB.
 GO:0035222; P:wing disc pattern formation; IGI:FlyBase. 
Interpro
 IPR011989; ARM-like.
 IPR016024; ARM-type_fold.
 IPR003152; FATC.
 IPR011009; Kinase-like_dom.
 IPR000403; PI3/4_kinase_cat_dom.
 IPR003151; PIK-rel_kinase_FAT.
 IPR014009; PIK_FAT. 
Pfam
 PF02259; FAT
 PF02260; FATC
 PF00454; PI3_PI4_kinase 
SMART
 SM00146; PI3Kc 
PROSITE
 PS51189; FAT
 PS51190; FATC
 PS50077; HEAT_REPEAT
 PS00915; PI3_4_KINASE_1
 PS00916; PI3_4_KINASE_2
 PS50290; PI3_4_KINASE_3 
PRINTS