CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-035094
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 E1A binding protein p400, isoform CRA_a 
Protein Synonyms/Alias
 Protein Ep400 
Gene Name
 Ep400 
Gene Synonyms/Alias
 rCG_21858 
Created Date
 July 27, 2013 
Organism
 Rattus norvegicus (Rat) 
NCBI Taxa ID
 10116 
Lysine Modification
Position
Peptide
Type
References
345SMGSSGIKKVPKKLEacetylation[1]
864ELEEKRKKALNLQKVacetylation[1]
870KKALNLQKVSRRGKEacetylation[1]
1394HEAELLYKKKVTRKLacetylation[1]
2230KQQTPFAKPLPTYVKacetylation[1]
2361GKRSPPIKPLLGMNPacetylation[1]
2685AQMKAVGKLAPEHIIacetylation[1]
3030VRLKTPTKPPCQ***acetylation[1]
Reference
 [1] Proteomic analysis of lysine acetylation sites in rat tissues reveals organ specificity and subcellular patterns.
 Lundby A, Lage K, Weinert BT, Bekker-Jensen DB, Secher A, Skovgaard T, Kelstrup CD, Dmytriyev A, Choudhary C, Lundby C, Olsen JV.
 Cell Rep. 2012 Aug 30;2(2):419-31. [PMID: 22902405
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 3034 AA 
Protein Sequence
MHHGSGPQNV QHQLQRSRSF TGSEEEQPAH PNLPPSPAAP FAPSASPSAP QSPGYQIQQL 60
MSRSPVAGQN VNITLQNVGP VVGGNQQITL APLPLPNPTS PGFQFGAQQR RFEHGSPSYI 120
QVTSPLSQQV QTQSPTQPNP GPGQTLQNVR AGAPGPGLGI CSSSPTGGFV DASVLVRQIS 180
LSPSSGGHFV FQEAPGLTQM AQGAQVQLQH SGAPITVRER RLSQPHAQSG GTIHHLGPQS 240
PATAGGTGLQ PLASPNHITT ASLPPQISSI IQGQLIQQQQ QVLQGQPMNR SLGFERTPGV 300
LLPGVGGPSA FGMTSPPPPT SPSRTTMPPG LSSVPLTSMG SSGIKKVPKK LEEIPPASQE 360
MAQMRKQCLD YHYKEMEALK EVFKEYLIEL FFLQHLQGNM MDFLAFKKKH YAPLQAYLRQ 420
NDLDIEEEEE EEEEEEEKSE VINDEHQALT GSLVVGPGST TEADPFKRQQ VMPPTEQSKR 480
PRLEVGHPGV VFQHPGVNAG VPLQQLMPTV QGGMPPTPQA TQLTGQKQSQ QQYDPSTGPP 540
VQNAASLHTP PPQLPARLPP ASVPATALPS TLQFSQQSQM VEAPTQLQIP VKTQQLNAPI 600
PAPLPSQLPA PSSQPAQPAL HVSMPGKAQM QTSQLSSQTQ TVASTRPPLE SAQPCQRSLP 660
TSSSSSSLVP VSGSGPGPSP ARSSPVNRPS SATNKALSPV TSRSPGVAVS APPKPQSPAQ 720
NAASSQDGSQ DKLAEQITLE NQIHQRIADL RKEGLWSLRR LPKLQEAPRP KSHWDYLLEE 780
MQWMATDFAQ ERRWKLAAAK KLVRTVARHH EEKKLREERG KKEEQSRLRR IAATTAREIE 840
YFWSNIEQVV EIKLQVELEE KRKKALNLQK VSRRGKESRL KGFDTSPEHS MDLGISGRKR 900
KASTSLTDDE VEDEEETIEE EEAHEGLVDH HTELSNLAKE AELPLIDLMK LYEGAFLPNF 960
QWPQPEPDHE ESSGEEDAED CPSDRESRRD SVLIDSLFIM DQFKAAERMS IGKSNTKDIT 1020
EVTAVAEAIL PKGSARVTTA VKFSAPSLLY GALRDYQKIG LDWLAKLYRK NLNGILADEA 1080
GLGKTVQIIA FFAHLACNEG NWGPHLVVMR SCNILKWELE LKRWCPGLKT LSYVGTHREL 1140
KAKRQEWTEP NNFHICITSY KQFFRGYTAF SRVHWKCLVV DEMQRVKGMT ERHWEAIFKL 1200
QSQQRLLLID VPLHNTFLEL WTMVHFLIPG ISRPYLSFPL KAPNEENQDY YHKMVIRLHR 1260
VTQPFILRRT KRDVEKQLTR KYEHVLKCRL SSRQKALYED VILQPRTQEA LKSGHFVSVL 1320
SVLTRLQRIC NHPGLVEPRV PGSSYAAGSL QYKSASLILR VLEREFWKET DLSMFDLIGL 1380
ENKISRHEAE LLYKKKVTRK LMEEVFTSPP PSARPAAVKL KASRLFQPVQ YGQKPEGRTV 1440
AFPSTHPPRM ANTNASTATP QGQVRGRPPI ATFSANPDTK GGEVVKIAQL ASIAGPQSRV 1500
AQPETPVTLQ FQGNKFTLSH SQLRQLTAGQ PLQLQGSVLQ IVSAPGQPYL RAPGPVVMQT 1560
VSQAGAVHST LGSKPPTSGP SPAPLSTQVG VPGRVAVSAM AVGEPGLASK PASPAAGPTQ 1620
EEKSRLLKER LDQIHFINER RCSQAPVYGR DLLRICSLPG RRKRPLCWSL DSNLGKGPKG 1680
ADYDTSLSKS EGDLILTLSQ RQESLQDVLD RVACVIPPVV ATPPSLWVER PPSLYSSRLR 1740
ALRQCLREHT GPYHRQLQQL TALRSLQFPE LRLVQFDSGK LEALAILLQK LKSEGRRVLI 1800
LSQMVLMLDI LEMFLNFHYL TYVRIDENAN SEQRQELMRS FNRDRRIFCA LLSTHSRATG 1860
INLVEADTVV FYDNDLNPVM DAKAQEWCDR IGRCKDIHIY RLVSGNSIEE KLLKNGTKDL 1920
IREVAAQGND YSMAFLTQRT IQELFEVYSP MDDAGFPVKA EEFVVLSQEP SVSETIAPKI 1980
ARPFIEALKS IECLEEDAQR STEEAVPGSS TVAVSSDSDG SRYDEEPSQL EELADFMEQL 2040
TPIEKYALNY LELFHTTTEQ EKERSSEDLV MATMKDWETR NARALQEREA QLQLEQEEAE 2100
LLTYTREDAY SMEYVYEDAD GQTEVMPLWT PPTPPQDDND IYIDSVMCLM YETTPIPEAK 2160
LPPVYVRKER KRHKTDPSAA GRKKKQRHGE AVVPPRSLFD RATPGMLKIR REGKEQKKNL 2220
LLKQQTPFAK PLPTYVKSSG EPAQDSPDWL IGEDWALLQA VKQLLELPLN LTIVSPAHTP 2280
NWDLVSDVVN SCSRIYRSSK QCRNRYENVI IPREEGKSKN NRPLRTSQIY AQDENATHTQ 2340
LYTSHFELMK MTAGKRSPPI KPLLGMNPFQ KNPKHASVLA ESGINYDKPL PPIQVASLRA 2400
ERIAKEKKAL ADQQKAQQPP VTQPPPQQQP QQQQPPQQQQ QQPPPPPQQP PPPVPQPQAA 2460
SSQTPAGQPV VQPQVQAQPQ PPPVQAQSKG QATMTTVGSA AVLAGTIKTS VTGTSIPTGT 2520
VSGNVIVNTI AGVPATFQSI NKRLASPVAP GALTTSGGSA PAQVVHTQQR AVGSPATATT 2580
DLVSMTTTQG VRAVTSVTAS AVVTTNLTPV QTPTRSLVTQ VSQATGVQLP GKTITPAAHF 2640
QLLRQQQQQQ QQQQQQQQQQ QTSQVQVPQL QGQAQSPAQM KAVGKLAPEH IIKMQKQKMQ 2700
LPPQPPPPQA QPGPPQQPAQ VQVQTAQPPQ QQQSPQLTTV TAPRPGALLT GTTVANLQVA 2760
RLTRVPTSQL QAQGQMQTQT PQPAQVALAK PPVVSVPAAV VSSPGVTTLP MNVAGISVAI 2820
GQPQKTAGQT VVAQPVHVQQ LLKLKQQAVQ QQKAIQPQVA QGQAAVQQKL TTQQITTQGP 2880
QQKVAYAAQP ALKTQFLTTP ISQAQKLAGT QQVQTQIQVA KLPQVVQQQT PVASIQQVAS 2940
ASQQASPQTV TLTQATAAGQ QVQMIPTVTA TAQVVQQKLI QQQVVTTASA SLQTPGGPSP 3000
AQLPTSSDSP NQQPKLQMRV PAVRLKTPTK PPCQ 3034 
Gene Ontology
 GO:0035267; C:NuA4 histone acetyltransferase complex; IEA:Compara.
 GO:0016607; C:nuclear speck; IEA:Compara.
 GO:0005524; F:ATP binding; IEA:InterPro.
 GO:0003682; F:chromatin binding; IEA:InterPro.
 GO:0003677; F:DNA binding; IEA:InterPro.
 GO:0004386; F:helicase activity; IEA:InterPro.
 GO:0043968; P:histone H2A acetylation; IEA:Compara.
 GO:0043967; P:histone H4 acetylation; IEA:Compara. 
Interpro
 IPR013999; HAS_subgr.
 IPR014012; Helicase/SANT-assoc_DNA-bd.
 IPR014001; Helicase_ATP-bd.
 IPR001650; Helicase_C.
 IPR006562; HSA.
 IPR017877; Myb-like_dom.
 IPR027417; P-loop_NTPase.
 IPR001005; SANT/Myb.
 IPR000330; SNF2_N. 
Pfam
 PF00271; Helicase_C
 PF07529; HSA
 PF00176; SNF2_N 
SMART
 SM00487; DEXDc
 SM00490; HELICc
 SM00573; HSA
 SM00717; SANT 
PROSITE
 PS51192; HELICASE_ATP_BIND_1
 PS51194; HELICASE_CTER
 PS51204; HSA
 PS50090; MYB_LIKE 
PRINTS