CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-034186
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
  
Protein Name
 Transcription initiation factor TFIID subunit 1 
Protein Synonyms/Alias
  
Gene Name
 Taf1 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Mus musculus (Mouse) 
NCBI Taxa ID
 10090 
Lysine Modification
Position
Peptide
Type
References
571KSRILLGKTGVIKEEacetylation[1]
Reference
 [1] SIRT5-Mediated Lysine Desuccinylation Impacts Diverse Metabolic Pathways.
 Park J, Chen Y, Tishkoff DX, Peng C, Tan M, Dai L, Xie Z, Zhang Y, Zwaans BM, Skinner ME, Lombard DB, Zhao Y.
 Mol Cell. 2013 Jun 27;50(6):919-30. [PMID: 23806337
Functional Description
  
Sequence Annotation
  
Keyword
 Bromodomain; Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1902 AA 
Protein Sequence
MGPGWAGLLQ DKGGGSPSVV MSDTDSDEES AGGGPFSLTG FLFGNINGAG QLEGESVLDD 60
ECKKHLAGLG ALGLGSLITE LTANEELSGS DGALVNDEGW IRSREDAVDY SDINEVAEDE 120
SRRYQQTMGS LQPLCHTDYD EDDYDADCED IDCKLMPPPP PPPGPLKKEK DQDDITGEKV 180
DFSSSSDSES EMGPQDAAQS ESKDGQLTLP LAGIMQHDAT KLLPSVTELF PEFRPGKVLR 240
FLRLFGPGKN VPSVWRSARR KRKKKHRELI QEGQVQEEEC SVELEVNQKS LWNYDYAPPP 300
LPDQCLSDDE ITMMAPVESK FSQSTGDTDK VMDTKPRVAE WRYGPARLWY DMLGVPEDGS 360
GFDYGFKMKK TEHESTIKCN IMKKLRKLEE NSGVDLLADE NFLMVTQLHW EDDIIWDGED 420
VKHKGTKPQR ASLAGWLPSS MTRNAMAYNV QQGRPCIKKP TSQTKQNTHT QNSISQPSGS 480
LHTTLDDDKP WYSIFPIDNE DLVYGRWEDN IIWDAQNMPR ILEPPVLTLD PNDENLILEI 540
PDEKEEATSN SPSKENKKES SLKKSRILLG KTGVIKEEPQ QNMSQPEVKD PWNLSNDEYY 600
YPKQQGLRGT FGGNIIQHSI PAVELRQPFF PTHMGPIKLR QFHRPPLKKY SFGALSQPGP 660
HSVQPLLKHI KKKAKMREQE RQASGGGEMF FMRTPQDLTG KDGDLILAEY SEENGPLMMQ 720
VGMATKIKNY YKRKPGKDPG APDCKYGETV YCHTSPFLGS LHPGQLLQAF ENNLFRAPIY 780
LHKMPESDFL IIRTRQGYFI RELVDIFVVG QQCPLFEVPG PNSKRANTHI RDFLQVFIYR 840
LFWKSKDRPR RIRMEDIKKA FPSHSESSIR KRLKLCADFK RTGMDSNWWV LKSDFRLPTE 900
EEIRAMVSPE QCCAYYSMIA AEQRLKDAGY GEKSFFAPEE ENEEDFQMKI DDEVRTAPWN 960
TTRAFIAAMK GKCLLEVTGV ADPTGCGEGF SYVKIPNKPT QQKDDKEPQP VKKTVTGTDA 1020
DLRRLSLKNA KQLLRKFGVP EEEIKKLSRW EVIDVVRTMS TEQARSGEGP MSKFARGSRF 1080
SVAEHQERYK EECQRIFDLQ NKVLSSTEVL STDTDSSSAE DSDFEEMGKN IENMLQNKKT 1140
SSQLSREREE QERKELQRML LEADGEAAGS AAAGNNHRDD DTASVTSLNS SATGRCLKIY 1200
RTFRDEEGKE YVRCETVRKA TVIDAYVRIR TTKDEEFIRK FALFDEQHRE EMRKERRRIQ 1260
EQLRRLKRNQ EKEKLKGPPE KKPKKMKERP DLKLKCGACG AIGHMRTNKF CPLYYQTNAP 1320
PSNPVAMTEE QEEELEKTVI HNDNEELIKV EGTKIVLGKQ LIESADEVRR KSLVLKFPKQ 1380
QLPPKKKRRV GTTVHCDYLN RPHKSIHRRR TDPMVTLSSI LESIINDMRD LPNTYPFHTP 1440
VNAKVVKDYY KIITRPMDLQ TLRENVRKRL YPSREEFREH LELIVKNSAT YNGPKHSLTQ 1500
ISQSMLDLCD EKLKEKEDKL ARLEKAINPL LDDDDQVAFS FILDNIVTQK MMAVPDSWPF 1560
HHPVNKKFVP DYYKVIVSPM DLETIRKNIS KHKYQSRESF LDDVNLILAN SVKYNGPESQ 1620
YTKTAQEIVN VCHQTLTEYD EHLTQLEKDI CTAKEAALEE AELESLDPMT PGPYTPQPPD 1680
LYDNNTSLSV SRDASVYQDE SNLSVLDIPS ATSEKQLTQE GGDGDGDLAD EEEGTVQQPQ 1740
ASVLYEDLLM SEGEDDEEDA GSDEEGDNPF FAIQLSESGS DSDVESGSLR PKQPRVLQEN 1800
TRMGMENEES MMSYEGDGGD ASRGLEDSNI SYGSYEEPDP KSNTQDTSFS SIGGYEVSEE 1860
EEDEEEQRSG PSVLSQVHLS EDEEDSEDFH SIAGDTDLDS DE 1902 
Gene Ontology
 GO:0045120; C:pronucleus; IDA:MGI.
 GO:0005669; C:transcription factor TFIID complex; ISO:MGI.
 GO:0003677; F:DNA binding; IDA:MGI.
 GO:0006352; P:DNA-dependent transcription, initiation; IEA:InterPro.
 GO:0006355; P:regulation of transcription, DNA-dependent; IEA:InterPro. 
Interpro
 IPR001487; Bromodomain.
 IPR018359; Bromodomain_CS.
 IPR011177; TAF1_animal.
 IPR009067; TAF_II_230-bd.
 IPR022591; TFIID_sub1_DUF3591. 
Pfam
 PF00439; Bromodomain
 PF12157; DUF3591
 PF09247; TBP-binding 
SMART
 SM00297; BROMO 
PROSITE
 PS00633; BROMODOMAIN_1
 PS50014; BROMODOMAIN_2 
PRINTS
 PR00503; BROMODOMAIN.