CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-003692
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Heme-responsive zinc finger transcription factor HAP1 
Protein Synonyms/Alias
 CYP1 activatory protein; Heme activator protein 1 
Gene Name
 HAP1 
Gene Synonyms/Alias
 CYP1; YLR256W; L9672.1 
Created Date
 July 27, 2013 
Organism
 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) (Baker's yeast) 
NCBI Taxa ID
 559292 
Lysine Modification
Position
Peptide
Type
References
280YSKLKSSKCPINHAQubiquitination[1]
399VDHRNYMKDYPSDMAubiquitination[1]
487VIALFIEKFFKHLYPubiquitination[1]
725VSNNGSKKANPSTNPubiquitination[1]
811LRDFSDTKLPSASRIubiquitination[1]
1124SRMLLFQKLTKQLSKubiquitination[1]
Reference
 [1] Global analysis of phosphorylation and ubiquitylation cross-talk in protein degradation.
 Swaney DL, Beltrao P, Starita L, Guo A, Rush J, Fields S, Krogan NJ, VillĂ©n J.
 Nat Methods. 2013 Jul;10(7):676-82. [PMID: 23749301
Functional Description
 Regulation of oxygen dependent gene expression. It modulates the expression of Iso-1 (CYP1) and Iso-2 (CYP3) cytochrome c. In response to heme, promotes transcription of genes encoding functions required for respiration, controlling oxidative damage and repression of anaerobic genes. Binds to the sequence 5'-CGGNNNTNNCGG-3' (By similarity). Is non-functional in terms of iso-1 cytochrome c expression in strain S288c and its derivatives. 
Sequence Annotation
 REPEAT 280 285 HRM 1.
 REPEAT 299 304 HRM 2.
 REPEAT 323 328 HRM 3.
 REPEAT 347 352 HRM 4.
 REPEAT 389 394 HRM 5.
 REPEAT 415 420 HRM 6.
 REPEAT 1192 1197 HRM 7.
 DNA_BIND 64 93 Zn(2)-C6 fungal-type.
 REGION 244 444 Heme-responsive; required for HMC
 METAL 64 64 Zinc 1 (By similarity).
 METAL 64 64 Zinc 2 (By similarity).
 METAL 67 67 Zinc 1 (By similarity).
 METAL 74 74 Zinc 1 (By similarity).
 METAL 81 81 Zinc 1 (By similarity).
 METAL 81 81 Zinc 2 (By similarity).
 METAL 84 84 Zinc 2 (By similarity).
 METAL 93 93 Zinc 2 (By similarity).  
Keyword
 Activator; Coiled coil; Complete proteome; DNA-binding; Heme; Iron; Metal-binding; Nucleus; Reference proteome; Repeat; Transcription; Transcription regulation; Zinc. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1502 AA 
Protein Sequence
MSNTPYNSSV PSIASMTQSS VSRSPNMHTA TTPGANTSSN SPPLHMSSDS SKIKRKRNRI 60
PLSCTICRKR KVKCDKLRPH CQQCTKTGVA HLCHYMEQTW AEEAEKELLK DNELKKLRER 120
VKSLEKTLSK VHSSPSSNSL KSYNTPESSN LFMGSDEHTT LVNANTGSAS SASHMHQQQQ 180
QQQQQEQQQD FSRSANANAN SSSLSISNKY DNDELDLTKD FDLLHIKSNG TIHLGATHWL 240
SIMKGDPYLK LLWGHIFAMR EKLNEWYYQK NSYSKLKSSK CPINHAQAPP SAAAAATRKC 300
PVDHSAFSSG MVAPKEETPL PRKCPVDHTM FSSGMIPPRE DTSSQKRCPV DHTMYSAGMM 360
PPKDETPSPF STKAMIDHNK HTMNPPQSKC PVDHRNYMKD YPSDMANSSS NPASRCPIDH 420
SSMKNTAALP ASTHNTIPHH QPQSGSHARS HPAQSRKHDS YMTESEVLAT LCEMLPPKRV 480
IALFIEKFFK HLYPAIPILD EQNFKNHVNQ MLSLSSMNPT VNNFGMSMPS SSTLENQPIT 540
QINLPKLSDS CNLGILIIIL RLTWLSIPSN SCEVDLGEES GSFLVPNESS NMSASALTSM 600
AKEESLLLKH ETPVEALELC QKYLIKFDEL SSISNNNVNL TTVQFAIFYN FYMKSASNDL 660
TTLTNTNNTG MANPGHDSES HQILLSNITQ MAFSCGLHRD PDNFPQLNAT IPATSQDVSN 720
NGSKKANPST NPTLNNNMSA ATTNSSSRSG SADSRSGSNP VNKKENQVSI ERFKHTWRKI 780
WYYIVSMDVN QSLSLGSPRL LRNLRDFSDT KLPSASRIDY VRDIKELIIV KNFTLFFQID 840
LCIIAVLNHI LNVSLARSVR KFELDSLINL LKNLTYGTEN VNDVVSSLIN KGLLPTSEGG 900
SVDSNNDEIY GLPKLPDILN HGQHNQNLYA DGRNTSSSDI DKKLDLPHES TTRALFFSKH 960
MTIRMLLYLL NYILFTHYEP MGSEDPGTNI LAKEYAQEAL NFAMDGYRNC MIFFNNIRNT 1020
NSLFDYMNVI LSYPCLDIGH RSLQFIVCLI LRAKCGPLTG MRESSIITNG TSSGFNSSVE 1080
DEDVKVKQES SDELKKDDFM KDVNLDSGDS LAEILMSRML LFQKLTKQLS KKYNYAIRMN 1140
KSTGFFVSLL DTPSKKSDSK SGGSSFMLGN WKHPKVSNMS GFLAGDKDQL QKCPVYQDAL 1200
GFVSPTGANE GSAPMQGMSL QGSTARMGGT QLPPIRSYKP ITYTSSNLRR MNETGEAEAK 1260
RRRFNDGYID NNSNNDIPRG ISPKPSNGLS SVQPLLSSFS MNQLNGGTIP TVPSLTNITS 1320
QMGALPSLDR ITTNQINLPD PSRDEAFDNS IKQMTPMTSA FMNANTTIPS STLNGNMNMN 1380
GAGTANTDTS ANGSALSTLT SPQGSDLASN SATQYKPDLE DFLMQNSNFN GLMINPSSLV 1440
EVVGGYNDPN NLGRNDAVDF LPVDNVEIDG VGIKINYHLL TSIYVTSILS YTVLEDDAND 1500
EK 1502 
Gene Ontology
 GO:0005634; C:nucleus; IDA:SGD.
 GO:0000978; F:RNA polymerase II core promoter proximal region sequence-specific DNA binding; IDA:SGD.
 GO:0001077; F:RNA polymerase II core promoter proximal region sequence-specific DNA binding transcription factor activity involved in positive regulation of transcription; IMP:SGD.
 GO:0001133; F:sequence-specific transcription regulatory region DNA binding RNA polymerase II transcription factor recruiting transcription factor activity; IC:SGD.
 GO:0008270; F:zinc ion binding; IEA:InterPro.
 GO:0000436; P:carbon catabolite activation of transcription from RNA polymerase II promoter; IMP:SGD.
 GO:0071169; P:establishment of protein localization to chromatin; IMP:SGD.
 GO:0061428; P:negative regulation of transcription from RNA polymerase II promoter in response to hypoxia; IMP:SGD.
 GO:0043457; P:regulation of cellular respiration; IMP:SGD. 
Interpro
 IPR007219; Transcription_factor_fun.
 IPR001138; Zn2-C6_fun-type_DNA-bd. 
Pfam
 PF00172; Zn_clus 
SMART
 SM00906; Fungal_trans
 SM00066; GAL4 
PROSITE
 PS00463; ZN2_CY6_FUNGAL_1
 PS50048; ZN2_CY6_FUNGAL_2 
PRINTS