CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-038696
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
  
Protein Name
 MAX gene-associated protein 
Protein Synonyms/Alias
  
Gene Name
 Mga 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Mus musculus (Mouse) 
NCBI Taxa ID
 10090 
Lysine Modification
Position
Peptide
Type
References
705TRISQLEKELIEDLKubiquitination[1]
Reference
 [1] Proteomic analyses reveal divergent ubiquitylation site patterns in murine tissues.
 Wagner SA, Beli P, Weinert BT, Schölz C, Kelstrup CD, Young C, Nielsen ML, Olsen JV, Brakebusch C, Choudhary C.
 Mol Cell Proteomics. 2012 Dec;11(12):1578-85. [PMID: 22790023
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; DNA-binding; Nucleus; Reference proteome; Transcription; Transcription regulation. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 3042 AA 
Protein Sequence
MEEKQQIILA NQDGGTVTGG APTFFVILKQ PGNGKTDQGI LVTNRDARAL LSRESSPGKS 60
KEKICLPADC TVGKITVTLD NNSMWNEFHN RSTEMILTKQ GRRMFPYCRY WITGLDSNLK 120
YILVMDISPV DSHRYKWNGR WWEPSGKAEP HILGRVFIHP ESPSTGHYWM HQPVSFYKLK 180
LTNNTLDQEG HIILHSMHRY LPRLHLVPAE KATEVIQLNG PGVHTFTFPQ TEFFAVTAYQ 240
NIQITQLKID YNPFAKGFRD DGLSSKPQRE GKQRNSSDQE GNSVSSSPAH RVRLTEGEGS 300
EIHSGDFDPV LRGHEASSLS LEKAPHNVKQ DFLGFMNTDS THEVPQLKHE ISESRIVNSF 360
EDDSQISSPS NPNGNFNVVI KEEPLDDYDY ELGECPEGIT VKQEETDEET DVYSNSDDDP 420
ILEKQLKRHN KVDNLEADHP SYKWLPNSPG VAKAKMFKLD AGKMPVVYLE PCAVTKSTVK 480
ISELPDNMLS TSRKDKSMLA ELEYLPAYIE NSDGTDFCLS KDSENSLRKH SPDLRIVQKY 540
TLLKEPNWKY PDILDNSSTE RIHDSSKGST AESFSGKEDL GKKRTTMLKM AIPSKTVTAS 600
HSASPNTPGK RGRPRKLRLS KAGRPPKNTG KSLTAAKNIP VGPGSTFPDV KPDLEDVDGV 660
LFVSFESKEA LDIHAVDGTT EEPSSLQTTT TNDSGCRTRI SQLEKELIED LKSLRHKQVI 720
HPALQEVGLK LNSVDPTVSI DLKYLGVQLP LAPATSFPLW NVTGTNPASP DAGFPFVSRT 780
GKTNDFTKIK GWRGKFQNAS ASRNEGGNSE ASLKNRSAFC SDKLDEYLEN EGKLMETNIG 840
FSSNAPTSPV VYQLPTKSTS YVRTLDSVLK KQSTISPSTS HSVKPQSVTT ASRKTKAQNK 900
QTTLSGRTKS SYKSILPYPV SPKQKNSHVS QGDKITKNSL SSTSDNQVTN LVVPSVDENA 960
FPKQISLRQA QQQHLQQQGT RPPGLSKSQV KLMDLEDCAL WEGKPRTYIT EERADVSLTT 1020
LLTAQASLKT KPIHTIIRKR APPCNNDFCR LGCVCSSLAL EKRQPAHCRR PDCMFGCTCL 1080
KRKVVLVKGG SKTKHFHKKA ANRDPLFYDT LGEEGREGGG VREDEEQLKE KKKRKKLEYT 1140
VCEAEPEQPV RHYPLWVKVE GEVDPEPVYI PTPSVIEPIK PLVLPQPDLS STTKGKLTPG 1200
IKPARTYTPK PNPVIREEDK DPVYLYFESM MTCARVRVYE RKKEEQRQLS PPLSPSSSFQ 1260
QQSSCYSSPE NRVTKELDSE QTLKQLICDL EDDSDKSQEK SWKSSCNEGE SSSTSYVHQR 1320
SPGGPTKLIE IISDCNWEED RNKILSILSQ HINSNMPQSL KVGSFIIELA SQRKCRGEKT 1380
PPVYSSRVKI SMPSSQDQDD MAEKSGSETP DGPLSPGKMD DISPVQTDAL DSVRERLHGG 1440
KGLPFYAGLS PSGKLVAYKR KPSSTTSGLI QVASNAKVAA SRKPRTLLPS TSNSKMASSG 1500
PATNRSGKNL KAFVPAKRPI AARPSPGGVF TQFVMSKVGA LQQKIPGVRT PQPLTGPQKF 1560
SIRPSPVMVV TPVVSSEQVQ VCSTVAAAVT TSPQVFLENV TAVPSLTANS DMGAKEATYS 1620
SSASTAGVVE ISETNNTTLV TSTQSTATVN LTKTTGITTS PVASVSFAKP LVASPTITLP 1680
VASTASTSIV MVTTAASSSV VTTPTSSLSS VPIILSGING SPPVSQRPEN APQIPVTTPQ 1740
ISSNNVKRTG PRLLLIPVQQ GSPTLRPIQN PQLQGQRMVL QPVRGPSGMN LFRHPNGQIV 1800
QLLPLHQIRG SNAQPSLQPV VFRNPGSMVG IRLPAPCKSS ETPSSSASSS AFSVMSPVIQ 1860
AVGSSPTVNV ISQAPSLLSS GSSFVSQAGT LTLRISPPET QNLASKTGSE SKITPSTGGQ 1920
PVGTASLIPL QSGSFALLQL PGQKPIPSSV LQHVASLQIK KESQSTDQKD ETNSIKREEE 1980
TKKALPSKDK ALDSEANIMK QNSGIIASEN TSNNSLDDGG DLLDEETLRE DARPYEYSYS 2040
TGSHTDEDKD GDEDSGNKNQ NSPKEKQTVP EVRAGSKNID IMALQSIRSI RPQKCVKVKV 2100
EPQEGSDNPE NPDDFLVLSK DSKFELSGNQ VKEQQSNSQA EAKKDCEDSL GKDSLRERWR 2160
KHLKGPLTQK YIGISQNFNK EANVQFFTEM KPCQENSEQD ISELLGKSGT IESGGVLKTE 2220
DGSWSGISSS AAFSIIPRRA TKGRRGSRHF QGHLLLPREQ MKPKQQTKDG RSSAADFTVL 2280
DLEDEDEEDE KTDDSLDEIV DVVSGYQSEE VDVEKNNYVD YLEDDEQVDV ETIEELSEEI 2340
NFPYKKTTAT HTQSFKQQCH SHISADEKAS EKSRKVSLIS SKLKDDCWGD KPHKETEAFA 2400
YYRRTHTANE RRRRGEMRDL FEKLKITLGL LHSSKVSKSL ILNRAFSEIQ GLTDQADKLI 2460
GQKNLLSRKR SILIRKVSSL SGKTEEVVLK KLEYIYAKQQ ALEAQKRKKK LGSDEFCVSP 2520
RIGTQLEGSS ASSVDLGQML MNNRRGKPLI LSRKRDQATE NASPSDTPHS SANLVMTPQG 2580
QLLTLKGPLF SGPVVAVSPA LLEGGLKPQV ASSTMSQSEN DDLFMMPRIV NVTSLAAEED 2640
LGGMSGNKYR HEVPDGKPLD HLRDIAGSEA SSLKDTERIS SRGNHRDSRK ALGPTQVLLA 2700
NKDSGFPHVA DVSTMQAAQE FIPKNMSGDV RGHRYKWKEC ELRGERLKSK ESQFHKLKMK 2760
DLKDSSIEME LRKVASAIEE AALHPSELLT NMEDEDDTDE TLTSLLNEIA FLNQQLNDDS 2820
GLAELSGSMD TEFSGDAQRA FISKLAPGNR SAFQVGHLGA GVKELPDVQE ESESISPLLL 2880
HLEDDDFSEN EKQLGDTASE PDVLKIVIDP EIKDSLVSHR KSSDGGQSTS GLPAEPESVS 2940
SPPILHMKTG PENSNTDTLW RPMPKLAPLG LKVANPPSDA DGQSLKVMPA LAPIAAKVGS 3000
IGHKMNLAGI DQEGRGSKVM PTLAPVVPKL GNSGAPSSSS GK 3042 
Gene Ontology
 GO:0005667; C:transcription factor complex; IDA:MGI.
 GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
 GO:0003700; F:sequence-specific DNA binding transcription factor activity; TAS:MGI.
 GO:0006351; P:transcription, DNA-dependent; IEA:UniProtKB-KW. 
Interpro
 IPR011598; bHLH_dom.
 IPR008967; p53-like_TF_DNA-bd.
 IPR001699; TF_T-box.
 IPR018186; TF_T-box_CS. 
Pfam
 PF00010; HLH
 PF00907; T-box 
SMART
 SM00353; HLH
 SM00425; TBOX 
PROSITE
 PS50888; BHLH
 PS01264; TBOX_2
 PS50252; TBOX_3 
PRINTS
 PR00937; TBOX.