CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-032450
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 MGA protein 
Protein Synonyms/Alias
  
Gene Name
 MGA 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
708ARISQLEKELIEDLKubiquitination[1]
Reference
 [1] Systematic and quantitative assessment of the ubiquitin-modified proteome.
 Kim W, Bennett EJ, Huttlin EL, Guo A, Li J, Possemato A, Sowa ME, Rad R, Rush J, Comb MJ, Harper JW, Gygi SP.
 Mol Cell. 2011 Oct 21;44(2):325-40. [PMID: 21906983
Functional Description
  
Sequence Annotation
  
Keyword
 DNA-binding; Nucleus; Transcription; Transcription regulation. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2856 AA 
Protein Sequence
MEEKQQIILA NQDGGTVAGA APTFFVILKQ PGNGKTDQGI LVTNQDACAL ASSVSSPVKS 60
KGKICLPADC TVGGITVTLD NNSMWNEFYH RSTEMILTKQ GRRMFPYCRY WITGLDSNLK 120
YILVMDISPV DNHRYKWNGR WWEPSGKAEP HVLGRVFIHP ESPSTGHYWM HQPVSFYKLK 180
LTNNTLDQEG HIILHSMHRY LPRLHLVPAE KAVEVIQLNG PGVHTFTFPQ TEFFAVTAYQ 240
NIQITQLKID YNPFAKGFRD DGLNNKPQRD GKQKNSSDQE GNNISSSSGH RVRLTEGQGS 300
EIQPGDLDPL SRGHETSGKG LEKTSLNIKR DFLGFMDTDS ALSEVPQLKQ EISECLIASS 360
FEDDSRVASP LDQNGSFNVV IKEEPLDDYD YELGECPEGV TVKQEETDEE TDVYSNSDDD 420
PILEKQLKRH NKVDNPEADH LSSKWLPSSP SGVAKAKMFK LDTGKMPVVY LEPCAVTRST 480
VKISELPDNM LSTSRKDKSS MLAELEYLPT YIENSNETAF CLGKESENGL RKHSPDLRVV 540
QKYPLLKEPQ WKYPDISDSI STERILDDSK DSVGDSLSGK EDLGRKRTTM LKIATAAKVV 600
NANQNASPNV PGKRGRPRKL KLCKAGRPPK NTGKSLISTK NTPVSPGSTF PDVKPDLEDV 660
DGVLFVSFES KEALDIHAVD GTTEESSSLQ ASTTNDSGYR ARISQLEKEL IEDLKSLRHK 720
QVIHPGLQEV GLKLNSVDPT MSIDLKYLGV QLPLAPATSF PFWNLTGTNP ASPDAGFPFV 780
SRTGKTNDFT KIKGWRGKFH SASASRNEGG NSESSLKNRS AFCSDKLDEY LENEGKLMET 840
SMGFSSNAPT SPVVYQLPTK STSYVRTLDS VLKKQSTISP STSYSLKPHS VPPVSRKAKS 900
QNRQATFSGR TKSSYKSILP YPVSPKQKYS HVILGDKVTK NSSGIISENQ ANNFVVPTLD 960
ENIFPKQISL RQAQQQQQQQ QGSRPPGLSK SQVKLMDLED CALWEGKPRT YITEERADVS 1020
LTTLLTAQAS LKTKPIHTII RKRAPPCNND FCRLGCVCSS LALEKRQPAH CRRPDCMFGC 1080
TCLKRKVVLV KGGSKTKHFQ RKAAHRDPVF YDTLGEEARE EEEGIREEEE QLKEKKKRKK 1140
LEYTICETEP EQPVRHYPLW VKVEGEVDPE PVYIPTPSVI EPMKPLLLPQ PEVLSPTVKG 1200
KLLTGIKSPR SYTPKPNPVI REEDKDPVYL YFESMMTCAR VRVYERKKED QRQPSSSSSP 1260
SPSFQQQTSC HSSPENHNNA KEPDSEQQPL KQLTCDLEDD SDKLQEKSWK SSCNEGESSS 1320
TSYMHQRSPG GPTKLIEIIS DCNWEEDRNK ILSILSQHIN SNMPQSLKVG SFIIELASQR 1380
KSPGEKNPPV YSSRVKISMP SCQDQDDMAE KSGSETPDGP LSPGKMEDIS PVQTDALDSV 1440
RERLHGGKGL PFYAGLSPAG KLVAYKRKPS SSTSGLIQVA SNAKVAASRK PRTLLPSTSN 1500
SKMASSSGTA TNRPGKNLKA FVAAKRPIEN AAQIPVATPQ VSPNTVKRAG PRLLLIPVQQ 1560
GSPTLRPVSN TQLQGHRMVL QPVRSPSGMN LFRHPNGQIV QLLPLHQLRG SNTQPNLQPV 1620
MFRNPGSVMG IRLPAPSKPS ETPPSSTSSS AFSVMNPVIQ AVGSSSAVNV ITQAPSLLSS 1680
GASFVSQAGT LTLRISPPEP QSFASKTGSE TKITYSSGGQ PVGTASLIPL QSGSFALLQL 1740
PGQKPVPSSI LQHVASLQMK RESQNPDQKD ETNSIKREQE TKKVLQSEGE AVDPEANVIK 1800
QNSGAATSEE TLNDSLEDRG DHLDEECLPE EGCATVKPSE HSCITGSHTD QDYKDVNEEY 1860
GARNRKSSKE KVAVLEVRTI SEKASNKTVQ NLSKVQHQKL GDVKVEQQKG FDNPEENSSE 1920
FPVTFKEESK FELSGSKVME QQSNLQPEAK EKECGDSLEK DRERWRKHLK GPLTRKCVGA 1980
SQECKKEADE QLIKETKTCQ ENSDVFQQEQ GISDLLGKSG ITEDARVLKT ECDSWSRISN 2040
PSAFSIVPRR AAKSSRGNGH FQGHLLLPGE QIQPKQEKKG GRSSADFTVL DLEEDDEDDN 2100
EKTDDSIDEI VDVVSDYQSE EVDDVEKNNC VEYIEDDEEH VDIETVEELS EEINVAHLKT 2160
TAAHTQSFKQ PSCTHISADE KAAERSRKAP PIPLKLKPDY WSDKLQKEAE AFAYYRRTHT 2220
ANERRRRGEM RDLFEKLKIT LGLLHSSKVS KSLILTRAFS EIQGLTDQAD KLIGQKNLLT 2280
RKRNILIRKV SSLSGKTEEV VLKKLEYIYA KQQALEAQKR KKKMGSDEFD ISPRISKQQE 2340
GSSASSVDLG QMFINNRRGK PLILSRKKDQ ATENTSPLNT PHTSANLVMT PQGQLLTLKG 2400
PLFSGPVVAV SPDLLESDLK PQVAGSAVAL PENDDLFMMP RIVNVTSLAT EGGLVDMGGS 2460
KYPHEVPDSK PSDHLKDTVR NEDNSLEDKG RISSRGNRDG RVTLGPTQVF LANKDSGYPQ 2520
IVDVSNMQKA QEFLLKKISG DMRGIQYKWK ESESRGERVK SKDSSFHKLK MKDLKDSSIE 2580
MELRKVTSAI EEAALDSSEL LTNMEDEDDT DETLTSLLNE IAFLNQQLND DSVGLAELPS 2640
SMDTEFPGDA RRAFISKVPP GSRATFQVEH LGTGLKELPD VQGESDSISP LLLHLEDDDF 2700
SENEKQLAEP ASEPDVLKIV IDSEIKDSLL SNKKAIDGGK NTSGLPAEPE SVSSPPTLHM 2760
KTGLENSNST DTLWRPMPKL APLGLKVANP SSDADGQSLK VMPCLAPIAA KVGSVGHKMN 2820
LTGNDQEGRE SKVMPTLAPV VAKLGNSGAS PSSAGK 2856 
Gene Ontology
 GO:0005634; C:nucleus; IEA:UniProtKB-KW.
 GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
 GO:0003700; F:sequence-specific DNA binding transcription factor activity; IEA:InterPro.
 GO:0006351; P:transcription, DNA-dependent; IEA:UniProtKB-KW. 
Interpro
 IPR011598; bHLH_dom.
 IPR008967; p53-like_TF_DNA-bd.
 IPR001699; TF_T-box.
 IPR018186; TF_T-box_CS. 
Pfam
 PF00010; HLH
 PF00907; T-box 
SMART
 SM00353; HLH
 SM00425; TBOX 
PROSITE
 PS50888; BHLH
 PS01264; TBOX_2
 PS50252; TBOX_3 
PRINTS
 PR00937; TBOX.