CPLM 1.0 - Compendium of Protein Lysine Modification
Tag
Content
CPLM ID
CPLM-032450
UniProt Accession
B9EGR5_HUMAN
;
B9EGR5
Genbank Protein ID
BC136659
Genbank Nucleotide ID
AAI36660.1
Protein Name
MGA protein
Protein Synonyms/Alias
Gene Name
MGA
Gene Synonyms/Alias
Created Date
July 27, 2013
Organism
Homo sapiens (Human)
NCBI Taxa ID
9606
Lysine Modification
Position
Peptide
Type
References
708
ARISQLE
K
ELIEDLK
ubiquitination
[1]
Reference
[1] Systematic and quantitative assessment of the ubiquitin-modified proteome.
Kim W, Bennett EJ, Huttlin EL, Guo A, Li J, Possemato A, Sowa ME, Rad R, Rush J, Comb MJ, Harper JW, Gygi SP.
Mol Cell. 2011 Oct 21;44(2):325-40. [
PMID: 21906983
]
Functional Description
Sequence Annotation
Keyword
DNA-binding; Nucleus; Transcription; Transcription regulation.
Sequence Source
UniProt (SWISSPROT/TrEMBL); GenBank; EMBL
Protein Length
2856 AA
Protein Sequence
MEEKQQIILA NQDGGTVAGA APTFFVILKQ PGNGKTDQGI LVTNQDACAL ASSVSSPVKS 60
KGKICLPADC TVGGITVTLD NNSMWNEFYH RSTEMILTKQ GRRMFPYCRY WITGLDSNLK 120
YILVMDISPV DNHRYKWNGR WWEPSGKAEP HVLGRVFIHP ESPSTGHYWM HQPVSFYKLK 180
LTNNTLDQEG HIILHSMHRY LPRLHLVPAE KAVEVIQLNG PGVHTFTFPQ TEFFAVTAYQ 240
NIQITQLKID YNPFAKGFRD DGLNNKPQRD GKQKNSSDQE GNNISSSSGH RVRLTEGQGS 300
EIQPGDLDPL SRGHETSGKG LEKTSLNIKR DFLGFMDTDS ALSEVPQLKQ EISECLIASS 360
FEDDSRVASP LDQNGSFNVV IKEEPLDDYD YELGECPEGV TVKQEETDEE TDVYSNSDDD 420
PILEKQLKRH NKVDNPEADH LSSKWLPSSP SGVAKAKMFK LDTGKMPVVY LEPCAVTRST 480
VKISELPDNM LSTSRKDKSS MLAELEYLPT YIENSNETAF CLGKESENGL RKHSPDLRVV 540
QKYPLLKEPQ WKYPDISDSI STERILDDSK DSVGDSLSGK EDLGRKRTTM LKIATAAKVV 600
NANQNASPNV PGKRGRPRKL KLCKAGRPPK NTGKSLISTK NTPVSPGSTF PDVKPDLEDV 660
DGVLFVSFES KEALDIHAVD GTTEESSSLQ ASTTNDSGYR ARISQLEKEL IEDLKSLRHK 720
QVIHPGLQEV GLKLNSVDPT MSIDLKYLGV QLPLAPATSF PFWNLTGTNP ASPDAGFPFV 780
SRTGKTNDFT KIKGWRGKFH SASASRNEGG NSESSLKNRS AFCSDKLDEY LENEGKLMET 840
SMGFSSNAPT SPVVYQLPTK STSYVRTLDS VLKKQSTISP STSYSLKPHS VPPVSRKAKS 900
QNRQATFSGR TKSSYKSILP YPVSPKQKYS HVILGDKVTK NSSGIISENQ ANNFVVPTLD 960
ENIFPKQISL RQAQQQQQQQ QGSRPPGLSK SQVKLMDLED CALWEGKPRT YITEERADVS 1020
LTTLLTAQAS LKTKPIHTII RKRAPPCNND FCRLGCVCSS LALEKRQPAH CRRPDCMFGC 1080
TCLKRKVVLV KGGSKTKHFQ RKAAHRDPVF YDTLGEEARE EEEGIREEEE QLKEKKKRKK 1140
LEYTICETEP EQPVRHYPLW VKVEGEVDPE PVYIPTPSVI EPMKPLLLPQ PEVLSPTVKG 1200
KLLTGIKSPR SYTPKPNPVI REEDKDPVYL YFESMMTCAR VRVYERKKED QRQPSSSSSP 1260
SPSFQQQTSC HSSPENHNNA KEPDSEQQPL KQLTCDLEDD SDKLQEKSWK SSCNEGESSS 1320
TSYMHQRSPG GPTKLIEIIS DCNWEEDRNK ILSILSQHIN SNMPQSLKVG SFIIELASQR 1380
KSPGEKNPPV YSSRVKISMP SCQDQDDMAE KSGSETPDGP LSPGKMEDIS PVQTDALDSV 1440
RERLHGGKGL PFYAGLSPAG KLVAYKRKPS SSTSGLIQVA SNAKVAASRK PRTLLPSTSN 1500
SKMASSSGTA TNRPGKNLKA FVAAKRPIEN AAQIPVATPQ VSPNTVKRAG PRLLLIPVQQ 1560
GSPTLRPVSN TQLQGHRMVL QPVRSPSGMN LFRHPNGQIV QLLPLHQLRG SNTQPNLQPV 1620
MFRNPGSVMG IRLPAPSKPS ETPPSSTSSS AFSVMNPVIQ AVGSSSAVNV ITQAPSLLSS 1680
GASFVSQAGT LTLRISPPEP QSFASKTGSE TKITYSSGGQ PVGTASLIPL QSGSFALLQL 1740
PGQKPVPSSI LQHVASLQMK RESQNPDQKD ETNSIKREQE TKKVLQSEGE AVDPEANVIK 1800
QNSGAATSEE TLNDSLEDRG DHLDEECLPE EGCATVKPSE HSCITGSHTD QDYKDVNEEY 1860
GARNRKSSKE KVAVLEVRTI SEKASNKTVQ NLSKVQHQKL GDVKVEQQKG FDNPEENSSE 1920
FPVTFKEESK FELSGSKVME QQSNLQPEAK EKECGDSLEK DRERWRKHLK GPLTRKCVGA 1980
SQECKKEADE QLIKETKTCQ ENSDVFQQEQ GISDLLGKSG ITEDARVLKT ECDSWSRISN 2040
PSAFSIVPRR AAKSSRGNGH FQGHLLLPGE QIQPKQEKKG GRSSADFTVL DLEEDDEDDN 2100
EKTDDSIDEI VDVVSDYQSE EVDDVEKNNC VEYIEDDEEH VDIETVEELS EEINVAHLKT 2160
TAAHTQSFKQ PSCTHISADE KAAERSRKAP PIPLKLKPDY WSDKLQKEAE AFAYYRRTHT 2220
ANERRRRGEM RDLFEKLKIT LGLLHSSKVS KSLILTRAFS EIQGLTDQAD KLIGQKNLLT 2280
RKRNILIRKV SSLSGKTEEV VLKKLEYIYA KQQALEAQKR KKKMGSDEFD ISPRISKQQE 2340
GSSASSVDLG QMFINNRRGK PLILSRKKDQ ATENTSPLNT PHTSANLVMT PQGQLLTLKG 2400
PLFSGPVVAV SPDLLESDLK PQVAGSAVAL PENDDLFMMP RIVNVTSLAT EGGLVDMGGS 2460
KYPHEVPDSK PSDHLKDTVR NEDNSLEDKG RISSRGNRDG RVTLGPTQVF LANKDSGYPQ 2520
IVDVSNMQKA QEFLLKKISG DMRGIQYKWK ESESRGERVK SKDSSFHKLK MKDLKDSSIE 2580
MELRKVTSAI EEAALDSSEL LTNMEDEDDT DETLTSLLNE IAFLNQQLND DSVGLAELPS 2640
SMDTEFPGDA RRAFISKVPP GSRATFQVEH LGTGLKELPD VQGESDSISP LLLHLEDDDF 2700
SENEKQLAEP ASEPDVLKIV IDSEIKDSLL SNKKAIDGGK NTSGLPAEPE SVSSPPTLHM 2760
KTGLENSNST DTLWRPMPKL APLGLKVANP SSDADGQSLK VMPCLAPIAA KVGSVGHKMN 2820
LTGNDQEGRE SKVMPTLAPV VAKLGNSGAS PSSAGK 2856
Gene Ontology
GO:0005634
; C:nucleus; IEA:UniProtKB-KW.
GO:0003677
; F:DNA binding; IEA:UniProtKB-KW.
GO:0003700
; F:sequence-specific DNA binding transcription factor activity; IEA:InterPro.
GO:0006351
; P:transcription, DNA-dependent; IEA:UniProtKB-KW.
Interpro
IPR011598
; bHLH_dom.
IPR008967
; p53-like_TF_DNA-bd.
IPR001699
; TF_T-box.
IPR018186
; TF_T-box_CS.
Pfam
PF00010
; HLH
PF00907
; T-box
SMART
SM00353
; HLH
SM00425
; TBOX
PROSITE
PS50888
; BHLH
PS01264
; TBOX_2
PS50252
; TBOX_3
PRINTS
PR00937
; TBOX.