CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-044085
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
  
Protein Name
 Serine/threonine-protein kinase SMG1 
Protein Synonyms/Alias
  
Gene Name
 SMG1 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
529KLFIPSSKLLFLRYHubiquitination[1]
598EIKHEAFKNHVFNVDubiquitination[1]
1411NQLLEKIKEQTVPIRubiquitination[2]
1468DLVQHFKKLSTQGQVubiquitination[2]
1632WAYRWGRKVVDNASQubiquitination[2, 3]
1650VRLLPREKSEVQNLLubiquitination[2]
1798RLLRLLVKHAGELRQubiquitination[2]
1989QQLEDEVKRVQNNNTubiquitination[4]
2232EAALQAQKAQDSYQTubiquitination[3, 5]
2256PSELYYSKIGPALKTubiquitination[2, 3]
2370KSLRVPEKVPFRMTQubiquitination[2, 3]
2471SSRVAEIKVNWFKNRubiquitination[2]
2729IRQVERLKQEAVTVPubiquitination[2]
3030LKRLQTIKEFFRLCGubiquitination[2]
Reference
 [1] hCKSAAP_UbSite: improved prediction of human ubiquitination sites by exploiting amino acid pattern and properties.
 Chen Z, Zhou Y, Song J, Zhang Z.
 Biochim Biophys Acta. 2013 Aug;1834(8):1461-7. [PMID: 23603789]
 [2] Refined preparation and use of anti-diglycine remnant (K-ε-GG) antibody enables routine quantification of 10,000s of ubiquitination sites in single proteomics experiments.
 Udeshi ND, Svinkina T, Mertins P, Kuhn E, Mani DR, Qiao JW, Carr SA.
 Mol Cell Proteomics. 2013 Mar;12(3):825-31. [PMID: 23266961]
 [3] Integrated proteomic analysis of post-translational modifications by serial enrichment.
 Mertins P, Qiao JW, Patel J, Udeshi ND, Clauser KR, Mani DR, Burgess MW, Gillette MA, Jaffe JD, Carr SA.
 Nat Methods. 2013 Jul;10(7):634-7. [PMID: 23749302]
 [4] Systematic and quantitative assessment of the ubiquitin-modified proteome.
 Kim W, Bennett EJ, Huttlin EL, Guo A, Li J, Possemato A, Sowa ME, Rad R, Rush J, Comb MJ, Harper JW, Gygi SP.
 Mol Cell. 2011 Oct 21;44(2):325-40. [PMID: 21906983]
 [5] Landscape of the PARKIN-dependent ubiquitylome in response to mitochondrial depolarization.
 Sarraf SA, Raman M, Guarani-Pereira V, Sowa ME, Huttlin EL, Gygi SP, Harper JW.
 Nature. 2013 Apr 18;496(7445):372-6. [PMID: 23503661
Functional Description
  
Sequence Annotation
  
Keyword
 ATP-binding; Complete proteome; Kinase; Nucleotide-binding; Reference proteome; Transferase. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 3662 AA 
Protein Sequence
MSRRAPGSRL SSGGGGGGTK YPRSWNDWQP RTDSASADPD NLKYSSSRDR GGSSSYGLQP 60
SNSAVVSRQR HDDTRVHADI QNDEKGGYSV NGGSGENTYG RKSLGQELRV NNVTSPEFTS 120
VQHGSRALAT KDMRKSQERS MSYSDESRLS NLLRRITRED DRDRRLATVK QLKEFIQQPE 180
NKLVLVKQLD NILAAVHDVL NESSKLLQEL RQEGACCLGL LCASLSYEAE KIFKWIFSKF 240
SSSAKDEVKL LYLCATYKAL ETVGEKKAFS SVMQLVMTSL QSILENVDTP ELLCKCVKCI 300
LLVARCYPHI FSTNFRDTVD ILVGWHIDHT QKPSLTQQVS GWLQSLEPFW VADLAFSTTL 360
LGQFLEDMEA YAEDLSHVAS GESVDEDVPP PSVSLPKLAA LLRVFSTVVR SIGERFSPIR 420
GPPITEAYVT DVLYRVMRCV TAANQVFFSE AVLTAANECV GVLLGSLDPS MTIHCDMVIT 480
YGLDQLENCQ TCGTDYIISV LNLLTLIVEQ INTKLPSSFV EKLFIPSSKL LFLRYHKEKE 540
VVAVAHAVYQ AVLSLKNIPV LETAYKLILG EMTCALNNLL HSLQLPEACS EIKHEAFKNH 600
VFNVDNAKFV VIFDLSALTT IGNAKNSLIG MWALSPTVFA LLSKNLMIVH SDLAVHFPAI 660
QYAVLYTLYS HCTRHDHFIS SSLSSSSPSL FDGAVISTVT TATKKHFSII LNLLGILLKK 720
DNLNQDTRKL LMTWALEAAV LMKKSETYAP LFSLPSFHKF CKGLLANTLV EDVNICLQAC 780
SSLHALSSSL PDDLLQRCVD VCRVQLVHSG TRIRQAFGKL LKSIPLDVVL SNNNHTEIQE 840
ISLALRSHMS KAPSNTFHPQ DFSDVISFIL YGNSHRTGKD NWLERLFYSC QRLDKRDQST 900
IPRNLLKTDA VLWQWAIWEA AQFTVLSKLR TPLGRAQDTF QTIEGIIRSL AAHTLNPDQD 960
VSQWTTADND EGHGNNQLRL VLLLQYLENL EKLMYNAYEG CANALTSPPK VIRTFFYTNR 1020
QTCQDWLTRI RLSIMRVGLL AGQPAVTVRH GFDLLTEMKT TSLSQGNELE VTIMMVVEAL 1080
CELHCPEAIQ GIAVWSSSIV GKNLLWINSV AQQAEGRFEK ASVEYQEHLC AMTGVDCCIS 1140
SFDKSVLTLA NAGRNSASPK HSLNGESRKT VLSKPTDSSP EVINYLGNKA CECYISIADW 1200
AAVQEWQNAI HDLKKSTSST SLNLKADFNY IKSLSSFESG KFVECTEQLE LLPGENINLL 1260
AGGSKEKIDM KKLLPNMLSP DPRELQKSIE VQLLRSSVCL ATALNPIEQD QKWQSITENV 1320
VKYLKQTSRI AIGPLRLSTL TVSQSLPVLS TLQLYCSSAL ENTVSNRLST EDCLIPLFSE 1380
ALRSCKQHDV RPWMQALRYT MYQNQLLEKI KEQTVPIRSH LMELGLTAAK FARKRGNVSL 1440
ATRLLAQCSE VQLGKTTTAQ DLVQHFKKLS TQGQVDEKWG PELDIEKTKL LYTAGQSTHA 1500
MEMLSSCAIS FCKSVKAEYA VAKSILTLAK WIQAEWKEIS GQLKQVYRAQ HQQNFTGLST 1560
LSKNILTLIE LPSVNTMEEE YPRIESESTV HIGVGEPDFI LGQLYHLSSV QAPEVAKSWA 1620
ALASWAYRWG RKVVDNASQG EGVRLLPREK SEVQNLLPDT ITEEEKERIY GILGQAVCRP 1680
AGIQDEDITL QITESEDNEE DDMVDVIWRQ LISSCPWLSE LDESATEGVI KVWRKVVDRI 1740
FSLYKLSCSA YFTFLKLNAG QIPLDEDDPR LHLSHRVEQS TDDMIVMATL RLLRLLVKHA 1800
GELRQYLEHG LETTPTAPWR GIIPQLFSRL NHPEVYVRQS ICNLLCRVAQ DSPHLILYPA 1860
IVGTISLSSE SQASGNKFST AIPTLLGNIQ GEELLVSECE GGSPPASQDS NKDEPKSGLN 1920
EDQAMMQDCY SKIVDKLSSA NPTMVLQVQM LVAELRRVTV LWDELWLGVL LQQHMYVLRR 1980
IQQLEDEVKR VQNNNTLRKE EKIAIMREKH TALMKPIVFA LEHVRSITAA PAETPHEKWF 2040
QDNYGDAIEN ALEKLKTPLN PAKPGSSWIP FKEIMLSLQQ RAQKRASYIL RLEEISPWLA 2100
AMTNTEIALP GEVSARDTVT IHSVGGTITI LPTKTKPKKL LFLGSDGKSY PYLFKGLEDL 2160
HLDERIMQFL SIVNTMFATI NRQETPRFHA RHYSVTPLGT RSGLIQWVDG ATPLFGLYKR 2220
WQQREAALQA QKAQDSYQTP QNPGIVPRPS ELYYSKIGPA LKTVGLSLDV SRRDWPLHVM 2280
KAVLEELMEA TPPNLLAKEL WSSCTTPDEW WRVTQSYARS TAVMSMVGYI IGLGDRHLDN 2340
VLIDMTTGEV VHIDYNVCFE KGKSLRVPEK VPFRMTQNIE TALGVTGVEG VFRLSCEQVL 2400
HIMRRGRETL LTLLEAFVYD PLVDWTAGGE AGFAGAVYGG GGQQAESKQS KREMEREITR 2460
SLFSSRVAEI KVNWFKNRDE MLVVLPKLDG SLDEYLSLQE QLTDVEKLQG KLLEEIEFLE 2520
GAEGVDHPSH TLQHRYSEHT QLQTQQRAVQ EAIQVKLNEF EQWITHYQAA FNNLEATQLA 2580
SLLQEISTQM DLGPPSYVPA TAFLQNAGQA HLISQCEQLE GEVGALLQQR RSVLRGCLEQ 2640
LHHYATVALQ YPKAIFQKHR IEQWKTWMEE LICNTTVERC QELYRKYEMQ YAPQPPPTVC 2700
QFITATEMTL QRYAADINSR LIRQVERLKQ EAVTVPVCED QLKEIERCIK VFLHENGEEG 2760
SLSLASVIIS ALCTLTRRNL MMEGAASSAG EQLVDLTSRD GAWFLEELCS MSGNVTCLVQ 2820
LLKQCHLVPQ DLDIPNPMEA SETVHLANGV YTSLQELNSN FRQIIFPEAL RCLMKGEYTL 2880
ESMLHELDGL IEQTTDGVPL QTLVESLQAY LRNAAMGLEE ETHAHYIDVA RLLHAQYGEL 2940
IQPRNGSVDE TPKMSAGQML LVAFDGMFAQ VETAFSLLVE KLNKMEIPIA WRKIDIIREA 3000
RSTQVNFFDD DNHRQVLEEI FFLKRLQTIK EFFRLCGTFS KTLSGSSSLE DQNTVNGPVQ 3060
IVNVKTLFRN SCFSEDQMAK PIKAFTADFV RQLLIGLPNQ ALGLTLCSFI SALGVDIIAQ 3120
VEAKDFGAES KVSVDDLCKK AVEHNIQIGK FSQLVMNRAT VLASSYDTAW KKHDLVRRLE 3180
TSISSCKTSL QRVQLHIAMF QQWQHEDLLI NRPQAMSVTP PPRSAILTSM KKKLHTLSQI 3240
ETSIATVQEK LAALESSIEQ RLKWAGGANP ALAPVLQDFE ATIAERRNLV LKESQRASQV 3300
TFLCSNIIHF ESLRTRTAEA LNLDAALFEL IKRCQQMCSF ASQFNSSVSE LELRLLQRVD 3360
TGLEHPIGSS EWLLSAHKQL TQDMSTQRAI QTEKEQQIET VCETIQNLVD NIKTVLTGHN 3420
RQLGDVKHLL KAMAKDEEAA LADGEDVPYE NSVRQFLGEY KSWQDNIQTV LFTLVQAMGQ 3480
VRSQEHVEML QEITPTLKEL KTQSQSIYNN LVSFASPLVT DATNECSSPT SSATYQPSFA 3540
AAVRSNTGQK TQPDVMSQNA RKLIQKNLAT SADTPPSTVP GTGKSVACSP KKAVRDPKTG 3600
KAVQERNSYA VSVWKRVKAK LEGRDVDPNR RMSVAEQVDY VIKEATNLDN LAQLYEGWTA 3660
WV 3662 
Gene Ontology
 GO:0005524; F:ATP binding; IEA:UniProtKB-KW.
 GO:0016301; F:kinase activity; IEA:UniProtKB-KW.
 GO:0016773; F:phosphotransferase activity, alcohol group as acceptor; IEA:InterPro. 
Interpro
 IPR016024; ARM-type_fold.
 IPR003152; FATC.
 IPR011009; Kinase-like_dom.
 IPR000403; PI3/4_kinase_cat_dom.
 IPR018936; PI3/4_kinase_CS.
 IPR014009; PIK_FAT. 
Pfam
 PF02260; FATC
 PF00454; PI3_PI4_kinase 
SMART
 SM00146; PI3Kc 
PROSITE
 PS51189; FAT
 PS51190; FATC
 PS00916; PI3_4_KINASE_2
 PS50290; PI3_4_KINASE_3 
PRINTS