CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-012980
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 GON-4-like protein 
Protein Synonyms/Alias
 GON-4 homolog 
Gene Name
 GON4L 
Gene Synonyms/Alias
 GON4; KIAA1606 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
331KMTRSKLKEVVEKGVubiquitination[1]
534TADDEDWKMWLGGLMubiquitination[2]
655KELFEQLKMKKSSAKubiquitination[1]
662KMKKSSAKQLQEVEKubiquitination[1]
818PVCSLKAKNPQDKIVubiquitination[1]
828QDKIVFTKAEDNLLAubiquitination[1]
882RAPDNIIKFYKKTKQubiquitination[1]
888IKFYKKTKQLPVLGKubiquitination[1]
895KQLPVLGKCCEEIQPubiquitination[1]
921HRLPFWLKASLPSIQubiquitination[1]
1126PSKRRGVKASPCMKPmethylation[3]
2070GRRHVSGKPDTQERWacetylation[4]
2117APRGGLAKDSGTQAKacetylation[4]
Reference
 [1] Refined preparation and use of anti-diglycine remnant (K-ε-GG) antibody enables routine quantification of 10,000s of ubiquitination sites in single proteomics experiments.
 Udeshi ND, Svinkina T, Mertins P, Kuhn E, Mani DR, Qiao JW, Carr SA.
 Mol Cell Proteomics. 2013 Mar;12(3):825-31. [PMID: 23266961]
 [2] hCKSAAP_UbSite: improved prediction of human ubiquitination sites by exploiting amino acid pattern and properties.
 Chen Z, Zhou Y, Song J, Zhang Z.
 Biochim Biophys Acta. 2013 Aug;1834(8):1461-7. [PMID: 23603789]
 [3] Mass spectrometry-based identification and characterisation of lysine and arginine methylation in the human proteome.
 Bremang M, Cuomo A, Agresta AM, Stugiewicz M, Spadotto V, Bonaldi T.
 Mol Biosyst. 2013 Jul 30;9(9):2231-47. [PMID: 23748837]
 [4] Integrated proteomic analysis of post-translational modifications by serial enrichment.
 Mertins P, Qiao JW, Patel J, Udeshi ND, Clauser KR, Mani DR, Burgess MW, Gillette MA, Jaffe JD, Carr SA.
 Nat Methods. 2013 Jul;10(7):634-7. [PMID: 23749302
Functional Description
  
Sequence Annotation
 DOMAIN 1624 1696 PAH 1.
 DOMAIN 1706 1777 PAH 2.
 DOMAIN 2148 2201 Myb-like.
 MOD_RES 346 346 Phosphoserine.
 CROSSLNK 534 534 Glycyl lysine isopeptide (Lys-Gly)  
Keyword
 Alternative splicing; Complete proteome; Isopeptide bond; Nucleus; Phosphoprotein; Polymorphism; Reference proteome; Repeat; Ubl conjugation. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2241 AA 
Protein Sequence
MLPCKKRRTT VTESLQHKGN QEENNVDLES AVKPESDQVK DLSSVSLSWD PSHGRVAGFE 60
VQSLQDAGNQ LGMEDTSLSS GMLTQNTNVP ILEGVDVAIS QGITLPSLES FHPLNIHIGK 120
GKLHATGSKR GKKMTLRPGP VTQEDRCDHL TLKEPFSGEP SEEVKEEGGK PQMNSEGEIP 180
SLPSGSQSAK PVSQPRKSTQ PDVCASPQEK PLRTLFHQPE EEIEDGGLFI PMEEQDNEES 240
EKRRKKKKGT KRKRDGRGQE GTLAYDLKLD DMLDRTLEDG AKQHNLTAVN VRNILHEVIT 300
NEHVVAMMKA AISETEDMPM FEPKMTRSKL KEVVEKGVVI PTWNISPIKK ANEIKPPQFV 360
DIHLEEDDSS DEEYQPDDEE EDETAEESLL ESDVESTASS PRGAKKSRLR QSSEMTETDE 420
ESGILSEAEK VTTPAIRHIS AEVVPMGPPP PPKPKQTRDS TFMEKLHAVD EELASSPVCM 480
DSFQPMDDSL IAFRTRSKMP LKDVPLGQLE AELQAPDITP DMYDPNTADD EDWKMWLGGL 540
MNDDVGNEDE ADDDDDPEYN FLEDLDEPDT EDFRTDRAVR ITKKEVNELM EELFETFQDE 600
MGFSNMEDDG PEEEECVAEP RPNFNTPQAL RFEEPLANLL NEQHRTVKEL FEQLKMKKSS 660
AKQLQEVEKV KPQSEKVHQT LILDPAQRKR LQQQMQQHVQ LLTQIHLLAT CNPNLNPEAT 720
TTRIFLKELG TFAQSSIALH HQYNPKFQTL FQPCNLMGAM QLIEDFSTHV SIDCSPHKTV 780
KKTANEFPCL PKQVAWILAT SKVFMYPELL PVCSLKAKNP QDKIVFTKAE DNLLALGLKH 840
FEGTEFPNPL ISKYLLTCKT AHQLTVRIKN LNMNRAPDNI IKFYKKTKQL PVLGKCCEEI 900
QPHQWKPPIE REEHRLPFWL KASLPSIQEE LRHMADGARE VGNMTGTTEI NSDRSLEKDN 960
LELGSESRYP LLLPKGVVLK LKPVATRFPR KAWRQKRSSV LKPLLIQPSP SLQPSFNPGK 1020
TPARSTHSEA PPSKMVLRIP HPIQPATVLQ TVPGVPPLGV SGGESFESPA ALPAVPPEAR 1080
TSFPLSESQT LLSSAPVPKV MLPSLAPSKF RKPYVRRRPS KRRGVKASPC MKPAPVIHHP 1140
ASVIFTVPAT TVKIVSLGGG CNMIQPVNAA VAQSPQTIPI TTLLVNPTSF PCPLNQSLVA 1200
SSVSPLIVSG NSVNLPIPST PEDKAHVNVD IACAVADGEN AFQGLEPKLE PQELSPLSAT 1260
VFPKVEHSPG PPLADAECQE GLSENSACRW TVVKTEEGRQ ALEPLPQGIQ ESLNNPTPGD 1320
LEEIVKMEPE EAREEISGSP ERDICDDIKV EHAVELDTGA PSEELSSAGE VTKQTVLQKE 1380
EERSQPTKTP SSSQEPPDEG TSGTDVNKGS SKNALSSMDP EVRLSSPPGK PEDSSSVDGQ 1440
SVGTPVGPET GGEKNGPEEE EEEDFDDLTQ DEEDEMSSAS EESVLSVPEL QETMEKLTWL 1500
ASERRMSQEG ESEEENSQEE NSEPEEEEEE EAEGMESLQK EDEMTDEAVG DSAEKPPTFA 1560
SPETAPEVET SRTPPGESIK AAGKGRNNHR ARNKRGSRAR ASKDTSKLLL LYDEDILERD 1620
PLREQKDLAF AQAYLTRVRE ALQHIPGKYE DFLQVIYEFE SSTQRRTAVD LYKSLQILLQ 1680
DWPQLLKDFA AFLLPEQALA CGLFEEQQAF EKSRKFLRQL EICFAENPSH HQKIIKVLQG 1740
CADCLPQEIT ELKTQMWQLL KGHDHLQDEF SIFFDHLRPA ASRMGDFEEI NWTEEKEYEF 1800
DGFEEVALPD VEEEEEPPKI PTASKNKRKK EIGVQNHDKE TEWPDGAKDC ACSCHEGGPD 1860
SKLKKSKRRS CSHCSSKVCD SKSYKSKEPH ELVGSSPHRE ASPMPGAKEA GQGKDMMEEE 1920
APEERESTEA TQSRTVRTTR KGEMPVSAGL AVGSTLPSPR EVTVTERLLL DGPPPHSPET 1980
PQFPPTTGAV LYTVKRNQVG PEVRSCPKAS PRLQKEREGQ KAVSESEALM LVWDASETEK 2040
LPGTVEPPAS FLSPVSSKTR DAGRRHVSGK PDTQERWLPS SRARVKTRDR TCPVHESPSG 2100
IDTSETSPKA PRGGLAKDSG TQAKGPEGEQ QPKAAEATVC ANNSKVSSTG EKVVLWTREA 2160
DRVILTMCQE QGAQPQTFNI ISQQLGNKTP AEVSHRFREL MQLFHTACEA SSEDEDDATS 2220
TSNADQLSDH GDLLSEEELD E 2241 
Gene Ontology
 GO:0005634; C:nucleus; IDA:HPA.
 GO:0003682; F:chromatin binding; IEA:InterPro.
 GO:0003677; F:DNA binding; IEA:InterPro.
 GO:0006355; P:regulation of transcription, DNA-dependent; IEA:InterPro. 
Interpro
 IPR009057; Homeodomain-like.
 IPR017877; Myb-like_dom.
 IPR003822; PAH.
 IPR001005; SANT/Myb. 
Pfam
 PF02671; PAH 
SMART
 SM00717; SANT 
PROSITE
 PS50090; MYB_LIKE
 PS51477; PAH 
PRINTS