CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-044079
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
  
Protein Name
 PERQ amino acid-rich with GYF domain-containing protein 2 
Protein Synonyms/Alias
  
Gene Name
 GIGYF2 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
537DDERLASKLQEHRAKubiquitination[1]
607QPLGDIMKMWGRVPFubiquitination[1]
689LQQFQTLKMRISDQNubiquitination[1]
995RQQRELMKALQQQQQubiquitination[1, 2, 3]
1007QQQQQQQKLSGWGNVubiquitination[1]
1016SGWGNVSKPSGTTKSubiquitination[1, 3]
1037EEARQMQKQQQQQQQubiquitination[1]
1106GFWDDAVKEVGPRNSubiquitination[1]
1125KNNASLSKSVGVSNRubiquitination[1]
1142KKVEEEEKLLKLFQGubiquitination[1]
1145EEEEKLLKLFQGVNKubiquitination[1, 2]
1207LGDTSEAKEFAKQFLubiquitination[1, 4]
1211SEAKEFAKQFLERRAubiquitination[1]
Reference
 [1] Refined preparation and use of anti-diglycine remnant (K-ε-GG) antibody enables routine quantification of 10,000s of ubiquitination sites in single proteomics experiments.
 Udeshi ND, Svinkina T, Mertins P, Kuhn E, Mani DR, Qiao JW, Carr SA.
 Mol Cell Proteomics. 2013 Mar;12(3):825-31. [PMID: 23266961]
 [2] A proteome-wide, quantitative survey of in vivo ubiquitylation sites reveals widespread regulatory roles.
 Wagner SA, Beli P, Weinert BT, Nielsen ML, Cox J, Mann M, Choudhary C.
 Mol Cell Proteomics. 2011 Oct;10(10):M111.013284. [PMID: 21890473]
 [3] hCKSAAP_UbSite: improved prediction of human ubiquitination sites by exploiting amino acid pattern and properties.
 Chen Z, Zhou Y, Song J, Zhang Z.
 Biochim Biophys Acta. 2013 Aug;1834(8):1461-7. [PMID: 23603789]
 [4] Integrated proteomic analysis of post-translational modifications by serial enrichment.
 Mertins P, Qiao JW, Patel J, Udeshi ND, Clauser KR, Mani DR, Burgess MW, Gillette MA, Jaffe JD, Carr SA.
 Nat Methods. 2013 Jul;10(7):634-7. [PMID: 23749302
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1321 AA 
Protein Sequence
MAAETQTLNF GPEWLRALSS GGSITSPPLS PALPKYKLAD YRYGREEMLA LFLKDNKIPS 60
DLLDKEFLPI LQEEPLPPLA LVPFTEEEQR NFSMSVNSAA VLRLTGRGGG GTVVGAPRGR 120
SSSRGRGRGR GECGFYQRSF DEVEGVFGRG GGREMHRSQS WEERGDRRFE KPGRKDVGKK 180
NGYYCMYSPV LLLGQPLCQG RPNFEEGGPT SVGRKHEFIR SESENWRIFR EEQNGEDEDG 240
GWRLAGSRRD GERWRPHSPD GPRSAGWREH MERRRRFEFD FRDRDDERGY RRVRSGSGSI 300
DDDRDSLPEW CLEDAEEEMG TFDSSGAFLS LKKVQKEPIP EEQEMDFRPV DEGEECSDSE 360
GSHNEEAKEP DKTNKKEGEK TDRVGVEASE ETPQTSSSSA RPGTPSDHQS QEASQFERKD 420
EPKTEQTEKA EEETRMENSL PAKVPSRGDE MVADVQQPLS QIPSDTASPL LILPPPVPNP 480
SPTLRPVETP VVGAPGMGSV STEPDDEEGL KHLEQQAEKM VAYLQDSALD DERLASKLQE 540
HRAKGVSIPL MHEAMQKWYY KDPQGEIQGP FNNQEMAEWF QAGYFTMSLL VKRACDESFQ 600
PLGDIMKMWG RVPFSPGPAP PPHMGELDQE RLTRQQELTA LYQMQHLQYQ QFLIQQQYAQ 660
VLAQQQKAAL SSQQQQQLAL LLQQFQTLKM RISDQNIIPS VTRSVSVPDT GSIWELQPTA 720
SQPTVWEGGS VWDLPLDTTT PGPALEQLQQ LEKAKAAKLE QERREAEMRA KREEEERKRQ 780
EELRRQQEEI LRRQQEEERK RREEEELARR KQEEALRRQR EQEIALRRQR EEEERQQQEE 840
ALRRLEERRR EEEERRKQEE LLRKQEEEAA KWAREEEEAQ RRLEENRLRM EEEAARLRHE 900
EEERKRKELE VQRQKELMRQ RQQQQEALRR LQQQQQQQQL AQMKLPSSST WGQQSNTTAC 960
QSQATLSLAE IQKLEEERER QLREEQRRQQ RELMKALQQQ QQQQQQKLSG WGNVSKPSGT 1020
TKSLLEIQQE EARQMQKQQQ QQQQHQQPNR ARNNTHSNLH TSIGNSVWGS INTGPPNQWA 1080
SDLVSSIWSN ADTKNSNMGF WDDAVKEVGP RNSTNKNKNN ASLSKSVGVS NRQNKKVEEE 1140
EKLLKLFQGV NKAQDGFTQW CEQMLHALNT ANNLDVPTFV SFLKEVESPY EVHDYIRAYL 1200
GDTSEAKEFA KQFLERRAKQ KANQQRQQQQ LPQQQQQQPP QQPPQQPQQQ DSVWGMNHST 1260
LHSVFQTNQS NNQQSNFEAV QSGKKKKKQK MVRADPSLLG FSVNASSERL NMGEIETLDD 1320
Y 1321 
Gene Ontology
  
Interpro
 IPR003169; GYF. 
Pfam
 PF02213; GYF 
SMART
 SM00444; GYF 
PROSITE
 PS50829; GYF 
PRINTS