CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-038092
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
  
Protein Name
 Protein Gcn1l1 
Protein Synonyms/Alias
  
Gene Name
 Gcn1l1 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Mus musculus (Mouse) 
NCBI Taxa ID
 10090 
Lysine Modification
Position
Peptide
Type
References
268LILPTIQKSLLRSPEubiquitination[1]
616KTVLNSHKVLPLEALubiquitination[1]
695PALLTRMKIDPDAFIubiquitination[1]
829EEVQLTSKQKEMLQAubiquitination[1]
965TIPSRVGKGEPDAAPubiquitination[1]
1121PSPDTDEKSGLSLLRubiquitination[1]
1390QQLLESDKYAERKGAubiquitination[1]
1553VLTDSHVKVQKAGQQubiquitination[1]
1745EIVATASKVDIAPHVubiquitination[1]
2572DIRLVAEKMIWWANKubiquitination[1]
Reference
 [1] Proteomic analyses reveal divergent ubiquitylation site patterns in murine tissues.
 Wagner SA, Beli P, Weinert BT, Schölz C, Kelstrup CD, Young C, Nielsen ML, Olsen JV, Brakebusch C, Choudhary C.
 Mol Cell Proteomics. 2012 Dec;11(12):1578-85. [PMID: 22790023
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2671 AA 
Protein Sequence
MAADTQVSET LKRFAVKVTT ASVKERREIL SELGRCIAGK DLPEGAVKGL CKLFCLTLHR 60
YRDAASRRAL QAAIQQLAEA QPEATAKNLL HSLQSSGVGS KACVPSKSSG SAALLALTWT 120
CLLVRIVFPL KAKRQGDIWN KLVEVQCLLL LEVLGGSHKH AVDGAVKKLT KLWKENPGLV 180
EQYFSAILSL EPSQNYAAML GLLVQFCTNH KEMDAVSQHK STLLEFYVKN ILMSKAKPPK 240
YLLDNCAPLL RFMSHSEFKD LILPTIQKSL LRSPENVIET ISSLLASVTL DLSQYALDIV 300
KGLANQLKSN SPRLMDEAVL ALRNLARQCS DSSATEALTK HLFAILGGSE GKLTIIAQKM 360
SVLSGIGSLS HHVVSGPSGQ VLNGCVAELF IPFLQQEVHE GTLVHAVSIL ALWCNRFTTE 420
VPKKLTDWFK KVFSLKTSTS AVRHAYLQCM LASFRGDTLL QALDFLPLLM QTVEKAASQG 480
TQVPTVTEGV AAALLLSKLS VADAQAEAKL SGFWQLVVDE KRQTFTSEKF LLLASEDALC 540
TVLRLTERLF LDHPHRLTNS KVQQYYRVLV AVLLSRTWHV RRQAQQTVRK LLSSLGGVKL 600
ANGLLDELKT VLNSHKVLPL EALVTDAGEV TEMGKTYVPP RVLQEALCVI SGVPGLKGDI 660
PSTEQLAQEM LIISHHPSLV AVQSGLWPAL LTRMKIDPDA FITRHLDQII PRITTQSPLN 720
QSSMNAMGSL SVLSPDRVLP QLISTITASV QNPALCLVTR EEFSIMQTPA GELFDKSIIQ 780
SAQQDSIKKA NMKRENKAYS FKEQIIEMEL KEEIKKKKGI KEEVQLTSKQ KEMLQAQMDK 840
EAQIRRRLQE LDGELEAALG LLDAIMARNP CGLIQYIPVL VDAFLPLLKS PLAAPRVKGP 900
FLSLAACVMP PRLKTLGTLV SHVTLRLLKP ECALDKSWCQ EELPVAVRRA VSLLHTHTIP 960
SRVGKGEPDA APLSAPAFSL VFPMLKMVLT EMPYHSEEEE EQMAQILQIL TVHAQLRASP 1020
DTPPERVDEN GPELLPRVAM LRLLTWVIGT GSPRLQVLAS DTLTALCASS SGEDGCAFAE 1080
QEEVDVLLAA LQSPCASVRE TALRGLMELR LVLPSPDTDE KSGLSLLRRL WVIKFDKEDE 1140
IRKLAERLWS TMGLDLQSDL CSLLIDDVIY HEAAVRQAGA EALSQAVARY QRQAAEVMGR 1200
LMEIYQEKLY RPPPVLDALG RVISESPPDQ WEARCGLALA LNKLSQYLDS SQVKPLFQFF 1260
VPDALNDRNP DVRKCMLDAA LATLNAHGKE NVNSLLPVFE EFLKDAPNDA SYDAVRQSVV 1320
VLMGSLAKHL DKSDPKVKPI VAKLIAALST PSQQVQESVA SCLPPLVPAV KEDAGGMIQR 1380
LMQQLLESDK YAERKGAAYG LAGLVKGLGI LSLKQQEMMA ALTDAIQDKK NFRRREGALF 1440
AFEMLCTMLG KLFEPYVVHV LPHLLLCFGD GNQYVREAAD DCAKAVMSNL SAHGVKLVLP 1500
SLLAALEEES WRTKAGSVEL LGAMAYCAPK QLSSCLPNIV PKLTEVLTDS HVKVQKAGQQ 1560
ALRQIGSVIR NPEILAIAPV LLDALTDPSR KTQKCLQTLL DTKFVHFIDA PSLALIMPIV 1620
QRAFQDRSTD TRKMAAQIIG NMYSLTDQKD LAPYLPSVTP GLKASLLDPV PEVRTVSAKA 1680
LGAMVKGMGE SCFEDLLPWL METLTYEQSS VDRSGAAQGL AEVMAGLGVE KLEKLMPEIV 1740
ATASKVDIAP HVRDGYIMMF NYLPITFGDK FTPYVGPIIP CILKALADEN EFVRDTALRA 1800
GQRVISMYAE TAIALLLPQL EQGLFDDLWR IRFSSVQLLG DLLFHISGVT GKMTTETASE 1860
DDNFGTAQSN KAIITALGVD RRNRVLAGLY MGRSDTQLVV RQASLHVWKI VVSNTPRTLR 1920
EILPTLFGLL LGFLASTCAD KRTIAARTLG DLVRKLGEKI LPEIIPILEE GLRSQKSDER 1980
QGVCIGLSEI MKSTSRDAVL FFSESLVPTA RKALCDPLEE VREAAAKTFE QLHSTIGHQA 2040
LEDILPFLLK QLDDEEVSEF ALDGLKQVMA VKSRVVLPYL VPKLTTPPVN TRVLAFLSSV 2100
AGDALTRHLG VILPAVMLAL KEKLGTPDEQ LEMANCQAVI LSVEDDTGHR IIIEDLLEAT 2160
RSPEVGMRQA AAIILNMYCS RSKADYSSHL RSLVSGLIRL FNDSSPVVLE ESWDALNAIT 2220
KKLDAGNQLA LIEELHKEIR FIGNECKGEH VPGFCLPKRG VTSILPVLRE GVLTGSPEQK 2280
EEAAKGLGLV IRLTSADALR PSVVSITGPL IRILGDRFNW TVKAALLETL SLLLGKVGIA 2340
LKPFLPQLQT TFTKALQDSN RGVRLKAADA LGKLISIHVK VDPLFTELLN GIRAVEDPGI 2400
RDTMLQALRF VIQGAGSKVD AAIRKNLVSL LLSMLGHDED NTRISTAGCL GELCAFLTDE 2460
ELNTVLQQCL LADVSGIDWM VRHGRSLALS VAVNVAPSRL CAGRYSNEVQ DMILSNAVAD 2520
RIPIAMSGIR GMGFLMKYHI ETGSGQLPPR LSSLLIKCLQ NPCSDIRLVA EKMIWWANKE 2580
PRPPLEPQTI KPILKALLDN TKDKNTVVRA YSDQAIVNLL KMRRGEELLQ SLSKILDVAS 2640
LEALNECSRR SLRKLACQAD SVEQVDDTIL T 2671 
Gene Ontology
 GO:0005737; C:cytoplasm; IEA:Compara. 
Interpro
 IPR011989; ARM-like.
 IPR016024; ARM-type_fold.
 IPR000225; Armadillo.
 IPR022716; DUF3554.
 IPR026827; ECM29/GCN1.
 IPR021133; HEAT_type_2. 
Pfam
 PF12074; DUF3554 
SMART
 SM00185; ARM 
PROSITE
 PS50077; HEAT_REPEAT 
PRINTS