CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-022531
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 LisH domain and HEAT repeat-containing protein KIAA1468 
Protein Synonyms/Alias
  
Gene Name
 KIAA1468 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
227ALRANLTKAAEHEVPubiquitination[1]
473PPSSLSSKKTVHFDKubiquitination[2]
517SRIADSEKSVMLMLGubiquitination[1]
871RLCRTFGKIFTNTKVubiquitination[1]
877GKIFTNTKVKPQFQEubiquitination[3]
1002MSEALVDKRVAPALVubiquitination[1]
1197TKTKFLNKMGQLTTSubiquitination[1]
Reference
 [1] Refined preparation and use of anti-diglycine remnant (K-ε-GG) antibody enables routine quantification of 10,000s of ubiquitination sites in single proteomics experiments.
 Udeshi ND, Svinkina T, Mertins P, Kuhn E, Mani DR, Qiao JW, Carr SA.
 Mol Cell Proteomics. 2013 Mar;12(3):825-31. [PMID: 23266961]
 [2] Global identification of modular cullin-RING ligase substrates.
 Emanuele MJ, Elia AE, Xu Q, Thoma CR, Izhar L, Leng Y, Guo A, Chen YN, Rush J, Hsu PW, Yen HC, Elledge SJ.
 Cell. 2011 Oct 14;147(2):459-74. [PMID: 21963094]
 [3] A proteome-wide, quantitative survey of in vivo ubiquitylation sites reveals widespread regulatory roles.
 Wagner SA, Beli P, Weinert BT, Nielsen ML, Cox J, Mann M, Choudhary C.
 Mol Cell Proteomics. 2011 Oct;10(10):M111.013284. [PMID: 21890473
Functional Description
  
Sequence Annotation
 DOMAIN 255 287 LisH.
 REPEAT 601 639 HEAT 1.
 REPEAT 640 679 HEAT 2.
 REPEAT 1004 1042 HEAT 3.
 MOD_RES 2 2 N-acetylalanine.
 MOD_RES 20 20 Phosphoserine.
 MOD_RES 22 22 Phosphoserine.
 MOD_RES 32 32 Phosphothreonine (By similarity).
 MOD_RES 54 54 Phosphoserine.
 MOD_RES 56 56 Phosphoserine.
 MOD_RES 180 180 Phosphoserine.
 MOD_RES 182 182 Phosphoserine.
 MOD_RES 183 183 Phosphothreonine.
 MOD_RES 186 186 Phosphoserine.  
Keyword
 Acetylation; Alternative splicing; Coiled coil; Complete proteome; Phosphoprotein; Polymorphism; Reference proteome; Repeat. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1216 AA 
Protein Sequence
MAAMAPGGSG SGGGVNPFLS DSDEDDDEVA ATEERRAVLR LGAGSGLDPG SAGSLSPQDP 60
VALGSSARPG LPGEASAAAV ALGGTGETPA RLSIDAIAAQ LLRDQYLLTA LELHTELLES 120
GRELPRLRDY FSNPGNFERQ SGTPPGMGAP GVPGAAGVGG AGGREPSTAS GGGQLNRAGS 180
ISTLDSLDFA RYSDDGNRET DEKVAVLEFE LRKAKETIQA LRANLTKAAE HEVPLQERKN 240
YKSSPEIQEP IKPLEKRALN FLVNEFLLKN NYKLTSITFS DENDDQDFEL WDDVGLNIPK 300
PPDLLQLYRD FGNHQVTGKD LVDVASGVEE DELEALTPII SNLPPTLETP QPAENSMLVQ 360
KLEDKISLLN SEKWSLMEQI RRLKSEMDFL KNEHFAIPAV CDSVQPPLDQ LPHKDSEDSG 420
QHPDVNSSDK GKNTDIHLSI SDEADSTIPK ENSPNSFPRR EREGMPPSSL SSKKTVHFDK 480
PNRKLSPAFH QALLSFCRMS ADSRLGYEVS RIADSEKSVM LMLGRCLPHI VPNVLLAKRE 540
ELIPLILCTA CLHPEPKERD QLLHILFNLI KRPDDEQRQM ILTGCVAFAR HVGPTRVEAE 600
LLPQCWEQIN HKYPERRLLV AESCGALAPY LPKEIRSSLV LSMLQQMLME DKADLVREAV 660
IKSLGIIMGY IDDPDKYHQG FELLLSALGD PSERVVSATH QVFLPAYAAW TTELGNLQSH 720
LILTLLNKIE KLLREGEHGL DEHKLHMYLS ALQSLIPSLF ALVLQNAPFS SKAKLHGEVP 780
QIEVTRFPRP MSPLQDVSTI IGSREQLAVL LQLYDYQLEQ EGTTGWESLL WVVNQLLPQL 840
IEIVGKINVT STACVHEFSR FFWRLCRTFG KIFTNTKVKP QFQEILRLSE ENIDSSAGNG 900
VLTKATVPIY ATGVLTCYIQ EEDRKLLVGF LEDVMTLLSL SHAPLDSLKA SFVELGANPA 960
YHELLLTVLW YGVVHTSALV RCTAARMFEL TLRGMSEALV DKRVAPALVT LSSDPEFSVR 1020
IATIPAFGTI METVIQRELL ERVKMQLASF LEDPQYQDQH SLHTEIIKTF GRVGPNAEPR 1080
FRDEFVIPHL HKLALVNNLQ IVDSKRLDIA THLFEAYSAL SCCFISEDLM VNHFLPGLRC 1140
LRTDMEHLSP EHEVILSSMI KECEQKVENK TVQEPQGSMS IAASLVSEDT KTKFLNKMGQ 1200
LTTSGAMLAN VFQRKK 1216 
Gene Ontology
  
Interpro
 IPR011989; ARM-like.
 IPR016024; ARM-type_fold.
 IPR021133; HEAT_type_2.
 IPR006594; LisH_dimerisation. 
Pfam
  
SMART
 SM00667; LisH 
PROSITE
 PS50077; HEAT_REPEAT
 PS50896; LISH 
PRINTS