CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-022548
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 HEAT repeat-containing protein 5B 
Protein Synonyms/Alias
  
Gene Name
 HEATR5B 
Gene Synonyms/Alias
 KIAA1414 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
294KSGGEMLKVGGSVNRubiquitination[1]
372VGSLLGEKAQIAAAKubiquitination[2]
387EICQAIGKQMKAVEAubiquitination[2]
590NVFPRSLKELEAEKAubiquitination[2]
841LKGLAENKSTLGPEEubiquitination[1]
899MAQYSFDKLKSARDVubiquitination[2]
1324ALEDIIKKFASVPEPubiquitination[2]
1829KTEAGVQKQWTALIRubiquitination[3, 4]
Reference
 [1] Systematic and quantitative assessment of the ubiquitin-modified proteome.
 Kim W, Bennett EJ, Huttlin EL, Guo A, Li J, Possemato A, Sowa ME, Rad R, Rush J, Comb MJ, Harper JW, Gygi SP.
 Mol Cell. 2011 Oct 21;44(2):325-40. [PMID: 21906983]
 [2] Refined preparation and use of anti-diglycine remnant (K-ε-GG) antibody enables routine quantification of 10,000s of ubiquitination sites in single proteomics experiments.
 Udeshi ND, Svinkina T, Mertins P, Kuhn E, Mani DR, Qiao JW, Carr SA.
 Mol Cell Proteomics. 2013 Mar;12(3):825-31. [PMID: 23266961]
 [3] A proteome-wide, quantitative survey of in vivo ubiquitylation sites reveals widespread regulatory roles.
 Wagner SA, Beli P, Weinert BT, Nielsen ML, Cox J, Mann M, Choudhary C.
 Mol Cell Proteomics. 2011 Oct;10(10):M111.013284. [PMID: 21890473]
 [4] hCKSAAP_UbSite: improved prediction of human ubiquitination sites by exploiting amino acid pattern and properties.
 Chen Z, Zhou Y, Song J, Zhang Z.
 Biochim Biophys Acta. 2013 Aug;1834(8):1461-7. [PMID: 23603789
Functional Description
  
Sequence Annotation
 REPEAT 848 885 HEAT 1.
 REPEAT 1062 1099 HEAT 2.
 REPEAT 1290 1327 HEAT 3.
 MOD_RES 1123 1123 Phosphoserine.
 MOD_RES 1737 1737 Phosphoserine.  
Keyword
 Alternative splicing; Complete proteome; Phosphoprotein; Polymorphism; Reference proteome; Repeat. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2071 AA 
Protein Sequence
MELAHSLLLN EEALAQITEA KRPVFIFEWL RFLDKVLVAA NKTDVKEKQK KLVEQLTGLI 60
SSSPGPPTRK LLAKNLAALY SIGDTFTVFQ TLDKCNDIIR NKDDTAAYLP TKLAAVACVG 120
AFYEKMGRML GSAFPETVNN LLKSLKSAES QGRSEILMSL QKVLSGLGGA AASSHRDIYK 180
NARSLLTDRS MAVRCAVAKC LLELQNEAVF MWTAELENIA TLCFKALENS NYGVRVAVSK 240
LLGTVMATAL MPKQATVMRQ NVKRATFDEV LELMATGFLR GGSGFLKSGG EMLKVGGSVN 300
REVRVGVTQA YVVFVTTLGG QWLERSFATF LSHVLDLVSH PRATQTHVEA VYSRRCVSFI 360
LRATVGSLLG EKAQIAAAKE ICQAIGKQMK AVEAVVNDTS GENKSGAADI AASQHVMVCA 420
LQELGSLVQS LNATASPLIQ EASIGLLEIV TSVLLHPSMA ARLAAAWCLR CVAVALPFQL 480
TPFLDRCAER LNNLKTSPEA VSGYSFAMAA LLGGVHQCPL GIPHAKGKMV VSIAEDLLRT 540
AAQNSRLSLQ RTQAGWLLLG ALMTLGPSVV RYHLPKMLLL WRNVFPRSLK ELEAEKARGD 600
SFTWQVTLEG RAGALCAMRS FVAHCPELLT EDVIRKLMTP IECAMTMMSH IPSVMKAHGA 660
HLKASAAMVR LRLYDILALL PPKTYEGSFN ALLRELVAEF TLTDNSANTT TSLLRSLCHY 720
DDSVLLGSWL QETDHKSIED QLQPNSASGS GALEHDPSSI YLRIPAGEAV PGPLPLGVSV 780
IDASVALFGV VFPHVSYKHR LQMLDHFAEC VKQAKGVRQQ AVQLNIFTAV LSALKGLAEN 840
KSTLGPEEVR KSALTLVMGP LDNPNPILRC AAGEALGRMA QVVGEATFIA RMAQYSFDKL 900
KSARDVVSRT GHSLALGCLH RYVGGIGSGQ HLKTSVSILL ALAQDGTSPE VQTWSLHSLA 960
LIVDSSGPMY RGYVEPTLSL VLTLLLTVPP SHTEVHQCLG RCLGAIITTV GPELQGNGAT 1020
TSTIRSSCLV GCAITQDHSD SLVQAAAISC LQQLHMFAPR HVNLSSLVPS LCVHLCSSHL 1080
LLRRAAVACL RQLAQREAAE VCEYAMSLAK NTGDKESSSA NVSPFAPGVS SRTDIHCRHQ 1140
GVNITETGLE GLLFGMLDRE TDRKLCSDIH DTLGHMLSSL AVEKLSHWLM LCKDVLAASS 1200
DMSTATLLSS GKDEEAEKKD EMDDDTMFTT LGEEDKSKPF VAPRWATRVF AADCLCRIIN 1260
LCENADQAHF DLALARSAKL RNPTNDLLVL HLSDLIRMAF MAATDHSNQL RMAGLQALED 1320
IIKKFASVPE PEFPGHVILE QYQANVGAAL RPAFSQDTPS DIIAKACQVC STWIGSGVVS 1380
DLNDLRRVHN LLVSSLDKVQ AGKGSSSQLY RESATTMEKL AVLKAWAEVY VVAMNIKKEA 1440
ESKPKRAIKN TDDDDDDCGT IDELPPDSLI TLVQPELPTL SRLWLAALKD YALLTLPAEF 1500
SSQLPPDGGA FYTPETIDTA RLHYRNSWAP ILHAVALWLN STGFTCSEST EAAAISGLQK 1560
RSTSVNLNQA SGAVGSAKSL PEINKDRMHL ILGVSIQFLC SPRPEEPIEH VTACLQALHT 1620
LLDSPYARVH IAEDQLIGVE LLSVLHRLLL TWNPSSVQLL VTGVVQQIVR AAQDYLQEKR 1680
NTLNEDDMEK EACTVLGEGG DSGGLIPGKS LVFATMELLM FILVRHMPHL STKVSDSPSH 1740
IATKTRLSEE SARLVAATVT ILSDLPSLCS PAGCMTILPT ILFLIARILK DTAIKSADNQ 1800
VPPPVSAALQ GIKSIVTLSM AKTEAGVQKQ WTALIRSTLA CILEYSQPED SVPTPDEVSM 1860
LTAIALFLWS ASNEIIGVQS LQNGCMNRFK NALNSCDPWV QAKCYQLLLS VFQHSNRALS 1920
TPYIHSLAPI VVEKLKAVER NRPASNIELL AVQEGIKVLE TLVALGEEQN RVQLLALLVP 1980
TLISYLLDEN SFASASSASK DLHEFALQNL MHIGPLYPHA FKTVMGAAPE LKVRLETAVR 2040
ASQASKAKAA ARQPAPAIHS APTIKLKTSF F 2071 
Gene Ontology
  
Interpro
 IPR011989; ARM-like.
 IPR016024; ARM-type_fold. 
Pfam
  
SMART
  
PROSITE
 PS50077; HEAT_REPEAT 
PRINTS