CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-035509
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
  
Protein Name
 Protein Herc1 
Protein Synonyms/Alias
  
Gene Name
 Herc1 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Rattus norvegicus (Rat) 
NCBI Taxa ID
 10116 
Lysine Modification
Position
Peptide
Type
References
1808KLRPNYDKTEIEKKGubiquitination[1]
2925GKTMVQGKNYGPQITubiquitination[1]
Reference
 [1] Synaptic protein ubiquitination in rat brain revealed by antibody-based ubiquitome analysis.
 Na CH, Jones DR, Yang Y, Wang X, Xu Y, Peng J.
 J Proteome Res. 2012 Sep 7;11(9):4722-32. [PMID: 22871113
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Ligase; Reference proteome; Repeat; Ubl conjugation pathway; WD repeat. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 3276 AA 
Protein Sequence
GPMHSQLESL SDSWTRLKHT RDWFYNSSYS FESDFDLTKS LGVHTLIENV VSFVSGDVGN 60
APGFKEPEES MSTSPQASII AMEQQQLRAE LRLEALHQIL TLLSGMEEKG NILLTGSRSS 120
SGFQSSTLLT SVRLQFLAGC FGLGTVGYTG AKGESGRLHH YQDGIRAAKR NIQVEIQVAV 180
HKIYQQLSAT LERALQANKH HIEAQQRLLL VTVFALSVHY QPAXXXXXXX TGLLNVLSQL 240
CGTDTMLGQP LQLLPKTGVS QLSTALKVAS TRLLQILAIT TGTYADKLSP KVVQSLLDLL 300
CSQLKNLLSQ TGVLFMASFG EGEEGEEEDK KVDSSGEAEK RDFRAALRKQ HAAELHLGDF 360
LVFLRRVVSS KAIQSKMASP KWTEVLLNIA SQKCSSGIPL VGNLRTRLLA LHVLEAVLPA 420
CESGVEDDQM AQVVERLFSL LSDCMWETPI AQAKHAIQIK EKEQEIKLQK QGELEEEDEN 480
LPIQEVSFDP EKAQCCIVEN GQILTHGSGG KGYGLASTGV TSGCYQWKFY IVKENRGNEG 540
TCVGVSRWPV HDFNHRTTSD MWLYRAYSGN LYHNGEQTLT LSSFTQGDFI TCVLDMEART 600
ISFGKNGEEP KLAFEDVDAA ELYPCVMFYS SNPGEKVKIC DMQMRGTPRD LLPGDPICSP 660
VAAVLAEATI QLIRILHRTD RWTYCINKKM MERLHKIKIC IRESGQKLKK SRSVQSREEN 720
EMREEKENKE EEKGKHNRHG LADLSEPQLR TLCIEVWPVL AVIGGVDAGL RVGGRCVHKQ 780
TGRHATLLGV VKEGSTSAKV QWDEAEITIS FPTFWSPSDT PLYNLEPCEP LLFDVARFRG 840
LTASVLLDLT YLTGIHEDIG KQSIKRHEKK HRHESEEKGD IEQKPESESI LDVRTGLISD 900
DVKSQGTTSS KSENEIASFS LESTLPGVES QHQITEGKRK NHEHISKTHD IAQSEIRAVQ 960
LSYLYLGAMK SLSALLGCSK YAELLLIPKV LAENGHNSDC ASSPVVHEDV EMRAALQFLM 1020
RHMVKRAVMR SPIKRALGLA DLERAQAMIY KLVVHGLLED QFGGKIKQEI DQQAEESDQA 1080
QQAQTPVTTS PSASSTTSFM SSSLEDTTTA TTPVTDTETV PASESPGVMP LSLLRQMFSS 1140
YPTTTVLPTR RAQTPPISSL PASPSDEVGR RQSLTSPDSQ STRPANRTAL SDPSSRLSTS 1200
PPPPAIAVPL LEMGFSLRQI AKAMEATGAR GEADAQNITV LAMWMIEHPG HEDEEEPQSS 1260
STADSRHGTA VLGSGGKSND PCYLQSPGDI PSADAAEMEE GFSESPDNLD HTENAASGSG 1320
PPTRGRSTVT RRHKFDLAAR TLLARAAGLY RSVQAHRNQS RREGISLQQD PGALYDFNLD 1380
EELEIDLDDE AMEAMFGQDL TSDNDILGMW IPEVLDWPTW HVCESEDREE VVVCELCECN 1440
VVSFNQHMKR NHPGCGRSAN RQGYRSNGSY VDGWFGGECG SGNPYYLLCG SCREKYLALK 1500
TKTKTTNSER YKGQAPDLIG KQDSVYEEDW DMLDVDEDEK LTGEEEFELL AGPLGLNDRR 1560
IVPEPVQFPD SDPLGASVAM VTATNSMEET LMQIGCHGSV EKSSSGRVTL GEQAAALANP 1620
HDRVVALRRV TAAAQVLLAR TMVMRALSLL SVSGSSCSLA AGLESLGLTD IRTLVRLMCL 1680
AAAGRAGLST SPSAIASTSE RSRGGHSKAS KPISCLAYLS TAVGCLASNT PSAAKLLVQL 1740
CTQNLISAAT GVNLTTVDDP IQRKFLPSFL RGIAEENKLV TSPNFVVTQA LVALLADKGA 1800
KLRPNYDKTE IEKKGPLELA NALAACCLSS RLSSQHRQWA AQQLVRTLAA HDRDNQTAPQ 1860
TLADMGGDLR KCSFIKLEAH QNRVMTCVWC NKKGLLATSG NDGTIRVWNV TKKQYSLQQT 1920
CVFNRLEGDA EESLGSPSDP SFSPVSWSIS GKYLAGALEK MVNIWQVNGG KGLVDIQPHW 1980
VSALAWPEEG PATAWSGESP ELLLVGRMDG SLGLIEVVDV STMHRRELEH CYRKDVSVTC 2040
IAWFSEDRPF AVGYFDGKLL MGTKEPLEKG GIVLIDAHKE TLVSMKWDPT GHILMTCAKE 2100
ENVKLWGPVS GCWRCLHSLC HPSTVNGIAW CSLPGKGSKM QLLMATGCQN GLVCVWCIPQ 2160
DTTQTSMTSS EGWWDQESNC QDGFKKSAGA KCVYQLRGHI TPVRTVAFSS DGLALVSGGL 2220
GGLMNIWSLR DGSVLQTVVI GSGAIQTTVW IPEVGVAACS NRSKDVLVVN CTAEWASANH 2280
ILATCRTALK QQGVLGLNMA PCMRAFLERL PMMLQEQYAY EKPHVVCGDQ LVHSPYMQCL 2340
ASLAVGLHLD QLLCSPPVPP HHQNCPPDPA SWNPNEWAWL ECFSTTIKAA EALTNGAQFP 2400
ESFTVPDLEP VPEDELVLLM DNSKWINGMD EQIMSWATSR PEDWHLGGKC DVYLWGAGRH 2460
GQLAEAGRNV MVPATAPSFS QAQQVICGQN CTFVIQANGT VLACGEGSYG RLGQGNSDDL 2520
HVLTVISALQ GFVVTQLVTS CGSDGHSMAL TESGEVFSWG DGDYGKLGHG NSDRQRRPRQ 2580
IEALQGEEVV QMSCGFKHSA VVTSDGKLFT FGNGDYGRLG LGNTSNKKLP ERVTALEGYQ 2640
IGQAWHKVAC GLNHTLAVSA DGSMVWAFGD GDYGKLGLGN STAKSSPQKV DVLCGIGIKK 2700
VACGTQFSVA LTKDGHVYTF GQDRLIGLPE GRARNHNRPQ QIPVLAGVVI EDVAVGAEHT 2760
LALASTGDVY AWGSNSEGQL GLGHTNHVRE PTLVTVLQGK NIRQISAGRC HSAAWTAPPV 2820
PPRAPGVSVP LQLGLPDAVP PQYGALREVS IHTVRARLRL LYHFSDLMYS SWRLLNLSPN 2880
NQNSTSHYNA GTWGIVQGQL RPLLAPRVYT LPMVRSIGKT MVQGKNYGPQ ITVKRISTRG 2940
RKCKPIFVQI ARQVVKLNAS DLRLPSRAWK VKLVGEGADD AGGVFDDTIT EMCQELETGI 3000
VDLLIPSPNA TAEVGYNRDR FLFSPSACLD EHLMQFKFLG ILMGVAIRTK KPLDLHLAPL 3060
VWKQLCCVPL TLEDLEEVDL LYVQTLNSIL HIEDSGITEE SFHEMIPLDS FVGQSADGKM 3120
VPIIPGGNSI PLTFSNRKEY VERAIEYRLH EMDRQVAAVR EGMSWIVPVP LLSLLTAKQL 3180
EQMVCGMPEI CVDVLKKVVR YREVDEQHQL VQWLWRTLEE FSNEERVLFM RFVSGRSRLP 3240
ANTADISQRF QIMKVDRPHA SPPSQALCLF HVRLPP 3276 
Gene Ontology
 GO:0005622; C:intracellular; IEA:InterPro.
 GO:0004842; F:ubiquitin-protein ligase activity; IEA:InterPro.
 GO:0016567; P:protein ubiquitination; IEA:GOC. 
Interpro
 IPR001870; B30.2/SPRY.
 IPR008985; ConA-like_lec_gl_sf.
 IPR000569; HECT.
 IPR009091; RCC1/BLIP-II.
 IPR000408; Reg_chr_condens.
 IPR018355; SPla/RYanodine_receptor_subgr.
 IPR003877; SPRY_rcpt.
 IPR015943; WD40/YVTN_repeat-like_dom.
 IPR001680; WD40_repeat.
 IPR019775; WD40_repeat_CS.
 IPR017986; WD40_repeat_dom. 
Pfam
 PF00632; HECT
 PF00415; RCC1
 PF00622; SPRY
 PF00400; WD40 
SMART
 SM00119; HECTc
 SM00449; SPRY
 SM00320; WD40 
PROSITE
 PS50188; B302_SPRY
 PS50237; HECT
 PS00626; RCC1_2
 PS50012; RCC1_3
 PS00678; WD_REPEATS_1
 PS50082; WD_REPEATS_2
 PS50294; WD_REPEATS_REGION 
PRINTS
 PR00633; RCCNDNSATION.