CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-041842
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Huntingtin 
Protein Synonyms/Alias
 Huntington disease gene homolog, isoform CRA_a 
Gene Name
 Htt 
Gene Synonyms/Alias
 Hdh; mCG_2547 
Created Date
 July 27, 2013 
Organism
 Mus musculus (Mouse) 
NCBI Taxa ID
 10090 
Lysine Modification
Position
Peptide
Type
References
794DCIPLLQKTLKDESSubiquitination[1]
922LVPKLFYKCDQGQADubiquitination[1]
1219YHLPSYLKLHDVLKAubiquitination[1]
1243DLQNSTEKFGGFLRSubiquitination[1]
1318GLSSNPSKSQCRAQRubiquitination[1]
1391TNLTSVTKNRADKNAubiquitination[1]
1412LFEPLVIKALKQYTTubiquitination[1]
1711LGECSEGKQKSLPEDubiquitination[1]
Reference
 [1] Proteomic analyses reveal divergent ubiquitylation site patterns in murine tissues.
 Wagner SA, Beli P, Weinert BT, Schölz C, Kelstrup CD, Young C, Nielsen ML, Olsen JV, Brakebusch C, Choudhary C.
 Mol Cell Proteomics. 2012 Dec;11(12):1578-85. [PMID: 22790023
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 3120 AA 
Protein Sequence
MATLEKLMKA FESLKSFQQQ QQQQPPPQAP PPPPPPPPPQ PPQPPPQGQP PPPPPPLPGP 60
AEEPLHRPKK ELSATKKDRV NHCLTICENI VAQSLRNSPE FQKLLGIAME LFLLCSDDAE 120
SDVRMVADEC LNKVIKALMD SNLPRLQLEL YKEIKKNGAP RSLRAALWRF AELAHLVRPQ 180
KCRPYLVNLL PCLTRTSKRP EESVQETLAA AVPKIMASFG NFANDNEIKV LLKAFIANLK 240
SSSPTVRRTA AGSAVSICQH SRRTQYFYNW LLNVLLGLLV PMEEEHSTLL ILGVLLTLRC 300
LVPLLQQQVK DTSLKGSFGV TRKEMEVSPS TEQLVQVYEL TLHHTQHQDH NVVTGALELL 360
QQLFRTPPPE LLQALTTPGG LGQLTLVQEE ARGRGRSGSI VELLAGGGSS CSPVLSRKQK 420
GKVLLGEEEA LEDDSESRSD VSSSAFAASV KSEIGGELAA SSGVSTPGSV GHDIITEQPR 480
SQHTLQADSV DLSGCDLTSA ATDGDEEDIL SHSSSQFSAV PSDPAMDLND GTQASSPISD 540
SSQTTTEGPD SAVTPSDSSE IVLDGADSQY LGMQIGQPQE DDEEGAAGVL SGEVSDVFRN 600
SSLALQQAHL LERMGHSRQP SDSSIDKYVT RDEVAEASDP ESKPCRIKGD IGQPNDDDSA 660
PLVHCVRLLS ASFLLTGEKK ALVPDRDVRV SVKALALSCI GAAVALHPES FFSRLYKVPL 720
NTTESTEEQY VSDILNYIDH GDPQVRGATA ILCGTLVYSI LSRSRLRVGD WLGNIRTLTG 780
NTFSLVDCIP LLQKTLKDES SVTCKLACTA VRHCVLSLCS SSYSDLGLQL LIDMLPLKNS 840
SYWLVRTELL DTLAEIDFRL VSFLEAKAES LHRGAHHYTG FLKLQERVLN NVVIYLLGDE 900
DPRVRHVAAT SLTRLVPKLF YKCDQGQADP VVAVARDQSS VYLKLLMHET QPPSHFSVST 960
ITRIYRGYSL LPSITDVTME NNLSRVVAAV SHELITSTTR ALTFGCCEAL CLLSAAFPVC 1020
TWSLGWHCGV PPLSASDESR KSCTVGMASM ILTLLSSAWF PLDLSAHQDA LILAGNLLAA 1080
SAPKSLRSSW TSEEEANSAA TRQEEIWPAL GDRTLVPLVE QLFSHLLKVI NICAHVLDDV 1140
TPGPAIKAAL PSLTNPPSLS PIRRKGKEKE PGEQASTPMS PKKVGEASAA SRQSDTSGPV 1200
TASKSSSLGS FYHLPSYLKL HDVLKATHAN YKVTLDLQNS TEKFGGFLRS ALDVLSQILE 1260
LATLQDIGKC VEEVLGYLKS CFSREPMMAT VCVQQLLKTL FGTNLASQFD GLSSNPSKSQ 1320
CRAQRLGSSS VRPGLYHYCF MAPYTHFTQA LADASLRNMV QAEQERDASG WFDVLQKVSA 1380
QLKTNLTSVT KNRADKNAIH NHIRLFEPLV IKALKQYTTT TSVQLQKQVL DLLAQLVQLR 1440
VNYCLLDSDQ VFIGFVLKQF EYIEVGQFRE SEAIIPNIFF FLVLLSYERY HSKQIIGIPK 1500
IIQLCDGIMA SGRKAVTHAI PALQPIVHDL FVLRGTNKAD AGKELETQKE VVVSMLLRLI 1560
QYHQVLEMFI LVLQQCHKEN EDKWKRLSRQ VADIILPMLA KQQMHIDSHE ALGVLNTLFE 1620
ILAPSSLRPV DMLLRSMFIT PSTMASVSTV QLWISGILAI LRVLISQSTE DIVLCRIQEL 1680
SFSPHLLSCP VINRLRGGGG NVTLGECSEG KQKSLPEDTF SRFLLQLVGI LLEDIVTKQL 1740
KVDMSEQQHT FYCQELGTLL MCLIHIFKSG MFRRITAAAT RLFTSDGCEG SFYTLESLNA 1800
RVRSMVPTHP ALVLLWCQIL LLINHTDHRW WAEVQQTPKR HSLSCTKSLN PQKSGEEEDS 1860
GSAAQLGMCN REIVRRGALI LFCDYVCQNL HDSEHLTWLI VNHIQDLISL SHEPPVQDFI 1920
SAIHRNSAAS GLFIQAIQSR CENLSTPTTL KKTLQCLEGI HLSQSGAVLT LYVDRLLGTP 1980
FRALARMVDT LACRRVEMLL AANLQSSMAQ LPEEELNRIQ EHLQNSGLAQ RHQRLYSLLD 2040
RFRLSTVQDS LSPLPPVTSH PLDGDGHTSL ETVSPDKDWY LQLVRSQCWT RSDSALLEGA 2100
ELVNRIPAED MNDFMMSSEF NLSLLAPCLS LGMSEIANGQ KSPLFEAARG VILNRVTSVV 2160
QQLPAVHQVF QPFLPIEPTA YWNKLNDLLG DTTSYQSLTI LARALAQYLV VLSKVPAHLH 2220
LPPEKEGDTV KFVVMTVEAL SWHLIHEQIP LSLDLQAGLD CCCLALQVPG LWGVLSSPEY 2280
VTHACSLIHC VRFILEAIAV QPGDQLLGPE SRSHTPRAVR KEEVDSDIQN LSHVTSACEM 2340
VADMVESLQS VLALGHKRNS TLPSFLTAVL KNIVISLARL PLVNSYTRVP PLVWKLGWSP 2400
KPGGDFGTVF PEIPVEFLQE KEILKEFIYR INTLGWTNRT QFEETWATLL GVLVTQPLVM 2460
EQEESPPEED TERTQIHVLA VQAITSLVLS AMTVPVAGNP AVSCLEQQPR NKPLKALDTR 2520
FGRKLSMIRG IVEQEIQEMV SQRENTATHH SHQAWDPVPS LLPATTGALI SHDKLLLQIN 2580
PEREPGNMSY KLGQVSIHSV WLGNNITPLR EEEWDEEEEE ESDVPAPTSP PVSPVNSRKH 2640
RAGVDIHSCS QFLLELYSRW ILPSSAARRT PVILISEVVR SLLVVSDLFT ERTQFEMMYL 2700
TLTELRRVHP SEDEILIQYL VPATCKAAAV LGMDKTVAEP VSRLLESTLR SSHLPSQIGA 2760
LHGILYVLEC DLLDDTAKQL IPVVSDYLLS NLKGIAHCVN IHSQQHVLVM CATAFYLMEN 2820
YPLDVGPEFS ASVIQMCGVM LSGSEESTPS IIYHCALRGL ERLLLSEQLS RLDTESLVKL 2880
SVDRVNVQSP HRAMAALGLM LTCMYTGKEK ASPGRASDPS PATPDSESVI VAMERVSVLF 2940
DRIRKGFPCE ARVVARILPQ FLDDFFPPQD VMNKVIGEFL SNQQPYPQFM ATVVYKVFQT 3000
LHSAGQSSMV RDWVMLSLSN FTQRTPVAMA MWSLSCFLVS ASTSPWVSAI LPHVISRMGK 3060
LEQVDVNLFC LVATDFYRHQ IEEEFDRRAF QSVFEVVAAP GSPYHRLLAC LQNVHKVTTC 3120 
Gene Ontology
 GO:0005776; C:autophagic vacuole; IEA:Compara.
 GO:0030424; C:axon; IDA:MGI.
 GO:0016023; C:cytoplasmic membrane-bounded vesicle; IDA:MGI.
 GO:0030659; C:cytoplasmic vesicle membrane; IEA:Compara.
 GO:0005829; C:cytosol; IEA:Compara.
 GO:0030425; C:dendrite; IEA:Compara.
 GO:0005783; C:endoplasmic reticulum; IEA:Compara.
 GO:0005794; C:Golgi apparatus; IEA:Compara.
 GO:0016234; C:inclusion body; IDA:MGI.
 GO:0005770; C:late endosome; IEA:Compara.
 GO:0005634; C:nucleus; IBA:RefGenome.
 GO:0043234; C:protein complex; IEA:Compara.
 GO:0050809; F:diazepam binding; IMP:MGI.
 GO:0009952; P:anterior/posterior pattern specification; IMP:MGI.
 GO:0008088; P:axon cargo transport; IMP:MGI.
 GO:0007569; P:cell aging; IMP:MGI.
 GO:0000052; P:citrulline metabolic process; IMP:MGI.
 GO:0008340; P:determination of adult lifespan; IMP:MGI.
 GO:0007212; P:dopamine receptor signaling pathway; IMP:MGI.
 GO:0007029; P:endoplasmic reticulum organization; IMP:MGI.
 GO:0016197; P:endosomal transport; IMP:MGI.
 GO:0006888; P:ER to Golgi vesicle-mediated transport; IMP:MGI.
 GO:0000132; P:establishment of mitotic spindle orientation; IEA:Compara.
 GO:0007030; P:Golgi organization; IEA:Compara.
 GO:0007625; P:grooming behavior; IMP:MGI.
 GO:0042445; P:hormone metabolic process; IMP:MGI.
 GO:0030073; P:insulin secretion; IMP:MGI.
 GO:0055072; P:iron ion homeostasis; IMP:MGI.
 GO:0051938; P:L-glutamate import; IMP:MGI.
 GO:0019244; P:lactate biosynthetic process from pyruvate; IMP:MGI.
 GO:0007626; P:locomotory behavior; IMP:MGI.
 GO:2001237; P:negative regulation of extrinsic apoptotic signaling pathway; IEA:Compara.
 GO:0043524; P:negative regulation of neuron apoptotic process; IMP:MGI.
 GO:0021990; P:neural plate formation; IMP:MGI.
 GO:0051402; P:neuron apoptotic process; IMP:MGI.
 GO:0048666; P:neuron development; IMP:MGI.
 GO:0021988; P:olfactory lobe development; IMP:MGI.
 GO:0048341; P:paraxial mesoderm formation; IMP:MGI.
 GO:0006606; P:protein import into nucleus; IMP:MGI.
 GO:0019805; P:quinolinate biosynthetic process; IMP:MGI.
 GO:0046902; P:regulation of mitochondrial membrane permeability; IMP:MGI.
 GO:0051881; P:regulation of mitochondrial membrane potential; IMP:MGI.
 GO:0034047; P:regulation of protein phosphatase type 2A activity; IEA:Compara.
 GO:0048167; P:regulation of synaptic plasticity; IMP:MGI.
 GO:0051592; P:response to calcium ion; IMP:MGI.
 GO:0006890; P:retrograde vesicle-mediated transport, Golgi to ER; IEA:Compara.
 GO:0035176; P:social behavior; IMP:MGI.
 GO:0007283; P:spermatogenesis; IMP:MGI.
 GO:0021756; P:striatum development; IMP:MGI.
 GO:0000050; P:urea cycle; IMP:MGI.
 GO:0047496; P:vesicle transport along microtubule; IDA:MGI.
 GO:0008542; P:visual learning; IMP:MGI. 
Interpro
 IPR011989; ARM-like.
 IPR016024; ARM-type_fold.
 IPR000091; Huntingtin.
 IPR024613; Huntingtin_middle-repeat. 
Pfam
 PF12372; DUF3652 
SMART
  
PROSITE
  
PRINTS
 PR00375; HUNTINGTIN.