CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-034980
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Ash1 (Absent, small, or homeotic)-like (Drosophila) (Predicted) 
Protein Synonyms/Alias
 Protein Ash1l 
Gene Name
 Ash1l 
Gene Synonyms/Alias
 Ash1l_predicted; rCG_62830 
Created Date
 July 27, 2013 
Organism
 Rattus norvegicus (Rat) 
NCBI Taxa ID
 10116 
Lysine Modification
Position
Peptide
Type
References
93LKIGLQAKRTKKPPKacetylation[1]
1816YDKILATKKNLDHVNacetylation[1]
Reference
 [1] Proteomic analysis of lysine acetylation sites in rat tissues reveals organ specificity and subcellular patterns.
 Lundby A, Lage K, Weinert BT, Bekker-Jensen DB, Secher A, Skovgaard T, Kelstrup CD, Dmytriyev A, Choudhary C, Lundby C, Olsen JV.
 Cell Rep. 2012 Aug 30;2(2):419-31. [PMID: 22902405
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Metal-binding; Methyltransferase; Reference proteome; Transferase; Zinc; Zinc-finger. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2918 AA 
Protein Sequence
MDPRNTAMLG LGSDSEGFSR KSPSAISSGT LASKREVEIE GNTDEEDPRK RNRERAIEAG 60
KDDGLTDAQQ QFSVKETNFS EGNLKLKIGL QAKRTKKPPK NLENYVCRPA IKTTVKHSRK 120
ALKSGKMTDE KNEHCPSKWD SSKLFKKAGD ATAIECQSEE SVHLHSQGEN NPLSKKLSPV 180
HSQMADYIPA APPLVGSRDP DIKDRALLNG GTSVTEKLAQ LIATCPPSKS SKAKPKKLGT 240
GTTVGLVSKD LIRKPGVGSI AGIIHKDLIK KPALSTAVGL VTKDPGKKPM FNAAVSLINK 300
DSVKKLGTGT TAVFINKDLG KKPGTVTTVG LLSKDPGKKL GIGIVPGLVN KESGKKLGLG 360
TVVGLVNKEL GKKLSSTVGL VAKDVTKKIV ASSAMGLVNK DIGKKLLSCP IAGQLGSKDA 420
LNLKSEALLP TQEQLKASCS ANISNHESQE LPESLKDSAT SKTFEKNVMR HSKESMLEKF 480
SVRKEITNLE KEMFNEGTCI QQDSFSSNER GAFETSKHEK QPPVYCTSPD FQIGGASDAS 540
TAKSPFGAVG ESNLPSSSPT VSVNPLTRNP PETSSQLVPN PLLLNSTAEQ MEEISESIGK 600
NQFTAESTHL NVGHRSLGHS ISIECKGIDK ELNESKSTHL DISRISSSLG KKPSLASDSG 660
IHTITPSVVN FTSLFSNKPF LKLGAVAAPD KHCQVAESLS TSFQSKPLKK RKGRKARWTK 720
VVARSTCRSP KGLDLERSEL FKNVSCSSLS NSSEPAKFMK TIGASSFVDH DFLKRRLPKL 780
SKSSAPSLAL LADSEKASHK SFITHKLSSS MCVTSDLLSD IYKPKRGRPK SKEMPQLEGP 840
PKRTLKIPAS KVFSLQSKEE QEPPILQPEI EIPSFKQSLS VSPFPKKRGR PKRQMRSPVK 900
MKPPVLSVAP FVATESPSKL ESESENHRSS SDFFESEDQL QDTDDLEDSH RQSVCSVSDL 960
EMEPDKKISK RNNGQLMKTI IRKINKMKTL KRKKLLNQIL SSSVESSNKG KVQSKLHNTV 1020
SSLAATFGSK LGQQINVSKK GTIYIGKRRG RKPKTVLNGL LSGSPASLAV LEQTAQQAAG 1080
SALGQILPPL LPSPASSSEI LPSPICSQSS GTSGGQSPVS SDAGFVEPSS VPYLHVHSRQ 1140
GSMIQTLAMK KAAKGRRRLS PPTLLPNSPS HLSELTSLKE ATPSPVSESH SDETIPSDSG 1200
IGTDNNSTSD RAEKFCGQKK RRHSFEHISL IPPETSTVLN SLKEKHKHKC KRRSHDYLSY 1260
DKMKRQKRKR KKKYPQLRNR QDPDFIAELE ELISRLSEIR ITHRSHHFIP RDLLPTIFRI 1320
NFNSFYTHPS FPLDPLHYIR KPDLKKKRGR PPKMREAMAE MPFMHSLSFP LSSTGFYPSY 1380
GMPYSPSPLT AAPIGLGYYG RYPPTLYPPP PSPSFTTPLP PPSYMHAGHL LLNPTKYHKK 1440
KHKLLRQEAF LTTSRTPLLS MSTYPSVPPE MAYGWMVEHK HRHRHKHREH RSEQPQVSMD 1500
TGSSRSVLES LKRYRFGKDT VGDRYKHKEK HRCHMSCPHL SPSKSLINRE EQWVSREPSE 1560
SSSLALGLQT PLQIDCSESS PSLSLGGFTP NSEPASSDEH MNLFTSAIGS CRVSNPNSSC 1620
RKKLTDSPGL FPVQDTALSR PHRKEPLPST ERAIQSLAGS QSASDKPSQR SSESTNCSPT 1680
RKRSSSESTS STVNGIPSRS PRLVASLDDS VDCLLQRIVQ HDEQESVEKN GDAPITTVSA 1740
PPSSSPGHSY SKERTLGKSD SLLVPAVPSD SCNSIPLLSE KLASRCSPHH IKRSVVEAMQ 1800
RQARKMCNYD KILATKKNLD HVNKILKAKK LQRQARTGNN FVKRRPGRPR KCPLQAVVSM 1860
QAFQAAQFVN PELNEGEEVS LHLSPDTVTD VIEAVVQSVN LNSEHKKGWK RKNWLLEEQT 1920
RKKQKTVPEE EEQENNKSFI EKPVEIPSPL ETPAEPSEPE SNLQPVLALI PREKKAPRPP 1980
KKKYQRAGLY SDVYKTTDPK SRLIQLKKEK LEYTPGEHEY GLFPAPIHVD VYVDVKPLSG 2040
YEATTCNCKK PDDDTRKGCG DDCLNRMIFA ECSPNTCPCG EQCCNQRIQR HEWVQCLERF 2100
RAEEKGWGIR TKEPLKAGQF IIEYLGEVVS EQEFRNRMIE QYHNHSDHYC LNLDSGMVID 2160
SYRMGNEARF INHSCDPNCE MQKWSVNGVY RIGLYALKDV PAGTELTYDY NFHSFNVEKQ 2220
QLCKCGFEKC RGIIGGKSQR MNGLPSHKGS QPASTHRKSA RSKEKRKSKH KLKKRRGHPS 2280
EEPSENINTP TRLTPQLQMK PMSNRERNFV LKHHVFLVRN WEKIHQKQEE VKHSRDIHST 2340
SLYTRWNGIC RDDGNIKSDV FMTQFSALQT ARSVRTRRLA AAEENLEVAR AARLAQIFKE 2400
ICDGIISYKD SSQQALAAPL LNLPPKKKNA DYYEKISDPL DLSTIEKQIL TGYYKTVEAF 2460
DADMLKVFRN AEKYYGRKSP IGRDVCRLRK AYYSARHEAS AQIDEIVGET ASEADSSETS 2520
VSEKENSHEK DDDVIRCICG LYKDEGLMIQ CDKCMVWQHC DCMGVNTDVE HYLCEQCDPR 2580
PVDREVPMIP RPHYAQPGCV YFICLLRDDL LLRQGDCVYL MRDSRRTPDG HPVRQSYRLL 2640
SHINRDKLDI FRIEKLWKNE KEERFAFGHH YFRPHETHHS PSRRFYHNEL FRVPLYEIIP 2700
LEAVVGTCCV LDLYTYCKGR PKGVKEQDVY ICDYRLDKSA HLFYKIHRNR YPVCTKPYAF 2760
DHFPKKLTPK RDFSPHYVPD NYKRNGGRSS WKSERSKPLL KDLGQDDDAL PLIEEVLASQ 2820
EQAANEMPSP EEPDQERVTG DVSDAEKKPE ESSQEPQLAS TPEERRHSQR ERLNQILLNL 2880
LEKIPGKNAI DVTYLLEEGS GRKLRRRTLF IPENSFRK 2918 
Gene Ontology
 GO:0005634; C:nucleus; IEA:InterPro.
 GO:0003677; F:DNA binding; IEA:InterPro.
 GO:0018024; F:histone-lysine N-methyltransferase activity; IEA:InterPro.
 GO:0008270; F:zinc ion binding; IEA:InterPro.
 GO:0034968; P:histone lysine methylation; IEA:GOC. 
Interpro
 IPR017956; AT_hook_DNA-bd_motif.
 IPR006560; AWS.
 IPR001025; BAH_dom.
 IPR001487; Bromodomain.
 IPR003616; Post-SET_dom.
 IPR001214; SET_dom.
 IPR019786; Zinc_finger_PHD-type_CS.
 IPR011011; Znf_FYVE_PHD.
 IPR001965; Znf_PHD.
 IPR019787; Znf_PHD-finger.
 IPR013083; Znf_RING/FYVE/PHD. 
Pfam
 PF01426; BAH
 PF00439; Bromodomain
 PF00628; PHD
 PF00856; SET 
SMART
 SM00384; AT_hook
 SM00570; AWS
 SM00439; BAH
 SM00297; BROMO
 SM00249; PHD
 SM00508; PostSET
 SM00317; SET 
PROSITE
 PS51215; AWS
 PS51038; BAH
 PS50014; BROMODOMAIN_2
 PS50868; POST_SET
 PS50280; SET
 PS01359; ZF_PHD_1 
PRINTS