CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-045673
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Rhinoceros, isoform D 
Protein Synonyms/Alias
  
Gene Name
 rno 
Gene Synonyms/Alias
 Dmel_CG7036 
Created Date
 July 27, 2013 
Organism
 Drosophila melanogaster (Fruit fly) 
NCBI Taxa ID
 7227 
Lysine Modification
Position
Peptide
Type
References
105TSKSKSTKLAKSASKacetylation[1]
108SKSTKLAKSASKCKSacetylation[1]
112KLAKSASKCKSQGASacetylation[1]
133ARSVADIKMSSIYNRacetylation[1]
Reference
 [1] Proteome-wide mapping of the Drosophila acetylome demonstrates a high degree of conservation of lysine acetylation.
 Weinert BT, Wagner SA, Horn H, Henriksen P, Liu WR, Olsen JV, Jensen LJ, Choudhary C.
 Sci Signal. 2011 Jul 26;4(183):ra48. [PMID: 21791702
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 3241 AA 
Protein Sequence
MSQRGKRGNQ QHHQSHHPPP QQHQRKDVEP QPPPTKRRKG RPPNGATTAA VAEVTGSGPA 60
TGSERVPVLP LCKSKHEEPG AEAGGGGQGR AAAGATSTSK SKSTKLAKSA SKCKSQGASS 120
SSSWQARSVA DIKMSSIYNR SSTEAPAELY RKDLISAMKL PDSEPLANYE YLIVTDPWKQ 180
EWEKGVQVPV NPDSLPEPCV YVLPEPVVSP AHDFKLPKNR YLRITKDEHY SPDLHYLTNV 240
VALAENTCAY DIDPIDEAWL RLYNSDRAQC GAFPINATQF ERVIEELEVR CWEQIQVILK 300
LEEGLGIEFD ENVICDVCRS PDSEEANEMV FCDNCNICVH QACYGITAIP SGQWLCRTCS 360
MGIKPDCVLC PNKGGAMKSN KSGKHWAHVS CALWIPEVSI GCVDRMEPIT KISSIPQSRW 420
SLICVLCRKR VGSCIQCSVK PCKTAYHVTC AFQHGLEMRA IIEEGNAEDG VKLRSYCQKH 480
SMSKGKKENA GSHGGGSASV ASAMQKANRY GSGAGGGADD GNNACGTTGE DPRRRKNHRK 540
TELTSEERNQ ARAQRLQEVE AEFDKHVNFN DISCHLFDVD DDAIVAIYNY WKLKRKSRHN 600
RELIPPKSED VEMIARKQEQ QDMENHKLVV HLRQDLERVR NLCYMVSRRE KLSRSLFKLR 660
EQVFYKQLGV LDEMRLEKQQ TKQEQQQPVM DLNAVIYAND GPTLYDRFYS SVGGQTVPAQ 720
YQDLKYILEQ LMGKLQSGKQ GRGRASQSPN KRKQPAKASP NKKLNNGILS SRTSSPEKTV 780
AGSKVGTTTS KVRSPPGKNP TGRRASKSSA AAATSTHNKS QFHSNIRSST TSHSSSGTIS 840
SGNSSSANGT SSSDSSSGSD SGSESGSSSA GSGVSKRKSS SGSPLKKQSY ARSVEQRQKQ 900
RQRRQNEAVA GASATYPDSR SASSSSDGED ERCRNRQEPE RGARRGPIQS KSVPNRSQAS 960
RSKPTTEADV GEGTGASARR KLSTTTRGLA QMDKDADESV SSDESEELLP LRGERQREST 1020
TTSGLATTGS AIGRNLGQHI YSDSESSSSE QEKDQEEQAT VESNVSDSQN QQTIRTKAAM 1080
KEFVPGTAAT TSSTSQAASS TSKAKNTREG KEGAASIGNS TKTKPNPNAK LYPADLLVVP 1140
QRQAAKKASE NMRSTNLATT LQPDVSDRVR EPDINSISGT AKSKVKDSSS RVSNEADKSS 1200
LEKVRPKEHL QKTVGKTSES APAERGKRGR PPKVPKDARP PSITENDKPA LPTHTQSKPP 1260
SVVATPVSAK SNFAVSLVPQ RQAAKKAAEQ LKSSKPVLES FSTGNDISDK ETVTSATISG 1320
SGSSVPAAST PVKPTRRSSI KEAPITPKEP LSGRRKSKED LLATPIKTTP LVKRRVVVPN 1380
LSSSSSGDSE SSSSSSSSGS SSSSGGSDSD SESQASNSEN PSSREPPVAP AKVPSDSSLV 1440
PKRSPRKSMD KPSALTIAPA SVNVLNIPST RSRQNSTTKS TKVALQKAVQ SVEDDVKCTP 1500
KTNRLQGSMD ECGKQVQLEQ ATKRATRGSK SRPPSPTAKS SPEKTVSRCK SRAEESPKKV 1560
ANLEQEISQR KVASGKGTSS LDKLLNKKQQ QMNHSAQATP PPISPTPPAS ETRIVKDQCD 1620
LKPDEVSIQQ INLGADAQPE PDLDPESAAE AGELPMDIDE ELTTAPTRTQ LSASASKLAD 1680
IIDDERPPAA PLPASPTPTP TSNDEMSDAG SDLSERRRMR WRSRRRRRRR SHEPDEEHTH 1740
HTQHLLNEME MARELEEERK NELLANASKY SASTSSPAVT VIPPDPPEII ELDSNSAEQQ 1800
QQHLHDQPLP PPLVVQSPAA DVVPTVMQQQ LLPSQRPLIE QLPVEHLPIV ETILEMEDSK 1860
FANNFASNLA SVLNPPNQMS LIGSSIDRSK QISEEDSIQA TRNLLEKLRK TKRKAQDDCS 1920
SKEAVDLLPP TPAIPSVFPF HNAADPEDII HAQKEQQHQQ QQQLQSSQTC IYGNSSGPNS 1980
VASLTIKDSP MTANSGSYAN SLTNTPNATP TNATMNNLGY QVNFPNSQPP PTLGLFLEKS 2040
PHQKGACPLS SNGGANVGQP APTPDFVDLA AAAVKNTLGS FRGAATVPTQ SGTGVNAKIN 2100
DYDESTRMQS PFGGMPWNES DLIAERRSSS PSSVSESNDP PQPPPVVTAT ATTARSLAQL 2160
ESCKNFFNSY PSGNAGPGTA ANATAPFNHP PMVNGIDSIP MFNNTNTTQH QPTTPAHQQQ 2220
QQRTPNNQYN GTIYPQLAGI MHPQTTPTEP PSSLYGNGGV GGAVQSTTLP PPAQVNQYPG 2280
TPYSATTLGM ISVQQPALST VPVQTATTPN NPFTLTSPID GKMPTYPAQL LSSCAEAVVA 2340
SMMPPTPPVT ATAKDSPSKR TSVSGSNLSK KQTHKSPQLP QGKSPGKSPR QPLQPPTPPA 2400
PVPVVALPPT KYDPQTHTLQ GKPRQRAPRG SGGSGAPGRG RGRGRGRGRG GGVTSGMAMV 2460
LPPPMSDYGS NTHIVNNLVG TPFEFNNEFD DMAGPGVENL QSLRDRRRSF ELRAPRVQNK 2520
PTTTPTTATT TNPLLHPVLP GPVDMRTYNL GFEAPHSTAS QEAYQNNLLG AFDSGTADQT 2580
LSEFNEEDER QFQSALRATG TGTSPSKQHS GPTALVAPPT GPNPTPAPNL LLHCTEANQM 2640
APNVAATGAA THLVEGSLVE ASLEATSEEV SIDSDSTIPH SKTSTSDARS QIKLKIKSPM 2700
AYPEHYNAMT NSSSLTLTST LVQSSNVVQT TVSTSTVVSA SSAVSGNSRR MRKKELLSLY 2760
VVQKDNHNDD SSCGLPAASD TLPLENLRKS EEEDELSGGN GTKRFKKNSS SRELRALDAN 2820
LALVEEQLLS SGAGACGGGS SGDGRRRSAC SSGSNNDNNG KTGAASSAGK RRGRSKTLES 2880
SEDDHQAPKL KIKIRGLTAN ETPSGVSSVD EGQNYSYEMT RRACPPKKRL TSNFSTLTLE 2940
EIKRDSMNYR KKVMQDFVKG EDSNKRGVVV KDGESLIMPQ PPTKRPKSSK PKKEKKEKKR 3000
QKQQQLILSS STTTMTTTLI ENTASASPGD KPKLILRFGK RKAETTTRTA SLEQPPTLEA 3060
PAPLRFKIAR NSSGGGYIIG TKAEKKDEST ADNTSPITEL PLISPLREAS PQGRLLNSFT 3120
PHSQNANTSP ALLGKDTGTP SPPCLVIDSS KSADVHDSTS LPESGEAAMG VQSSLVNATT 3180
PLCVNVGNYE NSNNSLPSAS GTGSASSNSC NSNSINNNGS GGGRASGEGG LLPLKKDCEV 3240
R 3241 
Gene Ontology
  
Interpro
  
Pfam
  
SMART
  
PROSITE
  
PRINTS