CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-031914
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Cut, isoform C 
Protein Synonyms/Alias
  
Gene Name
 ct 
Gene Synonyms/Alias
 CG11387; Dmel_CG11387 
Created Date
 July 27, 2013 
Organism
 Drosophila melanogaster (Fruit fly) 
NCBI Taxa ID
 7227 
Lysine Modification
Position
Peptide
Type
References
1630QYKIAPEKLMRTGSYacetylation[1]
Reference
 [1] Proteome-wide mapping of the Drosophila acetylome demonstrates a high degree of conservation of lysine acetylation.
 Weinert BT, Wagner SA, Horn H, Henriksen P, Liu WR, Olsen JV, Jensen LJ, Choudhary C.
 Sci Signal. 2011 Jul 26;4(183):ra48. [PMID: 21791702
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; DNA-binding; Homeobox; Nucleus; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2383 AA 
Protein Sequence
MQPTLPQAAG TADMDLTAVQ SINDWFFKKE QIYLLAQFWQ QMPIHDIHPI TQQPTPLDKV 60
ILPLHCSHSE FRATLAEKEV NTLKEQLSTG NPDSNLNSEN SDTAAAAATA AAVAAVVAGA 120
TATNDIEDEQ QQQLQQTASG GILESDSDKL LNSSIVAAAI TLQQQNGSNL LANTNTPSPS 180
PPLLSAEQQQ QLQSSLQQSG GVGGACLNPK LFFNHAQQMM MMEAAAAAAA AALQQQQQQQ 240
SPLHSPANEV AIPTEQPAAT VATGAAAAAA AAATPIATGN VKSGSTTSNA NHTNSNNSHQ 300
DEEELDDEEE DEEEDEDEDD EEENASMQSN ADDMELDAQQ ETRTEPSATT QQQHQQQDTE 360
DLEENKDAGE ASLNVSNNHN TTDSNNSCSR KNNNGGNESE QHVASSAEDD DCANNNTNTS 420
NNNNTSNTAT SNTNNNNNNN SSSGNSEKRK KKNNNNNNGQ PAVLLAAKDK EIKALLDELQ 480
RLRAQEQTHL VQIQRLEEHL EVKRQHIIRL EARLDKQQIN EALAEATALS AAASTNNNNN 540
SQSSDNNKKL NTAAERPMDA SSNADLPEST KAPVPAEDDE EDEDQAMLVD SEEAEDKPED 600
SHHDDDEDED EDREAVNATT TDSNELKIKK EQHSPLDLNV LSPNSAIAAA AAAAAAAACA 660
NDPNKFQALL IERTKALAAE ALKNGASDAL SEETNRNHED EDEVSPAATA TATLATPATV 720
ATPIETLATD SEAKASNDMA PKQEIEEVDD EDATDEPVIL PEQLVNDYWR RGFVESGNST 780
NTSTAAMPGS KTPTIYQNFK VEQQAQLLPP PPPPPPPPQQ QHQQQQQQQH LHFSQQQQQQ 840
HQQQHGPSST ALLSQLVNST FSNSSNSSRL DAHHQQQQHH QQQHQHQQQH HQQQHLHQQH 900
HHHLQQQPNS GSNSNPASND HHHGHHLHGH GLLHPSSAHH LHHQTTESNS NSSTPTAAGN 960
NNGSNNSSSN TNANSTAQLA ASLASTLNGT KSLMQEDSNG LAAVAMAAHA QHAAALGPGF 1020
LPGLPAFQFA AAQVAAGGDG RGHYRFADSE LQLPPGASMA GRLGESLIPK GDPMEAKLQE 1080
MLRYNMDKYA NQALDTLHIS RRVRELLSVH NIGQRLFAKY ILGLSQGTVS ELLSKPKPWD 1140
KLTEKGRDSY RKMHAWACDD NAVMLLKSLI PKKDSGLPQY AGRGAGGAGG DDSMSEDRIA 1200
HILSEASSLM KQSSVAQHRE QERRSHGGED SHSNEDSKSP PQSCTSPFFK VENQLKQHQH 1260
LNPEQAAAQQ REREREQRER EQQQRLRHDD QDKMARLYQE LIARTPRETA FPSFLFSPSL 1320
FGGAAGMPGA ASNAFPAMAD ENMRHVFERE IAKLQQHQQQ QQAAQAQAQF PNFSSLMALQ 1380
QQVLNGAQDL SLAAAAAKDI KLNGQRSSLE HSAGSSSCSK DGERDDAYPS SLHGRKSEGG 1440
GTPAPPAPPS GPGTGAGAPP TAAPPTGGAS SNSAAPSPLS NSILPPALSS QGEEFAATAS 1500
PLQRMASITN SLITQPPVTP HHSTPQRPTK AVLPPITQQQ FDMFNNLNTE DIVRRVKEAL 1560
SQYSISQRLF GESVLGLSQG SVSDLLARPK PWHMLTQKGR EPFIRMKMFL EDENAVHKLV 1620
ASQYKIAPEK LMRTGSYSGS PQMPQGLASK MQAASLPMQK MMSELKLQEP AQAQHLMQQM 1680
QAAAMSAAMQ QQQVAQAQQQ AQQAQQAQQH LQQQAQQHLQ QQQHLAQQQH PHQQHHQAAA 1740
AAAALHHQSM LLTSPGLPPQ HAISLPPSAG GAQPGGPGGN QGSSNPSNSE KKPMLMPVHG 1800
TNAMRSLHQH MSPTVYEMAA LTQDLDTHDI TTKIKEALLA NNIGQKIFGE AVLGLSQGSV 1860
SELLSKPKPW HMLSIKGREP FIRMQLWLSD ANNVERLQLL KNERREASKR RRSTGPNQQD 1920
NSSDTSSNDT NDFYTSSPGP GSVGSGVGGA PPSKKQRVLF SEEQKEALRL AFALDPYPNV 1980
GTIEFLANEL GLATRTITNW FHNHRMRLKQ QVPHGPAGQD NPIPSRESTS ATPFDPVQFR 2040
ILLQQRLLEL HKERMGMSGA PIPYPPYFAA AAILGRSLAG IPGAAAAAGA AAAAAAVGAS 2100
GGDELQALNQ AFKEQMSGLD LSMPTLKRER SDDYQDDLEL EGGGHNLSDN ESLEGQEPED 2160
KTTDYEKVLH KSALAAAAAY MSNAVRSSRR KPAAPQWVNP AGAVTNPSAV VAAVAAAAAA 2220
AADNERIING VCVMQASEYG RDDTDSNKPT DGGNDSDHEH AQLEIDQRFM EPEVHIKQEE 2280
DDDEEQSGSV NLDNEDNATS EQKLKVINEE KLRMVRVRRL SSTGGGSSEE MPAPLAPPPP 2340
PPAASSSIVS GESTTSSSSS SNTSSSTPAV TTAAATAAAG WNY 2383 
Gene Ontology
 GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
 GO:0043565; F:sequence-specific DNA binding; IEA:InterPro.
 GO:0003700; F:sequence-specific DNA binding transcription factor activity; IEA:InterPro. 
Interpro
 IPR003350; Hmoeo_CUT.
 IPR017970; Homeobox_CS.
 IPR001356; Homeodomain.
 IPR009057; Homeodomain-like.
 IPR010982; Lambda_DNA-bd_dom. 
Pfam
 PF02376; CUT
 PF00046; Homeobox 
SMART
 SM00389; HOX 
PROSITE
 PS51042; CUT
 PS00027; HOMEOBOX_1
 PS50071; HOMEOBOX_2 
PRINTS