CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-007195
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Homeobox protein cut-like 1 
Protein Synonyms/Alias
 CCAAT displacement protein; CDP; Homeobox protein cux-1 
Gene Name
 CUX1 
Gene Synonyms/Alias
 CUTL1 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
94AAFLNVYKRLIDVPDubiquitination[1]
217SLQTALEKTRTELFDubiquitination[1]
399LEVLLLEKNRSLQSEubiquitination[1, 2]
622KEPFHKMKQFLSDEQubiquitination[1]
Reference
 [1] hCKSAAP_UbSite: improved prediction of human ubiquitination sites by exploiting amino acid pattern and properties.
 Chen Z, Zhou Y, Song J, Zhang Z.
 Biochim Biophys Acta. 2013 Aug;1834(8):1461-7. [PMID: 23603789]
 [2] Systems-wide analysis of ubiquitylation dynamics reveals a key role for PAF15 ubiquitylation in DNA-damage bypass.
 Povlsen LK, Beli P, Wagner SA, Poulsen SL, Sylvestersen KB, Poulsen JW, Nielsen ML, Bekker-Jensen S, Mailand N, Choudhary C.
 Nat Cell Biol. 2012 Oct;14(10):1089-98. [PMID: 23000965
Functional Description
 Probably has a broad role in mammalian development as a repressor of developmentally regulated gene expression. May act by preventing binding of positively-activing CCAAT factors to promoters. Component of nf-munr repressor; binds to the matrix attachment regions (MARs) (5' and 3') of the immunoglobulin heavy chain enhancer. Represses T-cell receptor (TCR) beta enhancer function by binding to MARbeta, an ATC-rich DNA sequence located upstream of the TCR beta enhancer (By similarity). 
Sequence Annotation
 DNA_BIND 542 629 CUT 1.
 DNA_BIND 934 1021 CUT 2.
 DNA_BIND 1117 1204 CUT 3.
 DNA_BIND 1244 1303 Homeobox.
 MOD_RES 763 763 Phosphoserine.  
Keyword
 Alternative splicing; Coiled coil; Complete proteome; Developmental protein; DNA-binding; Homeobox; Nucleus; Phosphoprotein; Reference proteome; Repeat; Repressor; Transcription; Transcription regulation. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1505 AA 
Protein Sequence
MAANVGSMFQ YWKRFDLQQL QRELDATATV LANRQDESEQ SRKRLIEQSR EFKKNTPEDL 60
RKQVAPLLKS FQGEIDALSK RSKEAEAAFL NVYKRLIDVP DPVPALDLGQ QLQLKVQRLH 120
DIETENQKLR ETLEEYNKEF AEVKNQEVTI KALKEKIREY EQTLKNQAET IALEKEQKLQ 180
NDFAEKERKL QETQMSTTSK LEEAEHKVQS LQTALEKTRT ELFDLKTKYD EETTAKADEI 240
EMIMTDLERA NQRAEVAQRE AETLREQLSS ANHSLQLASQ IQKAPDVEQA IEVLTRSSLE 300
VELAAKEREI AQLVEDVQRL QASLTKLREN SASQISQLEQ QLSAKNSTLK QLEEKLKGQA 360
DYEEVKKELN ILKSMEFAPS EGAGTQDAAK PLEVLLLEKN RSLQSENAAL RISNSDLSGS 420
ARRKGKDQPE SRRPGSLPAP PPSQLPRNPG EQASNTNGTH QFSPAGLSQD FFSSSLASPS 480
LPLASTGKFA LNSLLQRQLM QSFYSKAMQE AGSTSMIFST GPYSTNSISS QSPLQQSPDV 540
NGMAPSPSQS ESAGSVSEGE EMDTAEIARQ VKEQLIKHNI GQRIFGHYVL GLSQGSVSEI 600
LARPKPWNKL TVRGKEPFHK MKQFLSDEQN ILALRSIQGR QRENPGQSLN RLFQEVPKRR 660
NGSEGNITTR IRASETGSDE AIKSILEQAK RELQVQKTAE PAQPSSASGS GNSDDAIRSI 720
LQQARREMEA QQAALDPALK QAPLSQSDIT ILTPKLLSTS PMPTVSSYPP LAISLKKPSA 780
APEAGASALP NPPALKKEAQ DAPGLDPQGA ADCAQGVLRQ VKNEVGRSGA WKDHWWSAVQ 840
PERRNAASSE EAKAEETGGG KEKGSGGSGG GSQPRAERSQ LQGPSSSEYW KEWPSAESPY 900
SQSSELSLTG ASRSETPQNS PLPSSPIVPM SKPTKPSVPP LTPEQYEVYM YQEVDTIELT 960
RQVKEKLAKN GICQRIFGEK VLGLSQGSVS DMLSRPKPWS KLTQKGREPF IRMQLWLNGE 1020
LGQGVLPVQG QQQGPVLHSV TSLQDPLQQG CVSSESTPKT SASCSPAPES PMSSSESVKS 1080
LTELVQQPCP PIEASKDSKP PEPSDPPASD SQPTTPLPLS GHSALSIQEL VAMSPELDTY 1140
GITKRVKEVL TDNNLGQRLF GETILGLTQG SVSDLLARPK PWHKLSLKGR EPFVRMQLWL 1200
NDPNNVEKLM DMKRMEKKAY MKRRHSSVSD SQPCEPPSVG TEYSQGASPQ PQHQLKKPRV 1260
VLAPEEKEAL KRAYQQKPYP SPKTIEDLAT QLNLKTSTVI NWFHNYRSRI RRELFIEEIQ 1320
AGSQGQAGAS DSPSARSGRA APSSEGDSCD GVEATEGPGS ADTEEPKSQG EAEREEVPRP 1380
AEQTEPPPSG TPGPDDARDD DHEGGPVEGP GPLPSPASAT ATAAPAAPED AATSAAAAPG 1440
EGPAAPSSAP PPSNSSSSSA PRRPSSLQSL FGLPEAAGAR DSRDNPLRKK KAANLNSIIH 1500
RLEKAASREE PIEWEF 1516 
Gene Ontology
 GO:0005829; C:cytosol; TAS:Reactome.
 GO:0000139; C:Golgi membrane; IEA:Compara.
 GO:0005634; C:nucleus; IDA:HPA.
 GO:0003682; F:chromatin binding; IEA:Compara.
 GO:0043565; F:sequence-specific DNA binding; IEA:InterPro.
 GO:0003700; F:sequence-specific DNA binding transcription factor activity; IEA:InterPro.
 GO:0042491; P:auditory receptor cell differentiation; IEA:Compara.
 GO:0030324; P:lung development; IEA:Compara.
 GO:0007275; P:multicellular organismal development; TAS:ProtInc.
 GO:0000122; P:negative regulation of transcription from RNA polymerase II promoter; TAS:ProtInc.
 GO:0000301; P:retrograde transport, vesicle recycling within Golgi; IEA:Compara.
 GO:0006351; P:transcription, DNA-dependent; IEA:UniProtKB-KW. 
Interpro
 IPR003350; Hmoeo_CUT.
 IPR017970; Homeobox_CS.
 IPR001356; Homeodomain.
 IPR009057; Homeodomain-like.
 IPR010982; Lambda_DNA-bd_dom. 
Pfam
 PF02376; CUT
 PF00046; Homeobox 
SMART
 SM00389; HOX 
PROSITE
 PS51042; CUT
 PS00027; HOMEOBOX_1
 PS50071; HOMEOBOX_2 
PRINTS