CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-007194
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Homeobox protein cut-like 1 
Protein Synonyms/Alias
 CCAAT displacement protein; CDP; Homeobox protein cux-1 
Gene Name
 CUX1 
Gene Synonyms/Alias
 CUTL1 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
83AAFLNVYKRLIDVPDubiquitination[1]
217ELFDLKTKYDEETTAubiquitination[1]
388LEVLLLEKNRSLQSEubiquitination[1]
611KEPFHKMKQFLSDEQubiquitination[1]
850AEETGGGKEKGSGGSacetylation[2]
Reference
 [1] A proteome-wide, quantitative survey of in vivo ubiquitylation sites reveals widespread regulatory roles.
 Wagner SA, Beli P, Weinert BT, Nielsen ML, Cox J, Mann M, Choudhary C.
 Mol Cell Proteomics. 2011 Oct;10(10):M111.013284. [PMID: 21890473]
 [2] Integrated proteomic analysis of post-translational modifications by serial enrichment.
 Mertins P, Qiao JW, Patel J, Udeshi ND, Clauser KR, Mani DR, Burgess MW, Gillette MA, Jaffe JD, Carr SA.
 Nat Methods. 2013 Jul;10(7):634-7. [PMID: 23749302
Functional Description
 Probably has a broad role in mammalian development as a repressor of developmentally regulated gene expression. May act by preventing binding of positively-activing CCAAT factors to promoters. Component of nf-munr repressor; binds to the matrix attachment regions (MARs) (5' and 3') of the immunoglobulin heavy chain enhancer. Represses T-cell receptor (TCR) beta enhancer function by binding to MARbeta, an ATC-rich DNA sequence located upstream of the TCR beta enhancer (By similarity). 
Sequence Annotation
 DNA_BIND 542 629 CUT 1.
 DNA_BIND 934 1021 CUT 2.
 DNA_BIND 1117 1204 CUT 3.
 DNA_BIND 1244 1303 Homeobox.
 MOD_RES 763 763 Phosphoserine.  
Keyword
 Alternative splicing; Coiled coil; Complete proteome; Developmental protein; DNA-binding; Homeobox; Nucleus; Phosphoprotein; Reference proteome; Repeat; Repressor; Transcription; Transcription regulation. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1505 AA 
Protein Sequence
MLCVAGARLK RELDATATVL ANRQDESEQS RKRLIEQSRE FKKNTPEDLR KQVAPLLKSF 60
QGEIDALSKR SKEAEAAFLN VYKRLIDVPD PVPALDLGQQ LQLKVQRLHD IETENQKLRE 120
TLEEYNKEFA EVKNQEVTIK ALKEKIREYE QTLKNQAETI ALEKEQKLQN DFAEKERKLQ 180
ETQMSTTSKL EEAEHKVQSL QTALEKTRTE LFDLKTKYDE ETTAKADEIE MIMTDLERAN 240
QRAEVAQREA ETLREQLSSA NHSLQLASQI QKAPDVEQAI EVLTRSSLEV ELAAKEREIA 300
QLVEDVQRLQ ASLTKLRENS ASQISQLEQQ LSAKNSTLKQ LEEKLKGQAD YEEVKKELNI 360
LKSMEFAPSE GAGTQDAAKP LEVLLLEKNR SLQSENAALR ISNSDLSGSA RRKGKDQPES 420
RRPGSLPAPP PSQLPRNPGE QASNTNGTHQ FSPAGLSQDF FSSSLASPSL PLASTGKFAL 480
NSLLQRQLMQ SFYSKAMQEA GSTSMIFSTG PYSTNSISSQ SPLQQSPDVN GMAPSPSQSE 540
SAGSVSEGEE MDTAEIARQV KEQLIKHNIG QRIFGHYVLG LSQGSVSEIL ARPKPWNKLT 600
VRGKEPFHKM KQFLSDEQNI LALRSIQGRQ RENPGQSLNR LFQEVPKRRN GSEGNITTRI 660
RASETGSDEA IKSILEQAKR ELQVQKTAEP AQPSSASGSG NSDDAIRSIL QQARREMEAQ 720
QAALDPALKQ APLSQSDITI LTPKLLSTSP MPTVSSYPPL AISLKKPSAA PEAGASALPN 780
PPALKKEAQD APGLDPQGAA DCAQGVLRQV KNEVGRSGAW KDHWWSAVQP ERRNAASSEE 840
AKAEETGGGK EKGSGGSGGG SQPRAERSQL QGPSSSEYWK EWPSAESPYS QSSELSLTGA 900
SRSETPQNSP LPSSPIVPMS KPTKPSVPPL TPEQYEVYMY QEVDTIELTR QVKEKLAKNG 960
ICQRIFGEKV LGLSQGSVSD MLSRPKPWSK LTQKGREPFI RMQLWLNGEL GQGVLPVQGQ 1020
QQGPVLHSVT SLQDPLQQGC VSSESTPKTS ASCSPAPESP MSSSESVKSL TELVQQPCPP 1080
IEASKDSKPP EPSDPPASDS QPTTPLPLSG HSALSIQELV AMSPELDTYG ITKRVKEVLT 1140
DNNLGQRLFG ETILGLTQGS VSDLLARPKP WHKLSLKGRE PFVRMQLWLN DPNNVEKLMD 1200
MKRMEKKAYM KRRHSSVSDS QPCEPPSVGT EYSQGASPQP QHQLKKPRVV LAPEEKEALK 1260
RAYQQKPYPS PKTIEDLATQ LNLKTSTVIN WFHNYRSRIR RELFIEEIQA GSQGQAGASD 1320
SPSARSGRAA PSSEGDSCDG VEATEGPGSA DTEEPKSQGE AEREEVPRPA EQTEPPPSGT 1380
PGPDDARDDD HEGGPVEGPG PLPSPASATA TAAPAAPEDA ATSAAAAPGE GPAAPSSAPP 1440
PSNSSSSSAP RRPSSLQSLF GLPEAAGARD SRDNPLRKKK AANLNSIIHR LEKAASREEP 1500
IEWEF 1505 
Gene Ontology
 GO:0005829; C:cytosol; TAS:Reactome.
 GO:0000139; C:Golgi membrane; IEA:Compara.
 GO:0005634; C:nucleus; IDA:HPA.
 GO:0003682; F:chromatin binding; IEA:Compara.
 GO:0043565; F:sequence-specific DNA binding; IEA:InterPro.
 GO:0003700; F:sequence-specific DNA binding transcription factor activity; IEA:InterPro.
 GO:0042491; P:auditory receptor cell differentiation; IEA:Compara.
 GO:0030324; P:lung development; IEA:Compara.
 GO:0007275; P:multicellular organismal development; TAS:ProtInc.
 GO:0000122; P:negative regulation of transcription from RNA polymerase II promoter; TAS:ProtInc.
 GO:0000301; P:retrograde transport, vesicle recycling within Golgi; IEA:Compara.
 GO:0006351; P:transcription, DNA-dependent; IEA:UniProtKB-KW. 
Interpro
 IPR003350; Hmoeo_CUT.
 IPR017970; Homeobox_CS.
 IPR001356; Homeodomain.
 IPR009057; Homeodomain-like.
 IPR010982; Lambda_DNA-bd_dom. 
Pfam
 PF02376; CUT
 PF00046; Homeobox 
SMART
 SM00389; HOX 
PROSITE
 PS51042; CUT
 PS00027; HOMEOBOX_1
 PS50071; HOMEOBOX_2 
PRINTS