CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-022571
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Cingulin 
Protein Synonyms/Alias
  
Gene Name
 CGN 
Gene Synonyms/Alias
 KIAA1319 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
461PAKEVLLKDLLETREubiquitination[1, 2]
573QSMFQKNKEDLRATKacetylation[3]
580KEDLRATKQELLQLRubiquitination[1, 2]
658HRDRELEKQLAVLRVubiquitination[1, 2]
Reference
 [1] A proteome-wide, quantitative survey of in vivo ubiquitylation sites reveals widespread regulatory roles.
 Wagner SA, Beli P, Weinert BT, Nielsen ML, Cox J, Mann M, Choudhary C.
 Mol Cell Proteomics. 2011 Oct;10(10):M111.013284. [PMID: 21890473]
 [2] hCKSAAP_UbSite: improved prediction of human ubiquitination sites by exploiting amino acid pattern and properties.
 Chen Z, Zhou Y, Song J, Zhang Z.
 Biochim Biophys Acta. 2013 Aug;1834(8):1461-7. [PMID: 23603789]
 [3] Lysine acetylation targets protein complexes and co-regulates major cellular functions.
 Choudhary C, Kumar C, Gnad F, Nielsen ML, Rehman M, Walther TC, Olsen JV, Mann M.
 Science. 2009 Aug 14;325(5942):834-40. [PMID: 19608861
Functional Description
 Probably plays a role in the formation and regulation of the tight junction (TJ) paracellular permeability barrier. 
Sequence Annotation
 REGION 1 351 Head.
 REGION 106 400 Interacts with ZO-2.
 REGION 1155 1197 Tail.
 MOTIF 42 56 ZIM.
 MOD_RES 111 111 Phosphoserine (By similarity).
 MOD_RES 112 112 Phosphoserine (By similarity).
 MOD_RES 129 129 Phosphoserine (By similarity).
 MOD_RES 131 131 Phosphoserine.
 MOD_RES 134 134 Phosphoserine.
 MOD_RES 149 149 Phosphoserine.
 MOD_RES 159 159 Phosphoserine.
 MOD_RES 208 208 Phosphoserine.
 MOD_RES 211 211 Phosphoserine.
 MOD_RES 252 252 Phosphoserine.
 MOD_RES 573 573 N6-acetyllysine.
 MOD_RES 706 706 Phosphothreonine.
 MOD_RES 1176 1176 Phosphoserine.  
Keyword
 Acetylation; Alternative splicing; Cell junction; Coiled coil; Complete proteome; Direct protein sequencing; Phosphoprotein; Polymorphism; Reference proteome; Tight junction. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1197 AA 
Protein Sequence
MAEPRGPVDH GVQIRFITEP VSGAEMGTLR RGGRRPAKDA RASTYGVAVR VQGIAGQPFV 60
VLNSGEKGGD SFGVQIKGAN DQGASGALSS DLELPENPYS QVKGFPAPSQ SSTSDEEPGA 120
YWNGKLLRSH SQASLAGPGP VDPSNRSNSM LELAPKVASP GSTIDTAPLS SVDSLINKFD 180
SQLGGQARGR TGRRTRMLPP EQRKRSKSLD SRLPRDTFEE RERQSTNHWT SSTKYDNHVG 240
TSKQPAQSQN LSPLSGFSRS RQTQDWVLQS FEEPRRSAQD PTMLQFKSTP DLLRDQQEAA 300
PPGSVDHMKA TIYGILREGS SESETSVRRK VSLVLEKMQP LVMVSSGSTK AVAGQGELTR 360
KVEELQRKLD EEVKKRQKLE PSQVGLERQL EEKTEECSRL QELLERRKGE AQQSNKELQN 420
MKRLLDQGED LRHGLETQVM ELQNKLKHVQ GPEPAKEVLL KDLLETRELL EEVLEGKQRV 480
EEQLRLRERE LTALKGALKE EVASRDQEVE HVRQQYQRDT EQLRRSMQDA TQDHAVLEAE 540
RQKMSALVRG LQRELEETSE ETGHWQSMFQ KNKEDLRATK QELLQLRMEK EEMEEELGEK 600
IEVLQRELEQ ARASAGDTRQ VEVLKKELLR TQEELKELQA ERQSQEVAGR HRDRELEKQL 660
AVLRVEADRG RELEEQNLQL QKTLQQLRQD CEEASKAKMV AEAEATVLGQ RRAAVETTLR 720
ETQEENDEFR RRILGLEQQL KETRGLVDGG EAVEARLRDK LQRLEAEKQQ LEEALNASQE 780
EEGSLAAAKR ALEARLEEAQ RGLARLGQEQ QTLNRALEEE GKQREVLRRG KAELEEQKRL 840
LDRTVDRLNK ELEKIGEDSK QALQQLQAQL EDYKEKARRE VADAQRQAKD WASEAEKTSG 900
GLSRLQDEIQ RLRQALQASQ AERDTARLDK ELLAQRLQGL EQEAENKKRS QDDRARQLKG 960
LEEKVSRLET ELDEEKNTVE LLTDRVNRGR DQVDQLRTEL MQERSARQDL ECDKISLERQ 1020
NKDLKTRLAS SEGFQKPSAS LSQLESQNQL LQERLQAEER EKTVLQSTNR KLERKVKELS 1080
IQIEDERQHV NDQKDQLSLR VKALKRQVDE AEEEIERLDG LRKKAQREVE EQHEVNEQLQ 1140
ARIKSLEKDS WRKASRSAAE SALKNEGLSS DEEFDSVYDP SSIASLLTES NLQTSSC 1197 
Gene Ontology
 GO:0016459; C:myosin complex; IEA:InterPro.
 GO:0005923; C:tight junction; NAS:UniProtKB.
 GO:0003779; F:actin binding; ISS:UniProtKB.
 GO:0003774; F:motor activity; IEA:InterPro.
 GO:0007179; P:transforming growth factor beta receptor signaling pathway; TAS:Reactome. 
Interpro
 IPR002928; Myosin_tail. 
Pfam
 PF01576; Myosin_tail_1 
SMART
  
PROSITE
  
PRINTS