CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-041289
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Tight junction protein 1 (Zona occludens 1), isoform CRA_a 
Protein Synonyms/Alias
 Tight junction protein ZO-1 
Gene Name
 TJP1 
Gene Synonyms/Alias
 hCG_27621 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
183VASSQPAKPTKVTLVubiquitination[1]
348RDEERISKPGAVSTPubiquitination[2]
553VDTLYNGKLGSWLAIubiquitination[3]
592SVQYTLPKTAGGDRAubiquitination[1]
695IIRLHTIKQIIDQDKubiquitination[3]
702KQIIDQDKHALLDVTubiquitination[3]
1184PHPSAGPKPAESKQYubiquitination[4]
1189GPKPAESKQYFEQYSubiquitination[3, 4]
1357EDEEYYRKQLSYFDRubiquitination[2]
1601SIHAEKPKYQINNISubiquitination[3]
Reference
 [1] Systematic and quantitative assessment of the ubiquitin-modified proteome.
 Kim W, Bennett EJ, Huttlin EL, Guo A, Li J, Possemato A, Sowa ME, Rad R, Rush J, Comb MJ, Harper JW, Gygi SP.
 Mol Cell. 2011 Oct 21;44(2):325-40. [PMID: 21906983]
 [2] Systems-wide analysis of ubiquitylation dynamics reveals a key role for PAF15 ubiquitylation in DNA-damage bypass.
 Povlsen LK, Beli P, Wagner SA, Poulsen SL, Sylvestersen KB, Poulsen JW, Nielsen ML, Bekker-Jensen S, Mailand N, Choudhary C.
 Nat Cell Biol. 2012 Oct;14(10):1089-98. [PMID: 23000965]
 [3] A proteome-wide, quantitative survey of in vivo ubiquitylation sites reveals widespread regulatory roles.
 Wagner SA, Beli P, Weinert BT, Nielsen ML, Cox J, Mann M, Choudhary C.
 Mol Cell Proteomics. 2011 Oct;10(10):M111.013284. [PMID: 21890473]
 [4] Global identification of modular cullin-RING ligase substrates.
 Emanuele MJ, Elia AE, Xu Q, Thoma CR, Izhar L, Leng Y, Guo A, Chen YN, Rush J, Hsu PW, Yen HC, Elledge SJ.
 Cell. 2011 Oct 14;147(2):459-74. [PMID: 21963094
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome; SH3 domain. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1768 AA 
Protein Sequence
MSARAAAAKS TAMEETAIWE QHTVTLHRAP GFGFGIAISG GRDNPHFQSG ETSIVISDVL 60
KGGPAEGQLQ ENDRVAMVNG VSMDNVEHAF AVQQLRKSGK NAKITIRRKK KVQIPVSRPD 120
PEPVSDNEED SYDEEIHDPR SGRSGVVNRR SEKIWPRDRS ASRERSLSPR SDRRSVASSQ 180
PAKPTKVTLV KSRKNEEYGL RLASHIFVKE ISQDSLAARD GNIQEGDVVL KINGTVTENM 240
SLTDAKTLIE RSKGKLKMVV QRDERATLLN VPDLSDSIHS ANASERDDIS EIQSLASDHS 300
GRSHDRPPRR SRSRSPDQRS EPSDHSRHSP QQPSNGSLRS RDEERISKPG AVSTPVKHAD 360
DHTPKTVEEV TVERNEKQTP SLPEPKPVYA QVGQPDVDLP VSPSDGVLPN STHEDGILRP 420
SMKLVKFRKG DSVGLRLAGG NDVGIFVAGV LEDSPAAKEG LEEGDQILRV NNVDFTNIIR 480
EEAVLFLLDL PKGEEVTILA QKKKDVYRRI VESDVGDSFY IRTHFEYEKE SPYGLSFNKG 540
EVFRVVDTLY NGKLGSWLAI RIGKNHKEVE RGIIPNKNRA EQLASVQYTL PKTAGGDRAD 600
FWRFRGLRSS KRNLRKSRED LSAQPVQTKF PAYERVVLRE AGFLRPVTIF GPIADVAREK 660
LAREEPDIYQ IAKSEPRDAG TDQRSSGIIR LHTIKQIIDQ DKHALLDVTP NAVDRLNYAQ 720
WYPIVVFLNP DSKQGVKTMR MRLCPESRKS ARKLYERSHK LRKNNHHLFT TTINLNSMND 780
GWYGALKEAI QQQQNQLVWV SEGKADGATS DDLDLHDDRL SYLSAPGSEY SMYSTDSRHT 840
SDYEDTDTEG GAYTDQELDE TLNDEVGTPP ESAITRSSEP VREDSSGMHH ENQTYPPYSP 900
QAQPQPIHRI DSPGFKPASQ QKAEASSPVP YLSPETNPAS STSAVNHNVN LTNVRLEEPT 960
PAPSTSYSPQ ADSLRTPSTE AAHIMLRDQE PSLSSHVDPT KVYRKDPYPE EMMRQNHVLK 1020
QPAVSHPGHR PDKEPNLTYE PQLPYVEKQA SRDLEQPTYR YESSSYTDQF SRNYEHRLRY 1080
EDRVPMYEEQ WSYYDDKQPY PSRPPFDNQH SQDLDSRQHP EESSERGYFP RFEEPAPLSY 1140
DSRPRYEQAP RASALRHEEQ PAPGYDTHGR LRPEAQPHPS AGPKPAESKQ YFEQYSRSYE 1200
QVPPQGFTSR AGHFEPLHGA AAVPPLIPSS QHKPEALPSN TKPLPPPPTQ TEEEEDPAMK 1260
PQSVLTRVKM FENKRSASLE TKKDVNDTGS FKPPEVASKP SGAPIIGPKP TSQNQFSEHD 1320
KTLYRIPEPQ KPQLKPPEDI VRSNHYDPEE DEEYYRKQLS YFDRRSFENK PPAHIAASHL 1380
SEPAKPAHSQ NQSNFSSYSS KGKPPEADGV DRSFGEKRYE PIQATPPPPP LPSQYAQPSQ 1440
PVTSASLHIH SKGAHGEGNS VSLDFQNSLV SKPDPPPSQN KPATFRPPNR EDTAQAAFYP 1500
QKSFPDKAPV NGTEQTQKTV TPAYNRFTPK PYTSSARPFE RKFESPKFNH NLLPSETAHK 1560
PDLSSKTPTS PKTLVKSHSL AQPPEFDSGV ETFSIHAEKP KYQINNISTV PKAIPVSPSA 1620
VEEDEDEDGH TVVATARGIF NSNGGVLSSI ETGVSIIIPQ GAIPEGVEQE IYFKVCRDNS 1680
ILPPLDKEKG ETLLSPLVMC GPHGLKFLKP VELRLPHCAS MTPDGWSFAL KSSDSSSGDP 1740
KTWQNKCLPG DPNYLVGANC VSVLIDHF 1768 
Gene Ontology
 GO:0030054; C:cell junction; IDA:HPA.
 GO:0005737; C:cytoplasm; IDA:HPA.
 GO:0005886; C:plasma membrane; IDA:HPA.
 GO:0005923; C:tight junction; IEA:InterPro. 
Interpro
 IPR008144; Guanylate_kin.
 IPR008145; Guanylate_kin/L-typ_Ca_channel.
 IPR027417; P-loop_NTPase.
 IPR001478; PDZ.
 IPR011511; SH3_2.
 IPR001452; SH3_domain.
 IPR005417; ZonOcculdens.
 IPR005418; ZonOcculS1.
 IPR000906; ZU5. 
Pfam
 PF00625; Guanylate_kin
 PF00595; PDZ
 PF07653; SH3_2
 PF00791; ZU5 
SMART
 SM00072; GuKc
 SM00228; PDZ
 SM00326; SH3
 SM00218; ZU5 
PROSITE
 PS50052; GUANYLATE_KINASE_2
 PS50106; PDZ
 PS50002; SH3
 PS51145; ZU5 
PRINTS
 PR01597; ZONOCCLUDNS.
 PR01598; ZONOCCLUDNS1.