CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-019386
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Protocadherin-16 
Protein Synonyms/Alias
 Cadherin-19; Cadherin-25; Fibroblast cadherin-1; Protein dachsous homolog 1 
Gene Name
 DCHS1 
Gene Synonyms/Alias
 CDH19; CDH25; FIB1; KIAA1773; PCDH16 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
2967LVRARSRKAEAAPGPubiquitination[1]
2988LASDSLQKLGREPPSubiquitination[1]
Reference
 [1] Landscape of the PARKIN-dependent ubiquitylome in response to mitochondrial depolarization.
 Sarraf SA, Raman M, Guarani-Pereira V, Sowa ME, Huttlin EL, Gygi SP, Harper JW.
 Nature. 2013 Apr 18;496(7445):372-6. [PMID: 23503661
Functional Description
 Calcium-dependent cell-adhesion protein (Potential). 
Sequence Annotation
 DOMAIN 43 143 Cadherin 1.
 DOMAIN 144 255 Cadherin 2.
 DOMAIN 256 362 Cadherin 3.
 DOMAIN 367 472 Cadherin 4.
 DOMAIN 474 578 Cadherin 5.
 DOMAIN 579 685 Cadherin 6.
 DOMAIN 686 790 Cadherin 7.
 DOMAIN 791 894 Cadherin 8.
 DOMAIN 895 1000 Cadherin 9.
 DOMAIN 1001 1111 Cadherin 10.
 DOMAIN 1112 1211 Cadherin 11.
 DOMAIN 1218 1324 Cadherin 12.
 DOMAIN 1333 1436 Cadherin 13.
 DOMAIN 1437 1546 Cadherin 14.
 DOMAIN 1547 1649 Cadherin 15.
 DOMAIN 1650 1751 Cadherin 16.
 DOMAIN 1752 1855 Cadherin 17.
 DOMAIN 1856 1960 Cadherin 18.
 DOMAIN 1965 2068 Cadherin 19.
 DOMAIN 2069 2171 Cadherin 20.
 DOMAIN 2172 2277 Cadherin 21.
 DOMAIN 2278 2376 Cadherin 22.
 DOMAIN 2377 2482 Cadherin 23.
 DOMAIN 2483 2602 Cadherin 24.
 DOMAIN 2603 2706 Cadherin 25.
 DOMAIN 2707 2813 Cadherin 26.
 DOMAIN 2814 2933 Cadherin 27.
 CARBOHYD 217 217 N-linked (GlcNAc...) (Potential).
 CARBOHYD 256 256 N-linked (GlcNAc...) (Potential).
 CARBOHYD 402 402 N-linked (GlcNAc...) (Potential).
 CARBOHYD 584 584 N-linked (GlcNAc...) (Potential).
 CARBOHYD 1249 1249 N-linked (GlcNAc...) (Potential).
 CARBOHYD 1521 1521 N-linked (GlcNAc...) (Potential).
 CARBOHYD 1718 1718 N-linked (GlcNAc...) (Potential).
 CARBOHYD 1996 1996 N-linked (GlcNAc...) (Potential).
 CARBOHYD 2361 2361 N-linked (GlcNAc...) (Potential).
 CARBOHYD 2428 2428 N-linked (GlcNAc...) (Potential).
 CARBOHYD 2569 2569 N-linked (GlcNAc...) (Potential).
 CARBOHYD 2761 2761 N-linked (GlcNAc...) (Potential).
 CARBOHYD 2792 2792 N-linked (GlcNAc...) (Potential).
 CARBOHYD 2862 2862 N-linked (GlcNAc...) (Potential).  
Keyword
 Calcium; Cell adhesion; Cell membrane; Complete proteome; Glycoprotein; Membrane; Polymorphism; Reference proteome; Repeat; Signal; Transmembrane; Transmembrane helix. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 3298 AA 
Protein Sequence
MQKELGIVPS CPGMKSPRPH LLLPLLLLLL LLLGAGVPGA WGQAGSLDLQ IDEEQPAGTL 60
IGDISAGLPA GTAAPLMYFI SAQEGSGVGT DLAIDEHSGV VRTARVLDRE QRDRYRFTAV 120
TPDGATVEVT VRVADINDHA PAFPQARAAL QVPEHTAFGT RYPLEPARDA DAGRLGTQGY 180
ALSGDGAGET FRLETRPGPD GTPVPELVVT GELDRENRSH YMLQLEAYDG GSPPRRAQAL 240
LDVTLLDIND HAPAFNQSRY HAVVSESLAP GSPVLQVFAS DADAGVNGAV TYEINRRQSE 300
GDGPFSIDAH TGLLQLERPL DFEQRRVHEL VVQARDGGAH PELGSAFVTV HVRDANDNQP 360
SMTVIFLSAD GSPQVSEAAP PGQLVARISV SDPDDGDFAH VNVSLEGGEG HFALSTQDSV 420
IYLVCVARRL DREERDAYNL RVTATDSGSP PLRAEAAFVL HVTDVNDNAP AFDRQLYRPE 480
PLPEVALPGS FVVRVTARDP DQGTNGQVTY SLAPGAHTHW FSIDPTSGII TTAASLDYEL 540
EPQPQLIVVA TDGGLPPLAS SATVSVALQD VNDNEPQFQR TFYNASLPEG TQPGTCFLQV 600
TATDADSGPF GLLSYSLGAG LGSSGSPPFR IDAHSGDVCT TRTLDRDQGP SSFDFTVTAV 660
DGGGLKSMVY VKVFLSDEND NPPQFYPREY AASISAQSPP GTAVLRLRAH DPDQGSHGRL 720
SYHILAGNSP PLFTLDEQSG LLTVAWPLAR RANSVVQLEI GAEDGGGLQA EPSARVDISI 780
VPGTPTPPIF EQLQYVFSVP EDVAPGTSVG IVQAHNPPGR LAPVTLSLSG GDPRGLFSLD 840
AVSGLLQTLR PLDRELLGPV LELEVRAGSG VPPAFAVARV RVLLDDVNDN SPAFPAPEDT 900
VLLPPNTAPG TPIYTLRALD PDSGVNSRVT FTLLAGGGGA FTVDPTTGHV RLMRPLGPSG 960
GPAHELELEA RDGGSPPRTS HFRLRVVVQD VGTRGLAPRF NSPTYRVDLP SGTTAGTQVL 1020
QVQAQAPDGG PITYHLAAEG ASSPFGLEPQ SGWLWVRAAL DREAQELYIL KVMAVSGSKA 1080
ELGQQTGTAT VRVSILNQNE HSPRLSEDPT FLAVAENQPP GTSVGRVFAT DRDSGPNGRL 1140
TYSLQQLSED SKAFRIHPQT GEVTTLQTLD REQQSSYQLL VQVQDGGSPP RSTTGTVHVA 1200
VLDLNDNSPT FLQASGAAGG GLPIQVPDRV PPGTLVTTLQ AKDPDEGENG TILYTLTGPG 1260
SELFSLHPHS GELLTAAPLI RAERPHYVLT LSAHDQGSPP RSASLQLLVQ VLPSARLAEP 1320
PPDLAERDPA APVPVVLTVT AAEGLRPGSL LGSVAAPEPA GVGALTYTLV GGADPEGTFA 1380
LDAASGRLYL ARPLDFEAGP PWRALTVRAE GPGGAGARLL RVQVQVQDEN EHAPAFARDP 1440
LALALPENPE PGAALYTFRA SDADGPGPNS DVRYRLLRQE PPVPALRLDA RTGALSAPRG 1500
LDRETTPALL LLVEATDRPA NASRRRAARV SARVFVTDEN DNAPVFASPS RVRLPEDQPP 1560
GPAALHVVAR DPDLGEAARV SYRLASGGDG HFRLHSSTGA LSVVRPLDRE QRAEHVLTVV 1620
ASDHGSPPRS ATQVLTVSVA DVNDEAPTFQ QQEYSVLLRE NNPPGTSLLT LRATDPDVGA 1680
NGQVTYGGVS SESFSLDPDT GVLTTLRALD REEQEEINLT VYAQDRGSPP QLTHVTVRVA 1740
VEDENDHAPT FGSAHLSLEV PEGQDPQTLT MLRASDPDVG ANGQLQYRIL DGDPSGAFVL 1800
DLASGEFGTM RPLDREVEPA FQLRIEARDG GQPALSATLL LTVTVLDAND HAPAFPVPAY 1860
SVEVPEDVPA GTLLLQLQAH DPDAGANGHV TYYLGAGTAG AFLLEPSSGE LRTAAALDRE 1920
QCPSYTFSVS AVDGAAAGPL STTVSVTITV RDVNDHAPTF PTSPLRLRLP RPGPSFSTPT 1980
LALATLRAED RDAGANASIL YRLAGTPPPG TTVDSYTGEI RVARSPVALG PRDRVLFIVA 2040
TDLGRPARSA TGVIIVGLQG EAERGPRFPR ASSEATIREN APPGTPIVSP RAVHAGGTNG 2100
PITYSILSGN EKGTFSIQPS TGAITVRSAE GLDFEVSPRL RLVLQAESGG AFAFTVLTLT 2160
LQDANDNAPR FLRPHYVAFL PESRPLEGPL LQVEADDLDQ GSGGQISYSL AASQPARGLF 2220
HVDPTTGTIT TTAILDREIW AETRLVLMAT DRGSPALVGS ATLTVMVIDT NDNRPTIPQP 2280
WELRVSEDAL LGSEIAQVTG NDVDSGPVLW YVLSPSGPQD PFSVGRYGGR VSLTGPLDFE 2340
QCDRYQLQLL AHDGPHEGRA NLTVLVEDVN DNAPAFSQSL YQVMLLEHTP PGSAILSVSA 2400
TDRDSGANGH ISYHLASPAD GFSVDPNNGT LFTIVGTVAL GHDGSGAVDV VLEARDHGAP 2460
GRAARATVHV QLQDQNDHAP SFTLSHYRVA VTEDLPPGST LLTLEATDAD GSRSHAAVDY 2520
SIISGNWGRV FQLEPRLAEA GESAGPGPRA LGCLVLLEPL DFESLTQYNL TVAAADRGQP 2580
PQSSVVPVTV TVLDVNDNPP VFTRASYRVT VPEDTPVGAE LLHVEASDAD PGPHGLVRFT 2640
VSSGDPSGLF ELDESSGTLR LAHALDCETQ ARHQLVVQAA DPAGAHFALA PVTIEVQDVN 2700
DHGPAFPLNL LSTSVAENQP PGTLVTTLHA IDGDAGAFGR LRYSLLEAGP GPEGREAFAL 2760
NSSTGELRAR VPFDYEHTES FRLLVGAADA GNLSASVTVS VLVTGEDEYD PVFLAPAFHF 2820
QVPEGARRGH SLGHVQATDE DGGADGLVLY SLATSSPYFG INQTTGALYL RVDSRAPGSG 2880
TATSGGGGRT RREAPRELRL EVIARGPLPG SRSATVPVTV DITHTALGLA PDLNLLLVGA 2940
VAASLGVVVV LALAALVLGL VRARSRKAEA APGPMSQAAP LASDSLQKLG REPPSPPPSE 3000
HLYHQTLPSY GGPGAGGPYP RGGSLDPSHS SGRGSAEAAE DDEIRMINEF PRVASVASSL 3060
AARGPDSGIQ QDADGLSDTS CEPPAPDTWY KGRKAGLLLP GAGATLYREE GPPATATAFL 3120
GGCGLSPAPT GDYGFPADGK PCVAGALTAI VAGEEELRGS YNWDYLLSWC PQFQPLASVF 3180
TEIARLKDEA RPCPPAPRID PPPLITAVAH PGAKSVPPKP ANTAAARAIF PPASHRSPIS 3240
HEGSLSSAAM SPSFSPSLSP LAARSPVVSP FGVAQGPSAS ALSAESGLEP PDDTELHI 3298 
Gene Ontology
 GO:0016021; C:integral to membrane; IEA:UniProtKB-KW.
 GO:0016020; C:membrane; NAS:UniProtKB.
 GO:0005886; C:plasma membrane; IEA:UniProtKB-SubCell.
 GO:0005509; F:calcium ion binding; IEA:InterPro.
 GO:0016339; P:calcium-dependent cell-cell adhesion; NAS:UniProtKB.
 GO:0007156; P:homophilic cell adhesion; IEA:InterPro. 
Interpro
 IPR002126; Cadherin.
 IPR015919; Cadherin-like.
 IPR020894; Cadherin_CS. 
Pfam
 PF00028; Cadherin 
SMART
 SM00112; CA 
PROSITE
 PS00232; CADHERIN_1
 PS50268; CADHERIN_2 
PRINTS
 PR00205; CADHERIN.