CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-023271
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Protein bassoon 
Protein Synonyms/Alias
 Zinc finger protein 231 
Gene Name
 BSN 
Gene Synonyms/Alias
 KIAA0434; ZNF231 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
398ASKEAGPKPLGSGPGubiquitination[1]
2554EASRSGIKKRHSMPRacetylation[2]
Reference
 [1] Systematic and quantitative assessment of the ubiquitin-modified proteome.
 Kim W, Bennett EJ, Huttlin EL, Guo A, Li J, Possemato A, Sowa ME, Rad R, Rush J, Comb MJ, Harper JW, Gygi SP.
 Mol Cell. 2011 Oct 21;44(2):325-40. [PMID: 21906983]
 [2] Regulation of cellular metabolism by protein lysine acetylation.
 Zhao S, Xu W, Jiang W, Yu W, Lin Y, Zhang T, Yao J, Zhou L, Zeng Y, Li H, Li Y, Shi J, An W, Hancock SM, He F, Qin L, Chin J, Yang P, Chen X, Lei Q, Xiong Y, Guan KL.
 Science. 2010 Feb 19;327(5968):1000-4. [PMID: 20167786
Functional Description
 Is thought to be involved in the organization of the cytomatrix at the nerve terminals active zone (CAZ) which regulates neurotransmitter release. Seems to act through binding to ERC2/CAST1. Essential in regulated neurotransmitter release from a subset of brain glutamatergic synapses. Involved in the formation of the retinal photoreceptor ribbon synapses (By similarity). 
Sequence Annotation
 REPEAT 571 577 1.
 REPEAT 578 584 2.
 REPEAT 585 591 3.
 ZN_FING 170 193 C4-type (Potential).
 ZN_FING 198 220 C4-type (Potential).
 ZN_FING 465 488 C4-type (Potential).
 ZN_FING 493 515 C4-type (Potential).
 REGION 23 32 5 X 2 AA tandem repeats of P-G.
 REGION 61 74 7 X 2 AA tandem repeats of P-G.
 REGION 571 591 3 X 7 AA tandem repeats of K-A-S-P-[LQ]-
 MOD_RES 6 6 Phosphoserine (By similarity).
 MOD_RES 108 108 Phosphoserine (By similarity).
 MOD_RES 145 145 Phosphoserine (By similarity).
 MOD_RES 970 970 Phosphoserine (By similarity).
 MOD_RES 1009 1009 Phosphoserine (By similarity).
 MOD_RES 1092 1092 Phosphothreonine (By similarity).
 MOD_RES 1095 1095 Phosphoserine (By similarity).
 MOD_RES 1098 1098 Phosphoserine (By similarity).
 MOD_RES 1104 1104 Phosphoserine (By similarity).
 MOD_RES 1106 1106 Phosphothreonine (By similarity).
 MOD_RES 1126 1126 Phosphotyrosine (By similarity).
 MOD_RES 1226 1226 Phosphoserine (By similarity).
 MOD_RES 1476 1476 Phosphoserine (By similarity).
 MOD_RES 1477 1477 Phosphoserine (By similarity).
 MOD_RES 1481 1481 Phosphoserine (By similarity).
 MOD_RES 1488 1488 Phosphoserine (By similarity).
 MOD_RES 1490 1490 Phosphothreonine (By similarity).
 MOD_RES 1492 1492 Phosphoserine (By similarity).
 MOD_RES 1505 1505 Phosphoserine (By similarity).
 MOD_RES 1550 1550 Phosphoserine (By similarity).
 MOD_RES 1551 1551 Phosphoserine (By similarity).
 MOD_RES 2024 2024 Phosphoserine (By similarity).
 MOD_RES 2039 2039 Phosphotyrosine (By similarity).
 MOD_RES 2068 2068 Phosphotyrosine (By similarity).
 MOD_RES 2117 2117 Phosphoserine (By similarity).
 MOD_RES 2570 2570 Phosphoserine (By similarity).
 MOD_RES 2587 2587 Phosphothreonine (By similarity).
 MOD_RES 2614 2614 Phosphothreonine (By similarity).
 MOD_RES 2685 2685 Phosphoserine (By similarity).
 MOD_RES 2694 2694 Phosphothreonine (By similarity).
 MOD_RES 2799 2799 Phosphoserine (By similarity).
 MOD_RES 2802 2802 Phosphoserine (By similarity).
 MOD_RES 2813 2813 Phosphoserine (By similarity).
 MOD_RES 2849 2849 Phosphoserine (By similarity).
 MOD_RES 2851 2851 Phosphoserine (By similarity).
 MOD_RES 2857 2857 Phosphoserine (By similarity).
 MOD_RES 2899 2899 Phosphoserine (By similarity).
 MOD_RES 3013 3013 Phosphoserine (By similarity).
 MOD_RES 3291 3291 Phosphoserine (By similarity).
 MOD_RES 3373 3373 Phosphoserine (By similarity).
 MOD_RES 3422 3422 Phosphotyrosine (By similarity).
 MOD_RES 3423 3423 Phosphotyrosine (By similarity).
 MOD_RES 3449 3449 Phosphotyrosine (By similarity).
 LIPID 2 2 N-myristoyl glycine (By similarity).
 CARBOHYD 1343 1343 O-linked (GlcNAc) (By similarity).
 CARBOHYD 1384 1384 O-linked (GlcNAc) (By similarity).
 CARBOHYD 2314 2314 O-linked (GlcNAc) (By similarity).
 CARBOHYD 2691 2691 O-linked (GlcNAc) (By similarity).
 CARBOHYD 2936 2936 O-linked (GlcNAc) (By similarity).  
Keyword
 Cell junction; Coiled coil; Complete proteome; Cytoplasm; Cytoskeleton; Glycoprotein; Lipoprotein; Metal-binding; Myristate; Phosphoprotein; Polymorphism; Reference proteome; Repeat; Synapse; Synaptosome; Zinc; Zinc-finger. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 3926 AA 
Protein Sequence
MGNEVSLEGG AGDGPLPPGG AGPGPGPGPG PGAGKPPSAP AGGGQLPAAG AARSTAVPPV 60
PGPGPGPGPG PGPGSTSRRL DPKEPLGNQR AASPTPKQAS ATTPGHESPR ETRAQGPAGQ 120
EADGPRRTLQ VDSRTQRSGR SPSVSPDRGS TPTSPYSVPQ IAPLPSSTLC PICKTSDLTS 180
TPSQPNFNTC TQCHNKVCNQ CGFNPNPHLT QVKEWLCLNC QMQRALGMDM TTAPRSKSQQ 240
QLHSPALSPA HSPAKQPLGK PDQERSRGPG GPQPGSRQAE TARATSVPGP AQAAAPPEVG 300
RVSPQPPQPT KPSTAEPRPP AGEAPAKSAT AVPAGLGATE QTQEGLTGKL FGLGASLLTQ 360
ASTLMSVQPE ADTQGQPAPS KGTPKIVFND ASKEAGPKPL GSGPGPGPAP GAKTEPGARM 420
GPGSGPGALP KTGGTTSPKH GRAEHQAASK AAAKPKTMPK ERAICPLCQA ELNVGSKSPA 480
NYNTCTTCRL QVCNLCGFNP TPHLVEKTEW LCLNCQTKRL LEGSLGEPTP LPPPTSQQPP 540
VGAPHRASGT SPLKQKGPQG LGQPSGPLPA KASPLSTKAS PLPSKASPQA KPLRASEPSK 600
TPSSVQEKKT RVPTKAEPMP KPPPETTPTP ATPKVKSGVR RAEPATPVVK AVPEAPKGGE 660
AEDLVGKPYS QDASRSPQSL SDTGYSSDGI SSSQSEITGV VQQEVEQLDS AGVTGPHPPS 720
PSEIHKVGSS MRPLLQAQGL APSERSKPLS SGTGEEQKQR PHSLSITPEA FDSDEELEDI 780
LEEDEDSAEW RRRREQQDTA ESSDDFGSQL RHDYVEDSSE GGLSPLPPQP PARAAELTDE 840
DFMRRQILEM SAEEDNLEED DTATSGRGLA KHGTQKGGPR PRPEPSQEPA ALPKRRLPHN 900
ATTGYEELLP EGGSAEATDG SGTLQGGLRR FKTIELNSTG SYGHELDLGQ GPDPSLDREP 960
ELEMESLTGS PEDRSRGEHS STLPASTPSY TSGTSPTSLS SLEEDSDSSP SRRQRLEEAK 1020
QQRKARHRSH GPLLPTIEDS SEEEELREEE ELLREQEKMR EVEQQRIRST ARKTRRDKEE 1080
LRAQRRRERS KTPPSNLSPI EDASPTEELR QAAEMEELHR SSCSEYSPSP SLDSEAEALD 1140
GGPSRLYKSG SEYNLPTFMS LYSPTETPSG SSTTPSSGRP LKSAEEAYEE MMRKAELLQR 1200
QQGQAAGARG PHGGPSQPTG PRGLGSFEYQ DTTDREYGQA AQPAAEGTPA SLGAAVYEEI 1260
LQTSQSIVRM RQASSRDLAF AEDKKKEKQF LNAESAYMDP MKQNGGPLTP GTSPTQLAAP 1320
VSFSTPTSSD SSGGRVIPDV RVTQHFAKET QDPLKLHSSP ASPSSASKEI GMPFSQGPGT 1380
PATTAVAPCP AGLPRGYMTP ASPAGSERSP SPSSTAHSYG HSPTTANYGS QTEDLPQAPS 1440
GLAAAGRAAR EKPLSASDGE GGTPQPSRAY SYFASSSPPL SPSSPSESPT FSPGKMGPRA 1500
TAEFSTQTPS PAPASDMPRS PGAPTPSPMV AQGTQTPHRP STPRLVWQES SQEAPFMVIT 1560
LASDASSQTR MVHASASTSP LCSPTETQPT THGYSQTTPP SVSQLPPEPP GPPGFPRVPS 1620
AGADGPLALY GWGALPAENI SLCRISSVPG TSRVEPGPRT PGTAVVDLRT AVKPTPIILT 1680
DQGMDLTSLA VEARKYGLAL DPIPGRQSTA VQPLVINLNA QEHTFLATAT TVSITMASSV 1740
FMAQQKQPVV YGDPYQSRLD FGQGGGSPVC LAQVKQVEQA VQTAPYRSGP RGRPREAKFA 1800
RYNLPNQVAP LARRDVLITQ MGTAQSIGLK PGPVPEPGAE PHRATPAELR SHALPGARKP 1860
HTVVVQMGEG TAGTVTTLLP EEPAGALDLT GMRPESQLAC CDMVYKLPFG SSCTGTFHPA 1920
PSVPEKSMAD AAPPGQSSSP FYGPRDPEPP EPPTYRAQGV VGPGPHEEQR PYPQGLPGRL 1980
YSSMSDTNLA EAGLNYHAQR IGQLFQGPGR DSAMDLSSLK HSYSLGFADG RYLGQGLQYG 2040
SVTDLRHPTD LLAHPLPMRR YSSVSNIYSD HRYGPRGDAV GFQEASLAQY SATTAREISR 2100
MCAALNSMDQ YGGRHGSGGG GPDLVQYQPQ HGPGLSAPQS LVPLRPGLLG NPTFPEGHPS 2160
PGNLAQYGPA AGQGTAVRQL LPSTATVRAA DGMIYSTINT PIAATLPITT QPASVLRPMV 2220
RGGMYRPYAS GGITAVPLTS LTRVPMIAPR VPLGPTGLYR YPAPSRFPIA SSVPPAEGPV 2280
YLGKPAAAKA PGAGGPSRPE MPVGAAREEP LPTTTPAAIK EAAGAPAPAP LAGQKPPADA 2340
APGGGSGALS RPGFEKEEAS QEERQRKQQE QLLQLERERV ELEKLRQLRL QEELERERVE 2400
LQRHREEEQL LVQRELQELQ TIKHHVLQQQ QEERQAQFAL QREQLAQQRL QLEQIQQLQQ 2460
QLQQQLEEQK QRQKAPFPAA CEAPGRGPPL AAAELAQNGQ YWPPLTHAAF IAMAGPEGLG 2520
QPREPVLHRG LPSSASDMSL QTEEQWEASR SGIKKRHSMP RLRDACELES GTEPCVVRRI 2580
ADSSVQTDDE DGESRYLLSR RRRARRSADC SVQTDDEDSA EWEQPVRRRR SRLPRHSDSG 2640
SDSKHDATAS SSSAAATVRA MSSVGIQTIS DCSVQTEPDQ LPRVSPAIHI TAATDPKVEI 2700
VRYISAPEKT GRGESLACQT EPDGQAQGVA GPQLVGPTAI SPYLPGIQIV TPGPLGRFEK 2760
KKPDPLEIGY QAHLPPESLS QLVSRQPPKS PQVLYSPVSP LSPHRLLDTS FASSERLNKA 2820
HVSPQKHFTA DSALRQQTLP RPMKTLQRSL SDPKPLSPTA EESAKERFSL YQHQGGLGSQ 2880
VSALPPNSLV RKVKRTLPSP PPEEAHLPLA GQASPQLYAA SLLQRGLTGP TTVPATKASL 2940
LRELDRDLRL VEHESTKLRK KQAELDEEEK EIDAKLKYLE LGITQRKESL AKDRGGRDYP 3000
PLRGLGEHRD YLSDSELNQL RLQGCTTPAG QFVDFPATAA APATPSGPTA FQQPRFQPPA 3060
PQYSAGSGGP TQNGFPAHQA PTYPGPSTYP APAFPPGASY PAEPGLPNQQ AFRPTGHYAG 3120
QTPMPTTQST LFPVPADSRA PLQKPRQTSL ADLEQKVPTN YEVIASPVVP MSSAPSETSY 3180
SGPAVSSGYE QGKVPEVPRA GDRGSVSQSP APTYPSDSHY TSLEQNVPRN YVMIDDISEL 3240
TKDSTSTAPD SQRLEPLGPG SSGRPGKEPG EPGVLDGPTL PCCYARGEEE SEEDSYDPRG 3300
KGGHLRSMES NGRPASTHYY GDSDYRHGAR VEKYGPGPMG PKHPSKSLAP AAISSKRSKH 3360
RKQGMEQKIS KFSPIEEAKD VESDLASYPP PAVSSSLVSR GRKFQDEITY GLKKNVYEQQ 3420
KYYGMSSRDA VEDDRIYGGS SRSRAPSAYS GEKLSSHDFS GWGKGYERER EAVERLQKAG 3480
PKPSSLSMAH SRVRPPMRSQ ASEEESPVSP LGRPRPAGGP LPPGGDTCPQ FCSSHSMPDV 3540
QEHVKDGPRA HAYKREEGYI LDDSHCVVSD SEAYHLGQEE TDWFDKPRDA RSDRFRHHGG 3600
HAVSSSSQKR GPARHSYHDY DEPPEEGLWP HDEGGPGRHA SAKEHRHGDH GRHSGRHTGE 3660
EPGRRAAKPH ARDLGRHEAR PHSQPSSAPA MPKKGQPGYP SSAEYSQPSR ASSAYHHASD 3720
SKKGSRQAHS GPAALQSKAE PQAQPQLQGR QAAPGPQQSQ SPSSRQIPSG AASRQPQTQQ 3780
QQQGLGLQPP QQALTQARLQ QQSQPTTRGS APAASQPAGK PQPGPSTATG PQPAGPPRAE 3840
QTNGSKGTAK APQQGRAPQA QPAPGPGPAG VKAGARPGGT PGAPAGQPGA DGESVFSKIL 3900
PGGAAEQAGK LTEAVSAFGK KFSSFW 3926 
Gene Ontology
 GO:0030054; C:cell junction; IEA:UniProtKB-KW.
 GO:0005737; C:cytoplasm; IDA:HPA.
 GO:0030425; C:dendrite; ISS:BHF-UCL.
 GO:0015630; C:microtubule cytoskeleton; IDA:HPA.
 GO:0044306; C:neuron projection terminus; IEA:Compara.
 GO:0005634; C:nucleus; IDA:HPA.
 GO:0048786; C:presynaptic active zone; IEA:Compara.
 GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
 GO:0007268; P:synaptic transmission; TAS:ProtInc. 
Interpro
 IPR011011; Znf_FYVE_PHD.
 IPR008899; Znf_piccolo.
 IPR013083; Znf_RING/FYVE/PHD. 
Pfam
 PF05715; zf-piccolo 
SMART
  
PROSITE
  
PRINTS