CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-020430
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Sperm flagellar protein 2 
Protein Synonyms/Alias
 Protein KPL2 
Gene Name
 SPEF2 
Gene Synonyms/Alias
 KIAA1770; KPL2 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
885LTTEIAKKKNKVEKKacetylation[1]
891KKKNKVEKKLEEKEAacetylation[1]
892KKNKVEKKLEEKEAEacetylation[1]
982KGKSSGGKVPVKKSPubiquitination[2, 3]
Reference
 [1] Regulation of cellular metabolism by protein lysine acetylation.
 Zhao S, Xu W, Jiang W, Yu W, Lin Y, Zhang T, Yao J, Zhou L, Zeng Y, Li H, Li Y, Shi J, An W, Hancock SM, He F, Qin L, Chin J, Yang P, Chen X, Lei Q, Xiong Y, Guan KL.
 Science. 2010 Feb 19;327(5968):1000-4. [PMID: 20167786]
 [2] Tryptic digestion of ubiquitin standards reveals an improved strategy for identifying ubiquitinated proteins by mass spectrometry.
 Denis NJ, Vasilescu J, Lambert JP, Smith JC, Figeys D.
 Proteomics. 2007 Mar;7(6):868-74. [PMID: 17370265]
 [3] hCKSAAP_UbSite: improved prediction of human ubiquitination sites by exploiting amino acid pattern and properties.
 Chen Z, Zhou Y, Song J, Zhang Z.
 Biochim Biophys Acta. 2013 Aug;1834(8):1461-7. [PMID: 23603789
Functional Description
 Required for correct axoneme development (By similarity). 
Sequence Annotation
 DOMAIN 1 105 CH.
 CROSSLNK 982 982 Glycyl lysine isopeptide (Lys-Gly)  
Keyword
 Alternative splicing; Coiled coil; Complete proteome; Isopeptide bond; Polymorphism; Reference proteome; Ubl conjugation. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1822 AA 
Protein Sequence
MSEILCQWLN KELKVSRTVS PKSFAKAFSS GYLLGEVLHK FELQDDFSEF LDSRVSSAKL 60
NNFSRLEPTL NLLGVQFDQN VAHGIITEKP GVATKLLYQL YIALQKKKKS GLTGVEMQTM 120
QRLTNLRLQN MKSDTFQERL RHMIPRQTDF NLMRITYRFQ EKYKHVKEDL AHLHFEKLER 180
FQKLKEEQRC FDIEKQYLNR RRQNEIMAKI QAAIIQIPKP ASNRTLKALE AQKMMKKKKE 240
AEDVADEIKK FEALIKKDLQ AKESASKTSL DTAGQTTTDL LNTYSDDEYI KKIQKRLEED 300
AFAREQREKR RRKLLMDQLI AHEAQEEAYR EEQLINRLMR QSQQERRIAV QLMHVRHEKE 360
VLWQNRIFRE KQHEERRLKD FQDALDREAA LAKQAKIDFE EQFLKEKRFH DQIAVERAQA 420
RYEKHYSVCA EILDQIVDLS TKVADYRMLT NNLIPYKLMH DWKELFFNAK PIYEQASVKT 480
LPANPSREQL TELEKRDLLD TNDYEEYKNM VGEWALPEEM VDNLPPSNNC ILGHILHRLA 540
EKSLPPRAES TTPELPSFAV KGCLLGKTLS GKTTILRSLQ KDFPIQILSI DTLVQEAIQA 600
FHDNEKVSEV LPIQKNDEED ALPVLQEEIK ESQDPQHVFS AGPVSDEVLP ETEGETMLSA 660
NADKTPKAEE VKSSDSFLKL TTRAQLGAKS EQLLKKGKSI PDVLLVDIIV NAINEIPVNQ 720
DCILDGFPMT LNQAQLLEEA LTGCNRNLTE VERKKAQKST LAIDPATSKE IPLPSPAFDF 780
VILLDVSDTS SMSRMNDIIA EELSYKTAHE DISQRVAAEN QDKDGDQNLR DQIQHRIIGF 840
LDNWPLLEQW FSEPENILIK INAEIDKESL CEKVKEILTT EIAKKKNKVE KKLEEKEAEK 900
KAAASLAELP LPTPPPAPPP EPEKEKEIHQ SHVASKTPTA KGKPQSEAPH GKQESLQEGK 960
GKKGETALKR KGSPKGKSSG GKVPVKKSPA DSTDTSPVAI VPQPPKPGSE EWVYVNEPVP 1020
EEMPLFLVPY WELIENSYIN TIKTVLRHLR EDQHTVLAYL YEIRTSFQEF LKRPDHKQDF 1080
VAQWQADFNS LPDDLWDDEE TKAELHQRVN DLRDRLWDIC DARKEEAEQE RLDIINESWL 1140
QDTLGMTMNH FFSLMQAELN RFQDTKRLLQ DYYWGMESKI PVEDNKRFTR IPLVQLDSKD 1200
NSESQLRIPL VPRISISLET VTPKPKTKSV LKGKMDNSLE NVESNFEADE KLVMDTWQQA 1260
SLAVSHMVAA EIHQRLMEEE KENQPADPKE KSPQMGANKK VKKEPPKKKQ EDKKPKGKSP 1320
PMAEATPVIV TTEEIAEIKR KNELRVKIKE EHLAALQFEE IATQFRLELI KTKALALLED 1380
LVTKVVDVYK LMEKWLGERY LNEMASTEKL TDVARYHIET STKIQNELYL SQEDFFINGN 1440
IKVFPDPPPS IRPPPVEKEE DGTLTIEQLD SLRDQFLDMA PKGIIGNKAF TDILIDLVTL 1500
NLGTNNFPSN WMHLTQPELQ ELTSLLTVNS EFVDWRKFLL VTSMPWPIPL EEELLETLQK 1560
FKAVDKEQLG TITFEQYMQA GLWFTGDEDI KIPENPLEPL PFNRQEHLIE FFFRLFADYE 1620
KDPPQLDYTQ MLLYFACHPD TVEGVYRALS VAVGTHVFQQ VKASIPSAEK TSSTDAGPAE 1680
EFPEPEENAA REERKLKDDT EKREQKDEEI PENANNEKMS METLLKVFKG GSEAQDSNRF 1740
ASHLKIENIY AEGFIKTFQD LGAKNLEPIE VAVLLKHPFI QDLISNYSDY KFPDIKIILQ 1800
RSEHVQGSDG ERSPSRHTEE KK 1822 
Gene Ontology
 GO:0005794; C:Golgi apparatus; IEA:Compara.
 GO:0002177; C:manchette; IEA:Compara.
 GO:0097225; C:sperm midpiece; IEA:Compara.
 GO:0005524; F:ATP binding; IEA:InterPro.
 GO:0019205; F:nucleobase-containing compound kinase activity; IEA:InterPro.
 GO:0006139; P:nucleobase-containing compound metabolic process; IEA:InterPro. 
Interpro
 IPR000850; Adenylate_kin.
 IPR001715; CH-domain.
 IPR010441; DUF1042.
 IPR008906; HATC.
 IPR027417; P-loop_NTPase. 
Pfam
 PF00406; ADK
 PF05699; Dimer_Tnp_hAT
 PF06294; DUF1042 
SMART
  
PROSITE
 PS50021; CH 
PRINTS