CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-012357
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 UPF0378 protein KIAA0100 
Protein Synonyms/Alias
 Antigen MLAA-22; Breast cancer-overexpressed gene 1 protein 
Gene Name
 KIAA0100 
Gene Synonyms/Alias
 BCOX1 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
555LMGPQSGKSAVSRHSubiquitination[1, 2]
846DEAVGVQKWLKGLHQubiquitination[1]
849VGVQKWLKGLHQGTRubiquitination[2]
898HDNYELMKDESKESAubiquitination[1]
914RLQLLDAKVAALRKQubiquitination[1]
920AKVAALRKQHGELLPubiquitination[2]
930GELLPARKIEELYASubiquitination[2]
1309GKLFNNLKPSKKKLGubiquitination[1]
1655RQLWLEVKNIEEHRQubiquitination[1]
1847KSLQDDSKNENLLDLubiquitination[1]
Reference
 [1] Systematic and quantitative assessment of the ubiquitin-modified proteome.
 Kim W, Bennett EJ, Huttlin EL, Guo A, Li J, Possemato A, Sowa ME, Rad R, Rush J, Comb MJ, Harper JW, Gygi SP.
 Mol Cell. 2011 Oct 21;44(2):325-40. [PMID: 21906983]
 [2] Refined preparation and use of anti-diglycine remnant (K-ε-GG) antibody enables routine quantification of 10,000s of ubiquitination sites in single proteomics experiments.
 Udeshi ND, Svinkina T, Mertins P, Kuhn E, Mani DR, Qiao JW, Carr SA.
 Mol Cell Proteomics. 2013 Mar;12(3):825-31. [PMID: 23266961
Functional Description
  
Sequence Annotation
 MOD_RES 1846 1846 Phosphoserine.
 CARBOHYD 730 730 N-linked (GlcNAc...) (Potential).  
Keyword
 Alternative splicing; Coiled coil; Complete proteome; Glycoprotein; Phosphoprotein; Polymorphism; Reference proteome; Secreted; Signal. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2235 AA 
Protein Sequence
MPLFFSALLV LLLVALSALF LGRWLVVRLA TKWCQRKLQA ELKIGSFRFF WIQNVSLKFQ 60
QHQQTVEIDN LWISSKLLSH DLPHYVALCF GEVRIRTDLQ KVSDLSAPFS QSAGVDQKEL 120
SFSPSLLKIF CQLFSIHVDA INIMVLKVDT SESLWHIQIS RSRFLLDSDG KRLICEVSLC 180
KINSKVLKSG QLEDTCLVEL SLALDLCLKV GISSRHLTAI TVDVWTLHAE LHEGLFQSQL 240
LCQGPSLASK PVPCSEVTEN LVEPTLPGLF LLQQLPDQVK VKMENTSVVL SMNSQKRHLT 300
WTLKLLQFLY HRDEDQLPLR SFTANSDMAQ MSTELLLEDG LLLSQSRQRI VCLNSLKASV 360
QVTTIDLSAS LVLNTCIIHY RHQEFSHWLH LLALETQGSS SPVLKQRKKR TFPQILAPII 420
FSTSISNVNI SIQLGDTPPF ALGFNSISLD YQHLRPQSIH QRGVLTVDHL CWRVGSDSHI 480
QRAPHPPNMH VWGEALVLDS FTLQGSYNQP LGLSSTQSDT LFLDCTIRGL QVEASDTCAQ 540
CLSRILSLMG PQSGKSAVSR HSSFGESVSL LWKVDLKVED MNLFTLSALV GASEVRLDTL 600
TILGSAETST VGIQGLVLAL VKSVTEKMQP CCKAPDIPTP VLSLSMLSIT YHSSIRSLEV 660
QCGAGLTLLW SPPDHMYLYQ HVLATLQCRD LLRATVFPET VPSLALETSG TTSELEGRAP 720
EPLPPKRLLN LTLEVSTAKL TAFVAEDKFI TLAAESVSLS RHGGSLQAYC PELAAGFDGN 780
SIFNFKEVEV QLLPELEEMI LHRNPFPALQ TLRNRVWLLS FGSVSVEFPY QYDFSRTLDE 840
AVGVQKWLKG LHQGTRAWAS PSPVPLPPDL LLKVEHFSWV FLDDVFEVKL HDNYELMKDE 900
SKESAKRLQL LDAKVAALRK QHGELLPARK IEELYASLER KNIEIYIQRS RRLYGNTPMR 960
RALLTWSLAG LELVALADAS FHGPEHVVEQ VQELDPGSPF PPEGLDLVIQ WCRMLKCNVK 1020
SFLVRIRDYP RYLFEIRDWR LMGRLVGTEQ SGQPCSRRRQ ILHLGLPWGN VAVERNMPPL 1080
KFYHDFHSEI FQYTVVWGPC WDPAWTLIGQ CVDLLTKPSA DPSPPLPWWD KSRLLFHGDW 1140
HMDIEQANLH QLATEDPYNT TENMHWEWSH LSFHWKPGQF VFKGDLDINV RTASKYDDCC 1200
FLHLPDLCMT LDLQWLCHGN PHDHHSVTLR APEFLPEVPL GQLHDSYRAF RSENLNLSIK 1260
MDLTRHSGTI SQPRILLYSS TLRWMQNFWA TWTSVTRPIC RGKLFNNLKP SKKKLGQHYK 1320
QLSYTALFPQ LQVHYWASFA QQRGIQIECS QGHVFTRGTQ RLIPQAGTVM RRLISDWSVT 1380
QMVSDLSQVT VHLMASPTEE NADHCLDPLV TKTHLLSLSS LTYQRHSNRT AEEELSARDG 1440
DPTFHTHQLH LVDLRISWTT TNRDIAFGLY DGYKKAAVLK RNLSTEALKG LKIDPQMPAK 1500
KPKRGVPTSA SAPPRVNTPS FSGQPDKGSS GGAYMLQKLI EETDRFVVFT EEESGMSDQL 1560
CGIAACQTDD IYNRNCLIEL VNCQMVLRGA ETEGCVIVSA AKAQLLQCQH HPAWYGDTLK 1620
QKTSWTCLLD GMQYFATTES SPTEQDGRQL WLEVKNIEEH RQRSLDSVQE LMESGQAVGG 1680
MVTTTTDWNQ PAEAQQAQQV QRIISRCNCR MYYISYSHDI DPELATQIKP PEVLENQEKE 1740
DLLKKQEGAV DTFTLIHHEL EISTNPAQYA MILDIVNNLL LHVEPKRKEH SEKKQRVRFQ 1800
LEISSNPEEQ RSSILHLQEA VRQHVAQIRQ LEKQMYSIMK SLQDDSKNEN LLDLNQKLQL 1860
QLNQEKANLQ LESEELNILI RCFKDFQLQR ANKMELRKQQ EDVSVVRRTE FYFAQARWRL 1920
TEEDGQLGIA ELELQRFLYS KVNKSDDTAE HLLELGWFTM NNLLPNAVYK VVLRPQSSCQ 1980
SGRQLALRLF SKVRPPVGGI SVKEHFEVNV VPLTIQLTHQ FFHRMMGFFF PGRSVEDDEV 2040
GDEEDKSKLV TTGIPVVKPR QLIATDDAVP LGPGKGVAQG LTRSSGVRRS FRKSPEHPVD 2100
DIDKMKERAA MNNSFIYIKI PQVPLCVSYK GEKNSVDWGD LNLVLPCLEY HNNTWTWLDF 2160
AMAVKRDSRK ALVAQVIKEK LRLKSATGSE VRGKLETKSD LNMQQQEEEE KARLLIGLSV 2220
GDKNPGKKSI FGRRK 2235 
Gene Ontology
 GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell. 
Interpro
 IPR019443; FMP27_C.
 IPR019441; FMP27_GFWDK_dom.
 IPR019439; FMP27_N. 
Pfam
 PF10351; Apt1
 PF10344; Fmp27
 PF10347; Fmp27_GFWDK 
SMART
  
PROSITE
  
PRINTS