CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-038060
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
  
Protein Name
 Protein Maskbp3 
Protein Synonyms/Alias
  
Gene Name
 Ankhd1 
Gene Synonyms/Alias
 Maskbp3 
Created Date
 July 27, 2013 
Organism
 Mus musculus (Mouse) 
NCBI Taxa ID
 10090 
Lysine Modification
Position
Peptide
Type
References
226GDVNAVRKLLDEGRSacetylation[1]
1775ELEDLIPKNHIRTPAacetylation[2]
1959RTPSSVRKQLFACVPubiquitination[3]
Reference
 [1] Quantitative acetylome analysis reveals the roles of SIRT1 in regulating diverse substrates and cellular pathways.
 Chen Y, Zhao W, Yang JS, Cheng Z, Luo H, Lu Z, Tan M, Gu W, Zhao Y.
 Mol Cell Proteomics. 2012 Oct;11(10):1048-62. [PMID: 22826441]
 [2] SIRT5-Mediated Lysine Desuccinylation Impacts Diverse Metabolic Pathways.
 Park J, Chen Y, Tishkoff DX, Peng C, Tan M, Dai L, Xie Z, Zhang Y, Zwaans BM, Skinner ME, Lombard DB, Zhao Y.
 Mol Cell. 2013 Jun 27;50(6):919-30. [PMID: 23806337]
 [3] Proteomic analyses reveal divergent ubiquitylation site patterns in murine tissues.
 Wagner SA, Beli P, Weinert BT, Schölz C, Kelstrup CD, Young C, Nielsen ML, Olsen JV, Brakebusch C, Choudhary C.
 Mol Cell Proteomics. 2012 Dec;11(12):1578-85. [PMID: 22790023
Functional Description
  
Sequence Annotation
  
Keyword
 ANK repeat; Complete proteome; Reference proteome; Repeat. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2548 AA 
Protein Sequence
MLTDSGGGGT SFEEDLDSVA PRSAPAGASE PPPPGGVGLG IRTVRLFGEA GPAPGVGGGG 60
GGGGGSSSSS SAGGGDAALD FKLAAAVLRT GGGGGASGSD EDEVSEVESF ILDQEDLDNP 120
VLKTSEIFLS STAEGADLRT VDPETQARLE ALLEAAGIGK LSTADGKAFA DPEVLRRLTS 180
SVSCALDEAA AALTRMRAEN THSTGQVDTR SLAEACSDGD VNAVRKLLDE GRSVNEHTEE 240
GESLLCLACS AGYYELAQVL LAMHANVEDR GNKGDITPLM AASSGGYLDI VKLLLLHDAD 300
VNSQSATGNT ALTYACAGGF IDIVKVLLNE GANIEDHNEN GHTPLMEAAS AGHVEVARVL 360
LDHGAGINTH SNEFKESALT LACYKGHLDM VRFLLEAGAD QEHKTDEMHT ALMEACMDGH 420
VEVARLLLDS GAQVNMPADS FESPLTLAAC GGHVELAALL IERGANLEEV NDEGYTPLME 480
AAREGHEEMV ALLLAQGANI NAQTEETQET ALTLACCGGF SEVADFLIKA GADIELGCST 540
PLMEASQEGH LELVKYLLAA GANVHATTAT GDTALTYACE NGHTDVADVL LQAGAHLEHE 600
SEGGRTPLMK AARAGHLCTV QFLISKGANV NRATANNDHT VVSLACAGGH LAVVELLLAH 660
GADPTHRLKD GSTMLIEAAK GGHTNVVSYL LDYPNNVLSV PTTDVSQLTS PSQDESQVPR 720
VPIHTLAMVV PPQEPDRTSQ ETSTALLGVQ KGASKQKSSS LQVADQDLLP PFHPYQPLEC 780
IVEETEGKLN ELGQRISAIE KAQLKSLELI QGEPLNKDKI EELKKNREEQ VQKKKKILKE 840
LQKVERQLQM KTQQQFTKEY LETKGQRDTE SPHQQCSNRG VFMAGEEDGS LPQDHSSESP 900
QLDTVLFKDH DIDDKQQSPP SAEQIDFVPV QPLSSPQCNF FSDLGSNGTN SLVLQKVSGN 960
QQIVGQPQIA IAGHEQGLLV QEPDGLMVAT PAQTLTDTLD DLIAAVSTRV PVGSNNPSQT 1020
TECPTPESCY QTPSNMATPS TPPVYPSVDI DAHTESNHDT ALTLACAGGH EELVSVLIAR 1080
DAKIEHRDKK GFTPLILAAT AGHVGVVEIL LDKGGDIEAQ SERTKDTPLS LACSGGRQEV 1140
VDLLLARGAN KEHRNVSDYT PLSLAASGGY VNIIKILLNA GAEINSRTGS KLGISPLMLA 1200
AMNGHVPAVK LLLDMGSDIN AQIETNRNTA LTLACFQGRA EVVSLLLDRK ANVEHRAKTG 1260
LTPLMEAASG GYAEVGRVLL DKGADVNAPP VPSSRDTALT IAADKGHYKF CELLINRGAH 1320
IDVRNKKGNT PLWLASNGGH FDVVQLLVQA GADVDAADNR KITPLMSAFR KGHVKVVQYL 1380
VKEVSQFPSD IECMRYIATI TDKELLKKCH QCVETIVKAK DQQAAEANKN ASILLKELDL 1440
EKSREESRKQ ALAAKREKRK EKRKKKKEEQ KRKQEDEENK PKENSEQPEG EDEENDEDVE 1500
QEIPIEPPSA TTTTTIGISA TSTTFTNVFG KKRANVVTTP STNRKNKKNK TKESPPTAHL 1560
ILPEPHISLA QQKADKNKIN GEPRGGGAGG NSDSDNIDST DCNSESSSGG KSQEFSFPVD 1620
VNPASDKRCS TVVSSQEEKA VTTTSKTQTR LDGEVNSMST SYKSLPLSSP TMKLNLTSPK 1680
RGQKREEGWK EVVRRSKKLS VPASVVSRIM GRGGCNITAI QDVTGAHIDV DKQKDKNGER 1740
MITIRGGTES TRYAVQLINA LIQDPAKELE DLIPKNHIRT PASTKSIHTN FSSGVGTTAT 1800
SSKNAFPLGA PALVTSQATT LSTFQPTNKL SKNVPTNVRS PFPVSLPLAY PHPHFALLAA 1860
QTMQQIRHPR LPMAQFGGTF SPSPNTWGPF PVRPVNPGNT SSSPKHNNTA RLPNQNGPVL 1920
PSESPGLATT GCPITVSSVV AASQQLCMTN SRTPSSVRKQ LFACVPKTSP PATVISSVTS 1980
TSSSLPSVSS TSITSGHVTT TFMPAPTQVP LSSQKVESFS VIPPPKEKVS TQDQPLTNLC 2040
TPSPAATSCN SSASNTSGAP EAHPSSTPPP PPGNTQEEGQ PSKASDLSPV SMPFASNSET 2100
APLTLASPRL VAADNRDTGS LPQLTVPAPR VSHRMQPRGS FYSVVPNATM HQDPQSIFVT 2160
NPVPLTPPQG PPAAVQLSSA VNIMNGSQVH INPANKSLQP TFGPATLFNH FSSLFDSGQV 2220
PANQGWGDGP LPSRVAADAS FTVQSAFLSN SVLGHLENVH PDNSKAPGFR PPSQRVSTSP 2280
VGLPSIDPSG NSPSAAAPLT SFSGIPGTRV FLQGPAPVGT PSFNRQHFSP HPWTSASNTC 2340
DSPIPSVSSG SSSPLSATSA PPTLGQQPKG NSASQDRKIP PPIGTERLAR IRQGGSVAQA 2400
PVGTSFVAPV GHGGIWSFGV NAMSEGLSGW SQSVIGNHPM HQQLSDPSTF SQHQPMERDD 2460
SGMVAPTNIF HQPMGLPISM YGGTIIPSHP QLADVPGGPL FNGLHNPDPA WNPMIKVIQN 2520
SAECTEAQQI WPGTWAPHIG NMHLKYVN 2548 
Gene Ontology
 GO:0003723; F:RNA binding; IEA:InterPro. 
Interpro
 IPR002110; Ankyrin_rpt.
 IPR020683; Ankyrin_rpt-contain_dom.
 IPR004087; KH_dom.
 IPR004088; KH_dom_type_1. 
Pfam
 PF00023; Ank
 PF12796; Ank_2
 PF00013; KH_1 
SMART
 SM00248; ANK
 SM00322; KH 
PROSITE
 PS50297; ANK_REP_REGION
 PS50088; ANK_REPEAT
 PS50084; KH_TYPE_1 
PRINTS
 PR01415; ANKYRIN.