CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-017043
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Ankyrin repeat and KH domain-containing protein 1 
Protein Synonyms/Alias
 HIV-1 Vpr-binding ankyrin repeat protein; Multiple ankyrin repeats single KH domain; hMASK 
Gene Name
 ANKHD1 
Gene Synonyms/Alias
 KIAA1085; MASK; VBARP; PP2500 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
1830STFQPANKLNKNVPTubiquitination[1]
1833QPANKLNKNVPTNVRubiquitination[1]
Reference
 [1] Methods for quantification of in vivo changes in protein ubiquitination following proteasome and deubiquitinase inhibition.
 Udeshi ND, Mani DR, Eisenhaure T, Mertins P, Jaffe JD, Clauser KR, Hacohen N, Carr SA.
 Mol Cell Proteomics. 2012 May;11(5):148-59. [PMID: 22505724
Functional Description
 May play a role as a scaffolding protein that may be associated with the abnormal phenotype of leukemia cells. Isoform 2 may possess an antiapoptotic effect and protect cells during normal cell survival through its regulation of caspases. 
Sequence Annotation
 REPEAT 204 233 ANK 1.
 REPEAT 237 266 ANK 2.
 REPEAT 271 300 ANK 3.
 REPEAT 304 333 ANK 4.
 REPEAT 337 366 ANK 5.
 REPEAT 371 400 ANK 6.
 REPEAT 404 433 ANK 7.
 REPEAT 437 466 ANK 8.
 REPEAT 470 499 ANK 9.
 REPEAT 504 533 ANK 10.
 REPEAT 534 563 ANK 11.
 REPEAT 567 596 ANK 12.
 REPEAT 600 629 ANK 13.
 REPEAT 634 663 ANK 14.
 REPEAT 667 696 ANK 15.
 REPEAT 1054 1083 ANK 16.
 REPEAT 1087 1116 ANK 17.
 REPEAT 1121 1150 ANK 18.
 REPEAT 1154 1183 ANK 19.
 REPEAT 1189 1218 ANK 20.
 REPEAT 1223 1252 ANK 21.
 REPEAT 1256 1285 ANK 22.
 REPEAT 1291 1320 ANK 23.
 REPEAT 1324 1353 ANK 24.
 REPEAT 1357 1386 ANK 25.
 DOMAIN 1695 1759 KH.
 MOD_RES 101 101 Phosphoserine.
 MOD_RES 1653 1653 Phosphothreonine.  
Keyword
 Alternative splicing; ANK repeat; Coiled coil; Complete proteome; Cytoplasm; Phosphoprotein; Polymorphism; Reference proteome; Repeat; RNA-binding. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2542 AA 
Protein Sequence
MLTDSGGGGT SFEEDLDSVA PRSAPAGASE PPPPGGVGLG IRTVRLFGEA GPASGVGSSG 60
GGGSGSGTGG GDAALDFKLA AAVLRTGGGG GASGSDEDEV SEVESFILDQ EDLDNPVLKT 120
TSEIFLSSTA EGADLRTVDP ETQARLEALL EAAGIGKLST ADGKAFADPE VLRRLTSSVS 180
CALDEAAAAL TRMKAENSHN AGQVDTRSLA EACSDGDVNA VRKLLDEGRS VNEHTEEGES 240
LLCLACSAGY YELAQVLLAM HANVEDRGNK GDITPLMAAS SGGYLDIVKL LLLHDADVNS 300
QSATGNTALT YACAGGFVDI VKVLLNEGAN IEDHNENGHT PLMEAASAGH VEVARVLLDH 360
GAGINTHSNE FKESALTLAC YKGHLDMVRF LLEAGADQEH KTDEMHTALM EACMDGHVEV 420
ARLLLDSGAQ VNMPADSFES PLTLAACGGH VELAALLIER GANLEEVNDE GYTPLMEAAR 480
EGHEEMVALL LAQGANINAQ TEETQETALT LACCGGFSEV ADFLIKAGAD IELGCSTPLM 540
EASQEGHLEL VKYLLASGAN VHATTATGDT ALTYACENGH TDVADVLLQA GADLEHESEG 600
GRTPLMKAAR AGHLCTVQFL ISKGANVNRA TANNDHTVVS LACAGGHLAV VELLLAHGAD 660
PTHRLKDGST MLIEAAKGGH TNVVSYLLDY PNNVLSVPTT DVSQLPPPSQ DQSQVPRVPT 720
HTLAMVVPPQ EPDRTSQENS PALLGVQKGT SKQKSSSLQV ADQDLLPSFH PYQPLECIVE 780
ETEGKLNELG QRISAIEKAQ LKSLELIQGE PLNKDKIEEL KKNREEQVQK KKKILKELQK 840
VERQLQMKTQ QQFTKEYLET KGQKDTVSLH QQCSHRGVFP EGEGDGSLPE DHFSELPQVD 900
TILFKDNDVD DEQQSPPSAE QIDFVPVQPL SSPQCNFSSD LGSNGTNSLE LQKVSGNQQI 960
VGQPQIAITG HDQGLLVQEP DGLMVATPAQ TLTDTLDDLI AAVSTRVPTG SNSSSQTTEC 1020
LTPESCSQTT SNVASQSMPP VYPSVDIDAH TESNHDTALT LACAGGHEEL VSVLIARDAK 1080
IEHRDKKGFT PLILAATAGH VGVVEILLDK GGDIEAQSER TKDTPLSLAC SGGRQEVVDL 1140
LLARGANKEH RNVSDYTPLS LAASGGYVNI IKILLNAGAE INSRTGSKLG ISPLMLAAMN 1200
GHVPAVKLLL DMGSDINAQI ETNRNTALTL ACFQGRAEVV SLLLDRKANV EHRAKTGLTP 1260
LMEAASGGYA EVGRVLLDKG ADVNAPPVPS SRDTALTIAA DKGHYKFCEL LIHRGAHIDV 1320
RNKKGNTPLW LASNGGHFDV VQLLVQAGAD VDAADNRKIT PLMSAFRKGH VKVVQYLVKE 1380
VNQFPSDIEC MRYIATITDK ELLKKCHQCV ETIVKAKDQQ AAEANKNASI LLKELDLEKS 1440
REESRKQALA AKREKRKEKR KKKKEEQKRK QEEDEENKPK ENSELPEDED EEENDEDVEQ 1500
EVPIEPPSAT TTTTIGISAT SATFTNVFGK KRANVVTTPS TNRKNKKNKT KETPPTAHLI 1560
LPEQHMSLAQ QKADKNKING EPRGGGAGGN SDSDNLDSTD CNSESSSGGK SQELNFVMDV 1620
NSSKYPSLLL HSQEEKTSTA TSKTQTRLEG EVTPNSLSTS YKTVSLPLSS PNIKLNLTSP 1680
KRGQKREEGW KEVVRRSKKL SVPASVVSRI MGRGGCNITA IQDVTGAHID VDKQKDKNGE 1740
RMITIRGGTE STRYAVQLIN ALIQDPAKEL EDLIPKNHIR TPASTKSIHA NFSSGVGTTA 1800
ASSKNAFPLG APTLVTSQAT TLSTFQPANK LNKNVPTNVR SSFPVSLPLA YPHPHFALLA 1860
AQTMQQIRHP RLPMAQFGGT FSPSPNTWGP FPVRPVNPGN TNSSPKHNNT SRLPNQNGTV 1920
LPSESAGLAT ASCPITVSSV VAASQQLCVT NTRTPSSVRK QLFACVPKTS PPATVISSVT 1980
STCSSLPSVS SAPITSGQAP TTFLPASTSQ AQLSSQKMES FSAVPPTKEK VSTQDQPMAN 2040
LCTPSSTANS CSSSASNTPG APETHPSSSP TPTSSNTQEE AQPSSVSDLS PMSMPFASNS 2100
EPAPLTLTSP RMVAADNQDT SNLPQLAVPA PRVSHRMQPR GSFYSMVPNA TIHQDPQSIF 2160
VTNPVTLTPP QGPPAAVQLS SAVNIMNGSQ MHINPANKSL PPTFGPATLF NHFSSLFDSS 2220
QVPANQGWGD GPLSSRVATD ASFTVQSAFL GNSVLGHLEN MHPDNSKAPG FRPPSQRVST 2280
SPVGLPSIDP SGSSPSSSSA PLASFSGIPG TRVFLQGPAP VGTPSFNRQH FSPHPWTSAS 2340
NSCDSPIPSV SSGSSSPLSA TSAPPTLGQP KGVSASQDRK IPPPIGTERL ARIRQGGSVA 2400
QAPAGTSFVA PVGHSGIWSF GVNAVSEGLS GWSQSVMGNH PMHQQLSDPS TFSQHQPMER 2460
DDSGMVAPSN IFHQPMASGF VDFSKGLPIS MYGGTIIPSH PQLADVPGGP LFNGLHNPDP 2520
AWNPMIKVIQ NSTECTDAQQ VKWA 2544 
Gene Ontology
 GO:0005737; C:cytoplasm; IDA:HPA.
 GO:0005634; C:nucleus; IDA:HPA.
 GO:0003723; F:RNA binding; IEA:UniProtKB-KW. 
Interpro
 IPR002110; Ankyrin_rpt.
 IPR020683; Ankyrin_rpt-contain_dom.
 IPR004087; KH_dom.
 IPR004088; KH_dom_type_1. 
Pfam
 PF00023; Ank
 PF12796; Ank_2
 PF00013; KH_1 
SMART
 SM00248; ANK
 SM00322; KH 
PROSITE
 PS50297; ANK_REP_REGION
 PS50088; ANK_REPEAT
 PS50084; KH_TYPE_1 
PRINTS
 PR01415; ANKYRIN.