CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-044066
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
  
Protein Name
 SH3 and multiple ankyrin repeat domains protein 1 
Protein Synonyms/Alias
  
Gene Name
 SHANK1 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
86IPDLHQTKCLRFNPDubiquitination[1]
Reference
 [1] Refined preparation and use of anti-diglycine remnant (K-ε-GG) antibody enables routine quantification of 10,000s of ubiquitination sites in single proteomics experiments.
 Udeshi ND, Svinkina T, Mertins P, Kuhn E, Mani DR, Qiao JW, Carr SA.
 Mol Cell Proteomics. 2013 Mar;12(3):825-31. [PMID: 23266961
Functional Description
  
Sequence Annotation
  
Keyword
 ANK repeat; Complete proteome; Reference proteome; Repeat. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2169 AA 
Protein Sequence
MTHSPATSED EERHSASECP EGGSESDSSP DGPGRGPRGT RGQGSGAPGS LASVRGLQGR 60
SMSVPDDAHF SMMVFRIGIP DLHQTKCLRF NPDATIWTAK QQVLCALSES LQDVLNYGLF 120
QPATSGRDAN FLEEERLLRE YPQSFEKGVP YLEFRYKTRV YKQTNLDEKQ LAKLHTKTGL 180
KKFLEYVQLG TSDKVARLLD KGLDPNYHDS DSGETPLTLA AQTEGSVEVI RTLCLGGAHI 240
DFRARDGMTA LHKAACARHC LALTALLDLG GSPNYKDRRG LTPLFHTAMV GGDPRCCELL 300
LFNRAQLGIA DENGWQEIHQ ACQRGHSQHL EHLLFYGAEP GAQNASGNTA LHICALYNKE 360
TCARILLYRG ADKDVKNNNG QTPFQVAVIA GNFELGELIR NHREQDVVPF QESPKYAARR 420
RGPPGTGLTV PPALLRANSD TSMALPDWMV FSAPGAASSG APGPTSGSQG QSQPSAPTTK 480
LSSGTLRSAS SPRGARARSP SRGRHPEDAK RQPRGRPSSS GTPREGPAGG TGGSGGPGGS 540
LGSRGRRRKL YSAVPGRSFM AVKSYQAQAE GEISLSKGEK IKVLSIGEGG FWEGQVKGRV 600
GWFPSDCLEE VANRSQESKQ ESRSDKAKRL FRHYTVGSYD SFDAPSLMDG IGPGSDYIIK 660
EKTVLLQKKD SEGFGFVLRG AKAQTPIEEF TPTPAFPALQ YLESVDEGGV AWRAGLRMGD 720
FLIEVNGQNV VKVGHRQVVN MIRQGGNTLM VKVVMVTRHP DMDEAVHKKA PQQAKRLPPP 780
TISLRSKSMT SELEEMVSPW KKKSEYEQQP APVPSMEKKR TVYQMALNKL DEILAAAQQT 840
ISASESPGPG GLASLGKHRP KGFFATESSF DPHHRAQPSY ERPSFLPPGP GLMLRQKSIG 900
AAEDDRPYLA PPAMKFSRSL SVPGSEDIPP PPTTSPPEPP YSTPPVPSSS GRLTPSPRGG 960
PFNPGSGGPL PASSPASFDG PSPPDTRVGS REKSLYHSGP LPPAHHHPPH HHHHHAPPPQ 1020
PHHHHAHPPH PPEMETGGSP DDPPPRLALG PQPSLRGWRG GGPSPTPGAP SPSHHGSAGG 1080
GGGSSQGPAL RYFQLPPRAA SAAMYVPARS GRGRKGPLVK QTKVEGEPQK GGGLPPAPSP 1140
TSPASPQPPP AVAAPSEKNS IPIPTIIIKA PSTSSSGRSS QGSSTEAEPP TQPEPTGGGG 1200
GGGSSPSPAP AMSPVPPSPS PVPTPASPSG PATLDFTSQF GAALVGAARR EGGWQNEARR 1260
RSTLFLSTDA GDEDGGDGGL GTGAAPGPRL RHSKSIDEGM FSAEPYLRLE SAGSGAGYGG 1320
YGAGSRAYGG GGGSSAFTSF LPPRPLVHPL TGKALDPASP LGLALAARER ALKESSEGGG 1380
APQPPPRPPS PRYEAPPPTP HHHSPHAHHE PVLRLWGASP PDPARRELGY RAGLGSQEKS 1440
LPASPPAARR SLLHRLPPTA PGVGPLLLQL GTEPPAPHPG VSKPWRSAAP EEPERLPLHV 1500
RFLENCQPRA PVTSGRGPPS EDGPGVPPPS PRRSVPPSPT SPRASEENGL PLLVLPPPAP 1560
SVDVEDGEFL FVEPLPPPLE FSNSFEKPES PLTPGPPHPL PDTPAPATPL PPVPPPAVAA 1620
APPTLDSTAS SLTSYDSEVA TLTQGASAAP GDPHPPGPPA PAAPAPAAPQ PGPDPPPGTD 1680
SGIEEVDSRS SSDHPLETIS SASTLSSLSA EGGGSAGGGG GAGAGVASGP ELLDTYVAYL 1740
DGQAFGGSST PGPPYPPQLM TPSKLRGRAL GASGGLRPGP SGGLRDPVTP TSPTVSVTGA 1800
GTDGLLALRA CSGPPTAGVA GGPVAVEPEV PPVPLPTASS LPRKLLPWEE GPGPPPPPLP 1860
GPLAQPQASA LATVKASIIS ELSSKLQQFG GSSAAGGALP WARGGSGGGG DSHHGGASYV 1920
PERTSSLQRQ RLSDDSQSSL LSKPVSSLFQ NWPKPPLPPL PTGTGVSPTA AAAPGATSPS 1980
ASSSSTSTRH LQGVEFEMRP PLLRRAPSPS LLPASEHKVS PAPRPSSLPI LPSGPLYPGL 2040
FDIRGSPTGG AGGSADPFAP VFVPPHPGIS GGLGGALSGA SRSLSPTRLL SLPPDKPFGA 2100
KPLGFWTKFD VADWLEWLGL AEHRAQFLDH EIDGSHLPAL TKEDYVDLGV TRVGHRMNID 2160
RALKFFLER 2169 
Gene Ontology
  
Interpro
 IPR002110; Ankyrin_rpt.
 IPR020683; Ankyrin_rpt-contain_dom.
 IPR001478; PDZ.
 IPR001660; SAM.
 IPR013761; SAM/pointed.
 IPR021129; SAM_type1.
 IPR011511; SH3_2.
 IPR001452; SH3_domain. 
Pfam
 PF12796; Ank_2
 PF00595; PDZ
 PF00536; SAM_1
 PF07653; SH3_2 
SMART
 SM00248; ANK
 SM00228; PDZ
 SM00454; SAM
 SM00326; SH3 
PROSITE
 PS50297; ANK_REP_REGION
 PS50088; ANK_REPEAT
 PS50106; PDZ
 PS50105; SAM_DOMAIN
 PS50002; SH3 
PRINTS