CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-035594
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
  
Protein Name
 Protein LOC100911833 
Protein Synonyms/Alias
  
Gene Name
 LOC100911833 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Rattus norvegicus (Rat) 
NCBI Taxa ID
 10116 
Lysine Modification
Position
Peptide
Type
References
72ISQRGTRKLFDELVVacetylation[1]
111DINIEGAKHKFSERRacetylation[1]
113NIEGAKHKFSERRVVacetylation[1]
123ERRVVLVKNKESVVFacetylation[1]
145YKPGQSVKFRVVSMDacetylation[1]
171LAYIEDPKMNRIIQWacetylation[1]
188VKTENGLKQLSFSLSacetylation[1]
214ILKQSGVKEEHSFTVacetylation[1]
341EVERTRNKFLFLKADacetylation[1]
346RNKFLFLKADSHFRHacetylation[1]
556SVTFQVEKCLRNKVHacetylation[1]
561VEKCLRNKVHLSFSPacetylation[1]
658SVPYGREKDVYRYVRacetylation[1]
716MMPLGVNKSPLPKEPacetylation[1]
918IVEPEGIKKEHTFSSacetylation[1]
1010LTEKIKSKALGYLRAacetylation[1]
1026YQRELNYKHKDGSYSacetylation[1]
1175EKRNEILKSLDKEAIacetylation[1]
1278YGAATFSKSQKTPLVacetylation[1]
1343RYNMPLEKQQPAFALacetylation[1]
1400LSGFIPLKPTVKKLEacetylation[1]
Reference
 [1] Proteomic analysis of lysine acetylation sites in rat tissues reveals organ specificity and subcellular patterns.
 Lundby A, Lage K, Weinert BT, Bekker-Jensen DB, Secher A, Skovgaard T, Kelstrup CD, Dmytriyev A, Choudhary C, Lundby C, Olsen JV.
 Cell Rep. 2012 Aug 30;2(2):419-31. [PMID: 22902405
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1481 AA 
Protein Sequence
MKKNREAQLC LFSALLAFLP FASLLNGNSK YMVLVPSQLY TETPEKICLH LYHLNETVTV 60
TASLISQRGT RKLFDELVVD KDLFHCLSFT IPRLPSSEEE ESLDINIEGA KHKFSERRVV 120
LVKNKESVVF VQTDKPMYKP GQSVKFRVVS MDKNLHPLNE LFPLAYIEDP KMNRIIQWQD 180
VKTENGLKQL SFSLSAEPIQ GPYKIVILKQ SGVKEEHSFT VMEFVLPRFG VDVKVPNAIS 240
VYDEIINVTA CAIYTYGKPI SLCHGNPSFS SETKSACKEE DSELDNNGCS TQEVNITEFQ 300
LKENYLKMHQ AFHVNATVTE EGTGSEFSGS GRIEVERTRN KFLFLKADSH FRHGIPFFVK 360
IRLVDIKGDP IPNEQVFIKA QEAGYTNATT TDQHGLAKFS IDTSSISGYS LNIKVYHKEE 420
SSCIHSSCTA ERHAEEHHTA YAVYSLSKSY IYLDTEAGVL PCNQIHTVQA HFILKGQVLG 480
VLPQIVFHYL VMAQGSILQT GNHTHQVEPG VSQVQGNFAL EIPVEFSMVP VAKMLIYTIL 540
PDGEVIADSV TFQVEKCLRN KVHLSFSPSQ SLPASQTHMR VTASPQSLCG LRAVDQSVLL 600
LKPEAELSPS LIYDLPGMQD SNFIPSSYHP FEDEYDCLMY QPRDTEELTY SVPYGREKDV 660
YRYVRDMGLT AFTNLKIKHP TYCYEMNMVV LSAPAVESEL SPRGGEFEMM PLGVNKSPLP 720
KEPPRKDPPP KDPVIETIRN YFPETWIWDL VTVNSSGVTE VEMTVPDTIT EWKAGALCLS 780
NDTGLGLSSV ATLQAFQPFF VELTMPYSVI RGEAFMLKAT VMNYLPTSLP MAVQLEASPD 840
FTAVPVGNDQ DSYCLGANGR HTSSWLVTPK SLGNVNFSVS VEAQQSPELC GSQVATVPET 900
GRKDTVVKVL IVEPEGIKKE HTFSSLLCAS DAELSETLSL LLPPTVVKDS ARAHFSVMGD 960
ILSSAIKNTQ NLIQMPYGCG EQNMVLFAPN IYVLKYLNET QQLTEKIKSK ALGYLRAGYQ 1020
RELNYKHKDG SYSAFGDHNG QGQGNTWLTA FVLKSFAQAR AFIFIDESHI TDAFTWLSKQ 1080
QKDSGCFRSS GSLFNNAMKG GVDDEITLSA YITMALLESS LPDTDPVVSK ALGCLEASWE 1140
TIEQGRNGSF VYTKTLMAYA FALAGNQEKR NEILKSLDKE AIREDNSIHW ERPQKPTKSE 1200
GYLYTPQASS AEVEMSAYVV LARLTAQPAP SPEDLALSMG TIKWLTKQQN SHGGFSSTQD 1260
TVVALDALSK YGAATFSKSQ KTPLVTIQSS GSFSQKFQVD NSNRLLLQQV SLPDIPGNYT 1320
VSVSGEGCVY AQTTLRYNMP LEKQQPAFAL KVQTVPLTCN NPKGQNSFQI SLEISYTGSR 1380
PASNMVIADV KMLSGFIPLK PTVKKLERLE HVSRTEVTTN NVLLYLDQVT NQTLSFSFII 1440
QQDIPVKNLQ PAIVKVYDYY ETDEVAFAEY SSPCSSDKQN V 1481 
Gene Ontology
 GO:0005615; C:extracellular space; IEA:InterPro.
 GO:0004866; F:endopeptidase inhibitor activity; IEA:InterPro.
 GO:0010951; P:negative regulation of endopeptidase activity; IEA:GOC. 
Interpro
 IPR009048; A-macroglobulin_rcpt-bd.
 IPR011626; A2M_comp.
 IPR002890; A2M_N.
 IPR011625; A2M_N_2.
 IPR001599; Macroglobln_a2.
 IPR019742; MacrogloblnA2_CS.
 IPR019565; MacrogloblnA2_thiol-ester-bond.
 IPR008930; Terpenoid_cyclase/PrenylTrfase.
 IPR010916; TonB_box_CS. 
Pfam
 PF00207; A2M
 PF07678; A2M_comp
 PF01835; A2M_N
 PF07703; A2M_N_2
 PF07677; A2M_recep
 PF10569; Thiol-ester_cl 
SMART
  
PROSITE
 PS00477; ALPHA_2_MACROGLOBULIN
 PS00430; TONB_DEPENDENT_REC_1 
PRINTS