CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-035104
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
  
Protein Name
 Protein Ly75 
Protein Synonyms/Alias
  
Gene Name
 Ly75 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Rattus norvegicus (Rat) 
NCBI Taxa ID
 10116 
Lysine Modification
Position
Peptide
Type
References
702KDLVHLLKDQFSGQRacetylation[1]
Reference
 [1] Proteomic analysis of lysine acetylation sites in rat tissues reveals organ specificity and subcellular patterns.
 Lundby A, Lage K, Weinert BT, Bekker-Jensen DB, Secher A, Skovgaard T, Kelstrup CD, Dmytriyev A, Choudhary C, Lundby C, Olsen JV.
 Cell Rep. 2012 Aug 30;2(2):419-31. [PMID: 22902405
Functional Description
  
Sequence Annotation
  
Keyword
 Complete proteome; Disulfide bond; Reference proteome; Repeat. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1722 AA 
Protein Sequence
MGTRWATPGR ATGLLVLLLR CFELVEPSGK GGNNPFTIVN ENTGKCLQPL SDWIVAQDCS 60
ETKNMLWKWV SQHRLFHLES QKCLGLDITK ATDNLRMFNC DSKAMLWWKC EHHSLYTAAQ 120
YSLALKDGYA TASTNSTDVW KKGGSKENLC DQPYHEIYTR DGNSYGRPCE FPFLIGKTWH 180
HDCIRDEDHK GPWCATTLNY EYDQKWGICL QPENGCEGNW EKNGQIGSCY QFNNQEVLTW 240
KEAYVSCQNQ GADLLSIHST AELAYITGKE DIARIVWIGL NQLYSARGWE WSDFKPLKFL 300
NWDPGMPIAP VIGGSSCARM DTESGLWQSV SCESQQPYIC KKPLNSTAER PDDWTDSDTL 360
CDGGWLPNKG FCYLLANESG SWDAARMKCK ALGADLTSIH SLADVELVVT KLHSGDVKEE 420
IWTGLKNTNS PTLFQWSDGT EVTVTHWNEN EPNVPYNKTP NCVSYLGKLG QWKVQSCEKE 480
LRYVCKKKGE ITKDARSDEL CPPNEGWKRH GETCYKIYEK EVPFGTNCNL TITSRFEQEY 540
LNDMMKNYDK SLQKYFWTGL RDADSRGEYS WAVAGGAKQA VTFSNWNFLE PASPGGCVAM 600
STGKTLGKWE VKNCRSFRAL SICKKMRGPQ ASEEAAPKPD DPCPEGWHTF PSSLSCYKVF 660
HIERIVRKRN WEEAERFCRA LGAHLPSFSH RSEIKDLVHL LKDQFSGQRW LWIGLNKRSP 720
DLQGSWQWSD RTPVSAVIMD WEFQQDYDIR DCAAVKVLDT PWHRAWHFYD EREYAYLKPF 780
ACDAKLEWVC QIPKGNTPQM PDWYNPERTG IHGPPVIIEG SEYWFVADPH LNYEEAVLYC 840
ASNQSFLATI TSFTGLKAIK NKIANISGDD QKWWVKTSEN PIDRYFLGSR RMWHRFPMTF 900
GDECLHMSAK TWILDLSKRA NCNAKLPFIC EKYNVSSLEK YSPDPSAKVR CTGKWIPFQN 960
KCFLKVNSEP VTFSQASSIC HSYGGTLPSV LNRNEQDFII SLLPEMEASL WLGLRWSAYE 1020
RINKWTDGQE LTYSNFHPLL VGRRLSIPTN FFDDESHFHC ALILNLKKSP LTGTWNFTSC 1080
SERHSLSLCQ KYSETEDRQP WENTSETVRY LNNLYKIVSK PLTWHAALKE CLTEGMRLVS 1140
ITDPYQQAFL AVQAALRNTS FWIGLSSQDD ELNFGWSDGK RLHFSNWAGS NEQLDDCVIL 1200
DTDGFWKTAD CDDSQPAAIC YYPGNETEEE VRPLDSAKCP SPVQSTPWIP FQNACYNFMI 1260
TKNRHKTVTP EEVQSMCKQL HSKAHSLSVR TEEENTFVVE QLLYYNYIAS WVMLGITYEN 1320
NSLMWFDKTA LSYTHWRKGR PTVKNGKFLA GLSTDGFWDI QSFNIIEETL HFYQHSIFAC 1380
KIEMVDYEDK HNSTLPQVIP YEDGVYNVIQ KKVTWYEALK ECSQSGGELA SVHNPNGKLF 1440
LEDVVNRDGF PLWVGLSSHD GSESSFEWSD GSAFDYVPWN SPQSPGDCVV LYPKGIWRHE 1500
RCPSVKDGAV CYKPTKPKEL SSHTQSSKCP VAKREDVQWI QYGGHCYSSD QALHSFSEAK 1560
QLCQELDHSA TVVTIADENE NKFVSRLMRE NFNITMRVWL GLSQHSLDQS WSWLDGLDVT 1620
FVKWENKSKD GDGKCSVLIA SNETWRKVEC SRGYARAVCK IPLSPDYRGI AILFAVLSVL 1680
ALISGLIWFL LQRTHFHWTG FSPVRYEHGA NEDEVMLPSF HD 1722 
Gene Ontology
 GO:0030246; F:carbohydrate binding; IEA:InterPro. 
Interpro
 IPR001304; C-type_lectin.
 IPR016186; C-type_lectin-like.
 IPR018378; C-type_lectin_CS.
 IPR016187; C-type_lectin_fold.
 IPR000562; FN_type2_col-bd.
 IPR013806; Kringle-like.
 IPR000772; Ricin_B_lectin. 
Pfam
 PF00040; fn2
 PF00059; Lectin_C 
SMART
 SM00034; CLECT
 SM00059; FN2
 SM00458; RICIN 
PROSITE
 PS00615; C_TYPE_LECTIN_1
 PS50041; C_TYPE_LECTIN_2
 PS00023; FN2_1
 PS51092; FN2_2
 PS50231; RICIN_B_LECTIN 
PRINTS