CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-016317
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Extracellular matrix protein FRAS1 
Protein Synonyms/Alias
  
Gene Name
 FRAS1 
Gene Synonyms/Alias
 KIAA1500 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
3543IEFKTHAKFRGQFVMglycation[1]
Reference
 [1] Proteomic profiling of nonenzymatically glycated proteins in human plasma and erythrocyte membranes.
 Zhang Q, Tang N, Schepmoes AA, Phillips LS, Smith RD, Metz TO.
 J Proteome Res. 2008 May;7(5):2025-32. [PMID: 18396901
Functional Description
  
Sequence Annotation
 DOMAIN 27 88 VWFC 1.
 DOMAIN 93 153 VWFC 2.
 DOMAIN 157 217 VWFC 3.
 DOMAIN 219 279 VWFC 4.
 DOMAIN 283 343 VWFC 5.
 DOMAIN 347 417 VWFC 6.
 REPEAT 409 460 FU 1.
 REPEAT 462 505 FU 2.
 REPEAT 507 553 FU 3.
 REPEAT 555 599 FU 4.
 REPEAT 602 647 FU 5.
 REPEAT 649 705 FU 6.
 REPEAT 708 753 FU 7.
 REPEAT 755 800 FU 8.
 REPEAT 803 852 FU 9.
 REPEAT 854 900 FU 10.
 REPEAT 903 948 FU 11.
 REPEAT 952 997 FU 12.
 REPEAT 999 1042 FU 13.
 REPEAT 1046 1089 FU 14.
 REPEAT 1090 1201 CSPG 1.
 REPEAT 1202 1310 CSPG 2.
 REPEAT 1311 1445 CSPG 3.
 REPEAT 1446 1574 CSPG 4.
 REPEAT 1575 1691 CSPG 5.
 REPEAT 1692 1821 CSPG 6.
 REPEAT 1822 1939 CSPG 7.
 REPEAT 1940 2059 CSPG 8.
 REPEAT 2060 2178 CSPG 9.
 REPEAT 2179 2293 CSPG 10.
 REPEAT 2294 2415 CSPG 11.
 REPEAT 2418 2542 CSPG 12.
 DOMAIN 2543 2646 Calx-beta 1.
 DOMAIN 2659 2770 Calx-beta 2.
 DOMAIN 2784 2890 Calx-beta 3.
 DOMAIN 2905 3007 Calx-beta 4.
 DOMAIN 3025 3129 Calx-beta 5.
 MOD_RES 344 344 Phosphoserine.
 CARBOHYD 361 361 N-linked (GlcNAc...) (Potential).
 CARBOHYD 728 728 N-linked (GlcNAc...) (Potential).
 CARBOHYD 1093 1093 N-linked (GlcNAc...) (Potential).
 CARBOHYD 1108 1108 N-linked (GlcNAc...) (Potential).
 CARBOHYD 1504 1504 N-linked (GlcNAc...) (Potential).
 CARBOHYD 1777 1777 N-linked (GlcNAc...) (Potential).
 CARBOHYD 1948 1948 N-linked (GlcNAc...) (Potential).
 CARBOHYD 1978 1978 N-linked (GlcNAc...) (Potential).
 CARBOHYD 2563 2563 N-linked (GlcNAc...) (Potential).
 CARBOHYD 2664 2664 N-linked (GlcNAc...) (Potential).
 CARBOHYD 2682 2682 N-linked (GlcNAc...) (Potential).
 CARBOHYD 2908 2908 N-linked (GlcNAc...) (Potential).
 CARBOHYD 2985 2985 N-linked (GlcNAc...) (Potential).
 CARBOHYD 3070 3070 N-linked (GlcNAc...) (Potential).
 CARBOHYD 3218 3218 N-linked (GlcNAc...) (Potential).
 CARBOHYD 3676 3676 N-linked (GlcNAc...) (Potential).
 CARBOHYD 3875 3875 N-linked (GlcNAc...) (Potential).  
Keyword
 Alternative splicing; Calcium; Cell membrane; Complete proteome; Glycoprotein; Membrane; Metal-binding; Phosphoprotein; Polymorphism; Reference proteome; Repeat; Signal; Transmembrane; Transmembrane helix. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 4008 AA 
Protein Sequence
MGVLKVWLGL ALALAEFAVL PHHSEGACVY QDSLLADATI WKPDSCQSCR CHGDIVICKP 60
AVCRNPQCAF EKGEVLQIAA NQCCPECVLR TPGSCHHEKK IHEHGTEWAS SPCSVCSCNH 120
GEVRCTPQPC PPLSCGHQEL AFIPEGSCCP VCVGLGKPCS YEGHVFQDGE DWRLSRCAKC 180
LCRNGVAQCF TAQCQPLFCN QDETVVRVPG KCCPQCSARS CSAAGQVYEH GEQWSENACT 240
TCICDRGEVR CHKQACLPLR CGKGQSRARR HGQCCEECVS PAGSCSYDGV VRYQDEMWKG 300
SACEFCMCDH GQVTCQTGEC AKVECARDEE LIHLDGKCCP ECISRNGYCV YEETGEFMSS 360
NASEVKRIPE GEKWEDGPCK VCECRGAQVT CYEPSCPPCP VGTLALEVKG QCCPDCTSVH 420
CHPDCLTCSQ SPDHCDLCQD PTKLLQNGWC VHSCGLGFYQ AGSLCLACQP QCSTCTSGLE 480
CSSCQPPLLM RHGQCVPTCG DGFYQDRHSC AVCHESCAGC WGPTEKHCLA CRDPLHVLRD 540
GGCESSCGKG FYNRQGTCSA CDQSCDSCGP SSPRCLTCTE KTVLHDGKCM SECPGGYYAD 600
ATGRCKVCHN SCASCSGPTP SHCTACSPPK ALRQGHCLPR CGEGFYSDHG VCKACHSSCL 660
ACMGPAPSHC TGCKKPEEGL QVEQLSDVGI PSGECLAQCR AHFYLESTGI CEACHQSCFR 720
CAGKSPHNCT DCGPSHVLLD GQCLSQCPDG YFHQEGSCTE CHPTCRQCHG PLESDCISCY 780
PHISLTNGNC RTSCREEQFL NLVGYCADCH HLCQHCAADL HNTGSICLRC QNAHYLLLGD 840
HCVPDCPSGY YAERGACKKC HSSCRTCQGR GPFSCSSCDT NLVLSHTGTC STTCFPGHYL 900
DDNHVCQPCN THCGSCDSQA SCTSCRDPNK VLLFGECQYE SCAPQYYLDF STNTCKECDW 960
SCSACSGPLK TDCLQCMDGY VLQDGACVEQ CLSSFYQDSG LCKNCDSYCL QCQGPHECTR 1020
CKGPFLLLEA QCVQECGKGY FADHAKHKCT ACPQGCLQCS HRDRCHLCDH GFFLKSGLCV 1080
YNCVPGFSVH TSNETCSGKI HTPSLHVNGS LILPIGSIKP LDFSLLNVQD QEGRVEDLLF 1140
HVVSTPTNGQ LVLSRNGKEV QLDKAGRFSW KDVNEKKVRF VHSKEKLRKG YLFLKISDQQ 1200
FFSEPQLINI QAFSTQAPYV LRNEVLHISR GERATITTQM LDIRDDDNPQ DVVIEIIDPP 1260
LHGQLLQTLQ SPATPIYQFQ LDELSRGLLH YAHDGSDSTS DVAVLQANDG HSFHNILFQV 1320
KTVPQNDRGL QLVANSMVWV PEGGMLQITN RILQAEAPGA SAEEIIYKIT QDYPQFGEVV 1380
LLVNMPADSP ADEGQHLPDG RTATPTSTFT QQDINEGIVW YRHSGAPAQS DSFRFEVSSA 1440
SNAQTRLESH MFNIAILPQT PEAPKVSLEA SLHMTAREDG LTVIQPHSLS FINSEKPSGK 1500
IVYNITLPLH PNQGIIEHRD HPHSPIRYFT QEDINQGKVM YRPPPAAPHL QELMAFSFAG 1560
LPESVKFHFT VSDGEHTSPE MVLTIHLLPS DQQLPVFQVT APRLAVSPGG STSVGLQVVV 1620
RDAETAPKEL FFELRRPPQH GVLLKHTAEF RRPMATGDTF TYEDVEKNAL QYIHDGSSTR 1680
EDSMEISVTD GLTVTMLEVR VEVSLSEDRG PRLAAGSSLS ITVASKSTAI ITRSHLAYVD 1740
DSSPDPEIWI QLNYLPSYGT LLRISGSEVE ELSEVSNFTM EDINNKKIRY SAVFETDGHL 1800
VTDSFYFSVS DMDHNHLDNQ IFTIMITPAE NPPPVIAFAD LITVDEGGRA PLSFHHFFAT 1860
DDDDNLQRDA IIKLSALPKY GCIENTGTGD RFGPETASDL EASFPIQDVL ENYIYYFQSV 1920
HESIEPTHDI FSFYVSDGTS RSEIHSINIT IERKNDEPPR MTLQPLRVQL SSGVVISNSS 1980
LSLQDLDTPD NELIFVLTKK PDHGHVLWRQ TASEPLENGR VLVQGSTFTY QDILAGLVGY 2040
VPSVPGMVVD EFQFSLTDGL HVDTGRMKIY TELPASDTPH LAINQGLQLS AGSVARITEQ 2100
HLKVTDIDSD DHQVMYIMKE DPGAGRLQMM KHGNLEQISI KGPIRSFTQA DISQGQPEYS 2160
HGTGEPGGSF AFKFDVVDGE GNRLIDKSFS ISISEDKSPP VITTNKGLVL DENSVKKITT 2220
LQLSATDQDS GPTELIYRIT RQPQLGHLEH AASPGIQISS FTQADLTSRN VQYVHSSEAE 2280
KHSDAFSFTL SDGVSEVTQT FHITLHPVDD SLPVVQNLGM RVQEGMRKTI TEFELKAVDA 2340
DTEAESVTFT IVQPPRHGTI ERTSNGQHFH LTSTFTMKDI YQNRVSYSHD GSNSLKDRFT 2400
FTVSDGTNPF FIIEEGGKEI MTAAPQPFRV DILPVDDGTP RIVTNLGLQW LEYMDGKATN 2460
LITKKELLTM DPDTEDAQLV YEITTGPKHG FVENKLQPGR AAATFTQEDV NLGLIRYVLH 2520
KEKIREMMDS FQFLVKDSKP NVVSDNVFHI QWSLISFKYT SYNVSEKAGS VSVTVQRTGN 2580
LNQYAIVLCR TEQGTASSSS QPGQQDYVEY AGQVQFDERE DTKSCTIVIN DDDVFENVES 2640
FTVELSMPAY ALLGEFTQAK VIINDTEDEP TLEFDKKIYW VNESAGFLFA PIERKGDASS 2700
IVSAICYTVP KSAMGSLFYA LESGSDFKSR GMSAASRVIF GPGVTMSTCD VMLIDDSEYE 2760
EEEEFEIALA DASDNARIGR VATAKVLISG PNDASTVSLG NTAFTVSEDA GTVKIPVIRH 2820
GTDLSTFASV WCATRPSDPA SATPGVDYVP SSRKVEFGPG VIEQYCTLTI LDDTQYPVIE 2880
GLETFVVFLS SAQGAELTKP FQAVIAINDT FQDVPSMQFA KDLLLVKEKE GVLHVPITRS 2940
GDLSYESSVR CYTQSHSAQV MEDFEERQNA DSSRITFLKG DKVKNCTVYI HDDSMFEPEE 3000
QFRVYLGLPL GNHWSGARIG KNNMATITIS NDEDAPTIEF EEAAYQVREP AGPDAIAILN 3060
IKVIRRGDQN RTSKVRCSTR DGSAQSGVDY YPKSRVLKFS PGVDHIFFKV EILSNEDREW 3120
HESFSLVLGP DDPVEAVLGD VTTATVTILD QEAAGSLILP APPIVVTLAD YDHVEEVTKE 3180
GVKKSPSPGY PLVCVTPCDP HFPRYAVMKE RCSEAGINQT SVQFSWEVAA PTDGNGARSP 3240
FETITDNTPF TSVNHMVLDS IYFSRRFHVR CVAKAVDKVG HVGTPLRSNI VTIGTDSAIC 3300
HTPVVAGTSR GFQAQSFIAT LKYLDVKHKE HPNRIHISVQ IPHQDGMLPL ISTMPLHNLH 3360
FLLSESIYRH QHVCSNLVTT YDLRGLAEAG FLDDVVYDST ALGPGYDRPF QFDPSVREPK 3420
TIQLYKHLNL KSCVWTFDAY YDMTELIDVC GGSVTADFQV RDSAQSFLTV HVPLYVSYIY 3480
VTAPRGWASL EHHTEMEFSF FYDTVLWRTG IQTDSVLSAR LQIIRIYIRE DGRLVIEFKT 3540
HAKFRGQFVM EHHTLPEVKS FVLTPDHLGG IEFDLQLLWS AQTFDSPHQL WRATSSYNRK 3600
DYSGEYTIYL IPCTVQPTQP WVDPGEKPLA CTAHAPERFL IPIAFQQTNR PVPVVYSLNT 3660
EFQLCNNEKV FLMDPNTSDM SLAEMDYKGA FSKGQILYGR VLWNPEQNLN SAYKLQLEKV 3720
YLCTGKDGYV PFFDPTGTIY NEGPQYGCIQ PNKHLKHRFL LLDRNQPEVT DKYFHDVPFE 3780
AHFASELPDF HVVSNMPGVD GFTLKVDALY KVEAGHQWYL QVIYIIGPDT ISGPRVQRSL 3840
TAPLRRNRRD LVEPDGQLIL DDSLIYDNEG DQVKNGTNMK SLNLEMQELA VAASLSQTGA 3900
SIGSALAAIM LLLLVFLVAC FINRKCQKQR KKKPAEDILE EYPLNTKVEV PKRHPDRVEK 3960
NVNRHYCTVR NVNILSEPEA AYTFKGAKVK RLNLEVRVHN NLQDGTEV 4008 
Gene Ontology
 GO:0016021; C:integral to membrane; IEA:UniProtKB-KW.
 GO:0005886; C:plasma membrane; IEA:UniProtKB-SubCell.
 GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
 GO:0007154; P:cell communication; IEA:InterPro. 
Interpro
 IPR003644; Calx_beta.
 IPR000742; EG-like_dom.
 IPR006212; Furin_repeat.
 IPR009030; Growth_fac_rcpt_N_dom.
 IPR001007; VWF_C. 
Pfam
 PF03160; Calx-beta
 PF00093; VWC 
SMART
 SM00237; Calx_beta
 SM00181; EGF
 SM00261; FU
 SM00214; VWC 
PROSITE
 PS01208; VWFC_1
 PS50184; VWFC_2 
PRINTS