CPLM 1.0 - Compendium of Protein Lysine Modification
Tag
Content
CPLM ID
CPLM-040784
UniProt Accession
F8W7G7_HUMAN
;
F8W7G7
Genbank Protein ID
AC012462
;
AC073284
Genbank Nucleotide ID
Protein Name
Ugl-Y3
Protein Synonyms/Alias
Gene Name
FN1
Gene Synonyms/Alias
Created Date
July 27, 2013
Organism
Homo sapiens (Human)
NCBI Taxa ID
9606
Lysine Modification
Position
Peptide
Type
References
1050
NVGPSVS
K
YPLRNLQ
ubiquitination
[1]
Reference
[1] A proteome-wide, quantitative survey of in vivo ubiquitylation sites reveals widespread regulatory roles.
Wagner SA, Beli P, Weinert BT, Nielsen ML, Cox J, Mann M, Choudhary C.
Mol Cell Proteomics. 2011 Oct;10(10):M111.013284. [
PMID: 21890473
]
Functional Description
Sequence Annotation
Keyword
Complete proteome; Disulfide bond; Reference proteome; Repeat.
Sequence Source
UniProt (SWISSPROT/TrEMBL); GenBank; EMBL
Protein Length
2211 AA
Protein Sequence
MLRGPGPGLL LLAVQCLGTA VPSTGASKSK RQAQQMVQPQ SPVAVSQSKP GCYDNGKHYQ 60
INQQWERTYL GNALVCTCYG GSRGFNCESK PEAEETCFDK YTGNTYRVGD TYERPKDSMI 120
WDCTCIGAGR GRISCTIANR CHEGGQSYKI GDTWRRPHET GGYMLECVCL GNGKGEWTCK 180
PIAEKCFDHA AGTSYVVGET WEKPYQGWMM VDCTCLGEGS GRITCTSRNR CNDQDTRTSY 240
RIGDTWSKKD NRGNLLQCIC TGNGRGEWKC ERHTSVQTTS SGSGPFTDVR AAVYQPQPHP 300
QPPPYGHCVT DSGVVYSVGM QWLKTQGNKQ MLCTCLGNGV SCQETAVTQT YGGNSNGEPC 360
VLPFTYNGRT FYSCTTEGRQ DGHLWCSTTS NYEQDQKYSF CTDHTVLVQT RGGNSNGALC 420
HFPFLYNNHN YTDCTSEGRR DNMKWCGTTQ NYDADQKFGF CPMAAHEEIC TTNEGVMYRI 480
GDQWDKQHDM GHMMRCTCVG NGRGEWTCIA YSQLRDQCIV DDITYNVNDT FHKRHEEGHM 540
LNCTCFGQGR GRWKCDPVDQ CQDSETGTFY QIGDSWEKYV HGVRYQCYCY GRGIGEWHCQ 600
PLQTYPSSSG PVEVFITETP SQPNSHPIQW NAPQPSHISK YILRWRPKNS VGRWKEATIP 660
GHLNSYTIKG LKPGVVYEGQ LISIQQYGHQ EVTRFDFTTT STSTPVTSNT VTGETTPFSP 720
LVATSESVTE ITASSFVVSW VSASDTVSGF RVEYELSEEG DEPQYLDLPS TATSVNIPDL 780
LPGRKYIVNV YQISEDGEQS LILSTSQTTA PDAPPDTTVD QVDDTSIVVR WSRPQAPITG 840
YRIVYSPSVE GSSTELNLPE TANSVTLSDL QPGVQYNITI YAVEENQEST PVVIQQETTG 900
TPRSDTVPSP RDLQFVEVTD VKVTIMWTPP ESAVTGYRVD VIPVNLPGEH GQRLPISRNT 960
FAEVTGLSPG VTYYFKVFAV SHGRESKPLT AQQTTKLDAP TNLQFVNETD STVLVRWTPP 1020
RAQITGYRLT VGLTRRGQPR QYNVGPSVSK YPLRNLQPAS EYTVSLVAIK GNQESPKATG 1080
VFTTLQPGSS IPPYNTEVTE TTIVITWTPA PRIGFKLGVR PSQGGEAPRE VTSDSGSIVV 1140
SGLTPGVEYV YTIQVLRDGQ ERDAPIVNKV VTPLSPPTNL HLEANPDTGV LTVSWERSTT 1200
PDITGYRITT TPTNGQQGNS LEEVVHADQS SCTFDNLSPG LEYNVSVYTV KDDKESVPIS 1260
DTIIPAVPPP TDLRFTNIGP DTMRVTWAPP PSIDLTNFLV RYSPVKNEED VAELSISPSD 1320
NAVVLTNLLP GTEYVVSVSS VYEQHESTPL RGRQKTGLDS PTGIDFSDIT ANSFTVHWIA 1380
PRATITGYRI RHHPEHFSGR PREDRVPHSR NSITLTNLTP GTEYVVSIVA LNGREESPLL 1440
IGQQSTVSDV PRDLEVVAAT PTSLLISWDA PAVTVRYYRI TYGETGGNSP VQEFTVPGSK 1500
STATISGLKP GVDYTITVYA VTGRGDSPAS SKPISINYRT EIDKPSQMQV TDVQDNSISV 1560
KWLPSSSPVT GYRVTTTPKN GPGPTKTKTA GPDQTEMTIE GLQPTVEYVV SVYAQNPSGE 1620
SQPLVQTAVT NIDRPKGLAF TDVDVDSIKI AWESPQGQVS RYRVTYSSPE DGIHELFPAP 1680
DGEEDTAELQ GLRPGSEYTV SVVALHDDME SQPLIGTQST AIPAPTDLKF TQVTPTSLSA 1740
QWTPPNVQLT GYRVRVTPKE KTGPMKEINL APDSSSVVVS GLMVATKYEV SVYALKDTLT 1800
SRPAQGVVTT LENVSPPRRA RVTDATETTI TISWRTKTET ITGFQVDAVP ANGQTPIQRT 1860
IKPDVRSYTI TGLQPGTDYK IYLYTLNDNA RSSPVVIDAS TAIDAPSNLR FLATTPNSLL 1920
VSWQPPRARI TGYIIKYEKP GSPPREVVPR PRPGVTEATI TGLEPGTEYT IYVIALKNNQ 1980
KSEPLIGRKK TGQEALSQTT ISWAPFQDTS EYIISCHPVG TDEEPLQFRV PGTSTSATLT 2040
GLTRGATYNV IVEALKDQQR HKVREEVVTV GNSGWCHDNG VNYKIGEKWD RQGENGQMMS 2100
CTCLGNGKGE FKCDPHEATC YDDGKTYHVG EQWQKEYLGA ICSCTCFGGQ RGWRCDNCRR 2160
PGGEPSPEGT TGQSYNQYSQ RYHQRTNTNV NCPIECFMPL DVQADREDSR E 2211
Gene Ontology
GO:0005576
; C:extracellular region; IEA:InterPro.
Interpro
IPR000083
; Fibronectin_type1.
IPR003961
; Fibronectin_type3.
IPR000562
; FN_type2_col-bd.
IPR013783
; Ig-like_fold.
IPR013806
; Kringle-like.
Pfam
PF00039
; fn1
PF00040
; fn2
PF00041
; fn3
SMART
SM00058
; FN1
SM00059
; FN2
SM00060
; FN3
PROSITE
PS01253
; FN1_1
PS51091
; FN1_2
PS00023
; FN2_1
PS51092
; FN2_2
PS50853
; FN3
PRINTS