CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-018851
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Neogenin 
Protein Synonyms/Alias
 Immunoglobulin superfamily DCC subclass member 2 
Gene Name
 NEO1 
Gene Synonyms/Alias
 IGDCC2; NGN 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
1173HHERLELKPIDKSPDubiquitination[1, 2]
Reference
 [1] A proteome-wide, quantitative survey of in vivo ubiquitylation sites reveals widespread regulatory roles.
 Wagner SA, Beli P, Weinert BT, Nielsen ML, Cox J, Mann M, Choudhary C.
 Mol Cell Proteomics. 2011 Oct;10(10):M111.013284. [PMID: 21890473]
 [2] hCKSAAP_UbSite: improved prediction of human ubiquitination sites by exploiting amino acid pattern and properties.
 Chen Z, Zhou Y, Song J, Zhang Z.
 Biochim Biophys Acta. 2013 Aug;1834(8):1461-7. [PMID: 23603789
Functional Description
 May be involved as a regulatory protein in the transition of undifferentiated proliferating cells to their differentiated state. May also function as a cell adhesion molecule in a broad spectrum of embryonic and adult tissues. 
Sequence Annotation
 DOMAIN 52 141 Ig-like C2-type 1.
 DOMAIN 152 238 Ig-like C2-type 2.
 DOMAIN 243 336 Ig-like C2-type 3.
 DOMAIN 341 426 Ig-like C2-type 4.
 DOMAIN 439 532 Fibronectin type-III 1.
 DOMAIN 539 627 Fibronectin type-III 2.
 DOMAIN 633 728 Fibronectin type-III 3.
 DOMAIN 735 827 Fibronectin type-III 4.
 DOMAIN 853 949 Fibronectin type-III 5.
 DOMAIN 954 1051 Fibronectin type-III 6.
 CARBOHYD 73 73 N-linked (GlcNAc...) (Potential).
 CARBOHYD 210 210 N-linked (GlcNAc...).
 CARBOHYD 326 326 N-linked (GlcNAc...) (Potential).
 CARBOHYD 470 470 N-linked (GlcNAc...).
 CARBOHYD 489 489 N-linked (GlcNAc...).
 CARBOHYD 639 639 N-linked (GlcNAc...).
 CARBOHYD 715 715 N-linked (GlcNAc...) (Potential).
 CARBOHYD 909 909 N-linked (GlcNAc...) (Potential).
 DISULFID 74 129 By similarity.
 DISULFID 173 221 By similarity.
 DISULFID 270 320 By similarity.
 DISULFID 362 410 By similarity.  
Keyword
 3D-structure; Alternative splicing; Cell adhesion; Cell membrane; Complete proteome; Disulfide bond; Glycoprotein; Immunoglobulin domain; Membrane; Polymorphism; Reference proteome; Repeat; Signal; Transmembrane; Transmembrane helix. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1461 AA 
Protein Sequence
MAAERGARRL LSTPSFWLYC LLLLGRRAPG AAAARSGSAP QSPGASIRTF TPFYFLVEPV 60
DTLSVRGSSV ILNCSAYSEP SPKIEWKKDG TFLNLVSDDR RQLLPDGSLF ISNVVHSKHN 120
KPDEGYYQCV ATVESLGTII SRTAKLIVAG LPRFTSQPEP SSVYAGNNAI LNCEVNADLV 180
PFVRWEQNRQ PLLLDDRVIK LPSGMLVISN ATEGDGGLYR CVVESGGPPK YSDEVELKVL 240
PDPEVISDLV FLKQPSPLVR VIGQDVVLPC VASGLPTPTI KWMKNEEALD TESSERLVLL 300
AGGSLEISDV TEDDAGTYFC IADNGNETIE AQAELTVQAQ PEFLKQPTNI YAHESMDIVF 360
ECEVTGKPTP TVKWVKNGDM VIPSDYFKIV KEHNLQVLGL VKSDEGFYQC IAENDVGNAQ 420
AGAQLIILEH APATTGPLPS APRDVVASLV STRFIKLTWR TPASDPHGDN LTYSVFYTKE 480
GIARERVENT SHPGEMQVTI QNLMPATVYI FRVMAQNKHG SGESSAPLRV ETQPEVQLPG 540
PAPNLRAYAA SPTSITVTWE TPVSGNGEIQ NYKLYYMEKG TDKEQDVDVS SHSYTINGLK 600
KYTEYSFRVV AYNKHGPGVS TPDVAVRTLS DVPSAAPQNL SLEVRNSKSI MIHWQPPAPA 660
TQNGQITGYK IRYRKASRKS DVTETLVSGT QLSQLIEGLD RGTEYNFRVA ALTINGTGPA 720
TDWLSAETFE SDLDETRVPE VPSSLHVRPL VTSIVVSWTP PENQNIVVRG YAIGYGIGSP 780
HAQTIKVDYK QRYYTIENLD PSSHYVITLK AFNNVGEGIP LYESAVTRPH TDTSEVDLFV 840
INAPYTPVPD PTPMMPPVGV QASILSHDTI RITWADNSLP KHQKITDSRY YTVRWKTNIP 900
ANTKYKNANA TTLSYLVTGL KPNTLYEFSV MVTKGRRSST WSMTAHGTTF ELVPTSPPKD 960
VTVVSKEGKP KTIIVNWQPP SEANGKITGY IIYYSTDVNA EIHDWVIEPV VGNRLTHQIQ 1020
ELTLDTPYYF KIQARNSKGM GPMSEAVQFR TPKADSSDKM PNDQASGSGG KGSRLPDLGS 1080
DYKPPMSGSN SPHGSPTSPL DSNMLLVIIV SVGVITIVVV VIIAVFCTRR TTSHQKKKRA 1140
ACKSVNGSHK YKGNSKDVKP PDLWIHHERL ELKPIDKSPD PNPIMTDTPI PRNSQDITPV 1200
DNSMDSNIHQ RRNSYRGHES EDSMSTLAGR RGMRPKMMMP FDSQPPQPVI SAHPIHSLDN 1260
PHHHFHSSSL ASPARSHLYH PGSPWPIGTS MSLSDRANST ESVRNTPSTD TMPASSSQTC 1320
CTDHQDPEGA TSSSYLASSQ EEDSGQSLPT AHVRPSHPLK SFAVPAIPPP GPPTYDPALP 1380
STPLLSQQAL NHHIHSVKTA SIGTLGRSRP PMPVVVPSAP EVQETTRMLE DSESSYEPDE 1440
LTKEMAHLEG LMKDLNAITT A 1461 
Gene Ontology
 GO:0005887; C:integral to plasma membrane; TAS:ProtInc.
 GO:0004872; F:receptor activity; IEA:Compara.
 GO:0007411; P:axon guidance; TAS:Reactome.
 GO:0007155; P:cell adhesion; NAS:ProtInc.
 GO:0055072; P:iron ion homeostasis; IGI:MGI.
 GO:0042692; P:muscle cell differentiation; TAS:Reactome.
 GO:0007520; P:myoblast fusion; IEA:Compara.
 GO:0051149; P:positive regulation of muscle cell differentiation; TAS:Reactome.
 GO:0006355; P:regulation of transcription, DNA-dependent; IEA:Compara. 
Interpro
 IPR003961; Fibronectin_type3.
 IPR007110; Ig-like_dom.
 IPR013783; Ig-like_fold.
 IPR013098; Ig_I-set.
 IPR003599; Ig_sub.
 IPR003598; Ig_sub2.
 IPR010560; Neogenin_C. 
Pfam
 PF00041; fn3
 PF07679; I-set
 PF06583; Neogenin_C 
SMART
 SM00060; FN3
 SM00409; IG
 SM00408; IGc2 
PROSITE
 PS50853; FN3
 PS50835; IG_LIKE 
PRINTS