CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-023821
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Protein SOGA2 
Protein Synonyms/Alias
 Coiled-coil domain-containing protein 165 
Gene Name
 SOGA2 
Gene Synonyms/Alias
 CCDC165; KIAA0802 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
941PDLQSRLKEQLEWQLubiquitination[1]
1377PEEEENHKGNLQRAVubiquitination[1]
Reference
 [1] Refined preparation and use of anti-diglycine remnant (K-ε-GG) antibody enables routine quantification of 10,000s of ubiquitination sites in single proteomics experiments.
 Udeshi ND, Svinkina T, Mertins P, Kuhn E, Mani DR, Qiao JW, Carr SA.
 Mol Cell Proteomics. 2013 Mar;12(3):825-31. [PMID: 23266961
Functional Description
  
Sequence Annotation
 MOD_RES 77 77 Phosphoserine.
 MOD_RES 263 263 Phosphoserine.
 MOD_RES 549 549 Phosphoserine.
 MOD_RES 618 618 Phosphoserine.
 MOD_RES 621 621 Phosphothreonine.
 MOD_RES 685 685 Phosphoserine.
 MOD_RES 749 749 Phosphoserine (By similarity).
 MOD_RES 776 776 Phosphoserine.
 MOD_RES 901 901 Phosphoserine.
 MOD_RES 923 923 Phosphoserine.
 MOD_RES 1385 1385 Phosphoserine.
 MOD_RES 1388 1388 Phosphoserine.
 MOD_RES 1399 1399 Phosphoserine.
 MOD_RES 1417 1417 Phosphothreonine.
 MOD_RES 1421 1421 Phosphoserine.
 MOD_RES 1427 1427 Phosphotyrosine.
 MOD_RES 1561 1561 Phosphoserine.
 MOD_RES 1578 1578 Phosphoserine.
 MOD_RES 1583 1583 Phosphoserine.
 MOD_RES 1592 1592 Phosphoserine.
 MOD_RES 1661 1661 Phosphoserine.
 MOD_RES 1667 1667 Phosphothreonine.
 MOD_RES 1675 1675 Phosphothreonine.
 MOD_RES 1679 1679 Phosphoserine.
 MOD_RES 1683 1683 Phosphoserine.
 MOD_RES 1812 1812 Phosphoserine.
 MOD_RES 1814 1814 Phosphoserine.  
Keyword
 Alternative splicing; Coiled coil; Complete proteome; Phosphoprotein; Polymorphism; Reference proteome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1905 AA 
Protein Sequence
METLNGPAGG GAPDAKLQPP GQHHRHHHLH PVAERRRLHR APSPARPFLK DLHARPAAPG 60
PAVPSSGRAP APAAPRSPNL AGKAPPSPGS LAAPGRLSRR SGGVPGAKDK PPPGAGARAA 120
GGAKAALGSR RAARVAPAEP LSRAGKPPGA EPPSAAAKGR KAKRGSRAPP ARTVGPPTPA 180
ARIPAVTLAV TSVAGSPARC SRISHTDSSS DLSDCPSEPL SDEQRLLPAA SSDAESGTGS 240
SDREPPRGAP TPSPAARGAP PGSPEPPALL AAPLAAGACP GGRSIPSGVS GGFAGPGVAE 300
DVRGRSPPER PVPGTPKEPS LGEQSRLVPA AEEEELLREM EELRSENDYL KDELDELRAE 360
MEEMRDSYLE EDVYQLQELR RELDRANKNC RILQYRLRKA EQKSLKVAET GQVDGELIRS 420
LEQDLKVAKD VSVRLHHELK TVEEKRAKAE DENETLRQQM IEVEISKQAL QNELERLKES 480
SLKRRSTREM YKEKKTFNQD DSADLRCQLQ FAKEEAFLMR KKMAKLGREK DELEQELQKY 540
KSLYGDVDSP LPTGEAGGPP STREAELKLR LKLVEEEANI LGRKIVELEV ENRGLKAEME 600
DMRGQQEREG PGRDHAPSIP TSPFGDSLES STELRRHLQF VEEEAELLRR SISEIEDHNR 660
QLTHELSKFK FEPPREPGWL GEGASPGAGG GAPLQEELKS ARLQISELSG KVLKLQHENH 720
ALLSNIQRCD LAAHLGLRAP SPRDSDAESD AGKKESDGEE SRLPQPKREG PVGGESDSEE 780
MFEKTSGFGS GKPSEASEPC PTELLKARED SEYLVTLKHE AQRLERTVER LITDTDSFLH 840
DAGLRGGAPL PGPGLQGEEE QGEGDQQEPQ LLGTINAKMK AFKKELQAFL EQVNRIGDGL 900
SPLPHLTESS SFLSTVTSVS RDSPIGNLGK ELGPDLQSRL KEQLEWQLGP ARGDERESLR 960
LRAARELHRR ADGDTGSHGL GGQTCFSLEM EEEHLYALRW KELEMHSLAL QNTLHERTWS 1020
DEKNLMQQEL RSLKQNIFLF YVKLRWLLKH WRQGKQMEEE GEEFTEGEHP ETLSRLGELG 1080
VQGGHQADGP DHDSDRGCGF PVGEHSPHSR VQIGDHSLRL QTADRGQPHK QVVENQQLFS 1140
AFKALLEDFR AELREDERAR LRLQQQYASD KAAWDVEWAV LKCRLEQLEE KTENKLGELG 1200
SSAESKGALK KEREVHQKLL ADSHSLVMDL RWQIHHSEKN WNREKVELLD RLDRDRQEWE 1260
RQKKEFLWRI EQLQKENSPR RGGSFLCDQK DGNVRPFPHQ GSLRMPRPVA MWPCADADSI 1320
PFEDRPLSKL KESDRCSASE NLYLDALSLD DEPEEPPAHR PEREFRNRLP EEEENHKGNL 1380
QRAVSVSSMS EFQRLMDISP FLPEKGLPST SSKEDVTPPL SPDDLKYIEE FNKSWDYTPN 1440
RGHNGGGPDL WADRTEVGRA GHEDSTEPFP DSSWYLTTSV TMTTDTMTSP EHCQKQPLRS 1500
HVLTEQSGLR VLHSPPAVRR VDSITAAGGE GPFPTSRARG SPGDTKGGPP EPMLSRWPCT 1560
SPRHSRDYVE GARRPLDSPL CTSLGFASPL HSLEMSKNLS DDMKEVAFSV RNAICSGPGE 1620
LQVKDMACQT NGSRTMGTQT VQTISVGLQT EALRGSGVTS SPHKCLTPKA GGGATPVSSP 1680
SRSLRSRQVA PAIEKVQAKF ERTCCSPKYG SPKLQRKPLP KADQPNNRTS PGMAQKGYSE 1740
SAWARSTTTR ESPVHTTIND GLSSLFNIID HSPVVQDPFQ KGLRAGSRSR SAEPRPELGP 1800
GQETGTNSRG RSPSPIGVGS EMCREEGGEG TPVKQDLSAP PGYTLTENVA RILNKKLLEH 1860
ALKEERRQAA HGPPGLHSDS HSLGDTAEPG PMENQTVLLT APWGL 1905 
Gene Ontology
  
Interpro
 IPR021507; DUF3166. 
Pfam
 PF11365; DUF3166 
SMART
  
PROSITE
  
PRINTS