CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-010794
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Heterogeneous nuclear ribonucleoprotein U 
Protein Synonyms/Alias
 hnRNP U; Scaffold attachment factor A; SAF-A; p120; pp120 
Gene Name
 HNRNPU 
Gene Synonyms/Alias
 HNRPU; SAFA; U21.1 
Created Date
 July 27, 2013 
Organism
 Homo sapiens (Human) 
NCBI Taxa ID
 9606 
Lysine Modification
Position
Peptide
Type
References
215QQAGGDGKTEQKGGDubiquitination[1]
Reference
 [1] Systematic and quantitative assessment of the ubiquitin-modified proteome.
 Kim W, Bennett EJ, Huttlin EL, Guo A, Li J, Possemato A, Sowa ME, Rad R, Rush J, Comb MJ, Harper JW, Gygi SP.
 Mol Cell. 2011 Oct 21;44(2):325-40. [PMID: 21906983
Functional Description
 Component of the CRD-mediated complex that promotes MYC mRNA stabilization. Binds to pre-mRNA. Has high affinity for scaffold-attached region (SAR) DNA. Binds to double- and single- stranded DNA and RNA. 
Sequence Annotation
 DOMAIN 8 42 SAP.
 DOMAIN 267 464 B30.2/SPRY.
 NP_BIND 504 511 ATP (Potential).
 REGION 714 739 RNA-binding RGG-box.
 MOD_RES 2 2 N-acetylserine; partial.
 MOD_RES 4 4 Phosphoserine.
 MOD_RES 59 59 Phosphoserine.
 MOD_RES 66 66 Phosphoserine.
 MOD_RES 265 265 N6-acetyllysine.
 MOD_RES 271 271 Phosphoserine.
 MOD_RES 352 352 N6-acetyllysine.
 MOD_RES 516 516 N6-acetyllysine.
 MOD_RES 524 524 N6-acetyllysine.
 MOD_RES 551 551 N6-acetyllysine.
 MOD_RES 565 565 N6-acetyllysine.
 MOD_RES 635 635 N6-acetyllysine.
 MOD_RES 739 739 Dimethylated arginine; in A2780 ovarian
 MOD_RES 739 739 Omega-N-methylated arginine.
 MOD_RES 814 814 N6-acetyllysine.  
Keyword
 3D-structure; Acetylation; Alternative splicing; ATP-binding; Complete proteome; Cytoplasm; Direct protein sequencing; DNA-binding; Methylation; mRNA processing; mRNA splicing; Nucleotide-binding; Nucleus; Phosphoprotein; Polymorphism; Reference proteome; Ribonucleoprotein; RNA-binding; Spliceosome. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 825 AA 
Protein Sequence
MSSSPVNVKK LKVSELKEEL KKRRLSDKGL KAELMERLQA ALDDEEAGGR PAMEPGNGSL 60
DLGGDSAGRS GAGLEQEAAA GGDEEEEEEE EEEEGISALD GDQMELGEEN GAAGAADSGP 120
MEEEEAASED ENGDDQGFQE GEDELGDEEE GAGDENGHGE QQPQPPATQQ QQPQQQRGAA 180
KEAAGKSSGP TSLFAVTVAP PGARQGQQQA GGDGKTEQKG GDKKRGVKRP REDHGRGYFE 240
YIEENKYSRA KSPQPPVEEE DEHFDDTVVC LDTYNCDLHF KISRDRLSAS SLTMESFAFL 300
WAGGRASYGV SKGKVCFEMK VTEKIPVRHL YTKDIDIHEV RIGWSLTTSG MLLGEEEFSY 360
GYSLKGIKTC NCETEDYGEK FDENDVITCF ANFESDEVEL SYAKNGQDLG VAFKISKEVL 420
AGRPLFPHVL CHNCAVEFNF GQKEKPYFPI PEEYTFIQNV PLEDRVRGPK GPEEKKDCEV 480
VMMIGLPGAG KTTWVTKHAA ENPGKYNILG TNTIMDKMMV AGFKKQMADT GKLNTLLQRA 540
PQCLGKFIEI AARKKRNFIL DQTNVSAAAQ RRKMCLFAGF QRKAVVVCPK DEDYKQRTQK 600
KAEVEGKDLP EHAVLKMKGN FTLPEVAECF DEITYVELQK EEAQKLLEQY KEESKKALPP 660
EKKQNTGSKK SNKNKSGKNQ FNRGGGHRGR GGFNMRGGNF RGGAPGNRGG YNRRGNMPQR 720
GGGGGGSGGI GYPYPRAPVF PGRGSYSNRG NYNRGGMPNR GNYNQNFRGR GNNRGYKNQS 780
QGYNQWQQGQ FWGQKPWSQH YHQGYY 806 
Gene Ontology
 GO:0071013; C:catalytic step 2 spliceosome; IDA:UniProtKB.
 GO:0009986; C:cell surface; IEA:UniProtKB-SubCell.
 GO:0070937; C:CRD-mediated mRNA stability complex; IDA:UniProtKB.
 GO:0005654; C:nucleoplasm; TAS:Reactome.
 GO:0005524; F:ATP binding; IEA:UniProtKB-KW.
 GO:0003677; F:DNA binding; TAS:ProtInc.
 GO:0003723; F:RNA binding; TAS:ProtInc.
 GO:0070934; P:CRD-mediated mRNA stabilization; IMP:UniProtKB.
 GO:0000398; P:mRNA splicing, via spliceosome; IC:UniProtKB. 
Interpro
 IPR001870; B30.2/SPRY.
 IPR008985; ConA-like_lec_gl_sf.
 IPR026745; hnRNP_U.
 IPR027417; P-loop_NTPase.
 IPR003034; SAP_dom.
 IPR018355; SPla/RYanodine_receptor_subgr.
 IPR003877; SPRY_rcpt. 
Pfam
 PF02037; SAP
 PF00622; SPRY 
SMART
 SM00513; SAP
 SM00449; SPRY 
PROSITE
 PS50188; B302_SPRY
 PS50800; SAP 
PRINTS