CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-008702
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 THO complex subunit 2 
Protein Synonyms/Alias
 Low dye-binding protein 5; THO complex subunit RLR1; Zinc-regulated gene 13 protein 
Gene Name
 THO2 
Gene Synonyms/Alias
 LDB5; RLR1; ZRG13; YNL139C; N1209; N1835 
Created Date
 July 27, 2013 
Organism
 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) (Baker's yeast) 
NCBI Taxa ID
 559292 
Lysine Modification
Position
Peptide
Type
References
57ALESNDEKEDWLRTLubiquitination[1]
1415YSGNAGGKDGYGSSNacetylation[2]
1547KDETIRNKFQTQDYRacetylation[2]
Reference
 [1] Global analysis of phosphorylation and ubiquitylation cross-talk in protein degradation.
 Swaney DL, Beltrao P, Starita L, Guo A, Rush J, Fields S, Krogan NJ, VillĂ©n J.
 Nat Methods. 2013 Jul;10(7):676-82. [PMID: 23749301]
 [2] Proteome-wide analysis of lysine acetylation suggests its broad regulatory scope in Saccharomyces cerevisiae.
 Henriksen P, Wagner SA, Weinert BT, Sharma S, Bacinskaja G, Rehman M, Juffer AH, Walther TC, Lisby M, Choudhary C.
 Mol Cell Proteomics. 2012 Nov;11(11):1510-22. [PMID: 22865919
Functional Description
 Component the THO subcomplex of the TREX complex, which operates in coupling transcription elongation to mRNA export. The THO complex is recruited to transcribed genes and moves along the gene with the elongating polymerase during transcription. THO is important for stabilizing nascent RNA in the RNA polymerase II elongation complex by preventing formation of DNA:RNA hybrids behind the elongating polymerase. It functions in cotranscriptional formation of an export-competent messenger ribonucleoprotein particle (mRNP) by facilitating the loading of ATP-dependent RNA helicase SUB2 and the mRNA export factor YRA1 along the nascent mRNA. 
Sequence Annotation
  
Keyword
 Complete proteome; Nucleus; Reference proteome; Transcription; Transcription regulation. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1597 AA 
Protein Sequence
MAEQTLLSKL NALSQKVIPP ASPSQASILT EEVIRNWPER SKTLCSDFTA LESNDEKEDW 60
LRTLFIELFD FINKNDENSP LKLSDVASFT NELVNHERQV SQASIVGKMF IAVSSTVPNI 120
NDLTTISLCK LIPSLHEELF KFSWISSKLL NKEQTTLLRH LLKKSKYELK KYNLLVENSV 180
GYGQLVALLI LAYYDPDNFS KVSAYLKEIY HIMGKYSLDS IRTLDVILNV SSQFITEGYK 240
FFIALLRKSD SWPSSHVANN SNYSSLNEGG NMIAANIISF NLSQYNEEVD KENYERYMDM 300
CCILLKNGFV NFYSIWDNVK PEMEFLQEYI QNLETELEEE STKGVENPLA MAAALSTENE 360
TDEDNALVVN DDVNMKDKIS EETNADIESK GKQKTQQDIL LFGKIKLLER LLIHGCVIPV 420
IHVLKQYPKV LYVSESLSRY LGRVFEYLLN PLYTSMTSSG ESKDMATALM ITRIDNGILA 480
HKPRLIHKYK THEPFESLEL NSSYVFYYSE WNSNLTPFAS VNDLFENSHI YLSIIGPYLG 540
RIPTLLSKIS RIGVADIQKN HGSESLHVTI DKWIDYVRKF IFPATSLLQN NPIATSEVYE 600
LMKFFPFEKR YFIYNEMMTK LSQDILPLKV SFNKAEREAK SILKALSIDT IAKESRRFAK 660
LISTNPLASL VPAVKQIENY DKVSELVVYT TKYFNDFAYD VLQFVLLLRL TYNRPAVQFD 720
GVNQAMWVQR LSIFIAGLAK NCPNMDISNI ITYILKTLHN GNIIAVSILK ELIITVGGIR 780
DLNEVNMKQL LMLNSGSPLK QYARHLIYDF RDDNSVISSR LTSFFTDQSA ISEIILLLYT 840
LNLKANTQNS HYKILSTRCD EMNTLLWSFI ELIKHCLKGK AFEENVLPFV ELNNRFHLST 900
PWTFHIWRDY LDNQLNSNEN FSIDELIEGA EFSDVDLTKI SKDLFTTFWR LSLYDIHFDK 960
SLYDERKNAL SGENTGHMSN RKKHLIQNQI KDILVTGISH QRAFKKTSEF ISEKSNVWNK 1020
DCGEDQIKIF LQNCVVPRVL FSPSDALFSS FFIFMAFRTE NLMSILNTCI TSNILKTLLF 1080
CCTSSEAGNL GLFFTDVLKK LEKMRLNGDF NDQASRKLYE WHSVITEQVI DLLSEKNYMS 1140
IRNGIEFMKH VTSVFPVVKA HIQLVYTTLE ENLINEERED IKLPSSALIG HLKARLKDAL 1200
ELDEFCTLTE EEAEQKRIRE MELEEIKNYE TACQNEQKQV ALRKQLELNK SQRLQNDPPK 1260
SVASGSAGLN SKDRYTYSRN EPVIPTKPSS SQWSYSKVTR HVDDINHYLA TNHLQKAISL 1320
VENDDETRNL RKLSKQNMPI FDFRNSTLEI FERYFRTLIQ NPQNPDFAEK IDSLKRYIKN 1380
ISREPYPDTT SSYSEAAAPE YTKRSSRYSG NAGGKDGYGS SNYRGPSNDR SAPKNIKPIS 1440
SYAHKRSELP TRPSKSKTYN DRSRALRPTG PDRGDGFDQR DNRLREEYKK NSSQRSQLRF 1500
PEKPFQEGKD SSKANPYQAS SYKRDSPSEN EEKPNKRFKK DETIRNKFQT QDYRNTRDSG 1560
AAHRANENQR YNGNRKSNTQ ALPQGPKGGN YVSRYQR 1597 
Gene Ontology
 GO:0000446; C:nucleoplasmic THO complex; IMP:SGD.
 GO:0000445; C:THO complex part of transcription export complex; IPI:SGD.
 GO:0003677; F:DNA binding; IPI:SGD.
 GO:0006310; P:DNA recombination; IMP:SGD.
 GO:0031124; P:mRNA 3'-end processing; IMP:SGD.
 GO:0006406; P:mRNA export from nucleus; IMP:SGD.
 GO:0006355; P:regulation of transcription, DNA-dependent; IEA:UniProtKB-KW.
 GO:0006368; P:transcription elongation from RNA polymerase II promoter; IMP:SGD.
 GO:0006283; P:transcription-coupled nucleotide-excision repair; IMP:SGD. 
Interpro
 IPR021418; THO_THOC2_C.
 IPR021726; THO_THOC2_N. 
Pfam
 PF11262; Tho2
 PF11732; Thoc2 
SMART
  
PROSITE
  
PRINTS