CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-024624
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 THO complex subunit 2 
Protein Synonyms/Alias
 Tho2 
Gene Name
 Thoc2 
Gene Synonyms/Alias
  
Created Date
 July 27, 2013 
Organism
 Mus musculus (Mouse) 
NCBI Taxa ID
 10090 
Lysine Modification
Position
Peptide
Type
References
253HILGFKFKFYQEPSGacetylation[1]
457AKVVRIGKSFMKEFQacetylation[2]
Reference
 [1] Quantitative acetylome analysis reveals the roles of SIRT1 in regulating diverse substrates and cellular pathways.
 Chen Y, Zhao W, Yang JS, Cheng Z, Luo H, Lu Z, Tan M, Gu W, Zhao Y.
 Mol Cell Proteomics. 2012 Oct;11(10):1048-62. [PMID: 22826441]
 [2] Label-free quantitative proteomics of the lysine acetylome in mitochondria identifies substrates of SIRT3 in metabolic pathways.
 Rardin MJ, Newman JC, Held JM, Cusack MP, Sorensen DJ, Li B, Schilling B, Mooney SD, Kahn CR, Verdin E, Gibson BW.
 Proc Natl Acad Sci U S A. 2013 Apr 16;110(16):6601-6. [PMID: 23576753
Functional Description
 Component of the THO subcomplex of the TREX complex. The TREX complex specifically associates with spliced mRNA and not with unspliced pre-mRNA. It is recruited to spliced mRNAs by a transcription-independent mechanism. Binds to mRNA upstream of the exon-junction complex (EJC) and is recruited in a splicing- and cap-dependent manner to a region near the 5' end of the mRNA where it functions in mRNA export. The recruitment occurs via an interaction between ALYREF/THOC4 and the cap-binding protein NCBP1. DDX39B functions as a bridge between ALYREF/THOC4 and the THO complex (By similarity). 
Sequence Annotation
 MOTIF 923 928 Nuclear localization signal (Potential).
 MOD_RES 1385 1385 Phosphothreonine (By similarity).
 MOD_RES 1393 1393 Phosphoserine (By similarity).
 MOD_RES 1417 1417 Phosphoserine (By similarity).
 MOD_RES 1486 1486 Phosphoserine (By similarity).  
Keyword
 Coiled coil; Complete proteome; mRNA processing; mRNA splicing; mRNA transport; Nucleus; Phosphoprotein; Reference proteome; RNA-binding; Transport. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 1594 AA 
Protein Sequence
MAAAAVVVPA EWIKNWEKSG RGEFLHLCRI LSENKSHDSS TYRDFQQALY ELSYHVIKGN 60
LKHEQASSVL NDISEFREDM PSILADVFCI LDIETNCLEE KSKRDYFTQL VLACLYLVSD 120
TVLKERLDPE TLESLGLIKQ SQQFNQKSVK IKTKLFYKQQ KFNLLREENE GYAKLIAELG 180
QDLSGNITSD LILENIKSLI GCFNLDPNRV LDVILEVFEC RPEHDDFFIS LLESYMSMCE 240
PQTLCHILGF KFKFYQEPSG ETPSSLYRVA AVLLQFNLID LDDLYVHLLP ADNCIMDEYK 300
REIVEAKQIV RKLTMVVLSS EKLDERDKEK DKDDEKVEKP PDNQKLGLLE ALLKVGDWQH 360
AQNIMDQMPP YYAASHKLIA LAICKLIHIT VEPLYRRVGV PKGAKGSPVS ALQNKRAPKQ 420
VESFEDLRRD VFNMFCYLGP HLSHDPILFA KVVRIGKSFM KEFQSDGSKQ EDKEKTEVIL 480
SCLLSITDQV LLPSLSLMDC NACMSEELWG MFKTFPYQHR YRLYGQWKNE TYNGHPLLVK 540
VKAQTIDRAK YIMKRLTKEN VKPSGRQIGK LSHSNPTILF DYILSQIQKY DNLITPVVDS 600
LKYLTSLNYD VLAYCIIEAL ANPEKERMKH DDTTISSWLQ SLASFCGAVF RKYPIDLAGL 660
LQYVANQLKA GKSFDLLILK EVVQKMAGIE ITEEMTMEQL EAMTGGEQLK AEGGYFGQIR 720
NTKKSSQRLK DALLDHDLAL PLCLLMAQQR NGVIFQEGGE KHLKLVGKLY DQCHDTLVQF 780
GGFLASNLST EDYIKRVPSI DVLCNEFHTP HDAAFFLSRP MYAHHISSKY DELKKSEKGS 840
KQQHKVHKYI TSCEMVMAPV HEAVVSLHVS KVWDDISPQF YATFWSLTMY DLAVPHTSYE 900
REVNKLKVQM KAIDDNQEMP PNKKKKEKER CTALQDKLLE EEKKQMEHVQ RVLQRLKLEK 960
DNWLLAKSTK NETITKFLQL CIFPRCIFSA IDAVYCARFV ELVHQQKTPN FSTLLCYDRV 1020
FSDIIYTVAS CTENEASRYG RFLCCMLETV TRWHSDRATY EKECGNYPGF LTILRATGFD 1080
GGNKADQLDY ENFRHVVHKW HYKLTKASVH CLETGEYTHI RNILIVLTKI LPWYPKVLNL 1140
GQALERRVNK ICQEEKEKRP DLYALAMGYS GQLKSRKSHM IPENEFHHKD PPPRNAVASV 1200
QNGPGGGTSS SSIGNASKSD ESGAEETDKS RERSQCGTKA VNKASSTTPK GNSSNGNSGS 1260
NSNKAVKEND KEKVKEKEKE KKEKTPATTP EARALGKDSK EKPKEERPNK EDKARETKER 1320
TPKSDKEKEK FKKEEKAKDE KFKTTVPIVE SKSTQERERE KEPSRERDVA KEMKSKENVK 1380
GGEKTPVSGS LKSPVPRSDI SEPDREQKRR KIDSHPSPSH SSTVKDSLID LKDSSAKLYI 1440
NHNPPPLSKS KEREMDKKDL DKSRERSRER EKKDEKDRKE RKRDHSNNDR EVPPDITKRR 1500
KEENGTMGVS KHKSESPCES QYPNEKDKEK NKSKSSGKEK SSSDSFKSEK MDKISSGGKK 1560
ESRHDKEKIE KKEKRDSSGG KEEKKHHKSS DKHR 1594 
Gene Ontology
 GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
 GO:0003723; F:RNA binding; IEA:UniProtKB-KW.
 GO:0006397; P:mRNA processing; IEA:UniProtKB-KW.
 GO:0051028; P:mRNA transport; IEA:UniProtKB-KW.
 GO:0008380; P:RNA splicing; IEA:UniProtKB-KW. 
Interpro
 IPR021418; THO_THOC2_C.
 IPR021726; THO_THOC2_N. 
Pfam
 PF11262; Tho2
 PF11732; Thoc2 
SMART
  
PROSITE
  
PRINTS