CPLM 1.0 - Compendium of Protein Lysine Modification
TagContent
CPLM ID CPLM-010759
UniProt Accession
Genbank Protein ID
Genbank Nucleotide ID
Protein Name
 Helicase SEN1 
Protein Synonyms/Alias
 tRNA-splicing endonuclease positive effector 
Gene Name
 SEN1 
Gene Synonyms/Alias
 YLR430W; L9576.1 
Created Date
 July 27, 2013 
Organism
 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) (Baker's yeast) 
NCBI Taxa ID
 559292 
Lysine Modification
Position
Peptide
Type
References
871TSDLAHEKHVKVDDSubiquitination[1]
Reference
 [1] Global analysis of phosphorylation and ubiquitylation cross-talk in protein degradation.
 Swaney DL, Beltrao P, Starita L, Guo A, Rush J, Fields S, Krogan NJ, VillĂ©n J.
 Nat Methods. 2013 Jul;10(7):676-82. [PMID: 23749301
Functional Description
 ATP-dependent 5'->3' DNA/RNA helicase required for the expression and maturation of diverse classes of non-protein-coding RNAs like precursor tRNAs, rRNAs and small nuclear (snRNA) and nucleolar (snoRNA) RNAs. Directs RNA polymerase II transcription termination on snoRNAs as well as on several short protein-coding genes. May also play a role in transcription-coupled nucleotide excision repair. 
Sequence Annotation
 NP_BIND 1360 1364 ATP (By similarity).
 MOTIF 1909 1927 Nuclear localization signal (Potential).
 BINDING 1339 1339 ATP (By similarity).
 BINDING 1619 1619 ATP (By similarity).
 BINDING 1655 1655 ATP (By similarity).
 BINDING 1787 1787 ATP (By similarity).  
Keyword
 ATP-binding; Complete proteome; Helicase; Hydrolase; mRNA processing; Nucleotide-binding; Nucleus; Reference proteome; rRNA processing; tRNA processing. 
Sequence Source
 UniProt (SWISSPROT/TrEMBL); GenBank; EMBL 
Protein Length
 2231 AA 
Protein Sequence
MNSNNPDNNN SNNINNNNKD KDIAPNSDVQ LATVYTKAKS YIPQIEQVYQ GTNPNIQEAK 60
LLGELLQVLA EVPKGTHLFC DPILEPISIF SLTIFSFNEE ATATWLKNHF NPILSVCDKC 120
ILNFARGKCK MLQHFAIQRH VPHEHVAKFN DIVCQWRVEA VFPILRNISV NDNTGINITN 180
EIETAMYECL CNPHMLRLNK QLKATFEAIF KFFYDTKHRL LDVTNPLSIK TFISGVIFCW 240
CEGSKEENEW SRAFLKDLYS RNFHINLSNL TPDIIEEVYI HILFLQNPAN WTEIVVSQFW 300
SRLLPVFNLF DKDVFIEYFQ VPKNVESLKK TFKFPLEPIF KMWYNHLSKS YHDKPLDFLL 360
RGLTMFLNKF GSEFWSKIEP FTFHSILDII FNRDSFPIKL IKIQDNPIVE HQTEVYFQLT 420
GSVTDLLSWT LPFYHALSPS KRIQMVRKVS MAFLRIIANY PSLKSIPKAC LMNSATALLR 480
AVLTIKENER AMLYKNDEFE TVLLTKTDSR ALLNNPLIQD IIIRSASNPN DFYPGLGAAS 540
ASVATSTMMV LAECIDFDIL LLCHRTFKLY SGKPISEIPI STNVLENVTN KIDLRSFHDG 600
PLLAKQLLVS LKNINGLLIV PSNTAVAEAH NALNQKFLLL STRLMEKFAD ILPGQLSKIL 660
ADEDASQGFW SCIFSSDKHL YQAATNILYN TFDVEGRLEG ILAILNSNLT VNLKNINVML 720
QRLINCEFYE PCPRAVRVLM DVVSAFVDPI SGVFANFQTL KSQNTEKEFL KFWESCWLFL 780
DTIYKFTLKW ASKYDYSELE NFTKDTLDLS RSLVDSFREF SDILHDQTKN LLLNVLETFK 840
NMLYWLRLSD EVLLESCVRL IISTSDLAHE KHVKVDDSLV EMMAKYASKA KRFSNKLTEQ 900
QASEILQKAK IFNKALTEEV ATEAENYRKE KELSRLGKVI DLTDSVPASP SLSPSLSSTI 960
ASSSAESRAD YLQRKALSSS ITGRPRVAQP KITSFGTFQS SANAKLHRTK PVKPLSKMEL 1020
ARMQLLNNRV VHPPSAPAFH TKSRGLSNKN DDSSSEESDN DIESARELFA IAKAKGKGIQ 1080
TVDINGKVVK RQTAAELAKQ ELEHMRKRLN VDMNPLYEII LQWDYTRNSE YPDDEPIGNY 1140
SDVKDFFNSP ADYQKVMKPL LLLESWQGLC SSRDREDYKP FSIIVGNRTA VSDFYDVYAS 1200
VAKQVIQDCG ISESDLIVMA YLPDFRPDKR LSSDDFKKAQ HTCLAKVRTL KNTKGGNVDV 1260
TLRIHRNHSF SKFLTLRSEI YCVKVMQMTT IEREYSTLEG LEYYDLVGQI LQAKPSPPVN 1320
VDAAEIETVK KSYKLNTSQA EAIVNSVSKE GFSLIQGPPG TGKTKTILGI IGYFLSTKNA 1380
SSSNVIKVPL EKNSSNTEQL LKKQKILICA PSNAAVDEIC LRLKSGVYDK QGHQFKPQLV 1440
RVGRSDVVNV AIKDLTLEEL VDKRIGERNY EIRTDPELER KFNNAVTKRR ELRGKLDSES 1500
GNPESPMSTE DISKLQLKIR ELSKIINELG RDRDEMREKN SVNYRNRDLD RRNAQAHILA 1560
VSDIICSTLS GSAHDVLATM GIKFDTVIID EACQCTELSS IIPLRYGGKR CIMVGDPNQL 1620
PPTVLSGAAS NFKYNQSLFV RMEKNSSPYL LDVQYRMHPS ISKFPSSEFY QGRLKDGPGM 1680
DILNKRPWHQ LEPLAPYKFF DIISGRQEQN AKTMSYTNME EIRVAIELVD YLFRKFDNKI 1740
DFTGKIGIIS PYREQMQKMR KEFARYFGGM INKSIDFNTI DGFQGQEKEI ILISCVRADD 1800
TKSSVGFLKD FRRMNVALTR AKTSIWVLGH QRSLAKSKLW RDLIEDAKDR SCLAYACSGF 1860
LDPRNNRAQS ILRKFNVPVP SEQEDDYKLP MEYITQGPDE VKSNKDTKKR RVVDEGEEAD 1920
KAVKKKKKEK KKEKKKSKAD DKKKNNKKAE SPSTSSGTKK KSSIFGGMSV PSAVVPKTFP 1980
DVDSNKKAAA VVGKKKNNKH VCFSDDVSFI PRNDEPEIKV TRSLSSVLKE KQLGLKETRT 2040
ISPPEISNNE DDDDEDDYTP SISDSSLMKS EANGRNNRVA SHNQNFSASI YDDPQVSQAK 2100
QTQVPAAITK HRSSNSVLSG GSSRILTASD YGEPNQNGQN GANRTLSQHV GNANQYSTAP 2160
VGTGELHETL PAHPQDSYPA EAEDPYDLNP HPQPQSSAFK GPGSGPTGTR NSSRRNASSS 2220
PFIPKKRKPR S 2231 
Gene Ontology
 GO:0035649; C:Nrd1 complex; IDA:SGD.
 GO:0005634; C:nucleus; IDA:SGD.
 GO:0005657; C:replication fork; IDA:SGD.
 GO:0005524; F:ATP binding; IEA:UniProtKB-KW.
 GO:0043141; F:ATP-dependent 5'-3' DNA helicase activity; ISS:SGD.
 GO:0032575; F:ATP-dependent 5'-3' RNA helicase activity; ISS:SGD.
 GO:0019904; F:protein domain specific binding; IDA:SGD.
 GO:0045005; P:maintenance of fidelity involved in DNA-dependent DNA replication; IMP:SGD.
 GO:0006378; P:mRNA polyadenylation; IMP:SGD.
 GO:0006364; P:rRNA processing; IMP:SGD.
 GO:0031126; P:snoRNA 3'-end processing; IMP:SGD.
 GO:0016180; P:snRNA processing; IMP:SGD.
 GO:0006369; P:termination of RNA polymerase II transcription; IMP:SGD.
 GO:0008033; P:tRNA processing; IMP:SGD. 
Interpro
 IPR024714; Helicase_Sen1.
 IPR024481; Helicase_Sen1_N.
 IPR027417; P-loop_NTPase. 
Pfam
 PF12726; SEN1_N 
SMART
  
PROSITE
  
PRINTS