{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,8,27]],"date-time":"2024-08-27T05:09:38Z","timestamp":1724735378848},"reference-count":30,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2006,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>The splicing of RNA transcripts is thought to be partly promoted and regulated by sequences embedded within exons. Known sequences include binding sites for SR proteins, which are thought to mediate interactions between splicing factors bound to the 5' and 3' splice sites. It would be useful to identify further candidate sequences, however identifying them computationally is hard since exon sequences are also constrained by their functional role in coding for proteins.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>This strategy identified a collection of motifs including several previously reported splice enhancer elements. Although only trained on coding exons, the model discriminates both coding and non-coding exons from intragenic sequence.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusion<\/jats:title>\n            <jats:p>We have trained a computational model able to detect signals in coding exons which seem to be orthogonal to the sequences' primary function of coding for proteins. We believe that many of the motifs detected here represent binding sites for both previously unrecognized proteins which influence RNA splicing as well as other regulatory elements.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1471-2105-7-419","type":"journal-article","created":{"date-parts":[[2006,9,26]],"date-time":"2006-09-26T18:26:05Z","timestamp":1159295165000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":11,"title":["A machine learning strategy to identify candidate binding sites in human protein-coding sequence"],"prefix":"10.1186","volume":"7","author":[{"given":"Thomas","family":"Down","sequence":"first","affiliation":[]},{"given":"Bernard","family":"Leong","sequence":"additional","affiliation":[]},{"given":"Tim JP","family":"Hubbard","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2006,9,26]]},"reference":[{"issue":"20","key":"1158_CR1","doi-asserted-by":"publisher","first-page":"11193","DOI":"10.1073\/pnas.201407298","volume":"98","author":"LP Lim","year":"2001","unstructured":"Lim LP, Burge CB: A computational analysis of sequence features involved in recognition of short introns. Proc Natl Acad Sci USA 2001, 98(20):11193\u201311198.","journal-title":"Proc Natl Acad Sci USA"},{"key":"1158_CR2","doi-asserted-by":"publisher","first-page":"1197","DOI":"10.1017\/S1355838200000960","volume":"6","author":"BR Graveley","year":"2000","unstructured":"Graveley BR: Sorting out the complexities of SR protein functions. RNA 2000, 6: 1197.","journal-title":"RNA"},{"key":"1158_CR3","doi-asserted-by":"publisher","first-page":"7347","DOI":"10.1128\/MCB.19.11.7347","volume":"19","author":"CF Bourgeois","year":"1999","unstructured":"Bourgeois CF, Popielarz M, Hildwein G, Stevenin J: Identification of a bidirectional splicing enhancer: differential involvement of SR proteins in 5' or 3' splice site activation. Mol Cell Biol 1999, 19: 7347\u20137356.","journal-title":"Mol Cell Biol"},{"key":"1158_CR4","doi-asserted-by":"publisher","first-page":"1998","DOI":"10.1101\/gad.12.13.1998","volume":"12","author":"HX Liu","year":"1998","unstructured":"Liu HX, Zhang M, Krainer AR: Identification of functional exonic splicing enhancers motifs recognized by individual SR proteins. Genes Dev 1998, 12: 1998\u20132012.","journal-title":"Genes Dev"},{"key":"1158_CR5","doi-asserted-by":"publisher","first-page":"2089","DOI":"10.1101\/gad.10.16.2089","volume":"10","author":"KW Lynch","year":"1996","unstructured":"Lynch KW, Maniatis T: Assembly of specific SR protein complexes on distinct regulatory elements of the Drosophila doublesex splicing enhancer. Genes Dev 1996, 10: 2089\u20132101.","journal-title":"Genes Dev"},{"key":"1158_CR6","doi-asserted-by":"publisher","first-page":"1705","DOI":"10.1128\/MCB.19.3.1705","volume":"19","author":"TD Schaal","year":"1999","unstructured":"Schaal TD, Maniatis T: Selection and characterization of pre-mRNA splicing enhancers: identification of novel SR protein-specific enhancers sequences. Mol Cell Biol 1999, 19: 1705\u20131719.","journal-title":"Mol Cell Biol"},{"key":"1158_CR7","doi-asserted-by":"crossref","first-page":"3540","DOI":"10.1002\/j.1460-2075.1995.tb07360.x","volume":"14","author":"R Tacke","year":"1995","unstructured":"Tacke R, Manley JL: The human splicing factors ASF\/SF2 and SC35 possess distinct functionally significant RNA binding specificities. EMBO J 1995, 14: 3540\u20133551.","journal-title":"EMBO J"},{"key":"1158_CR8","doi-asserted-by":"publisher","first-page":"139","DOI":"10.1016\/S0092-8674(00)81153-8","volume":"93","author":"R Tacke","year":"1999","unstructured":"Tacke R, Tohyama M, Ogawa S, Manley JL: Human Tra2 proteins are sequence specific activators of pre-mRNA splicing. Cell 1999, 93: 139\u2013148.","journal-title":"Cell"},{"key":"1158_CR9","doi-asserted-by":"publisher","first-page":"33833","DOI":"10.1074\/jbc.M102957200","volume":"276","author":"H Tian","year":"2001","unstructured":"Tian H, Kole R: Strong RNA splicing enhancers identified by a modified method of cycled selection interact with SR protein. J Biol Chem 2001, 276: 33833\u201333839.","journal-title":"J Biol Chem"},{"key":"1158_CR10","doi-asserted-by":"publisher","first-page":"14088","DOI":"10.1073\/pnas.95.24.14088","volume":"95","author":"ZM Zheng","year":"1998","unstructured":"Zheng ZM, Huynen M, Baker CC: A pyrimidine-rich exonic splicing suppressor binds multiple RNA splicing factors and inhibits spliceosome assembly. Proc Natl Acad Sci USA 1998, 95: 14088\u201393.","journal-title":"Proc Natl Acad Sci USA"},{"key":"1158_CR11","doi-asserted-by":"publisher","first-page":"468","DOI":"10.1017\/S1355838299981967","volume":"5","author":"Y Cavaloc","year":"1999","unstructured":"Cavaloc Y, Bourgeois CF, Kister L, Stevenin J: The splicing factors 9G8 and SRp20 transactivate splicing through different and specific enhancers. RNA 1999, 5: 468\u2013483.","journal-title":"RNA"},{"issue":"3","key":"1158_CR12","doi-asserted-by":"publisher","first-page":"1063","DOI":"10.1128\/MCB.20.3.1063-1071.2000","volume":"20","author":"HX Liu","year":"2000","unstructured":"Liu HX, Cartegni L, Zhang M, Krainer A: Exonic Splicing enhancer motif recognized by SC35 under splicing conditions. Mol Cell Bio 2000, 20(3):1063\u20131071.","journal-title":"Mol Cell Bio"},{"key":"1158_CR13","doi-asserted-by":"publisher","first-page":"1233","DOI":"10.1017\/S1355838202028030","volume":"8","author":"BJ Lam","year":"2002","unstructured":"Lam BJ, Hertel KJ: A general role for splicing enhancers in exon definition. RNA 2002, 8: 1233\u20131241.","journal-title":"RNA"},{"key":"1158_CR14","doi-asserted-by":"publisher","first-page":"D459","DOI":"10.1093\/nar\/gki135","volume":"33","author":"JL Ashurst","year":"2005","unstructured":"Ashurst JL, Chen C, Gilbert J, Jekosch K, Keenan S, Meidl P, Searle S, Stalker J, Storey R, Trevanion S, Wilming L, Hubbard T: The Vertebrate Genome Annotation Database. Nucleic Acids Res 2005, 33: D459\u2013465.","journal-title":"Nucleic Acids Res"},{"key":"1158_CR15","first-page":"144","volume":"5","author":"TA Down","year":"2002","unstructured":"Down TA, Hubbard TJP: What can we learn from noncoding regions of similarity between genomes. BMC Bioinformatics 2002, 5: 144.","journal-title":"BMC Bioinformatics"},{"key":"1158_CR16","doi-asserted-by":"publisher","first-page":"563","DOI":"10.1016\/0022-2836(90)90223-9","volume":"212","author":"P Bucher","year":"1990","unstructured":"Bucher P: Weight matrix descriptions of four eukaryotic RNA polymerase II promoter elements derived from 502 unrelated promoter sequences. Journal of Molecular Biology 1990, 212: 563\u2013578.","journal-title":"Journal of Molecular Biology"},{"issue":"6","key":"1158_CR17","doi-asserted-by":"publisher","first-page":"768","DOI":"10.1101\/gr.3217705","volume":"15","author":"XHF Zhang","year":"2005","unstructured":"Zhang XHF, Leslie CS, Chasin LA: Dichotomous Splicing Signals in Exon Flanks. Genome Res 2005, 15(6):768\u201379.","journal-title":"Genome Res"},{"key":"1158_CR18","doi-asserted-by":"publisher","first-page":"2042","DOI":"10.1101\/gr.1257503","volume":"13","author":"L Katz","year":"2003","unstructured":"Katz L, Burge CB: Widespread Selection for Local RNA Secondary Structure in Coding Regions of Bacterial Genes. Genome Res 2003, 13: 2042\u20132051.","journal-title":"Genome Res"},{"key":"1158_CR19","doi-asserted-by":"publisher","first-page":"1007","DOI":"10.1126\/science.1073774","volume":"297","author":"WG Fairbrother","year":"2002","unstructured":"Fairbrother WG, Yen RF, Sharp PA, Burge CB: Predictive Identification of Exonic Splicing Enhancers in human genes. Science 2002, 297: 1007\u20131013.","journal-title":"Science"},{"issue":"12","key":"1158_CR20","doi-asserted-by":"publisher","first-page":"2637","DOI":"10.1101\/gr.1679003","volume":"13","author":"HF Zhang","year":"2003","unstructured":"Zhang HF, Heller KA, I H, Leslie CS, Chasin LA: Sequence Information for the Splicing of Human Pre-mRNA Identified by Support Vector Machine Classification. Genome Res 2003, 13(12):2637\u20132650.","journal-title":"Genome Res"},{"key":"1158_CR21","doi-asserted-by":"crossref","first-page":"57","DOI":"10.1145\/640075.640082","volume-title":"Proceedings of the seventh annual international conference on Research in computational molecular biology","author":"M Blanchette","year":"2003","unstructured":"Blanchette M: A comparative analysis method for detecting binding sites in coding regions. In Proceedings of the seventh annual international conference on Research in computational molecular biology. Edited by: M V, S I, P P, M W. 2003, 57\u201366."},{"issue":"7","key":"1158_CR22","doi-asserted-by":"publisher","first-page":"897","DOI":"10.1093\/bioinformatics\/bti132","volume":"21","author":"G Dror","year":"2005","unstructured":"Dror G, Sorek R, Shamir R: Accurate identification of alternatively spliced exons using support vector machine. Bioinformatics 2005, 21(7):897.","journal-title":"Bioinformatics"},{"key":"1158_CR23","volume-title":"PhD thesis","author":"TA Down","year":"2003","unstructured":"Down TA: Computational Localization of Promoters and Transcription Start Sites in Mammalian Genomes. PhD thesis. University of Cambridge; 2003."},{"key":"1158_CR24","doi-asserted-by":"publisher","first-page":"451","DOI":"10.1093\/hmg\/11.4.451","volume":"11","author":"F Clark","year":"2002","unstructured":"Clark F, Thanaraj TA: Categorization and characterization of transcript-confirmed constitutively and alternatively spliced introns and exons from human. Human Molecular Genetics 2002, 11: 451\u2013464.","journal-title":"Human Molecular Genetics"},{"key":"1158_CR25","doi-asserted-by":"publisher","first-page":"925","DOI":"10.1101\/gr.1860604","volume":"14","author":"E Birney","year":"2004","unstructured":"Birney E, et al.: An Overview of Ensembl. Genome Res 2004, 14: 925\u2013928.","journal-title":"Genome Res"},{"key":"1158_CR26","unstructured":"The Biojava Project[http:\/\/www.biojava.org]"},{"key":"1158_CR27","doi-asserted-by":"publisher","first-page":"3389","DOI":"10.1093\/nar\/25.17.3389","volume":"25","author":"SF Altschul","year":"1997","unstructured":"Altschul SF, Madden T, Schaffer A, Zhang J, Zhang Z, Miller W, Lipman D: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 1997, 25: 3389\u20133402.","journal-title":"Nucleic Acids Res"},{"key":"1158_CR28","doi-asserted-by":"publisher","first-page":"1087","DOI":"10.1063\/1.1699114","volume":"21","author":"N Metropolis","year":"1953","unstructured":"Metropolis N, Rosenbluth A, Rosenbluth M, Teller A, Teller E: Equation of state calculations by fast computing machines. J Chemical Physics 1953, 21: 1087\u20131092.","journal-title":"J Chemical Physics"},{"key":"1158_CR29","first-page":"211","volume":"1","author":"ME Tipping","year":"2000","unstructured":"Tipping ME: Sparse Bayesian learning and the relevance vector machine. Journal of Machine Learning Research 2000, 1: 211\u2013244.","journal-title":"Journal of Machine Learning Research"},{"key":"1158_CR30","first-page":"298","volume-title":"Proceedings of the Thirteenth Annual Conference on Computational Learning Theory","author":"T Graepel","year":"2000","unstructured":"Graepel T, Herbrich R, Shawe-Taylor J: Generalisation Error Bounds for Sparse Linear Classifiers. Proceedings of the Thirteenth Annual Conference on Computational Learning Theory 2000, 298\u2013303."}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-7-419.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,1]],"date-time":"2021-09-01T20:40:13Z","timestamp":1630528813000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-7-419"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2006,9,26]]},"references-count":30,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2006,12]]}},"alternative-id":["1158"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-7-419","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2006,9,26]]},"assertion":[{"value":"16 August 2005","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"26 September 2006","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"26 September 2006","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"419"}}