{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,16]],"date-time":"2026-04-16T07:00:23Z","timestamp":1776322823120,"version":"3.50.1"},"reference-count":36,"publisher":"Oxford University Press (OUP)","issue":"Supplement_1","license":[{"start":{"date-parts":[[2023,6,30]],"date-time":"2023-06-30T00:00:00Z","timestamp":1688083200000},"content-version":"vor","delay-in-days":29,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"publisher","award":["R35 GM145283"],"award-info":[{"award-number":["R35 GM145283"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["IIS-2009071"],"award-info":[{"award-number":["IIS-2009071"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2023,6,30]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>RNA design is the search for a sequence or set of sequences that will fold to desired structure, also known as the inverse problem of RNA folding. However, the sequences designed by existing algorithms often suffer from low ensemble stability, which worsens for long sequence design. Additionally, for many methods only a small number of sequences satisfying the MFE criterion can be found by each run of design. These drawbacks limit their use cases.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>We propose an innovative optimization paradigm, SAMFEO, which optimizes ensemble objectives (equilibrium probability or ensemble defect) by iterative search and yields a very large number of successfully designed RNA sequences as byproducts. We develop a search method which leverages structure level and ensemble level information at different stages of the optimization: initialization, sampling, mutation, and updating. Our work, while being less complicated than others, is the first algorithm that is able to design thousands of RNA sequences for the puzzles from the Eterna100 benchmark. In addition, our algorithm solves the most Eterna100 puzzles among all the general optimization based methods in our study. The only baseline solving more puzzles than our work is dependent on handcrafted heuristics designed for a specific folding model. Surprisingly, our approach shows superiority on designing long sequences for structures adapted from the database of 16S Ribosomal RNAs.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>Our source code and data used in this article is available at https:\/\/github.com\/shanry\/SAMFEO.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btad252","type":"journal-article","created":{"date-parts":[[2023,6,30]],"date-time":"2023-06-30T08:19:49Z","timestamp":1688113189000},"page":"i563-i571","source":"Crossref","is-referenced-by-count":21,"title":["RNA design via structure-aware multifrontier ensemble optimization"],"prefix":"10.1093","volume":"39","author":[{"given":"Tianshuo","family":"Zhou","sequence":"first","affiliation":[{"name":"School of Electrical Engineering and Computer Science, Oregon State University , Corvalli OR 97330, United States"}]},{"given":"Ning","family":"Dai","sequence":"additional","affiliation":[{"name":"School of Electrical Engineering and Computer Science, Oregon State University , Corvalli OR 97330, United States"}]},{"given":"Sizhen","family":"Li","sequence":"additional","affiliation":[{"name":"School of Electrical Engineering and Computer Science, Oregon State University , Corvalli OR 97330, United States"}]},{"given":"Max","family":"Ward","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Software Engineering, The University of Western Australia , Perth, Australia"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2907-6557","authenticated-orcid":false,"given":"David H","family":"Mathews","sequence":"additional","affiliation":[{"name":"Department of Biochemistry and Biophysics , University of Rochester Medical Center , Rochester, NY 14642, United States"},{"name":"Center for RNA Biology , University of Rochester Medical Center , Rochester, NY 14642, United States"},{"name":"Department of Biostatistics & Computational Biology, University of Rochester Medical Center , Rochester, NY 14642, United States"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6444-7045","authenticated-orcid":false,"given":"Liang","family":"Huang","sequence":"additional","affiliation":[{"name":"School of Electrical Engineering and Computer Science, Oregon State University , Corvalli OR 97330, United States"}]}],"member":"286","published-online":{"date-parts":[[2023,6,30]]},"reference":[{"key":"2023063008164113900_btad252-B1","doi-asserted-by":"crossref","first-page":"748","DOI":"10.1016\/j.jmb.2015.11.013","article-title":"Principles for predicting RNA secondary structure design difficulty","volume":"428","author":"Anderson-Lee","year":"2016","journal-title":"J Mol Biol"},{"key":"2023063008164113900_btad252-B2","doi-asserted-by":"crossref","first-page":"607","DOI":"10.1016\/j.jmb.2003.12.041","article-title":"A new algorithm for RNA secondary structure design","volume":"336","author":"Andronescu","year":"2004","journal-title":"J Mol Biol"},{"key":"2023063008164113900_btad252-B3","doi-asserted-by":"crossref","first-page":"4","DOI":"10.1016\/j.jbiotec.2005.12.006","article-title":"Engineered riboswitches as novel tools in molecular biology","volume":"124","author":"Bauer","year":"2006","journal-title":"J Biotechnol"},{"key":"2023063008164113900_btad252-B4","doi-asserted-by":"crossref","first-page":"1555","DOI":"10.1261\/rna.066324.118","article-title":"Accelerated RNA secondary structure design using preselected sequences for helices and loops","volume":"24","author":"Bellaousov","year":"2018","journal-title":"RNA"},{"key":"2023063008164113900_btad252-B5","doi-asserted-by":"crossref","first-page":"302","DOI":"10.1089\/cmb.2019.0420","article-title":"Designing RNA secondary structures is hard","volume":"27","author":"Bonnet","year":"2020","journal-title":"J Comput Biol"},{"key":"2023063008164113900_btad252-B6","doi-asserted-by":"crossref","first-page":"1823","DOI":"10.1093\/bioinformatics\/btl194","article-title":"Info-RNA-a fast approach to inverse RNA folding","volume":"22","author":"Busch","year":"2006","journal-title":"Bioinformatics"},{"key":"2023063008164113900_btad252-B7","first-page":"350","article-title":"Design of RNAs: comparing programs for inverse RNA folding","volume":"19","author":"Churkin","year":"2018","journal-title":"Brief Bioinform"},{"key":"2023063008164113900_btad252-B8","doi-asserted-by":"crossref","first-page":"1392","DOI":"10.1093\/nar\/gkh291","article-title":"Paradigms for computational nucleic acid design","volume":"32","author":"Dirks","year":"2004","journal-title":"Nucleic Acids Res"},{"key":"2023063008164113900_btad252-B9","doi-asserted-by":"crossref","first-page":"e90","DOI":"10.1093\/bioinformatics\/btl246","article-title":"CONTRAfold: RNA secondary structure prediction without physics-based models","volume":"22","author":"Do","year":"2006","journal-title":"Bioinformatics"},{"key":"2023063008164113900_btad252-B10","doi-asserted-by":"crossref","first-page":"e1006176","DOI":"10.1371\/journal.pcbi.1006176","article-title":"Solving the RNA design problem with reinforcement learning","volume":"14","author":"Eastman","year":"2018","journal-title":"PLoS Comput Biol"},{"key":"2023063008164113900_btad252-B11","doi-asserted-by":"crossref","first-page":"20","DOI":"10.1186\/s12859-014-0444-5","article-title":"Erd: a fast and reliable tool for RNA design including constraints","volume":"16","author":"Esmaili-Taheri","year":"2015","journal-title":"BMC Bioinformatics"},{"key":"2023063008164113900_btad252-B12","doi-asserted-by":"crossref","first-page":"1990","DOI":"10.3390\/s17091990","article-title":"Design of artificial riboswitches as biosensors","volume":"17","author":"Findei\u00df","year":"2017","journal-title":"Sensors"},{"key":"2023063008164113900_btad252-B13","doi-asserted-by":"crossref","first-page":"e0137859","DOI":"10.1371\/journal.pone.0137859","article-title":"RNA thermodynamic structural entropy","volume":"10","author":"Garcia-Martin","year":"2015","journal-title":"PLoS One"},{"key":"2023063008164113900_btad252-B14","doi-asserted-by":"crossref","first-page":"1350001","DOI":"10.1142\/S0219720013500017","article-title":"RNAiFOLD: a constraint programming algorithm for RNA inverse folding and molecular design","volume":"11","author":"Garcia-Martin","year":"2013","journal-title":"J Bioinform Comput Biol"},{"key":"2023063008164113900_btad252-B15","first-page":"231","author":"Hale\u0161","year":"2015"},{"key":"2023063008164113900_btad252-B16","doi-asserted-by":"crossref","first-page":"8","DOI":"10.1016\/j.biochi.2017.10.005","article-title":"In silico approaches to RNA aptamer design","volume":"145","author":"Hamada","year":"2018","journal-title":"Biochimie"},{"key":"2023063008164113900_btad252-B17","doi-asserted-by":"crossref","first-page":"167","DOI":"10.1007\/BF00818163","article-title":"Fast folding and comparison of RNA secondary structures","volume":"125","author":"Hofacker","year":"1994","journal-title":"Monatsh Chem"},{"key":"2023063008164113900_btad252-B18","first-page":"53","author":"Huang","year":"2005"},{"key":"2023063008164113900_btad252-B19","doi-asserted-by":"crossref","first-page":"i295","DOI":"10.1093\/bioinformatics\/btz375","article-title":"LinearFold: linear-time approximate RNA folding by 5\u2032-to-3\u2032 dynamic programming and beam search","volume":"35","author":"Huang","year":"2019","journal-title":"Bioinformatics"},{"key":"2023063008164113900_btad252-B20","doi-asserted-by":"crossref","first-page":"3114","DOI":"10.1093\/bioinformatics\/btv319","article-title":"Antarna: ant colony-based RNA sequence design","volume":"31","author":"Kleinkauf","year":"2015","journal-title":"Bioinformatics"},{"key":"2023063008164113900_btad252-B21","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/1748-7188-6-26","article-title":"ViennaRNA package 2.0","volume":"6","author":"Lorenz","year":"2011","journal-title":"Algorithms Mol Biol"},{"key":"2023063008164113900_btad252-B22","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/1471-2105-13-260","article-title":"Frnakenstein: multiple target inverse RNA folding","volume":"13","author":"Lyngs\u00f8","year":"2012","journal-title":"BMC Bioinformatics"},{"key":"2023063008164113900_btad252-B23","doi-asserted-by":"crossref","first-page":"3663","DOI":"10.1021\/ct300267j","article-title":"Dynamics in sequence space for RNA secondary structure design","volume":"8","author":"Matthies","year":"2012","journal-title":"J Chem Theory Comput"},{"key":"2023063008164113900_btad252-B24","first-page":"345587","author":"Portela","year":"2018"},{"key":"2023063008164113900_btad252-B25","doi-asserted-by":"crossref","first-page":"i308","DOI":"10.1093\/bioinformatics\/btt217","article-title":"A weighted sampling algorithm for the design of RNA sequences with targeted secondary structure and nucleotide distribution","volume":"29","author":"Reinharz","year":"2013","journal-title":"Bioinformatics"},{"key":"2023063008164113900_btad252-B26","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/1471-2105-11-129","article-title":"RNAstructure: software for RNA secondary structure prediction and analysis","volume":"11","author":"Reuter","year":"2010","journal-title":"BMC Bioinformatics"},{"key":"2023063008164113900_btad252-B27","doi-asserted-by":"crossref","first-page":"156","DOI":"10.1109\/TEVC.2018.2844116","article-title":"Multiobjective metaheuristic to design RNA sequences","volume":"23","author":"Rubio-Largo","year":"2019","journal-title":"IEEE Trans Evol Comput"},{"key":"2023063008164113900_btad252-B28","year":"2018"},{"key":"2023063008164113900_btad252-B29","doi-asserted-by":"crossref","first-page":"1121","DOI":"10.1105\/tpc.105.039834","article-title":"Highly specific gene silencing by artificial microRNAs in Arabidopsis","volume":"18","author":"Schwab","year":"2006","journal-title":"Plant Cell"},{"key":"2023063008164113900_btad252-B30","author":"Shi","year":"2018"},{"key":"2023063008164113900_btad252-B31","first-page":"1","article-title":"Modena: a multi-objective RNA inverse folding","volume":"4","author":"Taneda","year":"2011","journal-title":"Adv Appl Bioinform Chem"},{"key":"2023063008164113900_btad252-B32","doi-asserted-by":"crossref","first-page":"D280","DOI":"10.1093\/nar\/gkp892","article-title":"NNDB: the nearest neighbor parameter database for predicting stability of nucleic acid secondary structure","volume":"38","author":"Turner","year":"2010","journal-title":"Nucleic Acids Res"},{"key":"2023063008164113900_btad252-B33","volume-title":"Nucleic Acids Research","author":"Ward"},{"key":"2023063008164113900_btad252-B34","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s12859-017-1882-7","article-title":"RNA inverse folding using Monte Carlo tree search","volume":"18","author":"Yang","year":"2017","journal-title":"BMC Bioinformatics"},{"key":"2023063008164113900_btad252-B35","first-page":"289","author":"Yao","year":"2019"},{"key":"2023063008164113900_btad252-B36","doi-asserted-by":"crossref","first-page":"439","DOI":"10.1002\/jcc.21633","article-title":"Nucleic acid sequence design via efficient ensemble defect optimization","volume":"32","author":"Zadeh","year":"2011","journal-title":"J Comput Chem"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/39\/Supplement_1\/i563\/50741920\/btad252.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/39\/Supplement_1\/i563\/50741920\/btad252.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,6,30]],"date-time":"2023-06-30T08:20:21Z","timestamp":1688113221000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/39\/Supplement_1\/i563\/7210514"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,6,1]]},"references-count":36,"journal-issue":{"issue":"Supplement_1","published-print":{"date-parts":[[2023,6,30]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btad252","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2023,6,1]]},"published":{"date-parts":[[2023,6,1]]}}}