{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,7,30]],"date-time":"2025-07-30T11:44:01Z","timestamp":1753875841977,"version":"3.41.2"},"reference-count":25,"publisher":"Oxford University Press (OUP)","issue":"10","license":[{"start":{"date-parts":[[2023,9,15]],"date-time":"2023-09-15T00:00:00Z","timestamp":1694736000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["1901324","2027655"],"award-info":[{"award-number":["1901324","2027655"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2023,10,3]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>DNA-based data storage is a quickly growing field that hopes to harness the massive theoretical information density of DNA molecules to produce a competitive next-generation storage medium suitable for archival data. In recent years, many DNA-based storage system designs have been proposed. Given that no common infrastructure exists for simulating these storage systems, comparing many different designs along with many different error models is increasingly difficult. To address this challenge, we introduce FrameD, a simulation infrastructure for DNA storage systems that leverages the underlying modularity of DNA storage system designs to provide a framework to express different designs while being able to reuse common components.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>We demonstrate the utility of FrameD and the need for a common simulation platform using a case study. Our case study compares designs that utilize strand copies differently, some that align strand copies using multiple sequence alignment algorithms and others that do not. We found that the choice to include multiple sequence alignment in the pipeline is dependent on the error rate and the type of errors being injected and is not always beneficial. In addition to supporting a wide range of designs, FrameD provides the user with transparent parallelism to deal with a large number of reads from sequencing and the need for many fault injection iterations. We believe that FrameD fills a void in the tools publicly available to the DNA storage community by providing a modular and extensible framework with support for massive parallelism. As a result, it will help accelerate the design process of future DNA-based storage systems.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>The source code for FrameD along with the data generated during the demonstration of FrameD is available in a public Github repository at https:\/\/github.com\/dna-storage\/framed, (https:\/\/dx.doi.org\/10.5281\/zenodo.7757762).<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btad572","type":"journal-article","created":{"date-parts":[[2023,9,15]],"date-time":"2023-09-15T17:42:44Z","timestamp":1694799764000},"source":"Crossref","is-referenced-by-count":3,"title":["FrameD: framework for DNA-based data storage design, verification, and validation"],"prefix":"10.1093","volume":"39","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-5200-5909","authenticated-orcid":false,"given":"Kevin D","family":"Volkel","sequence":"first","affiliation":[{"name":"Department of Electrical and Computer Engineering, North Carolina State University , Raleigh, NC, 27606, United States"}]},{"given":"Kevin N","family":"Lin","sequence":"additional","affiliation":[{"name":"Department of Chemical and Biomolecular Engineering, North Carolina State University , Raleigh, NC, 27695, United States"}]},{"given":"Paul W","family":"Hook","sequence":"additional","affiliation":[{"name":"Department of Biomedical Engineering, Johns Hopkins University , Baltimore, MD, 21218, United States"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2083-6027","authenticated-orcid":false,"given":"Winston","family":"Timp","sequence":"additional","affiliation":[{"name":"Department of Biomedical Engineering, Johns Hopkins University , Baltimore, MD, 21218, United States"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8958-1232","authenticated-orcid":false,"given":"Albert J","family":"Keung","sequence":"additional","affiliation":[{"name":"Department of Chemical and Biomolecular Engineering, North Carolina State University , Raleigh, NC, 27695, United States"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8975-0294","authenticated-orcid":false,"given":"James M","family":"Tuck","sequence":"additional","affiliation":[{"name":"Department of Electrical and Computer Engineering, North Carolina State University , Raleigh, NC, 27606, United States"}]}],"member":"286","published-online":{"date-parts":[[2023,9,15]]},"reference":[{"key":"2023101010525431300_btad572-B1","doi-asserted-by":"crossref","first-page":"1229","DOI":"10.1038\/s41587-019-0240-x","article-title":"Data storage in DNA with fewer synthesis cycles using composite DNA letters","volume":"37","author":"Anavy","year":"2019","journal-title":"Nat Biotechnol"},{"key":"2023101010525431300_btad572-B2","doi-asserted-by":"crossref","first-page":"5345","DOI":"10.1038\/s41467-020-19148-3","article-title":"Low cost DNA data storage using photolithographic synthesis and advanced information reconstruction and error correction","volume":"11","author":"Antkowiak","year":"2020","journal-title":"Nat Commun"},{"first-page":"39","year":"2017","author":"Bishop","key":"2023101010525431300_btad572-B3"},{"first-page":"637","year":"2016","author":"Bornholt","key":"2023101010525431300_btad572-B4"},{"year":"2022","author":"Chaykin","key":"2023101010525431300_btad572-B5"},{"key":"2023101010525431300_btad572-B6","doi-asserted-by":"crossref","first-page":"6582","DOI":"10.1038\/s41598-019-43105-w","article-title":"High information capacity DNA-based data storage with augmented encoding characters using degenerate bases","volume":"9","author":"Choi","year":"2019","journal-title":"Sci Rep"},{"key":"2023101010525431300_btad572-B7","doi-asserted-by":"crossref","first-page":"1628","DOI":"10.1126\/science.1226355","article-title":"Next-generation digital information storage in DNA","volume":"337","author":"Church","year":"2012","journal-title":"Science"},{"key":"2023101010525431300_btad572-B8","doi-asserted-by":"crossref","first-page":"113","DOI":"10.1186\/1471-2105-5-113","article-title":"MUSCLE: a multiple sequence alignment method with reduced time and space complexity","volume":"5","author":"Edgar","year":"2004","journal-title":"BMC Bioinformatics"},{"key":"2023101010525431300_btad572-B9","doi-asserted-by":"crossref","first-page":"950","DOI":"10.1126\/science.aaj2038","article-title":"DNA fountain enables a robust and efficient storage architecture","volume":"355","author":"Erlich","year":"2017","journal-title":"Science"},{"key":"2023101010525431300_btad572-B10","doi-asserted-by":"crossref","first-page":"77","DOI":"10.1038\/nature11875","article-title":"Towards practical, high-capacity, low-maintenance information storage in synthesized DNA","volume":"494","author":"Goldman","year":"2013","journal-title":"Nature"},{"key":"2023101010525431300_btad572-B11","doi-asserted-by":"crossref","first-page":"2552","DOI":"10.1002\/anie.201411378","article-title":"Robust chemical preservation of digital information on DNA in silica with error-correcting codes","volume":"54","author":"Grass","year":"2015","journal-title":"Angew Chem Int Ed Engl"},{"first-page":"1","year":"2021","author":"Hamoum","key":"2023101010525431300_btad572-B12"},{"first-page":"2157","year":"2017","author":"Heckel","key":"2023101010525431300_btad572-B13"},{"key":"2023101010525431300_btad572-B14","doi-asserted-by":"crossref","first-page":"2981","DOI":"10.1038\/s41467-020-16797-2","article-title":"Dynamic and scalable DNA-based information storage","volume":"11","author":"Lin","year":"2020","journal-title":"Nat Commun"},{"key":"2023101010525431300_btad572-B15","doi-asserted-by":"crossref","first-page":"2621","DOI":"10.3390\/electronics12122621","article-title":"Integrating FPGA acceleration in the DNAssim framework for faster DNA-based data storage simulations","volume":"12","author":"Marelli","year":"2023","journal-title":"Electronics"},{"key":"2023101010525431300_btad572-B16","doi-asserted-by":"crossref","first-page":"1358","DOI":"10.1038\/s41467-021-21587-5","article-title":"DNA stability: a central design consideration for DNA data storage systems","volume":"12","author":"Matange","year":"2021","journal-title":"Nat Commun"},{"key":"2023101010525431300_btad572-B17","doi-asserted-by":"crossref","first-page":"242","DOI":"10.1038\/nbt.4079","article-title":"Random access in large-scale DNA data storage","volume":"36","author":"Organick","year":"2018","journal-title":"Nat Biotechnol"},{"key":"2023101010525431300_btad572-B18","doi-asserted-by":"crossref","first-page":"616","DOI":"10.1038\/s41467-020-14319-8","article-title":"Probing the physical limits of reliable DNA data retrieval","volume":"11","author":"Organick","year":"2020","journal-title":"Nat Commun"},{"key":"2023101010525431300_btad572-B19","doi-asserted-by":"crossref","first-page":"18489","DOI":"10.1073\/pnas.2004821117","article-title":"HEDGES error-correcting code for DNA storage corrects indels and allows sequence constraints","volume":"117","author":"Press","year":"2020","journal-title":"Proc Natl Acad Sci U S A"},{"key":"2023101010525431300_btad572-B20","first-page":"3360","volume-title":"Advances in Neural Information Processing Systems 30","author":"Rashtchian","year":"2017"},{"key":"2023101010525431300_btad572-B21","doi-asserted-by":"crossref","first-page":"720","DOI":"10.1093\/bioinformatics\/btaa740","article-title":"SOLQC: synthetic oligo library quality control tool","volume":"37","author":"Sabary","year":"2021","journal-title":"Bioinformatics"},{"key":"2023101010525431300_btad572-B22","doi-asserted-by":"crossref","first-page":"1241","DOI":"10.1021\/acssynbio.9b00100","article-title":"Driving the scalability of DNA-based information storage systems","volume":"8","author":"Tomek","year":"2019","journal-title":"ACS Synth Biol"},{"key":"2023101010525431300_btad572-B23","doi-asserted-by":"crossref","first-page":"3518","DOI":"10.1038\/s41467-021-23669-w","article-title":"Promiscuous molecules for smarter file operations in DNA-based data storage","volume":"12","author":"Tomek","year":"2021","journal-title":"Nat Commun"},{"key":"2023101010525431300_btad572-B24","doi-asserted-by":"crossref","first-page":"5011","DOI":"10.1038\/s41598-017-05188-1","article-title":"Portable and error-free DNA-based data storage","volume":"7","author":"Yazdi","year":"2017","journal-title":"Sci Rep"},{"key":"2023101010525431300_btad572-B25","doi-asserted-by":"crossref","first-page":"185","DOI":"10.1186\/s12859-022-04723-w","article-title":"DeSP: a systematic DNA storage error simulation pipeline","volume":"23","author":"Yuan","year":"2022","journal-title":"BMC Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btad572\/51606634\/btad572.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/39\/10\/btad572\/51972107\/btad572.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/39\/10\/btad572\/51972107\/btad572.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,10,10]],"date-time":"2023-10-10T10:54:01Z","timestamp":1696935241000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btad572\/7274858"}},"subtitle":[],"editor":[{"given":"Janet","family":"Kelso","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2023,9,15]]},"references-count":25,"journal-issue":{"issue":"10","published-print":{"date-parts":[[2023,10,3]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btad572","relation":{},"ISSN":["1367-4811"],"issn-type":[{"type":"electronic","value":"1367-4811"}],"subject":[],"published-other":{"date-parts":[[2023,10,1]]},"published":{"date-parts":[[2023,9,15]]},"article-number":"btad572"}}