{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,26]],"date-time":"2026-02-26T20:34:22Z","timestamp":1772138062199,"version":"3.50.1"},"reference-count":40,"publisher":"Oxford University Press (OUP)","issue":"10","license":[{"start":{"date-parts":[[2023,9,27]],"date-time":"2023-09-27T00:00:00Z","timestamp":1695772800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"European Research Council Horizon 2020","award":["802567"],"award-info":[{"award-number":["802567"]}]},{"DOI":"10.13039\/501100003977","name":"Israel Science Foundation","doi-asserted-by":"publisher","award":["1782\/22"],"award-info":[{"award-number":["1782\/22"]}],"id":[{"id":"10.13039\/501100003977","id-type":"DOI","asserted-by":"publisher"}]},{"name":"European Research Council consolidator","award":["817811"],"award-info":[{"award-number":["817811"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2023,10,3]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Motivation<\/jats:title>\n                    <jats:p>Optical genome mapping (OGM) is a technique that extracts partial genomic information from optically imaged and linearized DNA fragments containing fluorescently labeled short sequence patterns. This information can be used for various genomic analyses and applications, such as the detection of structural variations and copy-number variations, epigenomic profiling, and microbial species identification. Currently, the choice of labeled patterns is based on the available biochemical methods and is not necessarily optimized for the application.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>In this work, we develop a model of OGM based on information theory, which enables the design of optimal labeling patterns for specific applications and target organism genomes. We validated the model through experimental OGM on human DNA and simulations on bacterial DNA. Our model predicts up to 10-fold improved accuracy by optimal choice of labeling patterns, which may guide future development of OGM biochemical labeling methods and significantly improve its accuracy and yield for applications such as epigenomic profiling and cultivation-free pathogen identification in clinical samples.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and implementation<\/jats:title>\n                    <jats:p>https:\/\/github.com\/yevgenin\/PatternCode<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btad601","type":"journal-article","created":{"date-parts":[[2023,9,26]],"date-time":"2023-09-26T16:11:46Z","timestamp":1695744706000},"source":"Crossref","is-referenced-by-count":8,"title":["Design of optimal labeling patterns for optical genome mapping via information theory"],"prefix":"10.1093","volume":"39","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-7484-727X","authenticated-orcid":false,"given":"Yevgeni","family":"Nogin","sequence":"first","affiliation":[{"name":"Russell Berrie Nanotechnology Institute, Technion , Haifa 320003, Israel"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6766-1450","authenticated-orcid":false,"given":"Daniella","family":"Bar-Lev","sequence":"additional","affiliation":[{"name":"Department of Computer Science, Technion , Haifa 320003, Israel"}]},{"given":"Dganit","family":"Hanania","sequence":"additional","affiliation":[{"name":"Department of Computer Science, Technion , Haifa 320003, Israel"}]},{"given":"Tahir","family":"Detinis Zur","sequence":"additional","affiliation":[{"name":"Department of Chemistry, Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University , Tel Aviv 6997801, Israel"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7107-7529","authenticated-orcid":false,"given":"Yuval","family":"Ebenstein","sequence":"additional","affiliation":[{"name":"Department of Chemistry, Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University , Tel Aviv 6997801, Israel"},{"name":"Department of Biomedical Engineering, Faculty of Engineering, Tel Aviv University , Tel Aviv 6997801, Israel"}]},{"given":"Eitan","family":"Yaakobi","sequence":"additional","affiliation":[{"name":"Department of Computer Science, Technion , Haifa 320003, Israel"}]},{"given":"Nir","family":"Weinberger","sequence":"additional","affiliation":[{"name":"Department of Electrical Engineering, Technion , Haifa 320003, Israel"}]},{"given":"Yoav","family":"Shechtman","sequence":"additional","affiliation":[{"name":"Russell Berrie Nanotechnology Institute, Technion , Haifa 320003, Israel"},{"name":"Department of Biomedical Engineering, Technion , Haifa 320003, Israel"},{"name":"Lorry I. Lokey Center for Life Sciences and Engineering, Technion , Haifa 320003, Israel"}]}],"member":"286","published-online":{"date-parts":[[2023,9,27]]},"reference":[{"key":"2023101010441669100_btad601-B1","doi-asserted-by":"crossref","first-page":"e8","DOI":"10.1093\/nar\/gkaa1088","article-title":"Customized optical mapping by CRISPR\u2013Cas9 mediated DNA labeling with multiple sgRNAs","volume":"49","author":"Abid","year":"2021","journal-title":"Nucleic Acids Res"},{"key":"2023101010441669100_btad601-B2","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1007\/3-540-44696-6_3","volume-title":"Algorithms in Bioinformatics: First International Workshop, WABI 2001, \u00c5rhus, Denmark, August 28\u201331, 2001 Proceedings","author":"Anantharaman","year":"2001"},{"key":"2023101010441669100_btad601-B3","doi-asserted-by":"crossref","first-page":"lqz007","DOI":"10.1093\/nargab\/lqz007","article-title":"Identifying microbial species by single-molecule DNA optical mapping and resampling statistics","volume":"2","author":"Bouwens","year":"2020","journal-title":"NAR Genom Bioinform"},{"key":"2023101010441669100_btad601-B4","doi-asserted-by":"crossref","first-page":"404","DOI":"10.1093\/biomet\/26.4.404","article-title":"The use of confidence or fiducial limits illustrated in the case of the binomial","volume":"26","author":"Clopper","year":"1934","journal-title":"Biometrika"},{"key":"2023101010441669100_btad601-B5","volume-title":"Elements of Information Theory","author":"Cover"},{"key":"2023101010441669100_btad601-B6","doi-asserted-by":"crossref","first-page":"31","DOI":"10.1038\/nchembio754","article-title":"Direct transfer of extended groups from synthetic cofactors by DNA methyltransferases","volume":"2","author":"Dalhoff","year":"2006","journal-title":"Nat Chem Biol"},{"key":"2023101010441669100_btad601-B7","doi-asserted-by":"crossref","first-page":"809","DOI":"10.1021\/nn5063497","article-title":"Combing of genomic DNA from droplets containing picograms of material","volume":"9","author":"Deen","year":"2015","journal-title":"ACS Nano"},{"key":"2023101010441669100_btad601-B8","doi-asserted-by":"crossref","first-page":"5182","DOI":"10.1002\/anie.201608625","article-title":"Methyltransferase-directed labeling of biomolecules and its applications","volume":"56","author":"Deen","year":"2017","journal-title":"Angew Chem Int Ed Engl"},{"key":"2023101010441669100_btad601-B9","doi-asserted-by":"crossref","first-page":"100248","DOI":"10.1016\/j.patter.2021.100248","article-title":"Fandom: fast nested distance-based seeding of optical maps","volume":"2","author":"Dehkordi","year":"2021","journal-title":"Patterns (N Y)"},{"key":"2023101010441669100_btad601-B10","doi-asserted-by":"crossref","first-page":"eabf7117","DOI":"10.1126\/science.abf7117","article-title":"Haplotype-resolved diverse human genomes and integrated analysis of structural variation","volume":"372","author":"Ebert","year":"2021","journal-title":"Science"},{"key":"2023101010441669100_btad601-B11","doi-asserted-by":"crossref","first-page":"e92","DOI":"10.1093\/nar\/gkac460","article-title":"Chemoenzymatic labeling of DNA methylation patterns for single-molecule epigenetic mapping","volume":"50","author":"Gabrieli","year":"2022","journal-title":"Nucleic Acids Res"},{"key":"2023101010441669100_btad601-B12","doi-asserted-by":"crossref","first-page":"7148","DOI":"10.1021\/acsnano.8b03023","article-title":"Epigenetic optical mapping of 5-hydroxymethylcytosine in nanochannel arrays","volume":"12","author":"Gabrieli","year":"2018","journal-title":"ACS Nano"},{"key":"2023101010441669100_btad601-B13","volume-title":"Information Theory and Reliable Communication","author":"Gallager","year":"1968"},{"key":"2023101010441669100_btad601-B14","doi-asserted-by":"crossref","first-page":"e117","DOI":"10.1093\/nar\/gkv563","article-title":"Bacteriophage strain typing by rapid single molecule analysis","volume":"43","author":"Grunwald","year":"2015","journal-title":"Nucleic Acids Res"},{"key":"2023101010441669100_btad601-B15","doi-asserted-by":"crossref","first-page":"4947","DOI":"10.1109\/TIT.2009.2030478","article-title":"Information spectrum approach to second-order coding rate in channel coding","volume":"55","author":"Hayashi","year":"2009","journal-title":"IEEE Trans Inform Theory"},{"key":"2023101010441669100_btad601-B16","doi-asserted-by":"crossref","first-page":"51","DOI":"10.1042\/EBC20200021","article-title":"Single-molecule optical genome mapping in nanochannels: multidisciplinarity at the nanoscale","volume":"65","author":"Jeffet","year":"2021","journal-title":"Essays Biochem"},{"key":"2023101010441669100_btad601-B17","doi-asserted-by":"crossref","first-page":"690","DOI":"10.1016\/j.copbio.2013.01.009","article-title":"Beyond sequencing: optical mapping of DNA in the age of nanotechnology and nanoscopy","volume":"24","author":"Levy-Sakin","year":"2013","journal-title":"Curr Opin Biotechnol"},{"key":"2023101010441669100_btad601-B18","doi-asserted-by":"crossref","first-page":"3216","DOI":"10.1109\/TIT.2018.2809001","article-title":"Models and information-theoretic bounds for nanopore sequencing","volume":"64","author":"Mao","year":"2018","journal-title":"IEEE Trans Inform Theory"},{"key":"2023101010441669100_btad601-B19","doi-asserted-by":"crossref","first-page":"i327","DOI":"10.1093\/bioinformatics\/btab306","article-title":"Long reads capture simultaneous enhancer\u2013promoter methylation status for cell-type deconvolution","volume":"37","author":"Margalit","year":"2021","journal-title":"Bioinformatics"},{"key":"2023101010441669100_btad601-B20","doi-asserted-by":"crossref","first-page":"33","DOI":"10.1186\/2047-217X-3-33","article-title":"Computational methods for optical mapping","volume":"3","author":"Mendelowitz","year":"2014","journal-title":"Gigascience"},{"key":"2023101010441669100_btad601-B21","first-page":"1635","author":"Mohajer","year":"2013"},{"key":"2023101010441669100_btad601-B22","doi-asserted-by":"crossref","first-page":"6273","DOI":"10.1109\/TIT.2013.2270273","article-title":"Information theory of DNA shotgun sequencing","volume":"59","author":"Motahari","year":"2013","journal-title":"IEEE Trans Inform Theory"},{"key":"2023101010441669100_btad601-B23","doi-asserted-by":"crossref","first-page":"e89","DOI":"10.1093\/nar\/gkz489","article-title":"Enzyme-free optical DNA mapping of the human genome using competitive binding","volume":"47","author":"M\u00fcller","year":"2019","journal-title":"Nucleic Acids Res"},{"key":"2023101010441669100_btad601-B24","doi-asserted-by":"crossref","first-page":"1076","DOI":"10.1021\/acsinfecdis.9b00464","article-title":"Cultivation-free typing of bacteria using optical DNA mapping","volume":"6","author":"M\u00fcller","year":"2020","journal-title":"ACS Infect Dis"},{"key":"2023101010441669100_btad601-B25","doi-asserted-by":"crossref","first-page":"453","DOI":"10.1039\/c0sc00277a","article-title":"DNA fluorocode: a single molecule, optical map of DNA with nanometre resolution","volume":"1","author":"Neely","year":"2010","journal-title":"Chem Sci"},{"key":"2023101010441669100_btad601-B26","doi-asserted-by":"crossref","first-page":"298","DOI":"10.1002\/bip.21579","article-title":"Optical mapping of DNA: single-molecule-based methods for mapping genomes","volume":"95","author":"Neely","year":"2011","journal-title":"Biopolymers"},{"key":"2023101010441669100_btad601-B27","doi-asserted-by":"crossref","first-page":"btad137","DOI":"10.1093\/bioinformatics\/btad137","article-title":"DeepOM: single-molecule optical genome mapping via deep learning","volume":"39","author":"Nogin","year":"2023","journal-title":"Bioinformatics"},{"key":"2023101010441669100_btad601-B28","doi-asserted-by":"crossref","first-page":"31","DOI":"10.1038\/s43856-023-00259-z","article-title":"Strain-level bacterial typing directly from patient samples using optical DNA mapping","volume":"3","author":"Nyblom","year":"2023","journal-title":"Commun Med (Lond)"},{"key":"2023101010441669100_btad601-B29","doi-asserted-by":"crossref","first-page":"265","DOI":"10.1002\/cbic.200300739","article-title":"Sequence-specific methyltransferase-induced labeling of DNA (smiling Dna)","volume":"5","author":"Pljevalj\u010di\u0107","year":"2004","journal-title":"Chembiochem"},{"key":"2023101010441669100_btad601-B30","author":"Polyanskiy","year":"2010"},{"key":"2023101010441669100_btad601-B31","doi-asserted-by":"crossref","first-page":"2307","DOI":"10.1109\/TIT.2010.2043769","article-title":"Channel coding rate in the finite blocklength regime","volume":"56","author":"Polyanskiy","year":"2010","journal-title":"IEEE Trans Inform Theory"},{"key":"2023101010441669100_btad601-B32","doi-asserted-by":"crossref","first-page":"D298","DOI":"10.1093\/nar\/gku1046","article-title":"Rebase\u2014a database for DNA restriction and modification: enzymes, genes and genomes","volume":"43","author":"Roberts","year":"2015","journal-title":"Nucleic Acids Res"},{"key":"2023101010441669100_btad601-B33","doi-asserted-by":"crossref","first-page":"110","DOI":"10.1126\/science.8211116","article-title":"Ordered restriction maps of Saccharomyces cerevisiae chromosomes constructed by optical mapping","volume":"262","author":"Schwartz","year":"1993","journal-title":"Science"},{"key":"2023101010441669100_btad601-B34","doi-asserted-by":"crossref","first-page":"379","DOI":"10.1002\/j.1538-7305.1948.tb01338.x","article-title":"A mathematical theory of communication","volume":"27","author":"Shannon","year":"1948","journal-title":"Bell Syste Tech J"},{"key":"2023101010441669100_btad601-B35","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1561\/0100000086","article-title":"Asymptotic estimates in information theory with non-vanishing error probabilities","volume":"10","author":"Tan","year":"2014","journal-title":"FNT Commun Inform Theory"},{"key":"2023101010441669100_btad601-B36","doi-asserted-by":"crossref","first-page":"11414","DOI":"10.1039\/C9CC05198H","article-title":"Simultaneous detection of multiple DNA damage types by multi-colour fluorescent labelling","volume":"55","author":"Torchinsky","year":"2019","journal-title":"Chem Commun (Camb)"},{"key":"2023101010441669100_btad601-B37","doi-asserted-by":"crossref","first-page":"442","DOI":"10.1089\/cmb.2006.13.442","article-title":"Alignment of optical maps","volume":"13","author":"Valouev","year":"2006","journal-title":"J Comput Biol"},{"key":"2023101010441669100_btad601-B38","doi-asserted-by":"crossref","first-page":"e68","DOI":"10.1093\/nar\/gkz212","article-title":"DNA barcodes for rapid, whole genome, single-molecule analyses","volume":"47","author":"Wand","year":"2019","journal-title":"Nucleic Acids Res"},{"key":"2023101010441669100_btad601-B39","author":"Weinberger","year":"2023"},{"key":"2023101010441669100_btad601-B40","doi-asserted-by":"crossref","first-page":"045101","DOI":"10.1088\/1361-6528\/aaeddc","article-title":"Microfluidic DNA combing for parallel single-molecule analysis","volume":"30","author":"Wu","year":"2019","journal-title":"Nanotechnology"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btad601\/51779691\/btad601.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/39\/10\/btad601\/51972115\/btad601.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/39\/10\/btad601\/51972115\/btad601.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,10,10]],"date-time":"2023-10-10T06:46:05Z","timestamp":1696920365000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btad601\/7284110"}},"subtitle":[],"editor":[{"given":"Tobias","family":"Marschall","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2023,9,27]]},"references-count":40,"journal-issue":{"issue":"10","published-print":{"date-parts":[[2023,10,3]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btad601","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2023.05.23.541882","asserted-by":"object"}]},"ISSN":["1367-4811"],"issn-type":[{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2023,10,1]]},"published":{"date-parts":[[2023,9,27]]},"article-number":"btad601"}}