{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,16]],"date-time":"2026-02-16T03:04:00Z","timestamp":1771211040020,"version":"3.50.1"},"reference-count":44,"publisher":"Oxford University Press (OUP)","issue":"Supplement_1","license":[{"start":{"date-parts":[[2024,6,28]],"date-time":"2024-06-28T00:00:00Z","timestamp":1719532800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Ken Kennedy Institute Recruiting"},{"name":"Rice University Wagoner Foreign Study Scholarship"},{"DOI":"10.13039\/100000002","name":"NIH","doi-asserted-by":"publisher","award":["P01-AI152999"],"award-info":[{"award-number":["P01-AI152999"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000060","name":"National Institute of Allergy and Infectious Diseases","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100000060","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000001","name":"NSF","doi-asserted-by":"publisher","award":["IIS-2239114"],"award-info":[{"award-number":["IIS-2239114"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000001","name":"NSF","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]},{"name":"MIM Universal Rules of Live","award":["EF-2126387"],"award-info":[{"award-number":["EF-2126387"]}]},{"name":"European Union\u2019s Horizon 2020"},{"name":"Marie Sk\u0142odowska-Curie","award":["872539"],"award-info":[{"award-number":["872539"]}]},{"name":"Marie Sk\u0142odowska-Curie","award":["956229"],"award-info":[{"award-number":["956229"]}]},{"DOI":"10.13039\/100016662","name":"Carnegie Institution for Science","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100016662","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Department of Energy Joint Genome Institute"},{"DOI":"10.13039\/100006132","name":"Office of Science","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100006132","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000015","name":"Department of Energy","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100000015","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000001","name":"NSF","doi-asserted-by":"publisher","award":["2023333162"],"award-info":[{"award-number":["2023333162"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2024,6,28]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: The study of bacterial genome dynamics is vital for understanding the mechanisms underlying microbial adaptation, growth, and their impact on host phenotype. Structural variants (SVs), genomic alterations of 50 base pairs or more, play a pivotal role in driving evolutionary processes and maintaining genomic heterogeneity within bacterial populations. While SV detection in isolate genomes is relatively straightforward, metagenomes present broader challenges due to the absence of clear reference genomes and the presence of mixed strains. In response, our proposed method rhea, forgoes reference genomes and metagenome-assembled genomes (MAGs) by encompassing all metagenomic samples in a series (time or other metric) into a single co-assembly graph. The log fold change in graph coverage between successive samples is then calculated to call SVs that are thriving or declining.<\/jats:p><jats:p>Results: We show rhea to outperform existing methods for SV and horizontal gene transfer (HGT) detection in two simulated mock metagenomes, particularly as the simulated reads diverge from reference genomes and an increase in strain diversity is incorporated. We additionally demonstrate use cases for rhea on series metagenomic data of environmental and fermented food microbiomes to detect specific sequence alterations between successive time and temperature samples, suggesting host advantage. Our approach leverages previous work in assembly graph structural and coverage patterns to provide versatility in studying SVs across diverse and poorly characterized microbial communities for more comprehensive insights into microbial gene flux.<\/jats:p><jats:p>Availability and implementation: rhea is open source and available at: https:\/\/github.com\/treangenlab\/rhea.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btae224","type":"journal-article","created":{"date-parts":[[2024,6,28]],"date-time":"2024-06-28T09:36:24Z","timestamp":1719567384000},"page":"i58-i67","source":"Crossref","is-referenced-by-count":4,"title":["Reference-free structural variant detection in microbiomes via long-read co-assembly graphs"],"prefix":"10.1093","volume":"40","author":[{"given":"Kristen D","family":"Curry","sequence":"first","affiliation":[{"name":"Department of Computer Science, Rice University , 6100 Main St. , Houston, TX 77005, United States"},{"name":"Department of Genomes and Genetics, Microbial Evolutionary Genomics, Institut Pasteur, Universit\u00e9 Paris Cit\u00e9, CNRS, UMR3525 , Paris 75015, France"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3416-3046","authenticated-orcid":false,"given":"Feiqiao Brian","family":"Yu","sequence":"additional","affiliation":[{"name":"Arc Institute , Palo Alto, CA 94304, United States"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Summer E","family":"Vance","sequence":"additional","affiliation":[{"name":"Department of Environmental Science, Policy, and Management, University of California , Berkeley, CA 94720, United States"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Santiago","family":"Segarra","sequence":"additional","affiliation":[{"name":"Department of Electrical and Computer Engineering, Rice University , Houston, TX 77005, United States"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Devaki","family":"Bhaya","sequence":"additional","affiliation":[{"name":"Carnegie Institution for Science, Department of Plant Biology , Stanford, CA 94305, United States"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Rayan","family":"Chikhi","sequence":"additional","affiliation":[{"name":"Department of Computational Biology, Institut Pasteur, Universit\u00e9 Paris Cit\u00e9 , Paris 75015, France"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Eduardo P C","family":"Rocha","sequence":"additional","affiliation":[{"name":"Department of Genomes and Genetics, Microbial Evolutionary Genomics, Institut Pasteur, Universit\u00e9 Paris Cit\u00e9, CNRS, UMR3525 , Paris 75015, France"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Todd J","family":"Treangen","sequence":"additional","affiliation":[{"name":"Department of Computer Science, Rice University , 6100 Main St. , Houston, TX 77005, United States"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2024,6,28]]},"reference":[{"key":"2024062809083510600_btae224-B1","doi-asserted-by":"crossref","first-page":"240","DOI":"10.1186\/s13059-023-03038-0","article-title":"DIVE: a reference-free statistical approach to diversity-generating and mobile genetic element discovery","volume":"24","author":"Abante","year":"2023","journal-title":"Genome Biol"},{"key":"2024062809083510600_btae224-B2","doi-asserted-by":"crossref","first-page":"1143","DOI":"10.1038\/s41592-023-01932-w","article-title":"A survey of algorithms for the detection of genomic structural variants from long-read sequencing data","volume":"20","author":"Ahsan","year":"2023","journal-title":"Nat Methods"},{"key":"2024062809083510600_btae224-B3","doi-asserted-by":"crossref","first-page":"403","DOI":"10.1016\/S0022-2836(05)80360-2","article-title":"Basic local alignment search tool","volume":"215","author":"Altschul","year":"1990","journal-title":"J Mol Biol"},{"key":"2024062809083510600_btae224-B4","author":"Balaji","year":"2023"},{"key":"2024062809083510600_btae224-B5","first-page":"1","article-title":"High-quality metagenome assembly from long accurate reads with metaMDBG","author":"Benoit","year":"2024","journal-title":"Nat Biotechnol"},{"key":"2024062809083510600_btae224-B6","doi-asserted-by":"crossref","first-page":"703","DOI":"10.1038\/ismej.2007.46","article-title":"Population level functional diversity in a microbial community revealed by comparative genomic and metagenomic analyses","volume":"1","author":"Bhaya","year":"2007","journal-title":"ISME J"},{"key":"2024062809083510600_btae224-B7","doi-asserted-by":"crossref","first-page":"153","DOI":"10.1186\/s13059-019-1760-x","article-title":"Assignment of virus and antimicrobial resistance genes to microbial hosts in a complex microbial community by combined long-read assembly and proximity ligation","volume":"20","author":"Bickhart","year":"2019","journal-title":"Genome Biol"},{"key":"2024062809083510600_btae224-B8","doi-asserted-by":"crossref","first-page":"442","DOI":"10.1038\/s41579-021-00534-7","article-title":"Examining horizontal gene transfer in microbial communities","volume":"19","author":"Brito","year":"2021","journal-title":"Nat Rev Microbiol"},{"key":"2024062809083510600_btae224-B9","doi-asserted-by":"crossref","first-page":"5315","DOI":"10.1093\/bioinformatics\/btac672","article-title":"GTDB-Tk v2: memory friendly classification with the genome taxonomy database","volume":"38","author":"Chaumeil","year":"2022","journal-title":"Bioinformatics"},{"key":"2024062809083510600_btae224-B10","doi-asserted-by":"crossref","first-page":"3175","DOI":"10.1038\/s41467-022-30857-9","article-title":"Short- and long-read metagenomics expand individualized structural variations in gut microbiomes","volume":"13","author":"Chen","year":"2022","journal-title":"Nat Commun"},{"key":"2024062809083510600_btae224-B11","doi-asserted-by":"crossref","first-page":"912","DOI":"10.1038\/s41564-019-0473-y","article-title":"Microbiome genome structure drives function","volume":"4","author":"Durrant","year":"2019","journal-title":"Nat Microbiol"},{"key":"2024062809083510600_btae224-B12","doi-asserted-by":"crossref","first-page":"174","DOI":"10.1186\/s13059-019-1791-3","article-title":"MetaCarvel: linking assembly graph motifs to biological variants","volume":"20","author":"Ghurye","year":"2019","journal-title":"Genome Biol"},{"key":"2024062809083510600_btae224-B13","first-page":"353","article-title":"Metagenomic assembly: overview, challenges and applications","volume":"89","author":"Ghurye","year":"2016","journal-title":"Yale J Biol Med"},{"key":"2024062809083510600_btae224-B14","author":"Gupta"},{"key":"2024062809083510600_btae224-B15","doi-asserted-by":"crossref","first-page":"11","DOI":"10.25080\/TCWV9851","article-title":"Exploring network structure, dynamics, and function using networkx","author":"Hagberg","year":"2008","journal-title":"Proceedings of the 7th Python in Science Conference (SciPy2008)"},{"key":"2024062809083510600_btae224-B16","doi-asserted-by":"crossref","first-page":"226","DOI":"10.1038\/ng.1028","article-title":"De novo assembly and genotyping of variants using colored de bruijn graphs","volume":"44","author":"Iqbal","year":"2012","journal-title":"Nat Genet"},{"key":"2024062809083510600_btae224-B17","doi-asserted-by":"crossref","first-page":"14061","DOI":"10.1038\/ncomms14061","article-title":"Transient structural variations have strong effects on quantitative traits and reproductive isolation in fission yeast","volume":"8","author":"Jeffares","year":"2017","journal-title":"Nat Commun"},{"key":"2024062809083510600_btae224-B18","doi-asserted-by":"crossref","first-page":"181","DOI":"10.1126\/science.aau5238","article-title":"Invertible promoters mediate bacterial phase variation, antibiotic resistance, and host adaptation in the gut","volume":"363","author":"Jiang","year":"2019","journal-title":"Science"},{"key":"2024062809083510600_btae224-B19","doi-asserted-by":"crossref","first-page":"e7359","DOI":"10.7717\/peerj.7359","article-title":"MetaBAT 2: an adaptive binning algorithm for robust and efficient genome reconstruction from metagenome assemblies","volume":"7","author":"Kang","year":"2019","journal-title":"PeerJ"},{"key":"2024062809083510600_btae224-B20","doi-asserted-by":"crossref","first-page":"e16695","DOI":"10.7717\/peerj.16695","article-title":"Metagenomic assembly is the main bottleneck in the identification of mobile genetic elements","volume":"12","author":"Kerkvliet","year":"2024","journal-title":"PeerJ"},{"key":"2024062809083510600_btae224-B21","doi-asserted-by":"crossref","first-page":"1103","DOI":"10.1038\/s41592-020-00971-x","article-title":"metaFlye: scalable long-read metagenome assembly using repeat graphs","volume":"17","author":"Kolmogorov","year":"2020","journal-title":"Nat Methods"},{"key":"2024062809083510600_btae224-B22","doi-asserted-by":"crossref","first-page":"2103","DOI":"10.1093\/bioinformatics\/btw152","article-title":"Minimap and miniasm: fast mapping and de novo assembly for noisy long sequences","volume":"32","author":"Li","year":"2016","journal-title":"Bioinformatics"},{"key":"2024062809083510600_btae224-B23","doi-asserted-by":"crossref","first-page":"265","DOI":"10.1186\/s13059-020-02168-z","article-title":"The design and construction of reference pangenome graphs with minigraph","volume":"21","author":"Li","year":"2020","journal-title":"Genome Biol"},{"key":"2024062809083510600_btae224-B24","doi-asserted-by":"crossref","first-page":"e139","DOI":"10.1002\/imt2.139","article-title":"MetaSVs: a pipeline combining long and short reads for analysis and visualization of structural variants in metagenomes","volume":"2","author":"Li","year":"2023","journal-title":"iMeta"},{"key":"2024062809083510600_btae224-B25","doi-asserted-by":"crossref","first-page":"7421","DOI":"10.1038\/s41467-023-42997-7","article-title":"Gut microbial structural variation associates with immune checkpoint inhibitor response","volume":"14","author":"Liu","year":"2023","journal-title":"Nat Commun"},{"key":"2024062809083510600_btae224-B26","doi-asserted-by":"crossref","first-page":"246","DOI":"10.1186\/s13059-019-1828-7","article-title":"Structural variant calling: the long and the short of it","volume":"20","author":"Mahmoud","year":"2019","journal-title":"Genome Biol"},{"key":"2024062809083510600_btae224-B27","doi-asserted-by":"crossref","first-page":"20","DOI":"10.1038\/s42003-018-0023-9","article-title":"Genome-wide somatic variant calling using localized colored de bruijn graphs","volume":"1","author":"Narzisi","year":"2018","journal-title":"Commun Biol"},{"key":"2024062809083510600_btae224-B28","doi-asserted-by":"crossref","first-page":"5458","DOI":"10.1128\/AEM.05090-11","article-title":"Analysis of insertion sequences in thermophilic cyanobacteria: exploring the mechanisms of establishing, maintaining, and withstanding high insertion sequence abundance","volume":"77","author":"Nelson","year":"2011","journal-title":"Applied and Environmental Microbiology"},{"key":"2024062809083510600_btae224-B29","doi-asserted-by":"crossref","first-page":"2826","DOI":"10.1093\/bioinformatics\/btt502","article-title":"Exploring variation-aware contig graphs for (comparative) metagenomics using MaryGold","volume":"29","author":"Nijkamp","year":"2013","journal-title":"Bioinformatics"},{"key":"2024062809083510600_btae224-B30","doi-asserted-by":"crossref","first-page":"3242","DOI":"10.1093\/bioinformatics\/btaa115","article-title":"MUM&Co: accurate detection of all SV types through whole-genome alignment","volume":"36","author":"O\u2019Donnell","year":"2020","journal-title":"Bioinformatics"},{"key":"2024062809083510600_btae224-B31","doi-asserted-by":"crossref","first-page":"299","DOI":"10.1038\/35012500","article-title":"Lateral gene transfer and the nature of bacterial innovation","volume":"405","author":"Ochman","year":"2000","journal-title":"Nature"},{"key":"2024062809083510600_btae224-B32","doi-asserted-by":"crossref","first-page":"214","DOI":"10.1186\/s13059-021-02419-7","article-title":"STRONG: metagenomics strain resolution on assembly graphs","volume":"22","author":"Quince","year":"2021","journal-title":"Genome Biol"},{"key":"2024062809083510600_btae224-B33","doi-asserted-by":"crossref","first-page":"519","DOI":"10.1016\/j.mib.2004.08.006","article-title":"Order and disorder in bacterial genomes","volume":"7","author":"Rocha","year":"2004","journal-title":"Curr Opin Microbiol"},{"key":"2024062809083510600_btae224-B34","doi-asserted-by":"crossref","first-page":"1338","DOI":"10.1093\/molbev\/msy078","article-title":"Neutral theory, microbial practice: challenges in bacterial population genetics","volume":"35","author":"Rocha","year":"2018","journal-title":"Mol Biol Evol"},{"key":"2024062809083510600_btae224-B35","doi-asserted-by":"crossref","first-page":"954","DOI":"10.1101\/gr.170431.113","article-title":"Polymerase theta-mediated end joining of replication-associated DNA breaks in C. elegans","volume":"24","author":"Roerink","year":"2014","journal-title":"Genome Res"},{"key":"2024062809083510600_btae224-B36","doi-asserted-by":"crossref","first-page":"e00701\u201322","DOI":"10.1128\/msystems.00701-22","article-title":"Longitudinal, multi-platform metagenomics yields a high-quality genomic catalog and guides an in vitro model for cheese communities","volume":"8","author":"Saak","year":"2023","journal-title":"mSystems"},{"key":"2024062809083510600_btae224-B37","doi-asserted-by":"crossref","first-page":"e4015","DOI":"10.7717\/peerj.4015","article-title":"HgtSIM: a simulator for horizontal gene transfer (HGT) in microbial communities","volume":"5","author":"Song","year":"2017","journal-title":"PeerJ"},{"key":"2024062809083510600_btae224-B38","doi-asserted-by":"crossref","first-page":"36","DOI":"10.1186\/s40168-019-0649-y","article-title":"MetaCHIP: community-level horizontal gene transfer identification through the combination of best-match and phylogenetic approaches","volume":"7","author":"Song","year":"2019","journal-title":"Microbiome"},{"key":"2024062809083510600_btae224-B39","doi-asserted-by":"crossref","first-page":"102192","DOI":"10.1016\/j.mib.2022.102192","article-title":"From genome structure to function: insights into structural variation in microbiology","volume":"69","author":"West","year":"2022","journal-title":"Curr Opin Microbiol"},{"key":"2024062809083510600_btae224-B40","doi-asserted-by":"crossref","first-page":"3350","DOI":"10.1093\/bioinformatics\/btv383","article-title":"Bandage: interactive visualization of de novo genome assemblies","volume":"31","author":"Wick","year":"2015","journal-title":"Bioinformatics"},{"key":"2024062809083510600_btae224-B41","doi-asserted-by":"crossref","first-page":"257","DOI":"10.1186\/s13059-019-1891-0","article-title":"Improved metagenomic analysis with kraken 2","volume":"20","author":"Wood","year":"2019","journal-title":"Genome Biol"},{"key":"2024062809083510600_btae224-B42","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1093\/gigascience\/gix010","article-title":"NanoSim: nanopore sequence read simulator based on statistical characterization","volume":"6","author":"Yang","year":"2017","journal-title":"Gigascience"},{"key":"2024062809083510600_btae224-B43","doi-asserted-by":"crossref","first-page":"97","DOI":"10.1038\/nm.4002","article-title":"Systematic discovery of complex indels in human cancers","volume":"22","author":"Ye","year":"2016","journal-title":"Nat Med"},{"key":"2024062809083510600_btae224-B44","doi-asserted-by":"crossref","first-page":"43","DOI":"10.1038\/s41586-019-1065-y","article-title":"Structural variation in the gut microbiome associates with host health","volume":"568","author":"Zeevi","year":"2019","journal-title":"Nature"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/40\/Supplement_1\/i58\/58354930\/btae224.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/40\/Supplement_1\/i58\/58354930\/btae224.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,11,16]],"date-time":"2024-11-16T13:43:21Z","timestamp":1731764601000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/40\/Supplement_1\/i58\/7700881"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,6,28]]},"references-count":44,"journal-issue":{"issue":"Supplement_1","published-print":{"date-parts":[[2024,6,28]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btae224","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2024,7]]},"published":{"date-parts":[[2024,6,28]]}}}