{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,8]],"date-time":"2026-04-08T08:48:21Z","timestamp":1775638101345,"version":"3.50.1"},"reference-count":39,"publisher":"Frontiers Media SA","license":[{"start":{"date-parts":[[2024,11,15]],"date-time":"2024-11-15T00:00:00Z","timestamp":1731628800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["frontiersin.org"],"crossmark-restriction":true},"short-container-title":["Front. Bioinform."],"abstract":"<jats:p>The implementation of Hi-C reads in the <jats:italic>de novo<\/jats:italic> genome assembly process allows the ordering of large regions of the genome in scaffolds and the generation of chromosome-level assemblies. Several bioinformatics tools have been developed for genome scaffolding with Hi-C, and each tool has advantages and disadvantages that need to be carefully evaluated before their adoption. We generated two <jats:italic>de novo<\/jats:italic> assemblies of <jats:italic>Arabidopsis thaliana<\/jats:italic> obtained from the same raw PacBio HiFi and Oxford Nanopore Technologies data. We scaffolded the assemblies implementing Hi-C reads with the scaffolders 3D-DNA, SALSA2, and YaHS, with the aim of identifying the tool providing the most accurate assembly. The scaffolded assemblies were evaluated according to contiguity, completeness, accuracy, and structural correctness. In our analysis, YaHS proved to be the best-performing bioinformatics tool for scaffolding <jats:italic>de novo<\/jats:italic> genome assemblies in <jats:italic>Arabidopsis thaliana<\/jats:italic>.<\/jats:p>","DOI":"10.3389\/fbinf.2024.1462923","type":"journal-article","created":{"date-parts":[[2024,11,15]],"date-time":"2024-11-15T06:15:32Z","timestamp":1731651332000},"update-policy":"https:\/\/doi.org\/10.3389\/crossmark-policy","source":"Crossref","is-referenced-by-count":5,"title":["Benchmarking of Hi-C tools for scaffolding plant genomes obtained from PacBio HiFi and ONT reads"],"prefix":"10.3389","volume":"4","author":[{"given":"Lia","family":"Obinu","sequence":"first","affiliation":[]},{"given":"Urmi","family":"Trivedi","sequence":"additional","affiliation":[]},{"given":"Andrea","family":"Porceddu","sequence":"additional","affiliation":[]}],"member":"1965","published-online":{"date-parts":[[2024,11,15]]},"reference":[{"key":"B1","doi-asserted-by":"publisher","first-page":"giz068","DOI":"10.1093\/gigascience\/giz068","article-title":"How can a high-quality genome assembly help plant breeders?","volume":"8","author":"Benevenuto","year":"2019","journal-title":"GigaScience"},{"key":"B2","doi-asserted-by":"publisher","first-page":"1119","DOI":"10.1038\/nbt.2727","article-title":"Chromosome-scale scaffolding of de novo genome assemblies based on chromatin interactions","volume":"31","author":"Burton","year":"2013","journal-title":"Nat. Biotechnol."},{"key":"B3","doi-asserted-by":"publisher","first-page":"1361","DOI":"10.1534\/G3.119.400908","article-title":"Blobtoolkit \u2013 interactive quality assessment of genome assemblies","volume":"10","author":"Challis","year":"2020","journal-title":"G3 Genes\u2014 Genomes\u2014 Genet."},{"key":"B4","doi-asserted-by":"publisher","first-page":"170","DOI":"10.1038\/s41592-020-01056-5","article-title":"Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm","volume":"18","author":"Cheng","year":"2021","journal-title":"Nat. Methods"},{"key":"B5","doi-asserted-by":"publisher","first-page":"2666","DOI":"10.1093\/bioinformatics\/bty149","article-title":"Nanopack: visualizing and processing long-read sequencing data","volume":"34","author":"Coster","year":"2018","journal-title":"Bioinformatics"},{"key":"B6","doi-asserted-by":"publisher","first-page":"92","DOI":"10.1126\/science.aal3327","article-title":"De novo assembly of the aedes aegypti genome using hi-c yields chromosome-length scaffolds","volume":"356","author":"Dudchenko","year":"2017","journal-title":"Science"},{"key":"B7","doi-asserted-by":"publisher","DOI":"10.1101\/254797","article-title":"The juicebox assembly tools module facilitates de novo assembly of mammalian genomes with chromosome-length scaffolds for under $1000","author":"Dudchenko","year":"2018","journal-title":"bioRxiv"},{"key":"B8","doi-asserted-by":"publisher","first-page":"95","DOI":"10.1016\/j.cels.2016.07.002","article-title":"Juicer provides a one-click system for analyzing loop-resolution hi-c experiments","volume":"3","author":"Durand","year":"2016","journal-title":"Cell Syst."},{"key":"B9","doi-asserted-by":"publisher","first-page":"10069944","DOI":"10.1371\/journal.pcbi.1006994","article-title":"Modern technologies and algorithms for scaffolding assembled genomes","volume":"15","author":"Ghurye","year":"2019","journal-title":"PLoS Comput. Biol."},{"key":"B10","doi-asserted-by":"publisher","first-page":"527","DOI":"10.1186\/s12864-017-3879-z","article-title":"Scaffolding of long read assemblies using long range contact information","volume":"18","author":"Ghurye","year":"2017","journal-title":"BMC Genomics"},{"key":"B11","doi-asserted-by":"publisher","first-page":"10072733","DOI":"10.1371\/journal.pcbi.1007273","article-title":"Integrating hi-c links with assembly graphs for chromosome-scale assembly","volume":"15","author":"Ghurye","year":"2019","journal-title":"PLoS Comput. Biol."},{"key":"B12","doi-asserted-by":"publisher","first-page":"2896","DOI":"10.1093\/BIOINFORMATICS\/BTAA025","article-title":"Identifying and removing haplotypic duplication in primary genome assemblies","volume":"36","author":"Guan","year":"2020","journal-title":"Bioinforma. Oxf. Engl."},{"key":"B13","doi-asserted-by":"publisher","first-page":"569","DOI":"10.1186\/s12859-021-04453-5","article-title":"Efficient iterative hi-c scaffolder based on n-best neighbors","volume":"22","author":"Guan","year":"2021","journal-title":"BMC Bioinforma."},{"key":"B14","doi-asserted-by":"publisher","first-page":"2147","DOI":"10.3390\/genes14122147","article-title":"Comparison of hi-c-based scaffolding tools on plant genomes","volume":"14","author":"Hou","year":"2023","journal-title":"Genes"},{"key":"B15","doi-asserted-by":"publisher","first-page":"700","DOI":"10.1016\/J.TPLANTS.2019.05.003","article-title":"Tools and strategies for long-read sequencing and de novo assembly of plant genomes","volume":"24","author":"Jung","year":"2019","journal-title":"Trends Plant Sci."},{"key":"B16","doi-asserted-by":"publisher","first-page":"344","DOI":"10.1093\/BIOINFORMATICS\/BTAB672","article-title":"Tiara: deep learning-based classification system for eukaryotic sequences","volume":"38","author":"Karlicki","year":"2022","journal-title":"Bioinformatics"},{"key":"B17","doi-asserted-by":"publisher","first-page":"796","DOI":"10.1038\/35048692","article-title":"Analysis of the genome sequence of the flowering plant arabidopsis thaliana","volume":"408","author":"Kaul","year":"2000","journal-title":"Nat. 2000"},{"key":"B18","doi-asserted-by":"publisher","first-page":"540","DOI":"10.1038\/s41587-019-0072-8","article-title":"Assembly of long, error-prone reads using repeat graphs","volume":"37","author":"Kolmogorov","year":"2019","journal-title":"Nat. Biotechnol."},{"key":"B19","doi-asserted-by":"publisher","first-page":"909","DOI":"10.1111\/j.1365-313X.2009.04086.x","article-title":"The development of arabidopsis as a model plant","volume":"61","author":"Koornneef","year":"2010","journal-title":"Plant J."},{"key":"B20","doi-asserted-by":"publisher","first-page":"722","DOI":"10.1101\/gr.215087.116","article-title":"Canu: scalable and accurate long-read assembly via adaptive k -mer weighting and repeat separation","volume":"27","author":"Koren","year":"2017","journal-title":"Genome Res."},{"key":"B21","doi-asserted-by":"publisher","first-page":"e2115639118","DOI":"10.1073\/pnas.2115639118","article-title":"Standards recommendations for the earth biogenome project","volume":"119","author":"Lawniczak","year":"2022","journal-title":"Proc. Natl. Acad. Sci."},{"key":"B22","doi-asserted-by":"publisher","first-page":"3094","DOI":"10.1093\/bioinformatics\/bty191","article-title":"Minimap2: pairwise alignment for nucleotide sequences","volume":"34","author":"Li","year":"2018","journal-title":"Bioinformatics"},{"key":"B23","doi-asserted-by":"publisher","first-page":"289","DOI":"10.1126\/science.1181369","article-title":"Comprehensive mapping of long-range interactions reveals folding principles of the human genome","volume":"33292","author":"Lieberman-aiden","year":"2009","journal-title":"Science"},{"key":"B24","doi-asserted-by":"publisher","first-page":"bbab033","DOI":"10.1093\/bib\/bbab033","article-title":"A comprehensive review of scaffolding methods in genome assembly","volume":"22","author":"Luo","year":"2021","journal-title":"Briefings Bioinforma."},{"key":"B25","doi-asserted-by":"publisher","first-page":"4647","DOI":"10.1093\/MOLBEV\/MSAB199","article-title":"Busco update: novel and streamlined workflows along with broader and deeper phylogenetic coverage for scoring of eukaryotic, prokaryotic, and viral genomes","volume":"38","author":"Manni","year":"2021","journal-title":"Mol. Biol. Evol."},{"key":"B26","doi-asserted-by":"publisher","first-page":"10059444","DOI":"10.1371\/journal.pcbi.1005944","article-title":"Mummer4: a fast and versatile genome alignment system","volume":"14","author":"Mar\u00e7ais","year":"2018","journal-title":"PLoS Comput. Biol."},{"key":"B27","doi-asserted-by":"publisher","first-page":"5695","DOI":"10.1038\/ncomms6695","article-title":"High-quality genome (re)assembly using chromosomal contact data","volume":"5","author":"Marie-Nelly","year":"2014","journal-title":"Nat. Commun."},{"key":"B28","doi-asserted-by":"publisher","first-page":"26","DOI":"10.1016\/j.pbi.2019.12.009","article-title":"Building near-complete plant genomes","volume":"54","author":"Michael","year":"2020","journal-title":"Curr. Opin. Plant Biol."},{"key":"B29","doi-asserted-by":"publisher","first-page":"i142","DOI":"10.1093\/bioinformatics\/bty266","article-title":"Versatile genome assembly evaluation with quast-lg","volume":"34","author":"Mikheenko","year":"2018","journal-title":"Bioinformatics"},{"key":"B30","doi-asserted-by":"publisher","first-page":"245","DOI":"10.1186\/s13059-020-02134-9","article-title":"Merqury: reference-free quality, completeness, and phasing assessment for genome assemblies","volume":"21","author":"Rhie","year":"2020","journal-title":"Genome Biol."},{"key":"B31","doi-asserted-by":"publisher","first-page":"117","DOI":"10.1270\/jsbbs.20146","article-title":"Chromosome-level de novo genome assemblies of over 100 plant species","volume":"71","author":"Shirasawa","year":"2021","journal-title":"Breed. Sci."},{"key":"B32","doi-asserted-by":"publisher","first-page":"1639","DOI":"10.1093\/BIOINFORMATICS\/BTAA1016","article-title":"Liftoff: accurate mapping of gene annotations","volume":"37","author":"Shumate","year":"2021","journal-title":"Bioinformatics"},{"key":"B33","doi-asserted-by":"publisher","DOI":"10.1101\/2022.04.20.488415","article-title":"A benchmark of hi-c scaffolders using reference genomes and de novo assemblies","volume":"2022","author":"Sur","year":"2022","journal-title":"bioRxiv"},{"key":"B34","doi-asserted-by":"publisher","first-page":"737","DOI":"10.1101\/GR.214270.116","article-title":"Fast and accurate de novo genome assembly from long uncorrected reads","volume":"27","author":"Vaser","year":"2017","journal-title":"Genome Res."},{"key":"B35","doi-asserted-by":"publisher","first-page":"4","DOI":"10.1016\/j.gpb.2021.08.003","article-title":"High-quality arabidopsis thaliana genome assembly with nanopore and hifi long reads","volume":"20","author":"Wang","year":"2022","journal-title":"Genomics, Proteomics & Bioinforma."},{"key":"B36","doi-asserted-by":"publisher","first-page":"5923","DOI":"10.1111\/mec.16146","article-title":"Technical considerations in hi-c scaffolding and evaluation of chromosome-scale genome assemblies","volume":"30","author":"Yamaguchi","year":"2021","journal-title":"Mol. Ecol."},{"key":"B37","doi-asserted-by":"publisher","first-page":"833","DOI":"10.1038\/s41477-019-0487-8","article-title":"Assembly of allele-aware, chromosomal-scale autopolyploid genomes based on hi-c data","volume":"5","author":"Zhang","year":"2019","journal-title":"Nat. Plants"},{"key":"B38","doi-asserted-by":"publisher","first-page":"btac808","DOI":"10.1093\/bioinformatics\/btac808","article-title":"YaHS: yet another Hi-C scaffolding tool","volume":"39","author":"Zhou","year":"2023","journal-title":"J. Bioinform."},{"key":"B39","doi-asserted-by":"publisher","first-page":"265","DOI":"10.1016\/S1871-0069(06)02009-X","article-title":"Chapter 9 a single molecule system for whole genome analysis","volume":"2","author":"Zhou","year":"2007","journal-title":"Perspect. Bioanal."}],"container-title":["Frontiers in Bioinformatics"],"original-title":[],"link":[{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fbinf.2024.1462923\/full","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,11,15]],"date-time":"2024-11-15T06:15:34Z","timestamp":1731651334000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fbinf.2024.1462923\/full"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,11,15]]},"references-count":39,"alternative-id":["10.3389\/fbinf.2024.1462923"],"URL":"https:\/\/doi.org\/10.3389\/fbinf.2024.1462923","relation":{},"ISSN":["2673-7647"],"issn-type":[{"value":"2673-7647","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,11,15]]},"article-number":"1462923"}}