{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,20]],"date-time":"2026-06-20T01:25:27Z","timestamp":1781918727013,"version":"3.54.5"},"reference-count":44,"publisher":"Frontiers Media SA","license":[{"start":{"date-parts":[[2023,12,7]],"date-time":"2023-12-07T00:00:00Z","timestamp":1701907200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["frontiersin.org"],"crossmark-restriction":true},"short-container-title":["Front. Bioinform."],"abstract":"<jats:p>Ancient DNA is highly degraded, resulting in very short sequences. Reads generated with modern high-throughput sequencing machines are generally longer than ancient DNA molecules, therefore the reads often contain some portion of the sequencing adaptors. It is crucial to remove those adaptors, as they can interfere with downstream analysis. Furthermore, overlapping portions when DNA has been read forward and backward (paired-end) can be merged to correct sequencing errors and improve read quality. Several tools have been developed for adapter trimming and read merging, however, no one has attempted to evaluate their accuracy and evaluate their potential impact on downstream analyses. Through the simulation of sequencing data, seven commonly used tools were analyzed in their ability to reconstruct ancient DNA sequences through read merging. The analyzed tools exhibit notable differences in their abilities to correct sequence errors and identify the correct read overlap, but the most substantial difference is observed in their ability to calculate quality scores for merged bases. Selecting the most appropriate tool for a given project depends on several factors, although some tools such as fastp have some shortcomings, whereas others like leeHom outperform the other tools in most aspects. While the choice of tool did not result in a measurable difference when analyzing population genetics using principal component analysis, it is important to note that downstream analyses that are sensitive to wrongly merged reads or that rely on quality scores can be significantly impacted by the choice of tool.<\/jats:p>","DOI":"10.3389\/fbinf.2023.1260486","type":"journal-article","created":{"date-parts":[[2023,12,7]],"date-time":"2023-12-07T08:17:27Z","timestamp":1701937047000},"update-policy":"https:\/\/doi.org\/10.3389\/crossmark-policy","source":"Crossref","is-referenced-by-count":10,"title":["Benchmarking software tools for trimming adapters and merging next-generation sequencing data for ancient DNA"],"prefix":"10.3389","volume":"3","author":[{"given":"Annette","family":"Lien","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Leonardo Pestana","family":"Legori","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Louis","family":"Kraft","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Peter Wad","family":"Sackett","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Gabriel","family":"Renaud","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1965","published-online":{"date-parts":[[2023,12,7]]},"reference":[{"key":"B1","doi-asserted-by":"publisher","first-page":"e0185056","DOI":"10.1371\/journal.pone.0185056","article-title":"BBMerge\u2013accurate paired shotgun read merging via overlap","volume":"12","author":"Bushnell","year":"2017","journal-title":"PloS ONE"},{"key":"B2","doi-asserted-by":"publisher","first-page":"i884","DOI":"10.1093\/bioinformatics\/bty560","article-title":"fastp: an ultra-fast all-in-one FASTQ preprocessor","volume":"34","author":"Chen","year":"2018","journal-title":"Bioinformatics"},{"key":"B3","doi-asserted-by":"publisher","first-page":"404","DOI":"10.1093\/biomet\/26.4.404","article-title":"The use of confidence or fiducial limits illustrated in the case of the binomial","volume":"26","author":"Clopper","year":"1934","journal-title":"Biometrika"},{"key":"B4","doi-asserted-by":"publisher","first-page":"e2213563120","DOI":"10.1073\/pnas.2213563120","article-title":"Ancient DNA from a lost Negev Highlands desert grape reveals a Late Antiquity wine lineage","volume":"120","author":"Cohen","year":"2023","journal-title":"Proc. Natl. Acad. Sci."},{"key":"B5","doi-asserted-by":"publisher","first-page":"giab008","DOI":"10.1093\/gigascience\/giab008","article-title":"Twelve years of SAMtools and BCFtools","volume":"10","author":"Danecek","year":"2021","journal-title":"GigaScience"},{"key":"B6","doi-asserted-by":"publisher","first-page":"37","DOI":"10.3389\/fevo.2020.00037","article-title":"Unveiling the ecological applications of ancient DNA from mollusk shells","volume":"8","author":"Der Sarkissian","year":"2020","journal-title":"Front. Ecol. Evol."},{"key":"B7","volume-title":"EIG: eigen tools by nick patterson and alkes price lab","author":"Galinsky","year":"2022"},{"key":"B8","doi-asserted-by":"publisher","first-page":"4743","DOI":"10.1016\/j.cub.2022.09.023","article-title":"The population genomic legacy of the second plague pandemic","volume":"32","author":"Gopalakrishnan","year":"2022","journal-title":"Curr. Biol."},{"key":"B9","doi-asserted-by":"publisher","first-page":"652","DOI":"10.1038\/nature26151","article-title":"Reconstructing the genetic history of late Neanderthals","volume":"555","author":"Hajdinjak","year":"2018","journal-title":"Nature"},{"key":"B10","doi-asserted-by":"publisher","first-page":"3336","DOI":"10.1038\/s41467-018-05649-9","article-title":"Ancient DNA from Chalcolithic Israel reveals the role of population mixture in cultural transformation","volume":"9","author":"Harney","year":"2018","journal-title":"Nat. Commun."},{"key":"B11","doi-asserted-by":"publisher","first-page":"593","DOI":"10.1093\/bioinformatics\/btr708","article-title":"ART: a next-generation sequencing read simulator","volume":"28","author":"Huang","year":"2012","journal-title":"Bioinformatics"},{"key":"B12","volume-title":"SeqPrep: tool for stripping adaptors and\/or merging paired reads with overlap into single reads","author":"John","year":"2016"},{"key":"B13","doi-asserted-by":"publisher","first-page":"20162235","DOI":"10.1098\/rspb.2016.2235","article-title":"Tropical ancient DNA reveals relationships of the extinct Bahamian giant tortoise Chelonoidis alburyorum","volume":"284","author":"Kehlmaier","year":"2017","journal-title":"Proc. R. Soc. B Biol. Sci."},{"key":"B14","doi-asserted-by":"publisher","first-page":"eabq2574","DOI":"10.1126\/sciadv.abq2574","article-title":"Ancient DNA elucidates the lost world of western Indian Ocean giant tortoises and reveals a new extinct species from Madagascar","volume":"9","author":"Kehlmaier","year":"2023","journal-title":"Sci. Adv."},{"key":"B15","doi-asserted-by":"publisher","first-page":"2520","DOI":"10.1093\/bioinformatics\/bts480","article-title":"Snakemake-a scalable bioinformatics workflow engine","volume":"28","author":"K\u00f6ster","year":"2012","journal-title":"Bioinformatics"},{"key":"B16","doi-asserted-by":"publisher","first-page":"1569","DOI":"10.1038\/s41467-018-03857-x","article-title":"Ancient DNA study reveals HLA susceptibility locus for leprosy in medieval Europeans","volume":"9","author":"Krause-Kyora","year":"","journal-title":"Nat. Commun."},{"key":"B17","doi-asserted-by":"publisher","first-page":"e36666","DOI":"10.7554\/elife.36666","article-title":"Neolithic and medieval virus genomes reveal complex evolution of hepatitis B","volume":"7","author":"Krause-Kyora","year":"","journal-title":"eLife"},{"key":"B18","doi-asserted-by":"publisher","first-page":"409","DOI":"10.1038\/nature13673","article-title":"Ancient human genomes suggest three ancestral populations for present-day Europeans","volume":"513","author":"Lazaridis","year":"2014","journal-title":"Nature"},{"key":"B19","volume-title":"Seqtk: toolkit for processing sequences in FASTA\/Q formats","author":"Li","year":"2018"},{"key":"B20","volume-title":"adna: processing WGS aDNA data using the ReichLab protocol","author":"Li","year":"2019"},{"key":"B21","doi-asserted-by":"publisher","first-page":"589","DOI":"10.1093\/bioinformatics\/btp698","article-title":"Fast and accurate long-read alignment with Burrows\u2013Wheeler transform","volume":"26","author":"Li","year":"2010","journal-title":"Bioinformatics"},{"key":"B22","doi-asserted-by":"publisher","first-page":"290","DOI":"10.1038\/s41586-022-04430-9","article-title":"Ancient DNA and deep population structure in sub-Saharan African foragers","volume":"603","author":"Lipson","year":"2022","journal-title":"Nature"},{"key":"B23","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1155\/2012\/251364","article-title":"Comparison of next-generation sequencing systems","volume":"2012","author":"Liu","year":"2012","journal-title":"J. Biomed. Biotechnol."},{"key":"B24","doi-asserted-by":"publisher","first-page":"133","DOI":"10.1016\/j.tig.2007.12.007","article-title":"The impact of next-generation sequencing technology on genetics","volume":"24","author":"Mardis","year":"2008","journal-title":"Trends Genet."},{"key":"B25","doi-asserted-by":"publisher","first-page":"390","DOI":"10.1038\/s41586-020-2688-8","article-title":"Population genomics of the Viking world","volume":"585","author":"Margaryan","year":"2020","journal-title":"Nature"},{"key":"B26","doi-asserted-by":"publisher","first-page":"44","DOI":"10.1126\/science.abj6987","article-title":"The complete sequence of a human genome","volume":"376","author":"Nurk","year":"2022","journal-title":"Science"},{"key":"B27","doi-asserted-by":"publisher","first-page":"14","DOI":"10.1038\/s43586-020-00011-0","article-title":"Ancient DNA analysis","volume":"1","author":"Orlando","year":"2021","journal-title":"Nat. Rev. Methods Prim."},{"key":"B28","doi-asserted-by":"publisher","first-page":"220","DOI":"10.1002\/ajpa.22960","article-title":"Successful enrichment and recovery of whole mitochondrial genomes from ancient human dental calculus","volume":"160","author":"Ozga","year":"2016","journal-title":"Am. J. Phys. Anthropol."},{"key":"B29","doi-asserted-by":"publisher","first-page":"60","DOI":"10.1186\/s13059-016-0918-z","article-title":"EAGER: efficient ancient genome reconstruction","volume":"17","author":"Peltzer","year":"2016","journal-title":"Genome Biol."},{"key":"B30","doi-asserted-by":"publisher","first-page":"1623","DOI":"10.3390\/microorganisms10081623","article-title":"A case study for the recovery of authentic microbial ancient DNA from soil samples","volume":"10","author":"P\u00e9rez","year":"2022","journal-title":"Microorganisms"},{"key":"B31","doi-asserted-by":"publisher","first-page":"577","DOI":"10.1093\/bioinformatics\/btw670","article-title":"gargammel: a sequence simulator for ancient DNA","volume":"33","author":"Renaud","year":"2017","journal-title":"Bioinformatics"},{"key":"B32","doi-asserted-by":"publisher","first-page":"e141","DOI":"10.1093\/nar\/gku699","article-title":"leeHom: adaptor trimming and merging for Illumina sequencing reads","volume":"42","author":"Renaud","year":"2014","journal-title":"Nucleic Acids Res."},{"key":"B33","volume-title":"sequenceTools","author":"Schiffels","year":"2022"},{"key":"B34","doi-asserted-by":"publisher","first-page":"88","DOI":"10.1186\/s13104-016-1900-2","article-title":"AdapterRemoval v2: rapid adapter trimming, identification, and read merging","volume":"9","author":"Schubert","year":"2016","journal-title":"BMC Res. notes"},{"key":"B35","doi-asserted-by":"publisher","first-page":"427","DOI":"10.1101\/gr.260141.119","article-title":"Human auditory ossicles as an alternative optimal source of ancient DNA","volume":"30","author":"Sirak","year":"2020","journal-title":"Genome Res."},{"key":"B36","doi-asserted-by":"publisher","first-page":"14528","DOI":"10.1038\/s41598-022-17399-2","article-title":"Extended longevity of DNA preservation in levantine paleolithic sediments, sefunim cave, Israel","volume":"12","author":"Slon","year":"2022","journal-title":"Sci. Rep."},{"key":"B37","doi-asserted-by":"publisher","first-page":"2234","DOI":"10.1038\/s41467-018-04550-9","article-title":"Analysis of 3800-year-old Yersinia pestis genomes suggests Bronze Age origin for bubonic plague","volume":"9","author":"Spyrou","year":"2018","journal-title":"Nat. Commun."},{"key":"B38","doi-asserted-by":"publisher","first-page":"650","DOI":"10.1038\/s42003-020-01372-8","article-title":"Ancient DNA reveals monozygotic newborn twins from the Upper Palaeolithic","volume":"3","author":"Teschler-Nicola","year":"2020","journal-title":"Commun. Biol."},{"key":"B39","doi-asserted-by":"publisher","first-page":"2419","DOI":"10.1111\/2041-210x.13990","article-title":"Iteratively mapping ancient DNA to reconstruct highly divergent mitochondrial genomes: an evaluation of software, parameters and bait reference","volume":"13","author":"Westbury","year":"2022","journal-title":"Methods Ecol. Evol."},{"key":"B40","doi-asserted-by":"publisher","first-page":"357","DOI":"10.1038\/nature21674","article-title":"Neanderthal behaviour, diet, and disease inferred from ancient DNA in dental calculus","volume":"544","author":"Weyrich","year":"2017","journal-title":"Nature"},{"key":"B41","doi-asserted-by":"publisher","first-page":"234","DOI":"10.1038\/s41586-021-03532-0","article-title":"Reconstruction of ancient microbial genomes from the human gut","volume":"594","author":"Wibowo","year":"2021","journal-title":"Nature"},{"key":"B42","doi-asserted-by":"publisher","first-page":"129","DOI":"10.3390\/genes13010129","article-title":"Ancient DNA methods improve forensic DNA profiling of Korean War and World War II unknowns","volume":"13","author":"Zavala","year":"2022","journal-title":"Genes."},{"key":"B43","doi-asserted-by":"publisher","first-page":"1650","DOI":"10.1038\/s41467-023-36845-x","article-title":"Marine ecosystem shifts with deglacial sea-ice loss inferred from ancient DNA shotgun sequencing","volume":"14","author":"Zimmermann","year":"2023","journal-title":"Nat. Commun."},{"key":"B44","doi-asserted-by":"publisher","first-page":"160025","DOI":"10.1038\/sdata.2016.25","article-title":"Extensive sequencing of seven human genomes to characterize benchmark reference materials","volume":"3","author":"Zook","year":"2016","journal-title":"Sci. Data"}],"container-title":["Frontiers in Bioinformatics"],"original-title":[],"link":[{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fbinf.2023.1260486\/full","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,12,7]],"date-time":"2023-12-07T08:17:31Z","timestamp":1701937051000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fbinf.2023.1260486\/full"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,12,7]]},"references-count":44,"alternative-id":["10.3389\/fbinf.2023.1260486"],"URL":"https:\/\/doi.org\/10.3389\/fbinf.2023.1260486","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2023.07.17.549303","asserted-by":"object"}]},"ISSN":["2673-7647"],"issn-type":[{"value":"2673-7647","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,12,7]]},"article-number":"1260486"}}