{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"institution":[{"name":"bioRxiv"}],"indexed":{"date-parts":[[2026,1,15]],"date-time":"2026-01-15T10:00:25Z","timestamp":1768471225272,"version":"3.49.0"},"posted":{"date-parts":[[2018,2,18]]},"group-title":"Bioinformatics","reference-count":25,"publisher":"openRxiv","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"accepted":{"date-parts":[[2018,3,5]]},"abstract":"<jats:p>\n                  The general approaches to detect and quantify metagenomic sample composition are based on the alignment of the reads, according to an existing database containing reference microbial sequences. However, without proper parameterization, these methods are not suitable for ancient DNA. Quantifying somewhat dissimilar sequences by alignment methods is problematic, due to the need of fine-tuned thresholds, considering relaxed edit distances and the consequent increase of computational cost. Additionally, the choice of the thresholds poses the problem of how to quantify similarity without producing overestimated measures. We propose FALCON-meta, a compression-based method to infer metagenomic composition of next-generation sequencing samples. This unsupervised alignment-free method runs efficiently on FASTQ samples. FALCON-meta quickly learns how to give importance to the models that cooperate to predict similarity, incorporating parallelism and flexibility for multiple hardware characteristics. It shows substantial identification capabilities in ancient DNA without overestimation. In one of the examples, we found and authenticated an ancient\n                  <jats:italic>Pseudomonas<\/jats:italic>\n                  bacteria in a Mammoth mitogenome.\n                <\/jats:p>\n                <jats:p>\n                  FALCON-meta can be accessed at\n                  <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" ext-link-type=\"uri\" xlink:href=\"https:\/\/github.com\/pratas\/falcon\">https:\/\/github.com\/pratas\/falcon<\/jats:ext-link>\n                  .\n                <\/jats:p>","DOI":"10.1101\/267179","type":"posted-content","created":{"date-parts":[[2018,2,19]],"date-time":"2018-02-19T01:10:13Z","timestamp":1519002613000},"source":"Crossref","is-referenced-by-count":7,"title":["FALCON-meta: a method to infer metagenomic composition of ancient DNA"],"prefix":"10.64898","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-1176-552X","authenticated-orcid":false,"given":"Diogo","family":"Pratas","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9164-0016","authenticated-orcid":false,"given":"Armando J.","family":"Pinho","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5926-8042","authenticated-orcid":false,"given":"Raquel M.","family":"Silva","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9187-8094","authenticated-orcid":false,"given":"Jo\u00e3o M. O. S.","family":"Rodrigues","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8962-8985","authenticated-orcid":false,"given":"Morteza","family":"Hosseini","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9157-8761","authenticated-orcid":false,"given":"T\u00e2nia","family":"Caetano","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5147-0948","authenticated-orcid":false,"given":"Paulo J. S. G.","family":"Ferreira","sequence":"additional","affiliation":[]}],"member":"54368","reference":[{"key":"2019071819361973000_267179v3.1","doi-asserted-by":"publisher","DOI":"10.1126\/science.aad2545"},{"key":"2019071819361973000_267179v3.2","doi-asserted-by":"publisher","DOI":"10.1038\/nbt.2676"},{"key":"2019071819361973000_267179v3.3","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0060307"},{"key":"2019071819361973000_267179v3.4","doi-asserted-by":"publisher","DOI":"10.1038\/nbt.3329"},{"key":"2019071819361973000_267179v3.5","doi-asserted-by":"publisher","DOI":"10.1093\/nar\/29.23.4793"},{"key":"2019071819361973000_267179v3.6","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.0704665104"},{"key":"2019071819361973000_267179v3.7","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.1318934111"},{"key":"2019071819361973000_267179v3.8","doi-asserted-by":"publisher","DOI":"10.1038\/nmeth.f.303"},{"key":"2019071819361973000_267179v3.9","doi-asserted-by":"crossref","unstructured":"Weyrich, L. S. et al. Neanderthal behaviour, diet, and disease inferred from ancient DNA in dental calculus. Nature (2017).","DOI":"10.1038\/nature21674"},{"key":"2019071819361973000_267179v3.10","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/bts665"},{"key":"2019071819361973000_267179v3.11","doi-asserted-by":"publisher","DOI":"10.1101\/gr.171934.113"},{"key":"2019071819361973000_267179v3.12","doi-asserted-by":"crossref","unstructured":"Li, Y. et al. VIP: an integrated pipeline for metagenomics of virus identification and discovery. Scientific reports 6 (2016).","DOI":"10.1038\/srep23774"},{"key":"2019071819361973000_267179v3.13","doi-asserted-by":"publisher","DOI":"10.1186\/gb-2004-5-2-r12"},{"key":"2019071819361973000_267179v3.14","doi-asserted-by":"publisher","DOI":"10.1101\/gr.5969107"},{"key":"2019071819361973000_267179v3.15","doi-asserted-by":"publisher","DOI":"10.1186\/gb-2009-10-3-r25"},{"key":"2019071819361973000_267179v3.16","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btp324"},{"key":"2019071819361973000_267179v3.17","doi-asserted-by":"publisher","DOI":"10.1186\/1471-2164-13-178"},{"key":"2019071819361973000_267179v3.18","doi-asserted-by":"crossref","first-page":"1056","DOI":"10.1038\/nprot.2014.063","article-title":"Characterization of ancient and modern genomes by SNP detection and phylogenomic and metagenomic analysis using PALEOMIX","volume":"9","year":"2014","journal-title":"Nature protocols"},{"key":"2019071819361973000_267179v3.19","doi-asserted-by":"crossref","unstructured":"Herbig, A. et al. MALT: Fast alignment and analysis of metagenomic DNA sequence data applied to the Tyrolean Iceman. bioRxiv preprint (2017).","DOI":"10.1101\/050559"},{"key":"2019071819361973000_267179v3.20","doi-asserted-by":"publisher","DOI":"10.1093\/nar\/gkr1124"},{"key":"2019071819361973000_267179v3.21","doi-asserted-by":"crossref","first-page":"10203","DOI":"10.1038\/srep10203","article-title":"An alignment-free method to find and visualise rearrangements between pairs of DNA sequences","volume":"5","year":"2015","journal-title":"Scientific Reports"},{"key":"2019071819361973000_267179v3.22","doi-asserted-by":"publisher","DOI":"10.1128\/JVI.76.21.10608-10616.2002"},{"key":"2019071819361973000_267179v3.23","doi-asserted-by":"publisher","DOI":"10.1038\/nature09710"},{"key":"2019071819361973000_267179v3.24","doi-asserted-by":"publisher","DOI":"10.1186\/gb-2011-12-1-r1"},{"key":"2019071819361973000_267179v3.25","doi-asserted-by":"publisher","DOI":"10.1016\/j.cub.2015.07.055"}],"container-title":[],"original-title":[],"link":[{"URL":"https:\/\/syndication.highwire.org\/content\/doi\/10.1101\/267179","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,1,14]],"date-time":"2026-01-14T21:54:46Z","timestamp":1768427686000},"score":1,"resource":{"primary":{"URL":"http:\/\/biorxiv.org\/lookup\/doi\/10.1101\/267179"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,2,18]]},"references-count":25,"URL":"https:\/\/doi.org\/10.1101\/267179","relation":{},"subject":[],"published":{"date-parts":[[2018,2,18]]},"subtype":"preprint"}}