{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,10]],"date-time":"2026-06-10T08:01:29Z","timestamp":1781078489238,"version":"3.54.1"},"reference-count":53,"publisher":"Oxford University Press (OUP)","issue":"10","license":[{"start":{"date-parts":[[2018,10,15]],"date-time":"2018-10-15T00:00:00Z","timestamp":1539561600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/501100001866","name":"National Research Fund","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100001866","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001665","name":"ANR","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100001665","id-type":"DOI","asserted-by":"publisher"}]},{"name":"European Union program ERA-Net BiodivERsA","award":["ANR-16-CE02-0015-01"],"award-info":[{"award-number":["ANR-16-CE02-0015-01"]}]},{"DOI":"10.13039\/501100006488","name":"INRA","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100006488","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Labex NUMEV"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2019,5,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Approximate Bayesian computation (ABC) has grown into a standard methodology that manages Bayesian inference for models associated with intractable likelihood functions. Most ABC implementations require the preliminary selection of a vector of informative statistics summarizing raw data. Furthermore, in almost all existing implementations, the tolerance level that separates acceptance from rejection of simulated parameter values needs to be calibrated.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>We propose to conduct likelihood-free Bayesian inferences about parameters with no prior selection of the relevant components of the summary statistics and bypassing the derivation of the associated tolerance level. The approach relies on the random forest (RF) methodology of Breiman (2001) applied in a (non-parametric) regression setting. We advocate the derivation of a new RF for each component of the parameter vector of interest. When compared with earlier ABC solutions, this method offers significant gains in terms of robustness to the choice of the summary statistics, does not depend on any type of tolerance level, and is a good trade-off in term of quality of point estimator precision and credible interval estimations for a given computing time. We illustrate the performance of our methodological proposal and compare it with earlier ABC methods on a Normal toy example and a population genetics example dealing with human population evolution.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>All methods designed here have been incorporated in the R package abcrf (version 1.7.1) available on CRAN.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/bty867","type":"journal-article","created":{"date-parts":[[2018,10,12]],"date-time":"2018-10-12T19:16:07Z","timestamp":1539371767000},"page":"1720-1728","source":"Crossref","is-referenced-by-count":178,"title":["ABC random forests for Bayesian parameter inference"],"prefix":"10.1093","volume":"35","author":[{"given":"Louis","family":"Raynal","sequence":"first","affiliation":[{"name":"IMAG, Univ Montpellier, CNRS, Montpellier, France"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Jean-Michel","family":"Marin","sequence":"additional","affiliation":[{"name":"IMAG, Univ Montpellier, CNRS, Montpellier, France"},{"name":"IBC, Univ Montpellier, CNRS, Montpellier, France"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Pierre","family":"Pudlo","sequence":"additional","affiliation":[{"name":"Institut de Math\u00e9matiques de Marseille, Aix-Marseille Universit\u00e9, Marseille, France"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Mathieu","family":"Ribatet","sequence":"additional","affiliation":[{"name":"IMAG, Univ Montpellier, CNRS, Montpellier, France"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Christian P","family":"Robert","sequence":"additional","affiliation":[{"name":"Universit\u00e9 Paris Dauphine, PSL Research University, Paris, France"},{"name":"Department of Statistics, University of Warwick, Coventry, UK"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Arnaud","family":"Estoup","sequence":"additional","affiliation":[{"name":"IBC, Univ Montpellier, CNRS, Montpellier, France"},{"name":"CBGP, INRA, CIRAD, IRD, Montpellier SupAgro, Univ Montpellier, Montpellier, France"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"286","published-online":{"date-parts":[[2018,10,15]]},"reference":[{"key":"2023013107481444200_bty867-B1","doi-asserted-by":"crossref","first-page":"1109","DOI":"10.1093\/molbev\/msu411","article-title":"CodABC: a computational framework to coestimate recombination, substitution, and molecular adaptation rates by approximate Bayesian computation","volume":"32","author":"Arenas","year":"2015","journal-title":"Mol. Biol. Evol."},{"key":"2023013107481444200_bty867-B2","doi-asserted-by":"crossref","first-page":"e1006059","DOI":"10.1371\/journal.pgen.1006059","article-title":"The great migration and African-American genomic diversity","volume":"12","author":"Baharian","year":"2016","journal-title":"PLoS Genet."},{"key":"2023013107481444200_bty867-B3","doi-asserted-by":"crossref","first-page":"2025","DOI":"10.1093\/genetics\/162.4.2025","article-title":"Approximate Bayesian computation in population genetics","volume":"162","author":"Beaumont","year":"2002","journal-title":"Genetics"},{"key":"2023013107481444200_bty867-B4","first-page":"134","article-title":"Joint determination of topology, divergence time and immigration in population trees","volume-title":"Simulations, Genetics and Human Prehistory","author":"Beaumont","year":"2008"},{"key":"2023013107481444200_bty867-B5","doi-asserted-by":"crossref","first-page":"983","DOI":"10.1093\/biomet\/asp052","article-title":"Adaptive approximate Bayesian computation","volume":"96","author":"Beaumont","year":"2009","journal-title":"Biometrika"},{"key":"2023013107481444200_bty867-B6","doi-asserted-by":"crossref","first-page":"379","DOI":"10.1146\/annurev-ecolsys-102209-144621","article-title":"Approximate Bayesian computation in evolution and ecology","volume":"41","author":"Beaumont","year":"2010","journal-title":"Annu. Rev. Ecol. Evol. Syst."},{"key":"2023013107481444200_bty867-B8","doi-asserted-by":"crossref","first-page":"376","DOI":"10.1214\/13-AIHP590","article-title":"New insights into approximate Bayesian computation","volume":"51","author":"Biau","year":"2015","journal-title":"Ann. Inst. H. Poincar\u00e9 Probab. Statist."},{"key":"2023013107481444200_bty867-B9","article-title":"Mixture density networks","author":"Bishop","year":"1994"},{"key":"2023013107481444200_bty867-B10","doi-asserted-by":"crossref","first-page":"63","DOI":"10.1007\/s11222-009-9116-0","article-title":"Non-linear regression models for approximate Bayesian computation","volume":"20","author":"Blum","year":"2010","journal-title":"Stat. Comput."},{"key":"2023013107481444200_bty867-B11","doi-asserted-by":"crossref","first-page":"189","DOI":"10.1214\/12-STS406","article-title":"A comparative review of dimension reduction methods in approximate Bayesian computation","volume":"28","author":"Blum","year":"2013","journal-title":"Stat. Sci."},{"key":"2023013107481444200_bty867-B12","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1023\/A:1010933404324","article-title":"Random forests","volume":"45","author":"Breiman","year":"2001","journal-title":"Mach. Learn."},{"key":"2023013107481444200_bty867-B13","doi-asserted-by":"crossref","first-page":"37","DOI":"10.1016\/j.ajhg.2014.11.010","article-title":"The genetic ancestry of African Americans, Latinos, and European Americans across the United States","volume":"96","author":"Bryc","year":"2015","journal-title":"Am. J. Hum. Genet."},{"key":"2023013107481444200_bty867-B14","doi-asserted-by":"crossref","first-page":"2501","DOI":"10.1093\/molbev\/msu187","article-title":"Detecting concerted demographic response across community assemblages using hierarchical approximate Bayesian computation","volume":"31","author":"Chan","year":"2014","journal-title":"Mol. Biol. Evol."},{"key":"2023013107481444200_bty867-B15","doi-asserted-by":"crossref","first-page":"410","DOI":"10.1016\/j.tree.2010.04.001","article-title":"Approximate Bayesian computation (ABC) in practice","volume":"25","author":"Csill\u00e9ry","year":"2010","journal-title":"Trends Ecol. Evol."},{"key":"2023013107481444200_bty867-B18","doi-asserted-by":"crossref","first-page":"2713","DOI":"10.1093\/bioinformatics\/btn514","article-title":"Inferring population history with DIY ABC: a user-friendly approach to approximate Bayesian computation","volume":"24","author":"Cornuet","year":"2008","journal-title":"Bioinformatics"},{"key":"2023013107481444200_bty867-B19","doi-asserted-by":"crossref","first-page":"1187","DOI":"10.1093\/bioinformatics\/btt763","article-title":"DIYABC v2.0: a software to make approximate Bayesian computation inferences about population history using single nucleotide polymorphism, DNA sequence and microsatellite data","volume":"30","author":"Cornuet","year":"2014","journal-title":"Bioinformatics"},{"key":"2023013107481444200_bty867-B20","doi-asserted-by":"crossref","first-page":"1009","DOI":"10.1007\/s11222-011-9271-y","article-title":"An adaptive sequential Monte Carlo method for approximate Bayesian computation","volume":"22","author":"Del Moral","year":"2012","journal-title":"Stat. Comput."},{"key":"2023013107481444200_bty867-B21","doi-asserted-by":"crossref","first-page":"e1003905","DOI":"10.1371\/journal.pgen.1003905","article-title":"Robust demographic inference from genomic and SNP data","volume":"9","author":"Excoffier","year":"2013","journal-title":"PLoS Genet."},{"key":"2023013107481444200_bty867-B22","doi-asserted-by":"crossref","first-page":"419","DOI":"10.1111\/j.1467-9868.2011.01010.x","article-title":"Constructing summary statistics for approximate Bayesian computation: semi-automatic approximate Bayesian computation","volume":"74","author":"Fearnhead","year":"2012","journal-title":"J. R. Stat. Soc. Series B Stat. Methodol."},{"key":"2023013107481444200_bty867-B23","article-title":"Asymptotic properties of approximate Bayesian computation","author":"Frazier","year":"2017","journal-title":"arXiv"},{"key":"2023013107481444200_bty867-B24","doi-asserted-by":"crossref","first-page":"85","DOI":"10.1002\/jae.3950080507","article-title":"Indirect inference","volume":"8","author":"Gourieroux","year":"1993","journal-title":"J. Appl. Econom."},{"key":"2023013107481444200_bty867-B25","doi-asserted-by":"crossref","first-page":"e1000695","DOI":"10.1371\/journal.pgen.1000695","article-title":"Inferring the joint demographic history of multiple populations from multidimensional SNP frequency data","volume":"5","author":"Gutenkunst","year":"2009","journal-title":"PLoS Genet."},{"key":"2023013107481444200_bty867-B26","doi-asserted-by":"crossref","first-page":"17758","DOI":"10.1073\/pnas.1212380109","article-title":"The great human expansion","volume":"109","author":"Henn","year":"2012","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023013107481444200_bty867-B27","doi-asserted-by":"crossref","first-page":"337","DOI":"10.1093\/bioinformatics\/18.2.337","article-title":"Generating samples under a Wright-Fisher neutral model of genetic variation","volume":"18","author":"Hudson","year":"2002","journal-title":"Bioinformatics"},{"key":"2023013107481444200_bty867-B28","doi-asserted-by":"crossref","first-page":"Article 26","DOI":"10.2202\/1544-6115.1389","article-title":"Approximately sufficient statistics and Bayesian computation","volume":"7","author":"Joyce","year":"2008","journal-title":"Stat. Appl. Genet. Mol. Biol."},{"key":"2023013107481444200_bty867-B29","doi-asserted-by":"crossref","first-page":"45","DOI":"10.2202\/1544-6115.1586","article-title":"Choice of summary statistics weights in approximate Bayesian computation","volume":"10","author":"Jung","year":"2011","journal-title":"Stat. Appl. Genet. Mol. Biol."},{"key":"2023013107481444200_bty867-B30","doi-asserted-by":"crossref","DOI":"10.1007\/978-3-319-67471-1_8","article-title":"A scheme for adaptive selection of population sizes in approximate Bayesian computation\u2014sequential Monte Carlo","author":"Klinger","year":"2017"},{"key":"2023013107481444200_bty867-B31","doi-asserted-by":"crossref","first-page":"3591","DOI":"10.1093\/bioinformatics\/bty361","article-title":"pyABC: distributed, likelihood-free inference","volume":"34","author":"Klinger","year":"2018","journal-title":"Bioinformatics"},{"key":"2023013107481444200_bty867-B32","doi-asserted-by":"crossref","first-page":"624","DOI":"10.1007\/978-3-540-74958-5_61","article-title":"Ensembles of multi-objective decision trees","volume-title":"Machine Learning: ECML 2007. Lecture Notes in Computer Science","author":"Kocev","year":"2007"},{"key":"2023013107481444200_bty867-B33","article-title":"On the asymptotic efficiency of ABC estimators","author":"Li","year":"2015","journal-title":"arXiv"},{"key":"2023013107481444200_bty867-B34","doi-asserted-by":"crossref","first-page":"221","DOI":"10.1090\/conm\/080\/999014","article-title":"Composite likelihood methods","volume":"80","author":"Lindsay","year":"1988","journal-title":"Contemp. Math."},{"key":"2023013107481444200_bty867-B35","doi-asserted-by":"crossref","first-page":"278","DOI":"10.1016\/j.ajhg.2013.06.020","article-title":"RFMix: a discriminative modeling approach for rapid and robust local-ancestry inference","volume":"93","author":"Maples","year":"2013","journal-title":"Am. J. Hum. Genet."},{"key":"2023013107481444200_bty867-B36","doi-asserted-by":"crossref","first-page":"1167","DOI":"10.1007\/s11222-011-9288-2","article-title":"Approximate Bayesian computational methods","volume":"22","author":"Marin","year":"2012","journal-title":"Stat. Comput."},{"key":"2023013107481444200_bty867-B37","doi-asserted-by":"crossref","DOI":"10.1007\/978-1-4614-8687-9","volume-title":"Bayesian Essentials with R","author":"Marin","year":"2014"},{"key":"2023013107481444200_bty867-B38","doi-asserted-by":"crossref","DOI":"10.1201\/9781315117195-6","article-title":"Likelihood-free model choice","volume-title":"Handbook of Approximate Bayesian Computation","author":"Marin","year":"2018"},{"key":"2023013107481444200_bty867-B39","first-page":"983","article-title":"Quantile regression forests","volume":"7","author":"Meinshausen","year":"2006","journal-title":"J. Mach. Learn. Res."},{"key":"2023013107481444200_bty867-B40","doi-asserted-by":"crossref","DOI":"10.2202\/1544-6115.1576","article-title":"On optimal selection of summary statistics for approximate Bayesian computation","volume":"9","author":"Nunes","year":"2010","journal-title":"Stat. Appl. Genet. Mol. Biol."},{"key":"2023013107481444200_bty867-B16","doi-asserted-by":"crossref","first-page":"189","DOI":"10.32614\/RJ-2015-030","article-title":"abctools: An R Package for Tuning Approximate Bayesian Computation Analyses","volume":"7","author":"Nunes","year":"2015","journal-title":"The R Journal"},{"key":"2023013107481444200_bty867-B41","first-page":"1028","article-title":"Fast \u03b5-free inference of simulation models with Bayesian conditional density estimation","volume-title":"Advances in Neural Information Processing Systems 29","author":"Papamakarios","year":"2016"},{"key":"2023013107481444200_bty867-B42","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1214\/16-BA1002","article-title":"Adapting the ABC distance function","volume":"12","author":"Prangle","year":"2017","journal-title":"Bayesian Anal."},{"key":"2023013107481444200_bty867-B43","doi-asserted-by":"crossref","first-page":"1791","DOI":"10.1093\/oxfordjournals.molbev.a026091","article-title":"Population growth of human Y chromosomes: a study of Y chromosome microsatellites","volume":"16","author":"Pritchard","year":"1999","journal-title":"Mol. Biol. Evol."},{"key":"2023013107481444200_bty867-B44","doi-asserted-by":"crossref","first-page":"859","DOI":"10.1093\/bioinformatics\/btv684","article-title":"Reliable ABC model choice via random forests","volume":"32","author":"Pudlo","year":"2016","journal-title":"Bioinformatics"},{"key":"2023013107481444200_bty867-B45","doi-asserted-by":"crossref","first-page":"e1005416","DOI":"10.1371\/journal.pcbi.1005416","article-title":"Inferring epidemiological parameters from phylogenies using regression-ABC: a comparative study","volume":"13","author":"Saulnier","year":"2017","journal-title":"PLoS Comput. Biol."},{"key":"2023013107481444200_bty867-B46","doi-asserted-by":"crossref","first-page":"919","DOI":"10.1038\/ng.3015","article-title":"Inferring human population size and separation history from multiple genome sequences","volume":"46","author":"Schiffels","year":"2014","journal-title":"Nat. Genet."},{"key":"2023013107481444200_bty867-B47","doi-asserted-by":"crossref","first-page":"1760","DOI":"10.1073\/pnas.0607208104","article-title":"Sequential Monte Carlo without likelihoods","volume":"104","author":"Sisson","year":"2007","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023013107481444200_bty867-B48","first-page":"1760","article-title":"Correction: sequential Monte Carlo without likelihoods","volume":"106","author":"Sisson","year":"2009","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023013107481444200_bty867-B49","doi-asserted-by":"crossref","DOI":"10.1201\/9781315117195","volume-title":"Handbook of Approximate Bayesian Computation","author":"Sisson","year":"2018"},{"key":"2023013107481444200_bty867-B50","doi-asserted-by":"crossref","first-page":"505","DOI":"10.1093\/genetics\/145.2.505","article-title":"Inferring coalescence times from DNA sequence data","volume":"145","author":"Tavar\u00e9","year":"1997","journal-title":"Genetics"},{"key":"2023013107481444200_bty867-B51","doi-asserted-by":"crossref","first-page":"56","DOI":"10.1038\/nature11632","article-title":"An integrated map of genetic variation from 1, 092 human genomes","volume":"491","year":"2012","journal-title":"Nature"},{"key":"2023013107481444200_bty867-B52","doi-asserted-by":"crossref","first-page":"3653","DOI":"10.1093\/molbev\/mss175","article-title":"Inferring the history of population size change from genome-wide SNP Data","volume":"29","author":"Theunert","year":"2012","journal-title":"Mol. Biol. Evol."},{"key":"2023013107481444200_bty867-B53","doi-asserted-by":"crossref","first-page":"187","DOI":"10.1098\/rsif.2008.0172","article-title":"Approximate Bayesian computation scheme for parameter inference and model selection in dynamical systems","volume":"6","author":"Toni","year":"2009","journal-title":"J. Royal Soc. Interface"},{"key":"2023013107481444200_bty867-B54","first-page":"5","article-title":"An overview of composite likelihood methods","volume":"21","author":"Varin","year":"2011","journal-title":"Stat. Sin."},{"key":"2023013107481444200_bty867-B55","doi-asserted-by":"crossref","first-page":"1","DOI":"10.18637\/jss.v077.i01","article-title":"ranger: a fast implementation of random forests for high dimensional data in C++ and R","volume":"77","author":"Wright","year":"2017","journal-title":"J. Stat. Softw."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/35\/10\/1720\/48969808\/bioinformatics_35_10_1720.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/35\/10\/1720\/48969808\/bioinformatics_35_10_1720.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,31]],"date-time":"2023-01-31T10:48:33Z","timestamp":1675162113000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/35\/10\/1720\/5132692"}},"subtitle":[],"editor":[{"given":"Oliver","family":"Stegle","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"editor"}]}],"short-title":[],"issued":{"date-parts":[[2018,10,15]]},"references-count":53,"journal-issue":{"issue":"10","published-print":{"date-parts":[[2019,5,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/bty867","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2019,5,15]]},"published":{"date-parts":[[2018,10,15]]}}}