{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,24]],"date-time":"2026-03-24T13:42:15Z","timestamp":1774359735372,"version":"3.50.1"},"reference-count":36,"publisher":"Frontiers Media SA","license":[{"start":{"date-parts":[[2023,3,31]],"date-time":"2023-03-31T00:00:00Z","timestamp":1680220800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["frontiersin.org"],"crossmark-restriction":true},"short-container-title":["Front. Appl. Math. Stat."],"abstract":"<jats:p>In this paper, we tackle innovation diffusion from the perspective of an institution which aims to encourage the adoption of a new product (i.e., an innovation) with mostly social rather than individual benefits. Designing such innovation adoption policies is a very challenging task because of the difficulty to quantify and predict its effect on the behaviors of non-adopters and the exponential size of the space of possible policies. To solve these issues, we propose an approach that uses agent-based modeling to simulate in a credible way the behaviors of possible adopters and (deep) reinforcement learning to efficiently explore the policy search space. An application of our approach is presented for the question of the use of digital technologies in agriculture. Empirical results on this case study validate our scheme and show the potential of our approach to learn effective innovation diffusion policies.<\/jats:p>","DOI":"10.3389\/fams.2023.1000785","type":"journal-article","created":{"date-parts":[[2023,3,31]],"date-time":"2023-03-31T08:36:08Z","timestamp":1680251768000},"update-policy":"https:\/\/doi.org\/10.3389\/crossmark-policy","source":"Crossref","is-referenced-by-count":7,"title":["Toward AI-designed innovation diffusion policies using agent-based simulations and reinforcement learning: The case of digital tool adoption in agriculture"],"prefix":"10.3389","volume":"9","author":[{"given":"Meritxell","family":"Vinyals","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Regis","family":"Sabbadin","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"St\u00e9phane","family":"Couture","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Lo\u00efc","family":"Sadou","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Rallou","family":"Thomopoulos","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Kevin","family":"Chapuis","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Baptiste","family":"Lesquoy","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Patrick","family":"Taillandier","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1965","published-online":{"date-parts":[[2023,3,31]]},"reference":[{"key":"B1","doi-asserted-by":"publisher","first-page":"183","DOI":"10.1007\/s10100-011-0210-y","article-title":"Agent-based simulation of innovation diffusion: a review","volume":"20","author":"Kiesling","year":"2012","journal-title":"Central Eur J Operat Res"},{"key":"B2","doi-asserted-by":"publisher","first-page":"215","DOI":"10.1287\/mnsc.15.5.215","article-title":"A new product growth for model consumer durables","volume":"15","author":"Bass","year":"1969","journal-title":"Manag Sci"},{"key":"B3","doi-asserted-by":"publisher","first-page":"179","DOI":"10.1016\/0749-5978(91)90020-T","article-title":"The theory of planned behavior","volume":"50","author":"Ajzen","year":"1991","journal-title":"Organ Behav Hum Decis Process"},{"key":"B4","doi-asserted-by":"publisher","first-page":"238","DOI":"10.5751\/ES-12440-260238","article-title":"Governance in social-ecological agent-based models: a review","volume":"26","author":"Bourceret","year":"2021","journal-title":"Ecol Soc"},{"key":"B5","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2004.13332","article-title":"The AI economist: improving equality and productivity with AI-driven tax policies","author":"Zheng","year":"2020","journal-title":"arXiv preprint"},{"key":"B6","doi-asserted-by":"publisher","DOI":"10.2139\/ssrn.3900237","article-title":"Building a foundation for data-driven, interpretable, and robust policy design using the ai economist","author":"Trott","year":"2021","journal-title":"arXiv preprint"},{"key":"B7","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2106.06060","article-title":"Achieving diverse objectives with AI-driven prices in deep reinforcement learning multi-agent markets","author":"Danassis","year":"2021","journal-title":"arXiv preprint"},{"key":"B8","unstructured":"Policy learning with constraints in model-free reinforcement learning: a survey450815\n            LiuY\n            HalevA\n            LiuX\n          \n            ZhouZ\n          Montreal, QCProceedings of the Thirtieth International Joint Conference on Artificial Intelligence, IJCAI 2021 Virtual Event\/Montreal, Canada, 19-27 August 20212021"},{"key":"B9","doi-asserted-by":"publisher","first-page":"1041","DOI":"10.1086\/430220","article-title":"An individual-based model of innovation diffusion mixing social value and individual benefit","volume":"110","author":"Deffuant","year":"2005","journal-title":"Am J Sociol"},{"key":"B10","doi-asserted-by":"publisher","first-page":"87","DOI":"10.1142\/S0219525900000078","article-title":"Mixing beliefs among interacting agents","volume":"3","author":"Deffuant","year":"2000","journal-title":"Adv Complex Syst"},{"key":"B11","doi-asserted-by":"crossref","first-page":"423","DOI":"10.1007\/978-3-030-92843-8_32","article-title":"Better representing the diffusion of innovation through the theory of planned behavior and formal argumentation","volume-title":"Advances in Social Simulation: Proceedings of the 16th Social Simulation Conference","author":"Sadau","year":"2022"},{"key":"B12","volume-title":"Diffusion of Innovations","author":"Rogers","year":"1962"},{"key":"B13","doi-asserted-by":"crossref","first-page":"707","DOI":"10.1007\/s10462-017-9577-z","article-title":"Empirically grounded agent-based models of innovation diffusion: A critical review","volume":"52","author":"Zhang","year":"2019","journal-title":"Artif Intell Rev"},{"key":"B14","doi-asserted-by":"publisher","first-page":"107338","DOI":"10.1016\/j.ecolecon.2021.107338","article-title":"Adapting the governance of social-ecological systems to behavioural dynamics: an agent-based model for water quality management using the theory of planned behaviour","volume":"194","author":"Bourceret","year":"2022","journal-title":"Ecol Econ"},{"key":"B15","doi-asserted-by":"publisher","first-page":"117","DOI":"10.1016\/S0743-0167(99)00043-1","article-title":"Using social-psychology models to understand farmers' conservation behaviour","volume":"16","author":"Beedell","year":"2000","journal-title":"J Rural Stud"},{"key":"B16","first-page":"1934","article-title":"Reinforcement learning with parameterized actions","volume-title":"Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, February 12-17, 2016","author":"Masson","year":"2016"},{"key":"B17","article-title":"Deep reinforcement learning in parameterized action space","volume-title":"4th International Conference on Learning Representations, ICLR 2016. San Juan, Puerto Rico, May 2-4, 2016 Conference Track Proceedings","author":"Hausknecht","year":"2016"},{"key":"B18","first-page":"1838","article-title":"Deep reinforcement learning with a combinatorial action space for predicting popular reddit threads","volume-title":"Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, EMNLP 2016 Austin, Texas, USA, November 1-4, 2016","author":"He","year":"2016"},{"key":"B19","unstructured":"Reinforcement learning with combinatorial actions: an application to vehicle routing\n            DelarueA\n            AndersonR\n            TjandraatmadjaC\n          \n            LarochelleH\n            RanzatoM\n            HadsellR\n            BalcanM\n            LinH\n          Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020 NeurIPS 2020 December 6-12, 2020.2020"},{"key":"B20","doi-asserted-by":"crossref","DOI":"10.32473\/flairs.v35i.130584","article-title":"A closer look at invalid action masking in policy gradient algorithms","volume-title":"Proceedings of the Thirty-Fifth International Florida Artificial Intelligence Research Society Conference, FLAIRS 2022 Hutchinson Island, Jensen Beach, Florida, USA, May 15-18, 2022","author":"Huang","year":"2022"},{"key":"B21","unstructured":"Improving stochastic policy gradients in continuous control with deep reinforcement learning using the beta distribution83443\n            ChouP-W\n            MaturanaD\n            SchererSA\n          \n            PrecupD\n            TheYW\n          Sydney, NSWPMLRProceedings of the 34th International Conference on Machine Learning, Vol.702017"},{"key":"B22","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.1801.08757","article-title":"Safe exploration in continuous action spaces","author":"Dalal","year":"2018","journal-title":"CoRR"},{"key":"B23","first-page":"610","article-title":"Resource constrained deep reinforcement learning","volume-title":"Proceedings of the Twenty-Ninth International Conference on Automated Planning and Scheduling, ICAPS 2018","author":"Bhatia","year":"2019"},{"key":"B24","unstructured":"A lyapunov-based approach to safe reinforcement learning810312\n            ChowY\n            NachumO\n            Duenez-GuzmanEA\n            GhavamzadehM\n          \n            BengioS\n            WallachHM\n            LarochelleH\n            Grauman\n            Cesa-BianchiN\n            GarnettR\n          Montreal, QCAdvances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018.2018"},{"key":"B25","unstructured":"IPO: Interior-point policy optimization under constraints49407\n            LiuY\n            DingJ\n            LiuX\n          Proc AAAI Conf Artif Intell342020"},{"key":"B26","article-title":"Projection-based constrained policy optimization","volume-title":"8th International Conference on Learning Representations, ICLR 2020 Addis Ababa, Ethiopia, April 26-30, 2020","author":"Yang","year":"2020"},{"key":"B27","doi-asserted-by":"publisher","first-page":"7","DOI":"10.18564\/jasss.4259","article-title":"The ODD protocol for describing agent-based and other simulation models: a second update to improve clarity, replication, and structural realism","volume":"23","author":"Grimm","year":"2020","journal-title":"J Artif Soc Soc Simulat"},{"key":"B28","doi-asserted-by":"publisher","first-page":"918","DOI":"10.2307\/2234987","article-title":"Technology diffusion and public policy","volume":"104","author":"Stoneman","year":"1994","journal-title":"Econ J"},{"key":"B29","first-page":"10","article-title":"Un compteur guillemotleft intelligent guillemotright pour mesurer les usages de l'eau: l'entree en scene d'une nouvelle connaissance","author":"Collard","year":"2019","journal-title":"Developpement durable et territoires, Economie, geographie, politique, droit, sociologie"},{"key":"B30","doi-asserted-by":"publisher","first-page":"65","DOI":"10.5802\/roia.10","article-title":"Simuler la diffusion d'une innovation agricole \u00e0 l'aide de mod\u00e8les \u00e0 base d'agents et de l'argumentation formelle","volume":"2","author":"Sadou","year":"2021","journal-title":"Revue Ouverte d'Intelligence Artificielle"},{"key":"B31","unstructured":"Time limits in reinforcement learning404251\n            PardoF\n            TavakoliA\n            LevdikV\n            KormushevP\n          StockholmPMLRProceedings of the 35th International Conference on Machine Learning, Vol. 802018"},{"key":"B32","doi-asserted-by":"publisher","first-page":"108529","DOI":"10.1016\/j.ress.2022.108529","article-title":"A prescriptive Dirichlet power allocation policy with deep reinforcement learning","volume":"224","author":"Tian","year":"2022","journal-title":"Reliabil Eng Syst Safety"},{"key":"B33","doi-asserted-by":"publisher","first-page":"299","DOI":"10.1007\/s10707-018-00339-6","article-title":"Building, composing and experimenting complex spatial models with the GAMA platform","volume":"23","author":"Taillandier","year":"2019","journal-title":"GeoInformatica"},{"key":"B34","doi-asserted-by":"publisher","first-page":"1","DOI":"10.18564\/jasss.4531","article-title":"Introducing the argumentation framework within agent-based models to better simulate agents' cognition in opinion dynamics: application to vegetarian diet diffusion","volume":"24","author":"Tailandier","year":"2021","journal-title":"J. Artif. Soc. Soc. Simul"},{"key":"B35","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.1707.06347","article-title":"Proximal policy optimization algorithms","author":"Schulman","year":"2017","journal-title":"arXiv preprint"},{"key":"B36","unstructured":"What matters for on-policy deep actor-critic methods? A large-scale study\n            AndrychwiczM\n            RaichukA\n            StanczykP\n            OrsinM\n            GirginS\n            MarinierR\n          OpenReview.net9th International Conference on Learning Representation2021"}],"container-title":["Frontiers in Applied Mathematics and Statistics"],"original-title":[],"link":[{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fams.2023.1000785\/full","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,3,31]],"date-time":"2023-03-31T08:36:25Z","timestamp":1680251785000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fams.2023.1000785\/full"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,3,31]]},"references-count":36,"alternative-id":["10.3389\/fams.2023.1000785"],"URL":"https:\/\/doi.org\/10.3389\/fams.2023.1000785","relation":{},"ISSN":["2297-4687"],"issn-type":[{"value":"2297-4687","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,3,31]]},"article-number":"1000785"}}