{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,18]],"date-time":"2026-05-18T10:06:10Z","timestamp":1779098770606,"version":"3.51.4"},"reference-count":80,"publisher":"Association for Computing Machinery (ACM)","issue":"13s","license":[{"start":{"date-parts":[[2023,7,13]],"date-time":"2023-07-13T00:00:00Z","timestamp":1689206400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Comput. Surv."],"published-print":{"date-parts":[[2023,12,31]]},"abstract":"<jats:p>\n            In traditional decision-theoretic planning, information gathering is a means to a goal. The agent receives information about its environment (state or observation) and uses it as a way to optimize a state-based reward function. Recent works, however, have focused on application domains in which information gathering is not only the mean but the goal itself. The agent must optimize its knowledge of the environment. However, traditional Markov-based decision-theoretic models cannot account for rewarding the agent based on its knowledge, which leads to the development of many approaches to overcome this limitation. We survey recent approaches for using decision-theoretic models in information-gathering scenarios, highlighting common practices and existing generic models, and show that existing methods can be categorized into three classes:\n            <jats:italic>reactive sensing<\/jats:italic>\n            ,\n            <jats:italic>single-agent active sensing<\/jats:italic>\n            , and\n            <jats:italic>multi-agent active sensing<\/jats:italic>\n            . Finally, we highlight potential research gaps and suggest directions for future research.\n          <\/jats:p>","DOI":"10.1145\/3583068","type":"journal-article","created":{"date-parts":[[2023,2,10]],"date-time":"2023-02-10T11:59:35Z","timestamp":1676030375000},"page":"1-22","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":7,"title":["From Reactive to Active Sensing: A Survey on Information Gathering in Decision-theoretic Planning"],"prefix":"10.1145","volume":"55","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-5844-9383","authenticated-orcid":false,"given":"Tiago","family":"Veiga","sequence":"first","affiliation":[{"name":"Department of Computer Science, Norwegian University of Science and Technology, Norway"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2385-9470","authenticated-orcid":false,"given":"Jennifer","family":"Renoux","sequence":"additional","affiliation":[{"name":"Center for Applied Autonomous Sensor Systems, \u00d6rebro University, Sweden"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2023,7,13]]},"reference":[{"key":"e_1_3_2_2_2","doi-asserted-by":"publisher","DOI":"10.1109\/MC.2020.2996587"},{"key":"e_1_3_2_3_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-68711-7_14"},{"key":"e_1_3_2_4_2","first-page":"64","volume-title":"Proceedings of the 24th Annual Conference on Neural Information Processing Systems (NeurIPS\u201910)","author":"Araya-L\u00f3pez Mauricio","year":"2010","unstructured":"Mauricio Araya-L\u00f3pez, Olivier Buffet, Vincent Thomas, and Fran\u00e7ois Charpillet. 2010. A POMDP extension with belief-dependent rewards. In Proceedings of the 24th Annual Conference on Neural Information Processing Systems (NeurIPS\u201910), John D. Lafferty, Christopher K. I. Williams, John Shawe-Taylor, Richard S. Zemel, and Aron Culotta (Eds.). Curran Associates, Inc., 64\u201372."},{"key":"e_1_3_2_5_2","doi-asserted-by":"publisher","DOI":"10.1109\/ROBOT.2007.363691"},{"key":"e_1_3_2_6_2","doi-asserted-by":"publisher","DOI":"10.1109\/IROS.2017.8206230"},{"key":"e_1_3_2_7_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10514-019-09836-5"},{"key":"e_1_3_2_8_2","doi-asserted-by":"publisher","DOI":"10.1109\/TSP.2011.2160055"},{"key":"e_1_3_2_9_2","doi-asserted-by":"publisher","DOI":"10.1109\/5.5968"},{"key":"e_1_3_2_10_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10514-017-9615-3"},{"key":"e_1_3_2_11_2","doi-asserted-by":"publisher","DOI":"10.4230\/DagRep.7.6.1"},{"key":"e_1_3_2_12_2","doi-asserted-by":"publisher","DOI":"10.1287\/moor.27.4.819.297"},{"key":"e_1_3_2_13_2","doi-asserted-by":"publisher","DOI":"10.1177\/0278364918755924"},{"key":"e_1_3_2_14_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2018.8460617"},{"key":"e_1_3_2_15_2","doi-asserted-by":"publisher","DOI":"10.3166\/jancl.21.9-34"},{"key":"e_1_3_2_16_2","doi-asserted-by":"publisher","DOI":"10.1613\/jair.575"},{"key":"e_1_3_2_17_2","doi-asserted-by":"publisher","DOI":"10.3233\/978-1-61499-098-7-955"},{"key":"e_1_3_2_18_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10626-009-0071-x"},{"key":"e_1_3_2_19_2","doi-asserted-by":"publisher","DOI":"10.1609\/icaps.v27i1.13832"},{"key":"e_1_3_2_20_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2019.8794243"},{"key":"e_1_3_2_21_2","first-page":"1221","volume-title":"Proceedings of the International Conference on Autonomous Agents and Multiagent Systems (AAMAS\u201912)","author":"Eck Adam","year":"2012","unstructured":"Adam Eck and Leen-Kiat Soh. 2012. Evaluating POMDP rewards for active perception. In Proceedings of the International Conference on Autonomous Agents and Multiagent Systems (AAMAS\u201912), Wiebe van der Hoek, Lin Padgham, Vincent Conitzer, and Michael Winikoff (Eds.). IFAAMAS, 1221\u20131222."},{"key":"e_1_3_2_22_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10458-011-9189-y"},{"key":"e_1_3_2_23_2","first-page":"367","volume-title":"Proceedings of the International Conference on Autonomous Agents and Multiagent Systems (AAMAS\u201915)","author":"Eck Adam","year":"2015","unstructured":"Adam Eck and Leen-Kiat Soh. 2015. To ask, sense, or share: Ad Hoc information gathering. In Proceedings of the International Conference on Autonomous Agents and Multiagent Systems (AAMAS\u201915), Gerhard Weiss, Pinar Yolum, Rafael H. Bordini, and Edith Elkind (Eds.). ACM, 367\u2013376."},{"key":"e_1_3_2_24_2","first-page":"6933","volume-title":"Proceedings of the Annual Conference on Neural Information Processing Systems (NeurIPS\u201918)","author":"Fehr Mathieu","year":"2018","unstructured":"Mathieu Fehr, Olivier Buffet, Vincent Thomas, and Jilles Steeve Dibangoye. 2018. rho-POMDPs have Lipschitz-continuous epsilon-optimal value functions. In Proceedings of the Annual Conference on Neural Information Processing Systems (NeurIPS\u201918), Samy Bengio, Hanna M. Wallach, Hugo Larochelle, Kristen Grauman, Nicol\u00f2 Cesa-Bianchi, and Roman Garnett (Eds.). 6933\u20136943."},{"key":"e_1_3_2_25_2","first-page":"3177","volume-title":"Proceedings of the 37th International Conference on Machine Learning (ICML\u201920)","volume":"119","author":"Fischer Johannes","year":"2020","unstructured":"Johannes Fischer and \u00d6mer Sahin Tas. 2020. Information particle filter tree: An online algorithm for POMDPs with belief-based rewards on continuous domains. In Proceedings of the 37th International Conference on Machine Learning (ICML\u201920), Vol. 119. PMLR, 3177\u20133187."},{"key":"e_1_3_2_26_2","doi-asserted-by":"publisher","DOI":"10.1109\/CDC40024.2019.9029762"},{"key":"e_1_3_2_27_2","doi-asserted-by":"publisher","DOI":"10.24963\/ijcai.2019\/329"},{"key":"e_1_3_2_28_2","first-page":"21","volume-title":"Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS\u201910)","author":"Glinton Robin","year":"2010","unstructured":"Robin Glinton, Paul Scerri, and Katia P. Sycara. 2010. Exploiting scale invariant dynamics for efficient information propagation in large teams. In Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS\u201910), Wiebe van der Hoek, Gal A. Kaminka, Yves Lesp\u00e9rance, Michael Luck, and Sandip Sen (Eds.). IFAAMAS, 21\u201330. https:\/\/dl.acm.org\/citation.cfm?id=1838210."},{"key":"e_1_3_2_29_2","doi-asserted-by":"publisher","DOI":"10.1613\/jair.1579"},{"key":"e_1_3_2_30_2","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2019.2941706"},{"key":"e_1_3_2_31_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICTAI.2016.0111"},{"key":"e_1_3_2_32_2","doi-asserted-by":"publisher","DOI":"10.1145\/860575.860766"},{"key":"e_1_3_2_33_2","doi-asserted-by":"publisher","DOI":"10.1109\/ACC.2010.5531634"},{"key":"e_1_3_2_34_2","doi-asserted-by":"publisher","DOI":"10.1109\/CDC.2011.6160670"},{"key":"e_1_3_2_35_2","doi-asserted-by":"publisher","DOI":"10.1109\/ROBOT.2008.4543611"},{"key":"e_1_3_2_36_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2006.11.008"},{"key":"e_1_3_2_37_2","doi-asserted-by":"publisher","DOI":"10.1109\/TSP.2007.893747"},{"key":"e_1_3_2_38_2","doi-asserted-by":"publisher","DOI":"10.1016\/S0004-3702(98)00023-X"},{"key":"e_1_3_2_39_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2015.7139357"},{"key":"e_1_3_2_40_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA40945.2020.9197201"},{"key":"e_1_3_2_41_2","doi-asserted-by":"publisher","DOI":"10.5555\/2815660"},{"key":"e_1_3_2_42_2","doi-asserted-by":"publisher","DOI":"10.1609\/icaps.v28i1.13928"},{"key":"e_1_3_2_43_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2017.7989104"},{"key":"e_1_3_2_44_2","first-page":"13651","volume-title":"Advances in Neural Information Processing Systems","author":"Lauri Mikko","year":"2020","unstructured":"Mikko Lauri and Frans Oliehoek. 2020. Multi-agent active perception with prediction rewards. In Advances in Neural Information Processing Systems, Vol. 33. 13651\u201313661."},{"key":"e_1_3_2_45_2","doi-asserted-by":"publisher","DOI":"10.5555\/3306127.3331815"},{"key":"e_1_3_2_46_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10458-020-09467-6"},{"key":"e_1_3_2_47_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10472-013-9361-y"},{"key":"e_1_3_2_48_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2015.7139867"},{"key":"e_1_3_2_49_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.robot.2016.06.008"},{"key":"e_1_3_2_50_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10472-016-9527-5"},{"key":"e_1_3_2_51_2","doi-asserted-by":"publisher","DOI":"10.1613\/jair.1.12087"},{"key":"e_1_3_2_52_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2018.8460215"},{"key":"e_1_3_2_53_2","first-page":"316","volume-title":"Proceedings of the 5th International Conference on Numerical Methods and Applications","author":"Mihaylova L.","year":"2002","unstructured":"L. Mihaylova, T. Lefebvre, H. Bruyninckx, K. Gadeyne, and J. De Schutter. 2002. Active sensing for robotics - a survey. In Proceedings of the 5th International Conference on Numerical Methods and Applications. 316\u2013324."},{"key":"e_1_3_2_54_2","doi-asserted-by":"publisher","DOI":"10.1145\/3410992.3411001"},{"key":"e_1_3_2_55_2","first-page":"7227","volume-title":"Proceedings of the 34th AAAI Conference on Artificial Intelligence (AAAI 2020), the 32nd Innovative Applications of Artificial Intelligence Conference (IAAI \u201920), the 10th AAAI Symposium on Educational Advances in Artificial Intelligence (EAAI\u201920)","author":"Nguyen Hoa Van","year":"2020","unstructured":"Hoa Van Nguyen, Hamid Rezatofighi, Ba-Ngu Vo, and Damith Chinthana Ranasinghe. 2020. Multi-objective multi-agent planning for jointly discovering and tracking mobile objects. In Proceedings of the 34th AAAI Conference on Artificial Intelligence (AAAI 2020), the 32nd Innovative Applications of Artificial Intelligence Conference (IAAI \u201920), the 10th AAAI Symposium on Educational Advances in Artificial Intelligence (EAAI\u201920). AAAI Press, 7227\u20137235."},{"key":"e_1_3_2_56_2","doi-asserted-by":"publisher","DOI":"10.1109\/MRS.2019.8901060"},{"key":"e_1_3_2_57_2","doi-asserted-by":"publisher","DOI":"10.5555\/1104420"},{"key":"e_1_3_2_58_2","doi-asserted-by":"publisher","DOI":"10.1613\/jair.1024"},{"key":"e_1_3_2_59_2","doi-asserted-by":"publisher","DOI":"10.1109\/IROS.2015.7354123"},{"key":"e_1_3_2_60_2","volume-title":"Workshop on Multiagent Sequential Decision Making Under Uncertainty","author":"Renoux Jennifer","year":"2014","unstructured":"Jennifer Renoux, Abdel-Illah Mouaddib, and Simon Le Gloannec. 2014. A distributed decision-theoretic model for multiagent active information gathering. In Workshop on Multiagent Sequential Decision Making Under Uncertainty. Paris, France."},{"key":"e_1_3_2_61_2","doi-asserted-by":"publisher","DOI":"10.1109\/RO-MAN47096.2020.9223597"},{"key":"e_1_3_2_62_2","doi-asserted-by":"publisher","DOI":"10.1109\/ROMAN.2011.6005272"},{"key":"e_1_3_2_63_2","doi-asserted-by":"publisher","DOI":"10.1613\/jair.2567"},{"key":"e_1_3_2_64_2","doi-asserted-by":"publisher","DOI":"10.5555\/3398761.3398902"},{"key":"e_1_3_2_65_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10514-017-9666-5"},{"key":"e_1_3_2_66_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICIF.2005.1591903"},{"key":"e_1_3_2_67_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10458-012-9200-2"},{"key":"e_1_3_2_68_2","first-page":"330","volume-title":"Proceedings of the 18th International Conference on Automated Planning and Scheduling (ICAPS\u201908)","author":"Shani Guy","year":"2008","unstructured":"Guy Shani, Pascal Poupart, Ronen I. Brafman, and Solomon Eyal Shimony. 2008. Efficient ADD operations for point-based algorithms. In Proceedings of the 18th International Conference on Automated Planning and Scheduling (ICAPS\u201908), Jussi Rintanen, Bernhard Nebel, J. Christopher Beck, and Eric A. Hansen (Eds.). AAAI, 330\u2013337."},{"key":"e_1_3_2_69_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2019.8794389"},{"key":"e_1_3_2_70_2","doi-asserted-by":"publisher","DOI":"10.1038\/nature16961"},{"key":"e_1_3_2_71_2","volume-title":"AAAI\u201908 Workshop on Advancements in POMDP Solvers","author":"Spaan Matthijs T. J.","year":"2008","unstructured":"Matthijs T. J. Spaan. 2008. Cooperative active perception using POMDPs. In AAAI\u201908 Workshop on Advancements in POMDP Solvers."},{"key":"e_1_3_2_72_2","volume-title":"Reinforcement Learning: State of the Art","author":"Spaan Matthijs T. J.","year":"2012","unstructured":"Matthijs T. J. Spaan. 2012. Partially observable Markov decision processes. In Reinforcement Learning: State of the Art, Marco Wiering and Martijn van Otterlo (Eds.). Springer Verlag."},{"key":"e_1_3_2_73_2","doi-asserted-by":"publisher","DOI":"10.1609\/icaps.v19i1.13381"},{"key":"e_1_3_2_74_2","doi-asserted-by":"publisher","DOI":"10.1109\/IROS.2010.5648856"},{"key":"e_1_3_2_75_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10458-014-9279-8"},{"key":"e_1_3_2_76_2","doi-asserted-by":"publisher","DOI":"10.1613\/jair.1659"},{"key":"e_1_3_2_77_2","doi-asserted-by":"publisher","DOI":"10.3233\/FAIA200368"},{"key":"e_1_3_2_78_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2016.7487276"},{"key":"e_1_3_2_79_2","doi-asserted-by":"publisher","DOI":"10.1109\/IROS.2016.7759426"},{"key":"e_1_3_2_80_2","first-page":"773","volume-title":"Proceedings of the 29th International Conference on Automated Planning and Scheduling (ICAPS\u201918)","author":"Veiga Tiago S.","year":"2019","unstructured":"Tiago S. Veiga, Miguel Silva, Rodrigo Ventura, and Pedro U. Lima. 2019. A hierarchical approach to active semantic mapping using probabilistic logic and information reward POMDPs. In Proceedings of the 29th International Conference on Automated Planning and Scheduling (ICAPS\u201918), J. Benton, Nir Lipovetzky, Eva Onaindia, David E. Smith, and Siddharth Srivastava (Eds.). AAAI Press, 773\u2013781."},{"key":"e_1_3_2_81_2","doi-asserted-by":"publisher","DOI":"10.1109\/Allerton.2013.6736513"}],"container-title":["ACM Computing Surveys"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3583068","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3583068","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T17:48:47Z","timestamp":1750182527000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3583068"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,7,13]]},"references-count":80,"journal-issue":{"issue":"13s","published-print":{"date-parts":[[2023,12,31]]}},"alternative-id":["10.1145\/3583068"],"URL":"https:\/\/doi.org\/10.1145\/3583068","relation":{},"ISSN":["0360-0300","1557-7341"],"issn-type":[{"value":"0360-0300","type":"print"},{"value":"1557-7341","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,7,13]]},"assertion":[{"value":"2022-04-21","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-01-30","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-07-13","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}