{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:27:22Z","timestamp":1750220842478,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":20,"publisher":"ACM","license":[{"start":{"date-parts":[[2019,10,13]],"date-time":"2019-10-13T00:00:00Z","timestamp":1570924800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100012659","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61603368"],"award-info":[{"award-number":["61603368"]}],"id":[{"id":"10.13039\/501100012659","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2019,10,13]]},"DOI":"10.1145\/3356464.3357699","type":"proceedings-article","created":{"date-parts":[[2019,10,31]],"date-time":"2019-10-31T12:20:52Z","timestamp":1572524452000},"page":"1-8","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Stochastic multi-agent planning with partial state models"],"prefix":"10.1145","author":[{"given":"Feng","family":"Wu","sequence":"first","affiliation":[{"name":"University of Science and Technology of China, Hefei, Anhui, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Shlomo","family":"Zilberstein","sequence":"additional","affiliation":[{"name":"University of Massachusetts Amherst"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Nicholas R.","family":"Jennings","sequence":"additional","affiliation":[{"name":"Imperial College London, London, UK"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2019,10,13]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10458-009-9103-z"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1609\/icaps.v19i1.13355"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1287\/moor.27.4.819.297"},{"volume-title":"On-line algorithms in machine learning","author":"Blum Avrim","key":"e_1_3_2_1_4_1","unstructured":"Avrim Blum . 1998. On-line algorithms in machine learning . Springer . Avrim Blum. 1998. On-line algorithms in machine learning. Springer."},{"key":"e_1_3_2_1_5_1","volume-title":"Proc. of the 15th Int'l Conf. on Automated Planning and Scheduling. 40--49","author":"Bresina John L","year":"2005","unstructured":"John L Bresina , Ari K J\u00f3nsson , Paul H Morris , and Kanna Rajan . 2005 . Activity Planning for the Mars Exploration Rovers . In Proc. of the 15th Int'l Conf. on Automated Planning and Scheduling. 40--49 . John L Bresina, Ari K J\u00f3nsson, Paul H Morris, and Kanna Rajan. 2005. Activity Planning for the Mars Exploration Rovers. In Proc. of the 15th Int'l Conf. on Automated Planning and Scheduling. 40--49."},{"key":"e_1_3_2_1_6_1","volume-title":"Proc. of the 23d Int'l Joint Conf. on Artificial Intelligence.","author":"Dibangoye Jilles Steeve","year":"2013","unstructured":"Jilles Steeve Dibangoye , Christopher Amato , Olivier Buffet , and Fran\u00e7ois Charpillet . 2013 . Optimally Solving Dec-POMDPs as Continuous-State MDPs . In Proc. of the 23d Int'l Joint Conf. on Artificial Intelligence. Jilles Steeve Dibangoye, Christopher Amato, Olivier Buffet, and Fran\u00e7ois Charpillet. 2013. Optimally Solving Dec-POMDPs as Continuous-State MDPs. In Proc. of the 23d Int'l Joint Conf. on Artificial Intelligence."},{"key":"e_1_3_2_1_7_1","volume-title":"On human-agent collectives. Commun. ACM","author":"Jennings Nicholas R","year":"2014","unstructured":"Nicholas R Jennings , Luc Moreau , David Nicholson , Sarvapali D Ramchurn , Stephen J Roberts , T Rodden , and Alex Rogers . 2014. On human-agent collectives. Commun. ACM ( 2014 ). Nicholas R Jennings, Luc Moreau, David Nicholson, Sarvapali D Ramchurn, Stephen J Roberts, T Rodden, and Alex Rogers. 2014. On human-agent collectives. Commun. ACM (2014)."},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/1597735.1597738"},{"key":"e_1_3_2_1_9_1","volume-title":"Proc. of the 9th Int'l Conf. on Autonomous Agents and Multiagent Systems. 5--12","author":"Bradley Knox W","year":"2010","unstructured":"W Bradley Knox and Peter Stone . 2010 . Combining manual feedback with subsequent MDP reward signals for reinforcement learning . In Proc. of the 9th Int'l Conf. on Autonomous Agents and Multiagent Systems. 5--12 . W Bradley Knox and Peter Stone. 2010. Combining manual feedback with subsequent MDP reward signals for reinforcement learning. In Proc. of the 9th Int'l Conf. on Autonomous Agents and Multiagent Systems. 5--12."},{"key":"e_1_3_2_1_10_1","volume-title":"Proc. of the 9th Int'l Conf. on Autonomous Agents and Multiagent Systems. 1315--1322","author":"Kumar Akshat","year":"2010","unstructured":"Akshat Kumar and Shlomo Zilberstein . 2010 . Point-Based Backup for Decentralized POMDPs: Complexity and New Algorithms . In Proc. of the 9th Int'l Conf. on Autonomous Agents and Multiagent Systems. 1315--1322 . Akshat Kumar and Shlomo Zilberstein. 2010. Point-Based Backup for Decentralized POMDPs: Complexity and New Algorithms. In Proc. of the 9th Int'l Conf. on Autonomous Agents and Multiagent Systems. 1315--1322."},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/SSRR.2011.6106794"},{"key":"e_1_3_2_1_12_1","volume-title":"Proc. of the 18th Int'l Joint Conf. on Artificial Intelligence. 705--711","author":"Nair Ranjit","year":"2003","unstructured":"Ranjit Nair , Milind Tambe , Makoto Yokoo , David V. Pynadath , and Stacy Marsella . 2003 . Taming Decentralized POMDPs: Towards Efficient Policy Computation for Multiagent Settings . In Proc. of the 18th Int'l Joint Conf. on Artificial Intelligence. 705--711 . Ranjit Nair, Milind Tambe, Makoto Yokoo, David V. Pynadath, and Stacy Marsella. 2003. Taming Decentralized POMDPs: Towards Efficient Policy Computation for Multiagent Settings. In Proc. of the 18th Int'l Joint Conf. on Artificial Intelligence. 705--711."},{"key":"e_1_3_2_1_13_1","volume-title":"Proc. of the 17th Int'l Conf. on Machine Learning. 663--670","author":"Ng Andrew Y","year":"2000","unstructured":"Andrew Y Ng and Stuart J Russell . 2000 . Algorithms for inverse reinforcement learning . In Proc. of the 17th Int'l Conf. on Machine Learning. 663--670 . Andrew Y Ng and Stuart J Russell. 2000. Algorithms for inverse reinforcement learning. In Proc. of the 17th Int'l Conf. on Machine Learning. 663--670."},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.5555\/2512538.2512550"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.5555\/1402383.1402457"},{"key":"e_1_3_2_1_16_1","volume-title":"Proc. of the 25th Conf. on Neural Information Processing Systems. 2636--2644","author":"Pajarinen Joni","year":"2011","unstructured":"Joni Pajarinen and Jaakko Peltonen . 2011 . Periodic Finite State Controllers for Efficient POMDP and DEC-POMDP Planning . In Proc. of the 25th Conf. on Neural Information Processing Systems. 2636--2644 . Joni Pajarinen and Jaakko Peltonen. 2011. Periodic Finite State Controllers for Efficient POMDP and DEC-POMDP Planning. In Proc. of the 25th Conf. on Neural Information Processing Systems. 2636--2644."},{"key":"e_1_3_2_1_17_1","volume-title":"Proc. of the 20th Int'l Joint Conf. on Artificial Intelligence. 2009--2015","author":"Seuken Sven","year":"2007","unstructured":"Sven Seuken and Shlomo Zilberstein . 2007 . Memory-Bounded Dynamic Programming for DEC-POMDPs . In Proc. of the 20th Int'l Joint Conf. on Artificial Intelligence. 2009--2015 . Sven Seuken and Shlomo Zilberstein. 2007. Memory-Bounded Dynamic Programming for DEC-POMDPs. In Proc. of the 20th Int'l Joint Conf. on Artificial Intelligence. 2009--2015."},{"key":"e_1_3_2_1_18_1","volume-title":"Proc. of the 22nd Int'l Joint Conf. on Artificial Intelligence. 2027--2032","author":"Spaan Matthijs T.","year":"2011","unstructured":"Matthijs T. Spaan , Frans A. Oliehoek , and Christopher Amato . 2011 . Scaling up optimal heuristic search in Dec-POMDPs via incremental expansion . In Proc. of the 22nd Int'l Joint Conf. on Artificial Intelligence. 2027--2032 . Matthijs T. Spaan, Frans A. Oliehoek, and Christopher Amato. 2011. Scaling up optimal heuristic search in Dec-POMDPs via incremental expansion. In Proc. of the 22nd Int'l Joint Conf. on Artificial Intelligence. 2027--2032."},{"key":"e_1_3_2_1_19_1","volume-title":"Proc. of the 10th Int'l Conf. on Autonomous Agents and Multiagent Systems. 29--36","author":"Witwicki Stefan J","year":"2011","unstructured":"Stefan J Witwicki and Edmund H Durfee . 2011 . Towards a unifying characterization for quantifying weak coupling in Dec-POMDPs . In Proc. of the 10th Int'l Conf. on Autonomous Agents and Multiagent Systems. 29--36 . Stefan J Witwicki and Edmund H Durfee. 2011. Towards a unifying characterization for quantifying weak coupling in Dec-POMDPs. In Proc. of the 10th Int'l Conf. on Autonomous Agents and Multiagent Systems. 29--36."},{"key":"e_1_3_2_1_20_1","volume-title":"Proc. of the 26th Conf. on Uncertainty in Artificial Intelligence. 666--673","author":"Wu Feng","year":"2010","unstructured":"Feng Wu , Shlomo Zilberstein , and Xiaoping Chen . 2010 . Rollout sampling policy iteration for decentralized POMDPs . In Proc. of the 26th Conf. on Uncertainty in Artificial Intelligence. 666--673 . Feng Wu, Shlomo Zilberstein, and Xiaoping Chen. 2010. Rollout sampling policy iteration for decentralized POMDPs. In Proc. of the 26th Conf. on Uncertainty in Artificial Intelligence. 666--673."}],"event":{"name":"DAI '19: First International Conference on Distributed Artificial Intelligence","acronym":"DAI '19","location":"Beijing China"},"container-title":["Proceedings of the First International Conference on Distributed Artificial Intelligence"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3356464.3357699","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3356464.3357699","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T23:22:54Z","timestamp":1750202574000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3356464.3357699"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,10,13]]},"references-count":20,"alternative-id":["10.1145\/3356464.3357699","10.1145\/3356464"],"URL":"https:\/\/doi.org\/10.1145\/3356464.3357699","relation":{},"subject":[],"published":{"date-parts":[[2019,10,13]]},"assertion":[{"value":"2019-10-13","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}