{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,21]],"date-time":"2026-02-21T05:43:45Z","timestamp":1771652625614,"version":"3.50.1"},"reference-count":35,"publisher":"Springer Science and Business Media LLC","issue":"3","license":[{"start":{"date-parts":[[2009,8,25]],"date-time":"2009-08-25T00:00:00Z","timestamp":1251158400000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/www.springer.com\/tdm"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Auton Agent Multi-Agent Syst"],"published-print":{"date-parts":[[2010,11]]},"DOI":"10.1007\/s10458-009-9103-z","type":"journal-article","created":{"date-parts":[[2009,8,24]],"date-time":"2009-08-24T07:55:03Z","timestamp":1251100503000},"page":"293-320","source":"Crossref","is-referenced-by-count":66,"title":["Optimizing fixed-size stochastic controllers for POMDPs and decentralized POMDPs"],"prefix":"10.1007","volume":"21","author":[{"given":"Christopher","family":"Amato","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Daniel S.","family":"Bernstein","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Shlomo","family":"Zilberstein","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2009,8,25]]},"reference":[{"key":"9103_CR1","unstructured":"Amato, C., Bernstein, D. S., & Zilberstein, S. (2007). Solving POMDPs using quadratically constrained linear programs. In: Proceedings of the twentieth international joint conference on artificial intelligence. (pp. 2418\u20132424.) Hyderabad, India."},{"key":"9103_CR2","first-page":"423","volume":"22","author":"R. Becker","year":"2004","unstructured":"Becker R., Zilberstein S., Lesser V., Goldman C.V. (2004) Solving transition-independent decentralized Markov decision processes. Journal of AI Research 22: 423\u2013455","journal-title":"Journal of AI Research"},{"key":"9103_CR3","unstructured":"Bernstein, D. S., Hansen, E., & Zilberstein, S. (2005). Bounded policy iteration for decentralized POMDPs. In: Proceedings of the nineteenth international joint conference on artificial intelligence. (pp. 1287\u20131292). Edinburgh, Scotland."},{"key":"9103_CR4","first-page":"89","volume":"34","author":"D.S. Bernstein","year":"2009","unstructured":"Bernstein D.S., Amato C., Hansen E.A., Zilberstein S. (2009) Policy iteration for decentralized control of Markov decision processes. Journal of AI Research 34: 89\u2013132","journal-title":"Journal of AI Research"},{"key":"9103_CR5","volume-title":"Nonlinear programming","author":"D.P. Bertsekas","year":"2004","unstructured":"Bertsekas D.P. (2004) Nonlinear programming. Belmont, MA, Athena Scientific"},{"key":"9103_CR6","unstructured":"Cassandra, A. R. (1998a). A survey of POMDP applications. In: AAAI fall symposium: Planning with POMDPs. Orlando, FL."},{"key":"9103_CR7","unstructured":"Cassandra, A. R. (1998b). Exact and approximate algorithms for partially observable Markov decision processes. PhD thesis. Brown University Providence, RI."},{"key":"9103_CR8","doi-asserted-by":"crossref","first-page":"1058","DOI":"10.1287\/opre.16.5.1058","volume":"16","author":"J.E. Eckles","year":"1968","unstructured":"Eckles J.E. (1968) Optimum maintenance with incomplete information. Operations Research 16: 1058\u20131067","journal-title":"Operations Research"},{"key":"9103_CR9","unstructured":"Emery-Montemerlo, R., Gordon, G., Schneider, J., & Thrun, S. (2004). Approximate solutions for partially observable stochastic games with common payoffs. In: Proceedings of the third international joint conference on autonomous agents and multiagent systems (pp. 136\u2013143). New York, NY."},{"key":"9103_CR10","doi-asserted-by":"crossref","first-page":"99","DOI":"10.1137\/S0036144504446096","volume":"47","author":"P. E. Gill","year":"2005","unstructured":"Gill P. E., Murray W., Saunders M. (2005) Snopt: An SQP algorithm for large-scale constrained optimization. SIAM Review 47: 99\u2013131","journal-title":"SIAM Review"},{"key":"9103_CR11","unstructured":"Hansen, E. A. (1998). Solving POMDPs by searching in policy space. In: Proceedings of the fourteenth conference on uncertainty in artificial intelligence. (pp. 211\u2013219). Madison, WI."},{"key":"9103_CR12","unstructured":"Hansen, E. A., Bernstein, D. S., & Zilberstein, S. (2004). Dynamic programming for partially observable stochastic games. In: Proceedings of the nineteenth national conference on artificial intelligence. (pp. 709\u2013715). San Jose, CA."},{"key":"9103_CR13","unstructured":"Hauskrecht, M., & Fraser, H. (1998). Modeling treatment of ischemic heart disease with partially observable Markov decision processes. In: Proceedings of American medical informatics association annual symposium on computer applications in health care. (pp. 538\u2013542). Orlando, Florida."},{"key":"9103_CR14","doi-asserted-by":"crossref","DOI":"10.1007\/978-3-662-03199-5","volume-title":"Global optimization: Deterministic approaches","author":"R. Horst","year":"1996","unstructured":"Horst R., Tuy H. (1996) Global optimization: Deterministic approaches. Springer, New York"},{"key":"9103_CR15","unstructured":"Ji, S., Parr, R., Li, H., Liao, X., & Carin, L. (2007). Point-based policy iteration. In: Proceedings of the twenty-second national conference on artificial intelligence. (pp. 1243\u20131249). Vancouver, Canada."},{"key":"9103_CR16","volume-title":"Learning policies for partially observable environments: Scaling up. Technical report CS-95-11","author":"M.L. Littman","year":"1995","unstructured":"Littman M.L., Cassandra A.R., Kaelbling L.P. (1995) Learning policies for partially observable environments: Scaling up. Technical report CS-95-11. Brown University, Department of Computer Science, Providence, RI"},{"key":"9103_CR17","unstructured":"Marecki, J., Gupta, T., Varakantham, P., Tambe, M., & Yokoo, M. (2008). Not all agents are equal: Scaling up distributed POMDPs for agent networks. In: Proceedings of the seventh international joint conference on autonomous agents and multiagent systems. (pp. 485\u2013492). Estoril, Portugal."},{"key":"9103_CR18","unstructured":"Meuleau, N., Kim, K. E., Kaelbling, L. P., & Cassandra, A. R. (1999). Solving POMDPs by searching the space of finite policies. In: Proceedings of the fifteenth conference on uncertainty in artificial intelligence. (pp. 417\u2013426). Stockholm, Sweden."},{"key":"9103_CR19","unstructured":"Nair, R., Pynadath, D., Yokoo, M., Tambe, M., & Marsella, S. (2003). Taming decentralized POMDPs: Towards efficient policy computation for multiagent settings. In: Proceedings of the nineteenth international joint conference on artificial intelligence. (pp. 705\u2013711). Acapulco, Mexico."},{"key":"9103_CR20","unstructured":"Petrik, M., & Zilberstein, S. (2007). Average-reward decentralized Markov decision processes. In Proceedings of the twentieth international joint conference on artificial intelligence (pp. 1997\u20132002). Hyderabad, India."},{"key":"9103_CR21","unstructured":"Pineau, J., Gordon, G., & Thrun, S. (2003). Point-based value iteration: An anytime algorithm for POMDPs. In: Proceedings of the eighteenth international joint conference on artificial intelligence. (pp. 1025\u20131032). Acapulco, Mexico."},{"key":"9103_CR22","unstructured":"Poupart, P. (2005). Exploiting structure to efficiently solve large scale partially observable Markov decision processes. PhD thesis. University of Toronto."},{"key":"9103_CR23","volume-title":"Advances in neural information processing systems 16","author":"P. Poupart","year":"2004","unstructured":"Poupart P., Boutilier C. (2004) Bounded finite state controllers. In: Thrun S., Saul L., Sch\u00f6lkopf B. (eds) Advances in neural information processing systems 16. MIT Press, Cambridge, MA"},{"key":"9103_CR24","unstructured":"Seuken, S., & Zilberstein, S. (2007a). Memory-bounded dynamic programming for DEC-POMDPs. In: Proceedings of the twentieth international joint conference on artificial intelligence. (pp. 2009\u20132015). Hyderabad, India."},{"key":"9103_CR25","unstructured":"Seuken, S., & Zilberstein, S. (2007b). Improved memory-bounded dynamic programming for decentralized POMDPs. In: Proceedings of the twenty-third conference on uncertainty in artificial intelligence. Vancouver, Canada."},{"key":"9103_CR26","unstructured":"Simmons, R., & Koenig, S. (1995). Probabilistic navigation in partially observable environments. In: Proceedings of the fourteenth international joint conference on artificial intelligence. (pp. 1080\u20131087). Montral, Canada."},{"key":"9103_CR27","doi-asserted-by":"crossref","unstructured":"Singh, S., Jaakkola, T., & Jordan, M. (1994). Learning without state-estimation in partially observable Markovian decision processes. In: Proceedings of the eleventh international conference on machine learning. (pp. 284\u2013292). New Brunswick, NJ.","DOI":"10.1016\/B978-1-55860-335-6.50042-8"},{"key":"9103_CR28","unstructured":"Smith, T., & Simmons, R. (2004). Heuristic search value iteration for POMDPs. In: Proceedings of the twentieth conference on uncertainty in artificial intelligence. (pp. 520\u2013527). Banff, Canada."},{"key":"9103_CR29","unstructured":"Smith, T., & Simmons, R. (2005). Point-based POMDP algorithms: Improved analysis and implementation. In: Proceedings of the twenty-first conference on uncertainty in artificial intelligence. Edinburgh, Scotland."},{"key":"9103_CR30","unstructured":"Sondik, E. J. (1971). The optimal control of partially observable Markov processes. PhD thesis. Stanford University."},{"key":"9103_CR31","first-page":"195","volume":"24","author":"M. T. J. Spaan","year":"2005","unstructured":"Spaan M. T. J., Vlassis N. (2005) Perseus: Randomized point-based value iteration for POMDPs. Journal of AI Research 24: 195\u2013220","journal-title":"Journal of AI Research"},{"key":"9103_CR32","volume-title":"Reinforcement learning: An introduction","author":"R. S. Sutton","year":"1995","unstructured":"Sutton R. S., Barto A. G. (1995) Reinforcement learning: An introduction. MIT Press, Cambridge, MA"},{"key":"9103_CR33","doi-asserted-by":"crossref","unstructured":"Szer, D., & Charpillet, F. (2005). An optimal best-first search algorithm for solving infinite horizon DEC-POMDPs. In: Proceedings of the sixteenth European conference on machine learning. (pp. 389\u2013399). Porto, Portugal.","DOI":"10.1007\/11564096_38"},{"key":"9103_CR34","unstructured":"Szer, D., Charpillet, F., & Zilberstein, S. (2005). MAA*: A heuristic search algorithm for solving decentralized POMDPs. In: Proceedings of the twenty-first conference on uncertainty in artificial intelligence. (pp. 576\u2013583). Edinburgh, Scotland."},{"key":"9103_CR35","doi-asserted-by":"crossref","unstructured":"Wah, B. W., & Chen, Y. (2005). Solving large-scale nonlinear programming problems by constraint partitioning. In: Proceedings of the eleventh international conference on principles and practice of constraint programming. (pp. 697\u2013711).","DOI":"10.1007\/11564751_51"}],"container-title":["Autonomous Agents and Multi-Agent Systems"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s10458-009-9103-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1007\/s10458-009-9103-z\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s10458-009-9103-z","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2019,5,29]],"date-time":"2019-05-29T17:28:25Z","timestamp":1559150905000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/s10458-009-9103-z"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2009,8,25]]},"references-count":35,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2010,11]]}},"alternative-id":["9103"],"URL":"https:\/\/doi.org\/10.1007\/s10458-009-9103-z","relation":{},"ISSN":["1387-2532","1573-7454"],"issn-type":[{"value":"1387-2532","type":"print"},{"value":"1573-7454","type":"electronic"}],"subject":[],"published":{"date-parts":[[2009,8,25]]}}}