{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,31]],"date-time":"2025-10-31T18:21:01Z","timestamp":1761934861883,"version":"build-2065373602"},"reference-count":29,"publisher":"Springer Science and Business Media LLC","issue":"3","license":[{"start":{"date-parts":[[1995,12,1]],"date-time":"1995-12-01T00:00:00Z","timestamp":817776000000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/www.springer.com\/tdm"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Mach Learn"],"published-print":{"date-parts":[[1995,12]]},"DOI":"10.1007\/bf00993591","type":"journal-article","created":{"date-parts":[[2005,1,14]],"date-time":"2005-01-14T18:04:23Z","timestamp":1105725863000},"page":"199-233","source":"Crossref","is-referenced-by-count":90,"title":["The parti-game algorithm for variable resolution reinforcement learning in multidimensional state-spaces"],"prefix":"10.1007","volume":"21","author":[{"given":"Andrew W.","family":"Moore","sequence":"first","affiliation":[]},{"given":"Christopher G.","family":"Atkeson","sequence":"additional","affiliation":[]}],"member":"297","reference":[{"key":"CR1","doi-asserted-by":"crossref","unstructured":"Akian, M., Chancelier, J.P. & Quadrat, J.P., (1988). Dynamic Programming Complexity and Application. InProceedings of the 27th Conference on Decision and Control, Austin, Texas.","DOI":"10.1109\/CDC.1988.194590"},{"key":"CR2","unstructured":"Arcilla, A.S., Hauser, J., Eiseman, P.R. & Thompson, J.F., (1991).Numerical Grid Generation in Computational Fluid Dynamics and Related Fields. North-Holland."},{"key":"CR3","unstructured":"Barto, A.G., Bradtke, S.J. & Singh, S.P., (1994). Real-time Learning and Control using Asynchronous Dynamic Programming.AI Journal, to appear (also published as UMass Amherst Technical Report 91-57 in 1991)."},{"issue":"5","key":"CR4","doi-asserted-by":"crossref","first-page":"835","DOI":"10.1109\/TSMC.1983.6313077","volume":"13","author":"A.G. Barto","year":"1983","unstructured":"Barto, A.G., Sutton, R.S. & Anderson, C.W., (1983). Neuronlike Adaptive elements that that can learn difficult Control Problems.IEEE Trans. on Systems Man and Cybernetics, 13(5):835?846.","journal-title":"IEEE Trans. on Systems Man and Cybernetics"},{"key":"CR5","volume-title":"Dynamic Programming","author":"R.E. Bellman","year":"1957","unstructured":"Bellman, R.E., (1957).Dynamic Programming. Princeton University Press, Princeton, NJ."},{"key":"CR6","unstructured":"Bertsekas, D.P. & Tsitsiklis, J.N., (1989).Parallel and Distributed Computation. Prentice Hall."},{"key":"CR7","unstructured":"Brooks, R.A. & Lozano-Perez, T., (1983). A Subdivision Algorithm in Configuration Space for Findpath with rotation. InProceedings of the 8th International Conference on Artifical Intelligence."},{"key":"CR8","unstructured":"Chapman, D. & Kaelbling, L.P., (1991). Learning from Delayed Reinforcement In a Complex Domain. Technical Report, Teleos Research."},{"key":"CR9","unstructured":"Chow, C.S., (1990). Multigrid algorithms and complexity results for discrete-time stochastic control and related fixed-point problems. Technical report, M.I.T. Laboratory for Information and Decision Sciences."},{"key":"CR10","unstructured":"Dayan, P. & Hinton, G.E., (1993). Feudal Reinforcement Learning. In S. J. Hanson, J. D Cowan, and C. L. Giles, editors,Advances in Neural Information Processing Systems 5. Morgan Kaufmann."},{"key":"CR11","doi-asserted-by":"crossref","unstructured":"Hoppe, R. H. W., (1986). Multi-Grid Methods for Hamilton-Jacobi-Bellman Equations.Numerical Mathematics, 49.","DOI":"10.1007\/BF01389627"},{"key":"CR12","unstructured":"Kaelbling, L. (1993). Hierarchicial Learning in Stochastic Domains: Preliminary Results. InMachine Learning: Proceedings of the Tenth International Workshop. Morgan Kaufmann."},{"key":"CR13","unstructured":"Kaelbling, L.P., (1990). Learning in Embedded Systems. PhD. Thesis; Technical Report No. TR-90-04, Stanford University, Department of Computer Science, June 1990."},{"key":"CR14","doi-asserted-by":"crossref","unstructured":"Kambhampati, Subbarao & Davis, Larry S., (1986). Multiresolution Path Planning for Mobile Robots.IEEE Journal of Robotics and Automation, Vol. RA-2, No. 3, 2(3).","DOI":"10.1109\/JRA.1986.1087051"},{"key":"CR15","unstructured":"Knuth, D.E., (1973).Sorting and Searching. Addison Wesley."},{"key":"CR16","unstructured":"Koenig, S. & Simmons, R.G. (1993). Complexity Analysis of Reinforcement Learning. InProceedings of the Eleventh International Conference on Artificial Intelligence (AAAI-93). MIT Press."},{"key":"CR17","doi-asserted-by":"crossref","unstructured":"Latombe, J. (1991).Robot Motion Planning. Kluwer.","DOI":"10.1007\/978-1-4615-4022-9"},{"key":"CR18","doi-asserted-by":"crossref","unstructured":"McCormick, S.F., (1989).Multilevel Adaptive Methods for Partial Differential Equations. SIAM.","DOI":"10.1137\/1.9781611971026"},{"key":"CR19","unstructured":"Michie, D. & Chambers, R.A., (1968). BOXES: An Experiment in Adaptive Control. In E. Dale and D. Michie, editors,Machine Intelligence 2. Oliver and Boyd."},{"key":"CR20","doi-asserted-by":"crossref","unstructured":"Moore, A.W., (1991). Variable Resolution Dynamic Programming: Efficiently Learning Action Maps in Multivariate Real-valued State-spaces. In L. Birnbaum and G. Collins, editors,Machine Learning: Proceedings of the Eighth International Workshop. Morgan Kaufmann.","DOI":"10.1016\/B978-1-55860-200-7.50069-6"},{"key":"CR21","doi-asserted-by":"crossref","unstructured":"Moore, A.W. & Atkeson, C.G., (1993). Prioritized Sweeping: Reinforcement Learning with Less Data and Less Real Time.Machine Learning, 13.","DOI":"10.1007\/BF00993104"},{"key":"CR22","unstructured":"Nilsson, N.J., (1971).Problem-solving Methods in Artificial Intelligence. McGraw Hill."},{"key":"CR23","doi-asserted-by":"crossref","unstructured":"Peng, J. & Williams, R.J., (1993). Efficient Learning and Planning Within the Dyna Framework. InProceedings of the Second International Conference on Simulation of Adaptive Behavior. MIT Press.","DOI":"10.1109\/ICNN.1993.298551"},{"key":"CR24","unstructured":"Sage, A.P. & White, C.C., (1977).Optimum Systems Control. Prentice Hall."},{"key":"CR25","unstructured":"Schaal, S. & Atkeson, C.G., (1994). Assessing the Quality of Local Linear Models. InAdvances in Neural Information Processing Systems 6. Morgan Kaufmann."},{"issue":"5","key":"CR26","doi-asserted-by":"crossref","first-page":"1109","DOI":"10.1109\/TAC.1982.1103060","volume":"27","author":"J. Simons","year":"1982","unstructured":"Simons, J., Van Brussel, H., De Schutter, J. & Verhaert, J. (1982). A Self-Learning Automaton with Variable Resolution for High Precision Assembly by Industrial Robots.IEEE Trans. on Automatic Control, 27(5):1109?1113.","journal-title":"IEEE Trans. on Automatic Control"},{"key":"CR27","volume-title":"Temporal Credit Assignment in Reinforcement Learning","author":"R.S. Sutton","year":"1984","unstructured":"Sutton, R.S., (1984). Temporal Credit Assignment in Reinforcement Learning. Phd. thesis, University of Massachusetts, Amherst."},{"key":"CR28","doi-asserted-by":"crossref","unstructured":"Sutton, R.S., (1990). Integrated Architecture for Learning, Planning, and Reacting Based on Approximating Dynamic Programming. InProceedings of the 7th International Conference on Machine Learning. Morgan Kaufmann.","DOI":"10.1016\/B978-1-55860-141-3.50030-4"},{"key":"CR29","unstructured":"Watkins, C. J. C. H. (1989). Learning from Delayed Rewards. PhD. Thesis, King's College, University of Cambridge."}],"container-title":["Machine Learning"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/BF00993591.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1007\/BF00993591\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/BF00993591","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2020,4,5]],"date-time":"2020-04-05T09:51:56Z","timestamp":1586080316000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/BF00993591"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[1995,12]]},"references-count":29,"journal-issue":{"issue":"3","published-print":{"date-parts":[[1995,12]]}},"alternative-id":["BF00993591"],"URL":"https:\/\/doi.org\/10.1007\/bf00993591","relation":{},"ISSN":["0885-6125","1573-0565"],"issn-type":[{"type":"print","value":"0885-6125"},{"type":"electronic","value":"1573-0565"}],"subject":[],"published":{"date-parts":[[1995,12]]}}}