{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,25]],"date-time":"2026-01-25T14:44:27Z","timestamp":1769352267844,"version":"3.49.0"},"publisher-location":"Berlin, Heidelberg","reference-count":22,"publisher":"Springer Berlin Heidelberg","isbn-type":[{"value":"9783540644170","type":"print"},{"value":"9783540697817","type":"electronic"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[1998]]},"DOI":"10.1007\/bfb0026709","type":"book-chapter","created":{"date-parts":[[2005,11,19]],"date-time":"2005-11-19T08:27:35Z","timestamp":1132388855000},"page":"382-393","source":"Crossref","is-referenced-by-count":19,"title":["Theoretical results on reinforcement learning with temporally abstract options"],"prefix":"10.1007","author":[{"given":"Doina","family":"Precup","sequence":"first","affiliation":[]},{"given":"Richard S.","family":"Sutton","sequence":"additional","affiliation":[]},{"given":"Satinder","family":"Singh","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2005,6,16]]},"reference":[{"key":"45_CR1","volume-title":"Dynamic Programming: Deterministic and Stochastic Models","author":"D. P. Bertsekas","year":"1987","unstructured":"Dimitri P. Bertsekas. Dynamic Programming: Deterministic and Stochastic Models. Prentice Hall, Englewood Cliffs, NJ, 1987."},{"key":"45_CR2","doi-asserted-by":"crossref","first-page":"613","DOI":"10.1162\/neco.1993.5.4.613","volume":"5","author":"P. Dayan","year":"1993","unstructured":"Peter Dayan. Improving generalization for temporal difference learning: The successor representation. Neural Computation, 5:613\u2013624, 1993.","journal-title":"Neural Computation"},{"key":"45_CR3","first-page":"271","volume-title":"Advances in Neural Information Processing Systems, volume 5","author":"P. Dayan","year":"1993","unstructured":"Peter Dayan and Geoff E. Hinton. Feudal reinforcement learning. In Advances in Neural Information Processing Systems, volume 5, pages 271\u2013278, Cambridge, MA, 1993. MIT Press."},{"key":"45_CR4","unstructured":"Thomas G. Dietterich. Hierarchical reinfrecement learning with maxq value function decomposition. Technical report, Computer Science Department, Oregon State University, 1997."},{"key":"45_CR5","volume-title":"Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence, IJCAI-97","author":"M. Huber","year":"1997","unstructured":"Manfred Huber and Roderic A. Grupen. Learning to coordinate controllers \u2014 reinforcement learning on a control basis. In Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence, IJCAI-97, San Francisco, CA, 1997. Morgan Kaufmann."},{"key":"45_CR6","first-page":"167","volume-title":"Proceedings of the Tenth International Conference on Machine Learning ICML'93","author":"L. P. Kaelbling","year":"1993","unstructured":"Leslie P. Kaelbling. Hierarchical learning in stochastic domains: Preliminary results. In Proceedings of the Tenth International Conference on Machine Learning ICML'93, pages 167\u2013173, San Mateo, CA, 1993. Morgan Kaufmann."},{"key":"45_CR7","volume-title":"Learning to Solve Problems by Searching for Macro-Operators","author":"R. E. Korf","year":"1985","unstructured":"Richard E. Korf. Learning to Solve Problems by Searching for Macro-Operators. Pitman Publishing Ltd, London, 1985."},{"key":"45_CR8","first-page":"11","volume":"1","author":"J. E. Laird","year":"1986","unstructured":"John E. Laird, Paul S. Rosenbloom, and Allan Newell. Chunking in SOAR: The anatomy of a general learning mechanism. Machine Learning, 1:11\u201346, 1986.","journal-title":"Machine Learning"},{"issue":"2\u20133","key":"45_CR9","doi-asserted-by":"publisher","first-page":"311","DOI":"10.1016\/0004-3702(92)90058-6","volume":"55","author":"S. Mahadevan","year":"1992","unstructured":"Sridhar Mahadevan and Jonathan Connell. Automatic programming of behavior-based robots using reinforcement learning. Artificial Intelligence, 55(2\u20133):311\u2013365, 1992","journal-title":"Artificial Intelligence"},{"key":"45_CR10","unstructured":"Amy McGovern, Richard S. Sutton, and Andrew H. Fagg. Roles of macro-actions in accelerating reinforcement learning. In Grace Hopper Celebration of Women in Computing, pages 13\u201318, 1997."},{"key":"45_CR11","first-page":"103","volume":"13","author":"A. W. Moore","year":"1993","unstructured":"Andrew W. Moore and Chris G. Atkeson. Prioritized sweeping: Reinforcement learning with less data and less real time. Machine Learning, 13:103\u2013130, 1993","journal-title":"Machine Learning"},{"key":"45_CR12","volume-title":"Advances in Neural Information Processing Systems, volume 10","author":"R. Parr","year":"1998","unstructured":"Ronald Parr and Stuart Russell. Reinforcement learning with hierarchies of machines. In Advances in Neural Information Processing Systems, volume 10, Cambridge, MA, 1998. MIT Press."},{"key":"45_CR13","first-page":"323","volume":"4","author":"J. Peng","year":"1993","unstructured":"Jing Peng and John Williams. Efficient learning and planning within the Dyna framework. Adaptive Behavior, 4:323\u2013334, 1993.","journal-title":"Adaptive Behavior"},{"key":"45_CR14","volume-title":"Advances in Neural Information Processing Systems, volume 10","author":"D. Precup","year":"1998","unstructured":"Doina Precup and Richard S. Sutton. Multi-Time models for temporally abstract planning. In Advances in Neural Information Processing Systems, volume 10, Cambridge, MA, 1998. MIT Press."},{"key":"45_CR15","doi-asserted-by":"crossref","DOI":"10.1002\/9780470316887","volume-title":"Markov Decision Processes","author":"M. L. Puterman","year":"1994","unstructured":"Martin L. Puterman. Markov Decision Processes. Wiley-Interscience, New York, NY, 1994."},{"key":"45_CR16","volume-title":"A Structure for Plans and Behavior","author":"E. D. Sacerdoti","year":"1977","unstructured":"Earl D. Sacerdoti. A Structure for Plans and Behavior. Elsevier, North-Holland, NY, 1977."},{"key":"45_CR17","first-page":"202","volume-title":"Proceedings of the Ninth International Conference on Machine Learning ICML'92","author":"S. P. Singh","year":"1992","unstructured":"Satinder P. Singh. Scaling reinforcement learning by learning variable temporal resolution models. In Proceedings of the Ninth International Conference on Machine Learning ICML'92, pages 202\u2013207, San Mateo, CA, 1992. Morgan Kaufmann."},{"key":"45_CR18","first-page":"216","volume-title":"Proceedings of the Seventh International Conference on Machine Learning ICML'90","author":"R. S. Sutton","year":"1990","unstructured":"Richard S. Sutton. Integrating architectures for learning, planning, and reacting based on approximating dynamic programming. In Proceedings of the Seventh International Conference on Machine Learning ICML'90, pages 216\u2013224, San Mateo, CA, 1990. Morgan Kaufmann."},{"key":"45_CR19","first-page":"531","volume-title":"Proceedings of the Twelfth International Conference on Machine Learning ICML'9S","author":"R. S. Sutton","year":"1995","unstructured":"Richard S. Sutton. TD models: Modeling the world as a mixture of time scales. In Proceedings of the Twelfth International Conference on Machine Learning ICML'9S, pages 531\u2013539, San Mateo, CA, 1995. Morgan Kaufmann."},{"key":"45_CR20","volume-title":"Reinforcement Learning. An Introduction","author":"R. S. Sutton","year":"1998","unstructured":"Richard S. Sutton and Andrew G. Barto. Reinforcement Learning. An Introduction. MIT Press, Cambridge, MA, 1998."},{"key":"45_CR21","unstructured":"Richard S. Sutton and Brian Pinette. The learning of world models by connectionist networks. In Proceedings of the Seventh Annual Conference of the Cognitive Science Society, pages 54\u201364, 1985."},{"key":"45_CR22","unstructured":"Christopher J. C. H. Watkins. Learning with Delayed Rewards. PhD thesis, Cambridge University, 1989."}],"container-title":["Lecture Notes in Computer Science","Machine Learning: ECML-98"],"original-title":[],"link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/BFb0026709","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2019,2,4]],"date-time":"2019-02-04T23:13:45Z","timestamp":1549322025000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/BFb0026709"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[1998]]},"ISBN":["9783540644170","9783540697817"],"references-count":22,"URL":"https:\/\/doi.org\/10.1007\/bfb0026709","relation":{},"ISSN":["0302-9743","1611-3349"],"issn-type":[{"value":"0302-9743","type":"print"},{"value":"1611-3349","type":"electronic"}],"subject":[],"published":{"date-parts":[[1998]]}}}