{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,13]],"date-time":"2025-10-13T08:50:57Z","timestamp":1760345457961},"reference-count":27,"publisher":"Elsevier BV","issue":"2-3","license":[{"start":{"date-parts":[[1999,11,1]],"date-time":"1999-11-01T00:00:00Z","timestamp":941414400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.elsevier.com\/tdm\/userlicense\/1.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Robotics and Autonomous Systems"],"published-print":{"date-parts":[[1999,11]]},"DOI":"10.1016\/s0921-8890(99)00051-2","type":"journal-article","created":{"date-parts":[[2002,10,31]],"date-time":"2002-10-31T21:12:04Z","timestamp":1036098724000},"page":"187-200","source":"Crossref","is-referenced-by-count":11,"title":["Representation of behavioral history for learning in nonstationary conditions"],"prefix":"10.1016","volume":"29","author":[{"given":"Fran\u00e7ois","family":"Michaud","sequence":"first","affiliation":[]},{"given":"Maja J.","family":"Matari\u0107","sequence":"additional","affiliation":[]}],"member":"78","reference":[{"key":"10.1016\/S0921-8890(99)00051-2_BIB1","unstructured":"P.E. Agre, The dynamic structure of everyday life, Ph.D. Thesis, Massachusetts Institute of Technology, Cambridge, MA, 1988."},{"key":"10.1016\/S0921-8890(99)00051-2_BIB2","doi-asserted-by":"crossref","unstructured":"M. Asada, E. Uchibe, S. Noda, S. Tawaratsumida, K. Hosoda, Coordination of multiple behaviors acquired by a vision-based reinforcement learning, in: Proceedings of the IEEE\/RSJ\/GI International Conference on Intelligent Robots and Systems, Munich, Germany, 1994.","DOI":"10.1109\/IROS.1994.407484"},{"key":"10.1016\/S0921-8890(99)00051-2_BIB3","doi-asserted-by":"crossref","unstructured":"R.A. Brooks, A robust layered control system for a mobile robot, IEEE Journal of Robotics and Automation RA-2 (1) (1986) 14\u201323.","DOI":"10.1109\/JRA.1986.1087032"},{"key":"10.1016\/S0921-8890(99)00051-2_BIB4","doi-asserted-by":"crossref","first-page":"139","DOI":"10.1016\/0004-3702(91)90053-M","article-title":"Intelligence without representation","volume":"47","author":"Brooks","year":"1991","journal-title":"Artificial Intelligence"},{"key":"10.1016\/S0921-8890(99)00051-2_BIB5","unstructured":"R.A. Brooks, MARS: Multiple Agency Reactivity System, Technical Report, IS Robotics, 1996."},{"key":"10.1016\/S0921-8890(99)00051-2_BIB6","doi-asserted-by":"crossref","unstructured":"A. Cassandra, L.P. Kaelbling, J.A. Kurien, Acting under uncertainty: Discrete bayesian models for mobile-robot navigation, in: Proceedings of the IEEE\/RSJ International Conference on Intelligent Robots and Systems, Osaka, Japan, 1996.","DOI":"10.1109\/IROS.1996.571080"},{"issue":"2","key":"10.1016\/S0921-8890(99)00051-2_BIB7","doi-asserted-by":"crossref","first-page":"321","DOI":"10.1016\/0004-3702(94)90047-7","article-title":"Robot shaping: Developing autonomous agents through learning","volume":"71","author":"Dorigo","year":"1994","journal-title":"Artificial Intelligence"},{"key":"10.1016\/S0921-8890(99)00051-2_BIB8","doi-asserted-by":"crossref","unstructured":"D. Floreano, F. Mondada, Evolution of homing navigation in a real mobile robot, in: IEEE Transactions on Systems, Man, and Cybernetics 26 (3) (1996) 396-407.","DOI":"10.1109\/3477.499791"},{"key":"10.1016\/S0921-8890(99)00051-2_BIB9","unstructured":"D. Goldberg, M.J. Matari\u0107, Interference as a tool for designing and evaluating multi-robot controllers, in: Proceedings of the National Conference on Artificial Intelligence (AAAI-97), Providence, RI, 1997, pp. 637\u2013642."},{"key":"10.1016\/S0921-8890(99)00051-2_BIB10","doi-asserted-by":"crossref","first-page":"237","DOI":"10.1613\/jair.301","article-title":"Reinforcement learning: A survey","volume":"4","author":"Kaelbling","year":"1996","journal-title":"Journal of Artificial Intelligence Research"},{"key":"10.1016\/S0921-8890(99)00051-2_BIB11","doi-asserted-by":"crossref","unstructured":"S. Koenig, R.G. Simmons, Unsupervised learning of probabilistic models for robot navigation, in: Proceedings of the IEEE International Conference on Robotics and Automation, Minneapolis, MN, 1996.","DOI":"10.1109\/ROBOT.1996.506507"},{"key":"10.1016\/S0921-8890(99)00051-2_BIB12","unstructured":"P. Maes, The dynamics of action selection, in: Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI-89), Detroit, MI, 1989, pp. 991\u2013997."},{"key":"10.1016\/S0921-8890(99)00051-2_BIB13","unstructured":"P. Maes, R.A. Brooks, Learning to coordinate behaviors, in: Proceedings of the National Conference on Artificial Intelligence (AAAI-90), Boston, MA, vol. 2, 1990, pp. 796-802."},{"key":"10.1016\/S0921-8890(99)00051-2_BIB14","doi-asserted-by":"crossref","first-page":"311","DOI":"10.1016\/0004-3702(92)90058-6","article-title":"Automatic programming of behavior-based robots using reinforcement learning","volume":"55","author":"Mahadevan","year":"1992","journal-title":"Artificial Intelligence"},{"issue":"4","key":"10.1016\/S0921-8890(99)00051-2_BIB15","first-page":"89","article-title":"The National Science Foundation Workshop on Reinforcement Learning: Summary and observations","volume":"17","author":"Mahadevan","year":"1996","journal-title":"AI Magazine"},{"key":"10.1016\/S0921-8890(99)00051-2_BIB16","unstructured":"M.J. Matari\u0107, Behavior-based control: Examples from navigation, learning, and group behavior, Journal of Experimental and Theoretical Artificial Intelligence 9 (2\u20133) (1997)."},{"key":"10.1016\/S0921-8890(99)00051-2_BIB17","doi-asserted-by":"crossref","unstructured":"M.J. Matari\u0107, Reinforcement learning in the multi-robot domain, Autonomous Robots 4 (1) (1997).","DOI":"10.1023\/A:1008819414322"},{"key":"10.1016\/S0921-8890(99)00051-2_BIB18","doi-asserted-by":"crossref","unstructured":"A.K. McCallum, Learning to use selective attention and short-term memory in sequential tasks, in: From Animals to Animats: Proceedings of the Fourth International Conference on Simulation of Adaptive Behavior, MIT Press, Cape Cod, 1996, pp. 315\u2013324.","DOI":"10.7551\/mitpress\/3118.003.0039"},{"key":"10.1016\/S0921-8890(99)00051-2_BIB19","doi-asserted-by":"crossref","unstructured":"A.K. McCallum, Reinforcement learning with selective perception and hidden state, Ph.D. Thesis, Department of Computer Science, University of Rochester, 1996.","DOI":"10.1109\/3477.499796"},{"key":"10.1016\/S0921-8890(99)00051-2_BIB20","doi-asserted-by":"crossref","unstructured":"F. Michaud, G. Lachiver, C.T. Le Dinh, A new control architecture combining reactivity, deliberation and motivation for situated autonomous agent, in: From Animals to Animats: Proceedings of the Fourth International Conference on Simulation of Adaptive Behavior, MIT Press, Cape Cod, 1996, pp. 245\u2013254.","DOI":"10.7551\/mitpress\/3118.003.0031"},{"key":"10.1016\/S0921-8890(99)00051-2_BIB21","doi-asserted-by":"crossref","unstructured":"F. Michaud, M.J. Matari\u0107, Learning from history for behavior-based mobile robots in non-stationary conditions, Machine Learning 31 (1998) 141\u2013167; Autonomous Robots 5 (1998) 335\u2013354; Joint Special Issue on Learning in Autonomous Robots.","DOI":"10.1023\/A:1008814507256"},{"key":"10.1016\/S0921-8890(99)00051-2_BIB22","unstructured":"R. Parr, S. Russell, Approximating optimal policies for partially observable stochastic domains, in: Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI-95), Montr\u00e9al, Qu\u00e9bec, 1995, pp. 1088\u20131094."},{"issue":"4","key":"10.1016\/S0921-8890(99)00051-2_BIB23","first-page":"347","article-title":"Multistrategy learning in reactive control systems for autonomous robotic navigation","volume":"17","author":"Ram","year":"1993","journal-title":"Informatica"},{"key":"10.1016\/S0921-8890(99)00051-2_BIB24","doi-asserted-by":"crossref","unstructured":"S. Russell, Machine learning, in: M.A. Boden (Ed.), Handbook of Perception and Cognition, Academic Press, vol. 14, Academic Press, New York, 1996, Chapter 4.","DOI":"10.1016\/B978-012161964-0\/50006-6"},{"key":"10.1016\/S0921-8890(99)00051-2_BIB25","unstructured":"S. Russell, P. Norvig, Artificial Intelligence: A Modern Approach, Prentice-Hall, Englewood Cliffs, NJ, 1995."},{"key":"10.1016\/S0921-8890(99)00051-2_BIB26","unstructured":"S.P. Singh, T. Jaakkola, M.I. Jordan, Learning without state-estimation in partially observable Markovian decision process, in: Proceedings of the 13th International Conference on Machine Learning, 1996."},{"key":"10.1016\/S0921-8890(99)00051-2_BIB27","doi-asserted-by":"crossref","unstructured":"R.S. Sutton, A.G. Barto, Reinforcement Learning: An Introduction, MIT Press\/Bradford Books, Cambridge, MA, 1998.","DOI":"10.1109\/TNN.1998.712192"}],"container-title":["Robotics and Autonomous Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/api.elsevier.com\/content\/article\/PII:S0921889099000512?httpAccept=text\/xml","content-type":"text\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/api.elsevier.com\/content\/article\/PII:S0921889099000512?httpAccept=text\/plain","content-type":"text\/plain","content-version":"vor","intended-application":"text-mining"}],"deposited":{"date-parts":[[2024,1,2]],"date-time":"2024-01-02T04:38:42Z","timestamp":1704170322000},"score":1,"resource":{"primary":{"URL":"https:\/\/linkinghub.elsevier.com\/retrieve\/pii\/S0921889099000512"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[1999,11]]},"references-count":27,"journal-issue":{"issue":"2-3","published-print":{"date-parts":[[1999,11]]}},"alternative-id":["S0921889099000512"],"URL":"https:\/\/doi.org\/10.1016\/s0921-8890(99)00051-2","relation":{},"ISSN":["0921-8890"],"issn-type":[{"value":"0921-8890","type":"print"}],"subject":[],"published":{"date-parts":[[1999,11]]}}}