{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,30]],"date-time":"2026-04-30T04:02:43Z","timestamp":1777521763658,"version":"3.51.4"},"reference-count":31,"publisher":"SAGE Publications","issue":"3","license":[{"start":{"date-parts":[[2003,9,1]],"date-time":"2003-09-01T00:00:00Z","timestamp":1062374400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Adaptive Behavior"],"published-print":{"date-parts":[[2003,9]]},"abstract":"<jats:p>Natural intelligence and autonomous agents face difficulties when acting in information-dense environments. Assailed by a multitude of stimuli they have to make sense of the inflow of information, filtering and processing what is necessary, but discarding that which is unimportant. This paper aims at investigating the interactions between evolution of the sensorial channel extracting the information from the environment and the simultaneous individual adaptation of agent-control. Our particular goal is to study the influence of learning on the evolution of sensors, with learning duration being the tunable parameter. A genetic algorithm governs the evolution of sensors appropriate for the agent solving a simple grid world task. The performance of the agent is taken as fitness; \u2018sensors\u2019 are conceived as a map from environmental states to agent observations, and individual adaptation is modeled by Q-learning. Our experimental results show that due to the principles of cognitive economy learning and varying the degree thereof actually transforms the fitness landscape. In particular we identify a trade-off between learning speed (load) and sensor accuracy (error). These results are further reinforced by theoretical analysis: we derive an analytical measure for the quality of sensors based on the mutual entropy between the system of states and the selection of an optimal action, a concept recently proposed by Polani, Martinetz, and Kim.<\/jats:p>","DOI":"10.1177\/1059712303113002","type":"journal-article","created":{"date-parts":[[2004,4,21]],"date-time":"2004-04-21T20:12:39Z","timestamp":1082578359000},"page":"159-177","source":"Crossref","is-referenced-by-count":2,"title":["Evolution and Learning: Evolving Sensors in a Simple MDP Environment"],"prefix":"10.1177","volume":"11","author":[{"given":"Tobias","family":"Jung","sequence":"first","affiliation":[{"name":"Institut f\u00fcr Informatik, Gutenberg-Universit\u00e4t Mainz,"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Peter","family":"Dauscher","sequence":"additional","affiliation":[{"name":"Institut f\u00fcr Informatik, Gutenberg-Universit\u00e4t Mainz,"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Thomas","family":"Uthmann","sequence":"additional","affiliation":[{"name":"Institut f\u00fcr Informatik, Gutenberg-Universit\u00e4t Mainz,"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"179","published-online":{"date-parts":[[2003,9,1]]},"reference":[{"key":"atypb1","unstructured":"Ackley, D. & Littman, M. (1991). Interactions between learning and evolution. In C. G. Langton, C. Taylor, J. D. Farmer, & S. Rasmussen (Eds.), Artificial life II, SFI studies in the sciences of complexity (Vol. X, pp. 487-509). Reading, MA: Addison-Wesley ."},{"key":"atypb2","doi-asserted-by":"crossref","unstructured":"Adami, C. (1998). Introduction to artificial life. New York Springer Verlag .","DOI":"10.1007\/978-1-4612-1650-6"},{"key":"atypb3","unstructured":"Barlow, H. B. (1959). Sensory mechanisms, the reduction of redundancy, and intelligence . In Proceedings of the Symposium on the Mechanisation of Thought Processes, (pp. 535-539 )."},{"key":"atypb4","unstructured":"Bellman, R. (1957). Dynamic programming. NJ Princeton University Press ."},{"key":"atypb5","doi-asserted-by":"crossref","unstructured":"Bruner, J. S., Goodnow, J. J. & Austin, G. A. (1956). A study of thinking. NY Wiley and Sons .","DOI":"10.2307\/1292061"},{"key":"atypb6","unstructured":"Dautenhahn, K., Polani, D. & Uthmann, T. (Eds.). (2001). Special issue on sensor evolution. Cambridge, MA: MIT Press ."},{"key":"atypb7","doi-asserted-by":"crossref","unstructured":"Harvey, I., Husbands, P. & Cliff, D. (1994). Seeing the light: artificial evolution, real vision . In D. Cliff, P. Husbands, J.A. Meyer, & S.Wilson (Eds.), From animals to animats 3, Proceedings of the 3rd International Conference on Simulation of Adaptive Behavior, SAB94 (pp. 392-401 ). Boston, MA: MIT Press\/Bradford Books.","DOI":"10.7551\/mitpress\/3117.003.0058"},{"key":"atypb8","unstructured":"Hinton, G. E. & Nowlan, S. J. (1987). How learning can guide evolution . Complex Systems, 1(1), 495-502 ."},{"key":"atypb9","doi-asserted-by":"publisher","DOI":"10.1613\/jair.301"},{"key":"atypb10","doi-asserted-by":"crossref","unstructured":"Kortmann, R. & Herik, E. P. J. van den. (2001). Evolution of visual resolution constrained by a trade-off . Artificial Life (Special Issue on Sensor Evolution), 7(2), 125-145 .","DOI":"10.1162\/106454601753138970"},{"key":"atypb11","unstructured":"Lee, W. P., Hallam, J. & Lund, H. (1996). A hybrid GP\/GA aproach for co-evolving controllers and robot bodies to achieve fitness-specified tasks . In Proceedings IEEE 3rd International Conference on Evolutionary Computation. NJ: IEEE Press."},{"key":"atypb12","doi-asserted-by":"crossref","unstructured":"Liese, A., Polani, D. & Uthmann, T. (2001). Study of the simulated evolution of the spectral sensitivity of visual agent receptors . Artificial Life (Special Issue on Sensor Evolution), 7(2), 99-124 .","DOI":"10.1162\/106454601753138961"},{"key":"atypb13","unstructured":"Mark, A., Polani, D. & Uthmann, T. (1998). A framework for Sensor Evolution in a population of Braitenberg vehicle-like agents . In C. Adami, R. B. H. Kitano, & C. Taylor (Eds.), Proceedings of Artificial Life IV. Cambridge, MA: MIT Press."},{"key":"atypb14","doi-asserted-by":"crossref","unstructured":"Maylay, G. (1996). Landscapes, learning costs, and genetic assimilation . Evolution, Learning and Instinct: 100 Years of the Baldwin Effect. A Special Edition of Evolutionary Computation, 4(3).","DOI":"10.1162\/evco.1996.4.3.iv"},{"key":"atypb15","doi-asserted-by":"crossref","unstructured":"Menczer, F. & Belew, R. (1994). Evolving sensors in environments of controlled complexity. In R. Brooks & P. Maes (Eds.), Artificial life IV. Cambridge, MA: MIT Press .","DOI":"10.7551\/mitpress\/1428.003.0025"},{"key":"atypb16","doi-asserted-by":"publisher","DOI":"10.1037\/h0045942"},{"key":"atypb17","doi-asserted-by":"crossref","unstructured":"Nehaniv, C. L. (1999). Meaning for observers and agents . In Proceedings IEEE International Symposium on Intelligent Control\/Intelligent Systems and semiotics.","DOI":"10.1109\/ISIC.1999.796694"},{"key":"atypb18","doi-asserted-by":"publisher","DOI":"10.1177\/105971239400300102"},{"key":"atypb19","doi-asserted-by":"crossref","unstructured":"Nolfi, S. & Floreano, D. (1999). Learning and evolution . Autonomous Robots, 7(1).","DOI":"10.1023\/A:1008973931182"},{"key":"atypb20","doi-asserted-by":"crossref","unstructured":"Nolfi, S. & Floreano, D. (2000). Evolutionary robotics\u2014the biology, intelligence, and technology of self-organizing machines. Cambridge, MA: MIT Press .","DOI":"10.7551\/mitpress\/2889.001.0001"},{"key":"atypb21","doi-asserted-by":"crossref","unstructured":"Polani, D., Martinetz, T. & Kim, J. (2001). An information-theoretic approach for the quantification of relevance . In J. Kelemen & P. Sosik (Eds.), Proceedings 6th European Conference on Artificial Life. Berlin: Springer Verlag.","DOI":"10.1007\/3-540-44811-X_82"},{"key":"atypb22","unstructured":"Rosch, E. (1978). Principles of categorization. In E. Rosch & B. B. Lloyd (Eds.), Cognition and categorization. Hillsdale, New Jersey: Lawrence Erlbaum ."},{"key":"atypb23","doi-asserted-by":"crossref","unstructured":"Singh, S., Jaakkola, T. & Jordan, M. I. (1994). Learning without state-estimation in partially observable markovian decision processes . Proceedings of the 11th Machine Learning Conference.","DOI":"10.1016\/B978-1-55860-335-6.50042-8"},{"key":"atypb24","doi-asserted-by":"crossref","unstructured":"Sutton, R. & Barto, A. (1998). Reinforcement learning: An introduction. MIT Press .","DOI":"10.1109\/TNN.1998.712192"},{"key":"atypb25","unstructured":"Tishby, N., Pereira, F. & Bialek., W. (1999). The information bottleneck method . In Proceedings of the 37th Annual Allerton Conference on Communication, Control, and Computing. Illinois."},{"key":"atypb26","doi-asserted-by":"crossref","unstructured":"Todd, P. M. & Miller, G. F. (1991). Exploring adaptive agency II: Simulating the evolution of associative learning . In J. A. Meyer & S. W. Wilson (Eds.), From animals to animats. Proceedings of the First International Conference on Simulation of Adaptive Behavior. Cambridge, MA: MIT Press.","DOI":"10.7551\/mitpress\/3115.003.0042"},{"key":"atypb27","doi-asserted-by":"crossref","unstructured":"Turney, P., Whitley, D. & Anderson, R. (1996). Evolution, learning, and instinct: 100 years of the baldwin effect . Spec. Issue of Evol. Comp. on the Baldwin Effect, 4.","DOI":"10.1162\/evco.1996.4.3.iv"},{"key":"atypb28","unstructured":"Watkins, C. J. C. H. (1989).\n                      Learning from delayed rewards.\n                      Unpublished doctoral dissertation, Cambridge, England."},{"key":"atypb29","doi-asserted-by":"crossref","unstructured":"Watkins, C. J. C. H. & Dayan, P. (1992). Q-learning . Machine Learning, 8(3), 279-292 .","DOI":"10.1023\/A:1022676722315"},{"key":"atypb30","doi-asserted-by":"crossref","unstructured":"Whitehead, S. D. & Ballard, D. H. (1991). Learning to perceive and act by trial and error . Machine Learning, 7, 45-83 .","DOI":"10.1007\/BF00058926"},{"key":"atypb31","unstructured":"Wittgenstein, L. (1953). Philosophical investigations. Macmillan, New York ."}],"container-title":["Adaptive Behavior"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/1059712303113002","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/1059712303113002","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,28]],"date-time":"2026-04-28T16:15:21Z","timestamp":1777392921000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.1177\/1059712303113002"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2003,9]]},"references-count":31,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2003,9]]}},"alternative-id":["10.1177\/1059712303113002"],"URL":"https:\/\/doi.org\/10.1177\/1059712303113002","relation":{},"ISSN":["1059-7123","1741-2633"],"issn-type":[{"value":"1059-7123","type":"print"},{"value":"1741-2633","type":"electronic"}],"subject":[],"published":{"date-parts":[[2003,9]]}}}