{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,24]],"date-time":"2026-06-24T18:21:01Z","timestamp":1782325261423,"version":"3.54.5"},"reference-count":275,"publisher":"Springer Science and Business Media LLC","issue":"11","license":[{"start":{"date-parts":[[2025,8,22]],"date-time":"2025-08-22T00:00:00Z","timestamp":1755820800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2025,8,22]],"date-time":"2025-08-22T00:00:00Z","timestamp":1755820800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Artif Intell Rev"],"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>The concept of rationality is central to the field of artificial intelligence (AI). Whether we are seeking to simulate human reasoning, or trying to achieve bounded optimality, our goal is generally to make artificial agents as rational as possible. Despite the centrality of the concept within AI, there is no unified definition of what constitutes a rational agent. This article provides a survey of rationality and irrationality in AI, and sets out the open questions in this area. We consider how the understanding of rationality in other fields has influenced its conception within AI, in particular work in economics, philosophy and psychology. Focusing on the behaviour of artificial agents, we examine irrational behaviours that can prove to be optimal in certain scenarios. Some methods have been developed to deal with irrational agents, both in terms of identification and interaction, however work in this area remains limited. Methods that have up to now been developed for other purposes, namely adversarial scenarios, may be adapted to suit interactions with artificial agents. We further discuss the interplay between human and artificial agents, and the role that rationality plays within this interaction; many questions remain in this area, relating to potentially irrational behaviour of both humans and artificial agents.<\/jats:p>","DOI":"10.1007\/s10462-025-11341-4","type":"journal-article","created":{"date-parts":[[2025,8,22]],"date-time":"2025-08-22T06:44:01Z","timestamp":1755845041000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":4,"title":["(Ir)rationality in AI: state of the art, research challenges and open questions"],"prefix":"10.1007","volume":"58","author":[{"given":"Olivia","family":"Macmillan-Scott","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Mirco","family":"Musolesi","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"297","published-online":{"date-parts":[[2025,8,22]]},"reference":[{"key":"11341_CR1","doi-asserted-by":"crossref","unstructured":"Abadie A, Carillo K, Fosso\u00a0Wamba S, Badot O (2019) Is Waze joking? Perceived irrationality dynamics in user-robot interactions. In: Proceedings of HICSS-19","DOI":"10.24251\/HICSS.2019.601"},{"issue":"9","key":"11341_CR2","doi-asserted-by":"publisher","first-page":"6569","DOI":"10.1007\/s10489-021-02658-y","volume":"51","author":"A Abate","year":"2021","unstructured":"Abate A, Gutierrez J, Hammond L, Harrenstein P, Kwiatkowska M, Najib M et al (2021) Rational verification: game-theoretic verification of multi-agent systems. Appl Intell 51(9):6569\u20136584","journal-title":"Appl Intell"},{"key":"11341_CR3","doi-asserted-by":"crossref","unstructured":"Abdul-Rahman A, Hailes S (2000) Supporting trust in virtual communities. In: Proceedings of HICSS-00. pp 6007\u20136016","DOI":"10.1109\/HICSS.2000.926814"},{"issue":"1","key":"11341_CR4","doi-asserted-by":"publisher","first-page":"67","DOI":"10.2307\/2525382","volume":"8","author":"SN Afriat","year":"1967","unstructured":"Afriat SN (1967) The construction of utility functions from expenditure data. Int Econ Rev 8(1):67\u201377","journal-title":"Int Econ Rev"},{"key":"11341_CR5","unstructured":"Albrecht SV, Stone P (2017) Reasoning about hypothetical agent behaviours and their parameters. In: Proceedings of AAMAS-17. pp 547\u2013555"},{"key":"11341_CR6","doi-asserted-by":"publisher","first-page":"66","DOI":"10.1016\/j.artint.2018.01.002","volume":"258","author":"SV Albrecht","year":"2018","unstructured":"Albrecht SV, Stone P (2018) Autonomous agents modelling other agents: a comprehensive survey and open problems. Artif Intell 258:66\u201395","journal-title":"Artif Intell"},{"key":"11341_CR7","doi-asserted-by":"crossref","unstructured":"Amershi S, Weld D, Vorvoreanu M, Fourney A, Nushi B, Collisson P et al (2019) Guidelines for human-AI interaction. In: Proceedings of CHI-19, New York, USA. pp 1\u201313","DOI":"10.1145\/3290605.3300233"},{"key":"11341_CR8","unstructured":"Amodei D, Olah C, Steinhardt J, Christiano P, Schulman J, Man\u00e9 D (2016) Concrete problems in AI safety. arXiv: 1606.06565"},{"key":"11341_CR9","unstructured":"Armstrong S, Mindermann S (2018) Occam\u2019s razor is insufficient to infer the preferences of irrational agents. In: Proceedings of NeurIPS-18"},{"issue":"102","key":"11341_CR10","doi-asserted-by":"publisher","first-page":"121","DOI":"10.2307\/2550390","volume":"26","author":"KJ Arrow","year":"1959","unstructured":"Arrow KJ (1959) Rational choice functions and orderings. Economica 26(102):121\u2013127","journal-title":"Economica"},{"issue":"2","key":"11341_CR11","first-page":"406","volume":"84","author":"WB Arthur","year":"1994","unstructured":"Arthur WB (1994) Inductive reasoning and bounded rationality. Am Econ Rev 84(2):406\u2013411","journal-title":"Am Econ Rev"},{"key":"11341_CR12","doi-asserted-by":"publisher","DOI":"10.1093\/acprof:oso\/9780195158427.001.0001","volume-title":"The architecture of reason: the structure and substance of rationality","author":"R Audi","year":"2002","unstructured":"Audi R (2002) The architecture of reason: the structure and substance of rationality. Oxford University Press, New York"},{"key":"11341_CR13","doi-asserted-by":"crossref","unstructured":"Avrahami-Zilberbrand D, Kaminka GA (2014) Keyhole adversarial plan recognition for recognition of suspicious and anomalous behavior. In: Sukthankar G, Geib C, Bui HH, Pynadath DV, Goldman RP (eds) Plan, activity, and intent recognition: theory and practice. Morgan Kaufmann, Boston, pp 87\u2013119","DOI":"10.1016\/B978-0-12-398532-3.00004-X"},{"key":"11341_CR14","doi-asserted-by":"crossref","unstructured":"Azaria A (2022) Irrational, but adaptive and goal oriented: humans interacting with autonomous agents. In: Raedt LD (ed) Proceedings of IJCAI-22. pp 5798\u20135802","DOI":"10.24963\/ijcai.2022\/813"},{"key":"11341_CR15","doi-asserted-by":"publisher","first-page":"117","DOI":"10.1002\/bdm.676","volume":"23","author":"G Barron","year":"2010","unstructured":"Barron G, Leider S (2010) The role of experience in the gambler\u2019s fallacy. J Behav Decis Mak 23:117\u2013129","journal-title":"J Behav Decis Mak"},{"key":"11341_CR16","unstructured":"Battaly H, Slote M (2015) Virtue epistemology and virtue ethics. In: Besser-Jones L, Slote M (eds) The Routledge companion to virtue ethics (chapter 19). Routledge"},{"issue":"5","key":"11341_CR17","doi-asserted-by":"publisher","first-page":"1289","DOI":"10.1287\/opre.1070.0485","volume":"56","author":"M Baucells","year":"2008","unstructured":"Baucells M, Carrasco JA, Hogarth RM (2008) Cumulative dominance and heuristic performance in binary multiattribute choice. Oper Res 56(5):1289\u20131304","journal-title":"Oper Res"},{"key":"11341_CR18","doi-asserted-by":"publisher","DOI":"10.7551\/mitpress\/14207.001.0001","volume-title":"Distributional reinforcement learning","author":"MG Bellemare","year":"2023","unstructured":"Bellemare MG, Dabney W, Rowland M (2023) Distributional reinforcement learning. The MIT Press, Cambridge"},{"key":"11341_CR19","unstructured":"Bengio Y (2019) The consciousness prior. arXiv: 1709.08568"},{"issue":"9","key":"11341_CR20","doi-asserted-by":"publisher","first-page":"537","DOI":"10.1080\/01691864.2021.1894233","volume":"35","author":"C Benn","year":"2021","unstructured":"Benn C, Grastien A (2021) Reducing moral ambiguity in partially observed human-robot interactions. Adv Robot 35(9):537\u2013552","journal-title":"Adv Robot"},{"key":"11341_CR21","doi-asserted-by":"crossref","unstructured":"Bertolero MA, Bassett DS (2020) Deep neural networks carve the brain at its joints. arXiv: 2002.08891","DOI":"10.1101\/2020.02.20.958082"},{"key":"11341_CR22","doi-asserted-by":"crossref","unstructured":"Besold TR (2013) Rationality in|for|through AI. In: Kelemen J, Romportl J, Zackova E (eds) Beyond artificial intelligence: contemplations, expectations, applications. Springer, Berlin\/Heidelberg, pp 49\u201362","DOI":"10.1007\/978-3-642-34422-0_3"},{"issue":"2","key":"11341_CR23","doi-asserted-by":"publisher","first-page":"331","DOI":"10.1080\/0952813X.2018.1430860","volume":"30","author":"TR Besold","year":"2018","unstructured":"Besold TR, Uckelman SL (2018) Normative and descriptive rationality: from nature to artifice and back. J Exp Theor Artif Intell 30(2):331\u2013344","journal-title":"J Exp Theor Artif Intell"},{"key":"11341_CR24","doi-asserted-by":"crossref","unstructured":"Besold TR, d\u2019Avila Garcez A, Bader S, Bowman H, Domingos P, Hitzler P et al (2022) Neural-symbolic learning and reasoning: a survey and interpretation. In: Hitzler P, Sarker MK (eds) Neuro-symbolic artificial intelligence: the state of the art (chapter\u00a01). IOS Press","DOI":"10.3233\/FAIA210348"},{"key":"11341_CR25","doi-asserted-by":"crossref","unstructured":"Binns R, Van\u00a0Kleek M, Veale M, Lyngs U, Zhao J, Shadbolt N (2018) \u2019It\u2019s Reducing a Human Being to a Percentage\u2019: perceptions of justice in algorithmic decisions. In: Proceedings of CHI-18. pp 1\u201314","DOI":"10.1145\/3173574.3173951"},{"issue":"6","key":"11341_CR26","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.2218523120","volume":"120","author":"M Binz","year":"2023","unstructured":"Binz M, Schulz E (2023) Using cognitive psychology to understand GPT-3. Proc Natl Acad Sci 120(6):e2218523120","journal-title":"Proc Natl Acad Sci"},{"key":"11341_CR27","doi-asserted-by":"publisher","first-page":"659","DOI":"10.1613\/jair.4818","volume":"53","author":"D Bloembergen","year":"2015","unstructured":"Bloembergen D, Tuyls K, Hennes D, Kaisers M (2015) Evolutionary dynamics of multi-agent learning: a survey. J Artif Intell Res 53:659\u2013697","journal-title":"J Artif Intell Res"},{"key":"11341_CR28","unstructured":"Bolukbasi T, Chang K-W, Zou J, Saligrama V, Kalai A (2016) Man is to computer programmer as woman is to homemaker? Debiasing word embeddings. In: Proceedings of NeurIPS-16. pp 4356\u20134364"},{"key":"11341_CR29","volume-title":"An investigation of the laws of thought: on which are founded the mathematical theories of logic and probabilities","author":"G Boole","year":"1854","unstructured":"Boole G (1854) An investigation of the laws of thought: on which are founded the mathematical theories of logic and probabilities. Walton and Maberly, London"},{"key":"11341_CR30","doi-asserted-by":"crossref","unstructured":"Bortolotti L (2013) Rationality and sanity: the role of rationality judgments in understanding psychiatric disorders. In: Fulford KWM, Davies M, Gipps R, Graham G, Sadler JZ,\u00a0Stanghellini G, Thornton T (eds) The Oxford handbook of philosophy and psychiatry. Oxford University Press","DOI":"10.1093\/oxfordhb\/9780199579563.013.0030"},{"issue":"2","key":"11341_CR31","doi-asserted-by":"publisher","first-page":"215","DOI":"10.1016\/S0004-3702(02)00121-2","volume":"136","author":"M Bowling","year":"2002","unstructured":"Bowling M, Veloso M (2002) Multiagent learning using a variable learning rate. Artif Intell 136(2):215\u2013250","journal-title":"Artif Intell"},{"key":"11341_CR32","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1007\/BF01199986","volume":"3","author":"W Brian Arthur","year":"1993","unstructured":"Brian Arthur W (1993) On designing economic agents that behave like human agents. J Evol Econ 3:1\u201322","journal-title":"J Evol Econ"},{"key":"11341_CR33","unstructured":"Briggs RA (2023) Normative theories of rational choice: expected utility. In: Zalta EN, Nodelman U (eds) The Stanford encyclopedia of philosophy, Fall 2023 edn. Metaphysics Research Lab, Stanford University"},{"key":"11341_CR34","unstructured":"Brighton H (2006) Robust inference with simple cognitive models. In: Proceedings of the 2006 AAAI Spring symposium. pp 17\u201322"},{"issue":"1","key":"11341_CR35","doi-asserted-by":"publisher","first-page":"139","DOI":"10.1016\/0004-3702(91)90053-M","volume":"47","author":"RA Brooks","year":"1991","unstructured":"Brooks RA (1991) Intelligence without representation. Artif Intell 47(1):139\u2013159","journal-title":"Artif Intell"},{"key":"11341_CR36","unstructured":"Brown GW (1951) Iterative solution of games by fictitious play. In: Koopmans TC (ed) Activity analysis of production and allocation. Wiley, pp 374\u2013376"},{"issue":"1","key":"11341_CR37","doi-asserted-by":"publisher","first-page":"69","DOI":"10.1353\/hph.1988.0021","volume":"26","author":"C Brown","year":"1988","unstructured":"Brown C (1988) Is hume an internalist? J Hist Philos 26(1):69\u201387","journal-title":"J Hist Philos"},{"key":"11341_CR38","unstructured":"Brown T, Mann B, Ryder N, Subbiah M, Kaplan JD, Dhariwal P et al (2020) Language models are few-shot learners. In: Larochelle H, Ranzato M, Hadsell R, Balcan M, Lin H (eds) Proceedings of NeurIPS-20. Curran Associates, Inc, pp 1877\u20131901"},{"key":"11341_CR39","unstructured":"Bubeck S, Chandrasekaran V, Eldan R, Gehrke J, Horvitz E, Kamar E et al (2023) Sparks of artificial general intelligence: early experiments with GPT-4. arXiv: 2303.12712"},{"key":"11341_CR40","unstructured":"Buckmann M, \u015eimsek \u00d6 (2017) Decision heuristics for comparison: how good are they? In: Guy TV, K\u00e1rn\u00fd M, Rios-Insua D, Wolpert DH (eds) Proceedings of the NeurIPS-16 workshop on imperfect decision makers. pp 1\u201311"},{"key":"11341_CR41","unstructured":"Buolamwini J, Gebru T (2018) Gender shades: intersectional accuracy disparities in commercial gender classification. In: Friedler SA, Wilson C (eds) Proceedings of FAccT-18. pp 77\u201391"},{"issue":"6334","key":"11341_CR42","doi-asserted-by":"publisher","first-page":"183","DOI":"10.1126\/science.aal4230","volume":"356","author":"A Caliskan","year":"2017","unstructured":"Caliskan A, Bryson JJ, Narayanan A (2017) Semantics derived automatically from language corpora contain human-like biases. Science 356(6334):183\u2013186","journal-title":"Science"},{"issue":"5","key":"11341_CR43","doi-asserted-by":"publisher","first-page":"1159","DOI":"10.1007\/s11225-021-09945-2","volume":"109","author":"CS Calude","year":"2021","unstructured":"Calude CS (2021) Incompleteness and the halting problem. Stud Log 109(5):1159\u20131169","journal-title":"Stud Log"},{"issue":"2","key":"11341_CR44","doi-asserted-by":"publisher","first-page":"13","DOI":"10.31820\/ejap.16.2.1","volume":"16","author":"V Cardella","year":"2020","unstructured":"Cardella V (2020) Rationality in mental disorders: too little or too much? Eur J Anal Philos 16(2):13\u201336","journal-title":"Eur J Anal Philos"},{"key":"11341_CR45","unstructured":"Carroll M, Foote D, Siththaranjan A, Russell S, Dragan A (2024) AI alignment with changing and influenceable reward functions. In: Proceedings of icml\u201924. JMLR.org"},{"key":"11341_CR46","doi-asserted-by":"publisher","first-page":"431","DOI":"10.1007\/s10790-006-7525-2","volume":"39","author":"EM Cave","year":"2005","unstructured":"Cave EM (2005) A normative interpretation of expected utility theory. J Value Inquiry 39:431","journal-title":"J Value Inquiry"},{"issue":"2","key":"11341_CR47","doi-asserted-by":"publisher","first-page":"182","DOI":"10.1007\/s10458-013-9222-4","volume":"28","author":"D Chakraborty","year":"2014","unstructured":"Chakraborty D, Stone P (2014) Multiagent learning in the presence of memory-bounded agents. Auton Agent Multi-Agent Syst 28(2):182\u2013213","journal-title":"Auton Agent Multi-Agent Syst"},{"key":"11341_CR48","unstructured":"Chan L, Critch A, Dragan A (2021) Human irrationality: both bad and good for reward inference. arXiv: 2111.06956"},{"key":"11341_CR49","unstructured":"Charpentier A, Elie R, Remlinger C (2020) Reinforcement learning in economics and finance. arXiv: 2003.10014"},{"issue":"6","key":"11341_CR50","doi-asserted-by":"publisher","first-page":"206","DOI":"10.1016\/S1364-6613(98)01179-6","volume":"2","author":"VM Chase","year":"1998","unstructured":"Chase VM, Hertwig R, Gigerenzer G (1998) Visions of rationality. Trends Cogn Sci 2(6):206\u2013214","journal-title":"Trends Cogn Sci"},{"key":"11341_CR51","unstructured":"Chen H, Chang HJ, Howes A (2020) Implications of human irrationality for reinforcement learning. arXiv: 2006.04072"},{"key":"11341_CR52","doi-asserted-by":"crossref","unstructured":"Chen H, Chang HJ, Howes A (2021) Apparently irrational choice as optimal sequential decision making. In: Proceedings of AAAI-21. pp 792\u2013800","DOI":"10.1609\/aaai.v35i1.16161"},{"key":"11341_CR53","doi-asserted-by":"publisher","first-page":"27755","DOI":"10.1038\/srep27755","volume":"6","author":"R Cichy","year":"2016","unstructured":"Cichy R, Khosla A, Pantazis D, Torralba A, Oliva A (2016) Comparison of deep neural networks to spatio-temporal cortical dynamics of human visual object recognition reveals hierarchical correspondence. Sci Rep 6:27755","journal-title":"Sci Rep"},{"issue":"12","key":"11341_CR54","doi-asserted-by":"publisher","first-page":"1521","DOI":"10.1287\/mnsc.39.12.1521","volume":"39","author":"CT Clotfelter","year":"1993","unstructured":"Clotfelter CT, Cook PJ (1993) Notes: the \u201cGambler\u2019s Fallacy\u2019\u2019 in lottery play. Manag Sci 39(12):1521\u20131525","journal-title":"Manag Sci"},{"issue":"3","key":"11341_CR55","doi-asserted-by":"publisher","first-page":"317","DOI":"10.1017\/S0140525X00009092","volume":"4","author":"LJ Cohen","year":"1981","unstructured":"Cohen LJ (1981) Can human irrationality be experimentally demonstrated? Behav Brain Sci 4(3):317\u2013331","journal-title":"Behav Brain Sci"},{"issue":"2","key":"11341_CR56","doi-asserted-by":"publisher","first-page":"331","DOI":"10.1353\/hms.2011.0421","volume":"18","author":"D Coleman","year":"1992","unstructured":"Coleman D (1992) Hume\u2019s internalism. Hume Stud 18(2):331\u2013347","journal-title":"Hume Stud"},{"issue":"1","key":"11341_CR57","doi-asserted-by":"publisher","first-page":"32","DOI":"10.1109\/TBIOM.2019.2897801","volume":"1","author":"CM Cook","year":"2019","unstructured":"Cook CM, Howard JJ, Sirotin YB, Tipton JL, Vemury AR (2019) Demographic effects in facial recognition and their dependence on image acquisition: an evaluation of eleven commercial systems. IEEE Trans Biom Behav Identity Sci 1(1):32\u201341","journal-title":"IEEE Trans Biom Behav Identity Sci"},{"key":"11341_CR58","unstructured":"Dafoe A, Hughes E, Bachrach Y, Collins T, McKee KR, Leibo JZ et al (2020) Open problems in cooperative AI. arXiv: 2012.08630"},{"issue":"2","key":"11341_CR59","doi-asserted-by":"publisher","first-page":"89","DOI":"10.1145\/1461928.1461951","volume":"52","author":"C Daskalakis","year":"2009","unstructured":"Daskalakis C, Goldberg PW, Papadimitriou CH (2009) The complexity of computing a Nash equilibrium. Commun ACM 52(2):89\u201397","journal-title":"Commun ACM"},{"issue":"7","key":"11341_CR60","doi-asserted-by":"publisher","first-page":"571","DOI":"10.1037\/0003-066X.34.7.571","volume":"34","author":"RM Dawes","year":"1979","unstructured":"Dawes RM (1979) The robust beauty of improper linear models in decision making. Am Psychol 34(7):571\u2013582","journal-title":"Am Psychol"},{"issue":"2","key":"11341_CR61","doi-asserted-by":"publisher","first-page":"95","DOI":"10.1037\/h0037613","volume":"81","author":"RM Dawes","year":"1974","unstructured":"Dawes RM, Corrigan B (1974) Linear models in decision making. Psychol Bull 81(2):95\u2013106","journal-title":"Psychol Bull"},{"key":"11341_CR62","doi-asserted-by":"crossref","unstructured":"Dawkins H (2021) Marked attribute bias in natural language inference. In: Proceedings of ACL-IJCNLP-21","DOI":"10.18653\/v1\/2021.findings-acl.369"},{"key":"11341_CR63","doi-asserted-by":"publisher","first-page":"250","DOI":"10.1007\/s10458-015-9317-1","volume":"31","author":"H de Weerd","year":"2017","unstructured":"de Weerd H, Verbrugge R, Verheij B (2017) Negotiating with other minds: the role of recursive theory of mind in negotiation with incomplete information. Auton Agent Multi-Agent Syst 31:250\u2013287","journal-title":"Auton Agent Multi-Agent Syst"},{"key":"11341_CR64","unstructured":"Dennett DC (1981) True believers: the intentional strategy and why it works. In: Haugeland J (ed) Mind design II: philosophy, psychology, and artificial intelligence. The MIT Press"},{"issue":"1","key":"11341_CR65","doi-asserted-by":"publisher","first-page":"114","DOI":"10.1037\/xge0000033","volume":"144","author":"BJ Dietvorst","year":"2015","unstructured":"Dietvorst BJ, Simmons JP, Massey C (2015) Algorithm aversion: people erroneously avoid algorithms after seeing them err. J Exp Psychol 144(1):114\u2013126","journal-title":"J Exp Psychol"},{"issue":"1","key":"11341_CR66","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1006\/jesp.1996.1309","volume":"33","author":"S Donovan","year":"1997","unstructured":"Donovan S, Epstein S (1997) The difficulty of the Linda conjunction problem can be attributed to its simultaneous concrete and unnatural representation, and not to conversational implicature. J Exp Soc Psychol 33(1):1\u201320","journal-title":"J Exp Soc Psychol"},{"key":"11341_CR67","doi-asserted-by":"crossref","unstructured":"Eichhorn C, Kern-Isberner G, Ragni M (2018) Rational inference patterns based on conditional logic. In: Proceedings of the AAAI conference on artificial intelligence, vol 32, no 1","DOI":"10.1609\/aaai.v32i1.11558"},{"issue":"2","key":"11341_CR68","doi-asserted-by":"publisher","first-page":"171","DOI":"10.1016\/0030-5073(75)90044-6","volume":"13","author":"HJ Einhorn","year":"1975","unstructured":"Einhorn HJ, Hogarth RM (1975) Unit weighting schemes for decision making. Organ Behav Hum Perform 13(2):171\u2013192","journal-title":"Organ Behav Hum Perform"},{"key":"11341_CR69","unstructured":"Eloundou T, Manning S, Mishkin P, Rock D (2023) GPTs are GPTs: an early look at the labor market impact potential of large language models. arXiv: 2303.10130"},{"issue":"4","key":"11341_CR70","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1162\/rest_a_01093","volume":"105","author":"B Enke","year":"2023","unstructured":"Enke B, Gneezy U, Hall B, Martin D, Nelidov V, Offerman T, van de Ven J (2023) Cognitive biases: mistakes or missing stakes? Rev Econ Stat 105(4):1\u201315","journal-title":"Rev Econ Stat"},{"key":"11341_CR71","volume-title":"Rationality and reasoning","author":"JSBT Evans","year":"1996","unstructured":"Evans JSBT, Over DE (1996) Rationality and reasoning. Taylor & Francis, Oxford"},{"key":"11341_CR72","unstructured":"Evans O, Stuhlmueller A, Goodman N (2015a) Learning the preferences of bounded agents. In: Proceedings of the NeurIPS-15 workshop on bounded optimality"},{"key":"11341_CR73","doi-asserted-by":"crossref","unstructured":"Evans O, Stuhlmueller A, Goodman N (2015b) Learning the preferences of ignorant, inconsistent agents. In: Proceedings of AAAI-15","DOI":"10.1609\/aaai.v30i1.10010"},{"issue":"438","key":"11341_CR74","doi-asserted-by":"publisher","first-page":"335","DOI":"10.1093\/mind\/110.438.335","volume":"110","author":"SJ Evnine","year":"2001","unstructured":"Evnine SJ (2001) The universality of logic: on the connection between rationality and logical ability. Mind 110(438):335\u2013367","journal-title":"Mind"},{"key":"11341_CR75","doi-asserted-by":"publisher","first-page":"187","DOI":"10.1023\/A:1023002625641","volume":"112","author":"K Farkas","year":"2003","unstructured":"Farkas K (2003) What is externalism? Philos Stud 112:187\u2013208","journal-title":"Philos Stud"},{"key":"11341_CR76","doi-asserted-by":"crossref","unstructured":"Fern\u00e0ndez F, Veloso M (2006) Probabilistic policy reuse in a reinforcement learning agent. In: Proceedings of AAMAS-06, New York, NY, USA. pp 720\u2013727","DOI":"10.1145\/1160633.1160762"},{"key":"11341_CR77","doi-asserted-by":"publisher","first-page":"0158","DOI":"10.1038\/s41562-017-0158","volume":"1","author":"CD Fiorillo","year":"2017","unstructured":"Fiorillo CD (2017) Neuroscience: rationality, uncertainty, dopamine. Nat Hum Behav 1:0158","journal-title":"Nat Hum Behav"},{"issue":"43","key":"11341_CR78","doi-asserted-by":"publisher","first-page":"26562","DOI":"10.1073\/pnas.1905334117","volume":"117","author":"C Firestone","year":"2020","unstructured":"Firestone C (2020) Performance vs. competence in human-machine comparisons. Proc Natl Acad Sci 117(43):26562\u201326571","journal-title":"Proc Natl Acad Sci"},{"key":"11341_CR79","unstructured":"Foerster JN, Chen RY, Al-Shedivat M, Whiteson S, Abbeel P, Mordatch I (2018) Learning with opponent-learning awareness. In: Proceedings of AAMAS-18"},{"key":"11341_CR80","doi-asserted-by":"publisher","DOI":"10.4159\/harvard.9780674334236","volume-title":"The theory of epistemic rationality","author":"R Foley","year":"1987","unstructured":"Foley R (1987) The theory of epistemic rationality. Harvard University Press, Cambridge"},{"key":"11341_CR81","doi-asserted-by":"publisher","first-page":"127","DOI":"10.1038\/nrn2787","volume":"11","author":"K Friston","year":"2010","unstructured":"Friston K (2010) The free-energy principle: a unified brain theory? Nat Rev Neurosci 11:127\u2013138","journal-title":"Nat Rev Neurosci"},{"key":"11341_CR82","unstructured":"F\u00fcrnkranz J (2001) Machine learning in games: a survey. In: F\u00fcrnkranz J, Kubat M (eds) Machines that learn to play games. Nova Science Publishers, pp 11\u201359"},{"key":"11341_CR83","unstructured":"Gan J, Guo Q, Tran-Thanh L, An B, Wooldridge M (2019) Manipulating a learning defender and ways to counteract. In: Wallach H, Larochelle H, Beygelzimer A, d\u2019 Alch\u00e9-Buc F, Fox E, Garnett R (eds) Proceedings of NeurIPS-19. Curran Associates, Inc"},{"key":"11341_CR84","doi-asserted-by":"crossref","unstructured":"Ganzfried S (2023) Safe equilibrium. In: Proceedings of the 62nd IEEE conference on decision and control (CDC), pp 5230\u20135236","DOI":"10.1109\/CDC49753.2023.10383525"},{"key":"11341_CR85","unstructured":"Ganzfried S, Sandholm T (2011) Game theory-based opponent modeling in large imperfect-information games. In: Proceedings of AAMAS-11"},{"issue":"6245","key":"11341_CR86","doi-asserted-by":"publisher","first-page":"273","DOI":"10.1126\/science.aac6076","volume":"349","author":"SJ Gershman","year":"2015","unstructured":"Gershman SJ, Horvitz EJ, Tenenbaum JB (2015) Computational rationality: a converging paradigm for intelligence in brains, minds, and machines. Science 349(6245):273\u2013278","journal-title":"Science"},{"key":"11341_CR87","doi-asserted-by":"crossref","unstructured":"Ghosal GR, Zurek M, Brown DS, Dragan AD (2023) The effect of modeling human rationality level on learning rewards from multiple feedback types. In: Proceedings of AAAI-23","DOI":"10.1609\/aaai.v37i5.25740"},{"key":"11341_CR88","unstructured":"Gigerenzer G (1993) The bounded rationality of probabilistic mental models. In: Manktelow KI, Over DE (eds) Rationality: psychological and philosophical perspectives. Taylor & Frances\/Routledge, pp 284\u2013313"},{"key":"11341_CR89","doi-asserted-by":"crossref","unstructured":"Gigerenzer G (2020) What is bounded rationality? In: Viale R (ed) Routledge handbook of bounded rationality (chapter\u00a02). Routledge","DOI":"10.4324\/9781315658353-2"},{"key":"11341_CR90","first-page":"3304","volume":"5","author":"G Gigerenzer","year":"2001","unstructured":"Gigerenzer G (2001) Decision making: nonrational theories. Int Encycl Soc Behav Sci 5:3304\u20133309","journal-title":"Int Encycl Soc Behav Sci"},{"issue":"1","key":"11341_CR91","doi-asserted-by":"publisher","first-page":"107","DOI":"10.1111\/j.1756-8765.2008.01006.x","volume":"1","author":"G Gigerenzer","year":"2009","unstructured":"Gigerenzer G, Brighton H (2009) Homo heuristicus: why biased minds make better inferences. Top Cogn Sci 1(1):107\u2013143","journal-title":"Top Cogn Sci"},{"issue":"4","key":"11341_CR92","doi-asserted-by":"publisher","first-page":"650","DOI":"10.1037\/0033-295X.103.4.650","volume":"103","author":"G Gigerenzer","year":"1996","unstructured":"Gigerenzer G, Goldstein D (1996) Reasoning the fast and frugal way: models of bounded rationality. Psychol Rev 103(4):650\u2013669","journal-title":"Psychol Rev"},{"key":"11341_CR93","doi-asserted-by":"publisher","DOI":"10.7551\/mitpress\/1654.001.0001","volume-title":"Bounded rationality: the adaptive toolbox","author":"G Gigerenzer","year":"2002","unstructured":"Gigerenzer G, Selten R (2002) Bounded rationality: the adaptive toolbox. The MIT Press, Cambridge"},{"key":"11341_CR94","doi-asserted-by":"publisher","DOI":"10.1017\/CBO9780511808098","volume-title":"Heuristics and biases: the psychology of intuitive judgment","author":"T Gilovich","year":"2002","unstructured":"Gilovich T, Griffin D, Kahneman D (2002) Heuristics and biases: the psychology of intuitive judgment. Cambridge University Press, Cambridge"},{"issue":"3","key":"11341_CR95","doi-asserted-by":"publisher","first-page":"311","DOI":"10.1016\/S0921-8009(00)00216-0","volume":"35","author":"H Gintis","year":"2000","unstructured":"Gintis H (2000) Beyond homo economicus: evidence from experimental economics. Ecol Econ 35(3):311\u2013322","journal-title":"Ecol Econ"},{"key":"11341_CR96","unstructured":"Glickman M, Sharot T (2022) Biased AI systems produce biased humans. OSF Preprints"},{"key":"11341_CR97","unstructured":"Goldstein S (2024) LLMs can never be ideally rational. PhilPapers"},{"key":"11341_CR98","unstructured":"Gorman R, Armstrong S (2022) The dangers in algorithms learning humans\u2019 values and irrationalities. arXiv: 2202.13985"},{"key":"11341_CR99","doi-asserted-by":"crossref","unstructured":"Goto T, Hatanaka T, Fujita M (2012) Payoff-based inhomogeneous partially irrational play for potential game theoretic cooperative control: convergence analysis. In: Proceedings of ACC-12, pp 2380\u20132387","DOI":"10.1109\/ACC.2012.6314613"},{"issue":"4","key":"11341_CR100","doi-asserted-by":"publisher","first-page":"477","DOI":"10.1177\/1043463110383972","volume":"22","author":"A Grandori","year":"2010","unstructured":"Grandori A (2010) A rational heuristic model of economic decision making. Ration Soc 22(4):477\u2013504","journal-title":"Ration Soc"},{"key":"11341_CR101","unstructured":"Gulati A, Lozano MA, Lepri B, Oliver N (2022) BIASeD: bringing irrationality into automated system design. arXiv: 2210.01122"},{"key":"11341_CR102","doi-asserted-by":"publisher","DOI":"10.1201\/9781315273570","volume-title":"An introduction to neural networks","author":"K Gurney","year":"2018","unstructured":"Gurney K (2018) An introduction to neural networks. CRC Press, London"},{"key":"11341_CR103","doi-asserted-by":"crossref","unstructured":"Gutierrez J, Hammond L, Lin AW, Najib M, Wooldridge M (2021) Rational verification for probabilistic systems. arXiv: 2107.09119","DOI":"10.24963\/kr.2021\/30"},{"issue":"10","key":"11341_CR104","doi-asserted-by":"publisher","first-page":"833","DOI":"10.1038\/s43588-023-00527-x","volume":"3","author":"T Hagendorff","year":"2023","unstructured":"Hagendorff T, Fabi S, Kosinski M (2023) Human-like intuitive behavior and reasoning biases emerged in large language models but disappeared in ChatGPT. Nat Comput Sci 3(10):833\u2013838","journal-title":"Nat Comput Sci"},{"issue":"3","key":"11341_CR105","first-page":"247","volume":"105","author":"PJ Hammond","year":"1997","unstructured":"Hammond PJ (1997) Rationality in economics. Rivista Internazionale di Scienze Sociali 105(3):247\u2013288","journal-title":"Rivista Internazionale di Scienze Sociali"},{"key":"11341_CR106","unstructured":"Hammond L, Abate A, Gutierrez J, Wooldridge M (2021) Multi-agent reinforcement learning with temporal logic specifications. In: Proceedings of AAMAS-21. pp 583\u2013592"},{"key":"11341_CR107","doi-asserted-by":"publisher","DOI":"10.7551\/mitpress\/5758.001.0001","volume-title":"Rationality and logic","author":"R Hanna","year":"2006","unstructured":"Hanna R (2006) Rationality and logic. The MIT Press, Cambridge"},{"key":"11341_CR108","unstructured":"He H, Boyd-Graber J, Kwok K, Daum\u00e9 H III (2016) Opponent modeling in deep reinforcement learning. In: Proceedings of ICML-16, New York, USA. pp 1804\u20131813"},{"key":"11341_CR109","unstructured":"Hendrycks D, Carlini N, Schulman J, Steinhardt J (2022) Unsolved problems in ML safety. arXiv: 2109.13916"},{"issue":"4","key":"11341_CR110","doi-asserted-by":"publisher","first-page":"447","DOI":"10.1023\/A:1008367325700","volume":"9","author":"J Hernandez-Orallo","year":"2000","unstructured":"Hernandez-Orallo J (2000) Beyond the turing test. J Logic Lang Inform 9(4):447\u2013466","journal-title":"J Logic Lang Inform"},{"issue":"1","key":"11341_CR111","doi-asserted-by":"publisher","first-page":"52","DOI":"10.2307\/2548574","volume":"1","author":"JR Hicks","year":"1934","unstructured":"Hicks JR, Allen RGD (1934) A reconsideration of the theory of value. Part I. Economica 1(1):52\u201376","journal-title":"Economica"},{"key":"11341_CR112","doi-asserted-by":"publisher","DOI":"10.7551\/mitpress\/13373.001.0001","volume-title":"How humans judge machines","author":"CA Hidalgo","year":"2021","unstructured":"Hidalgo CA (2021) How humans judge machines. MIT Press, Cambridge"},{"key":"11341_CR113","doi-asserted-by":"publisher","first-page":"205","DOI":"10.1007\/s11238-006-9000-8","volume":"61","author":"R Hogarth","year":"2006","unstructured":"Hogarth R, Karelaia N (2006) \u201cTake-the-Best\u2019\u2019 and other simple strategies: why and when they work \u201cWell\u2019\u2019 with binary cues. Theor Decis 61:205\u2013249","journal-title":"Theor Decis"},{"key":"11341_CR114","unstructured":"Horvitz EJ (1988) Reasoning about beliefs and actions under computational resource constraints. In: Proceedings of UAI-87, Arlington, Virginia, USA. pp 429\u2013447"},{"issue":"66","key":"11341_CR115","doi-asserted-by":"publisher","first-page":"159","DOI":"10.2307\/2549382","volume":"17","author":"HS Houthakker","year":"1950","unstructured":"Houthakker HS (1950) Revealed preference and the utility function. Economica 17(66):159\u2013174","journal-title":"Economica"},{"issue":"8","key":"11341_CR116","doi-asserted-by":"publisher","DOI":"10.1111\/lnc3.12432","volume":"15","author":"D Hovy","year":"2021","unstructured":"Hovy D, Prabhumoye S (2021) Five sources of bias in natural language processing. Lang Linguist Compass 15(8):e12432","journal-title":"Lang Linguist Compass"},{"issue":"4","key":"11341_CR117","doi-asserted-by":"publisher","first-page":"368","DOI":"10.1037\/a0039996","volume":"123","author":"A Howes","year":"2016","unstructured":"Howes A, Warren PA, Farmer GD, El-Deredy W, Lewis RL (2016) Why contextual preference reversals maximize expected value. Psychol Rev 123(4):368\u2013391","journal-title":"Psychol Rev"},{"key":"11341_CR118","unstructured":"H\u00fcllermeier E, Mohr F, Tornede A, Wever M (2021) Automated machine learning, bounded rationality, and rational metareasoning. arXiv:2109.04744"},{"key":"11341_CR119","doi-asserted-by":"crossref","unstructured":"Hunicke R (2005) The case for dynamic difficulty adjustment in games. In: Proceedings of ACE-05. pp 429\u2013433","DOI":"10.1145\/1178477.1178573"},{"issue":"517","key":"11341_CR120","doi-asserted-by":"publisher","first-page":"111","DOI":"10.1093\/mind\/fzz065","volume":"130","author":"T Icard","year":"2021","unstructured":"Icard T (2021) Why be random? Mind 130(517):111\u2013139","journal-title":"Mind"},{"key":"11341_CR121","doi-asserted-by":"publisher","first-page":"161","DOI":"10.1007\/s12530-010-9008-8","volume":"1","author":"J Iglesias","year":"2010","unstructured":"Iglesias J, Angelov P, Ledezma Espino A, Sanchis de Miguel A (2010) Evolving classification of agents\u2019 behaviors: a general approach. Evol Syst 1:161\u2013171","journal-title":"Evol Syst"},{"key":"11341_CR122","doi-asserted-by":"crossref","unstructured":"Jiang B, Xie Y, Wang X, Yuan Y, Hao Z, Bai X et al (2025) Towards rationality in language and multimodal agents: a survey. arXiv: 2406.00252","DOI":"10.18653\/v1\/2025.naacl-long.186"},{"key":"11341_CR123","doi-asserted-by":"publisher","first-page":"1726","DOI":"10.3389\/fpsyg.2017.01726","volume":"8","author":"KM Jozwik","year":"2017","unstructured":"Jozwik KM, Kriegeskorte N, Storrs KR, Mur M (2017) Deep convolutional neural networks outperform feature-based but not categorical models in explaining object similarity judgments. Front Psychol 8:1726","journal-title":"Front Psychol"},{"issue":"1","key":"11341_CR124","doi-asserted-by":"publisher","first-page":"237","DOI":"10.1613\/jair.301","volume":"4","author":"LP Kaelbling","year":"1996","unstructured":"Kaelbling LP, Littman ML, Moore AW (1996) Reinforcement learning: a survey. J Artif Intell Res 4(1):237\u2013285","journal-title":"J Artif Intell Res"},{"issue":"1","key":"11341_CR125","doi-asserted-by":"publisher","first-page":"160","DOI":"10.1038\/scientificamerican0182-160","volume":"246","author":"D Kahneman","year":"1982","unstructured":"Kahneman D, Tversky A (1982) The psychology of preferences. Sci Am 246(1):160\u2013173","journal-title":"Sci Am"},{"issue":"1","key":"11341_CR126","doi-asserted-by":"publisher","first-page":"193","DOI":"10.1257\/jep.5.1.193","volume":"5","author":"D Kahneman","year":"1991","unstructured":"Kahneman D, Knetsch JL, Thaler RH (1991) Anomalies: the endowment effect, loss aversion, and status quo bias. J Econ Perspect 5(1):193\u2013206","journal-title":"J Econ Perspect"},{"key":"11341_CR127","doi-asserted-by":"publisher","DOI":"10.1016\/j.lindif.2023.102274","volume":"103","author":"E Kasneci","year":"2023","unstructured":"Kasneci E, Sessler K, K\u00fcchemann S, Bannert M, Dementieva D, Fischer F et al (2023) ChatGPT for good? On opportunities and challenges of large language models for education. Learn Individ Differ 103:102274","journal-title":"Learn Individ Differ"},{"issue":"1","key":"11341_CR128","doi-asserted-by":"publisher","first-page":"10","DOI":"10.1287\/deca.1100.0191","volume":"8","author":"KV Katsikopoulos","year":"2011","unstructured":"Katsikopoulos KV (2011) Psychological heuristics for making inferences: definition, performance, and the emerging theory and practice. Decis Anal 8(1):10\u201329","journal-title":"Decis Anal"},{"issue":"5","key":"11341_CR129","doi-asserted-by":"publisher","first-page":"488","DOI":"10.1016\/j.jmp.2006.06.001","volume":"50","author":"KV Katsikopoulos","year":"2006","unstructured":"Katsikopoulos KV, Martignon L (2006) Na\u00efve heuristics for paired comparisons: some results on their relative accuracy. J Math Psychol 50(5):488\u2013494","journal-title":"J Math Psychol"},{"key":"11341_CR130","unstructured":"Kim DK, Liu M, Riemer MD, Sun C, Abdulhai M, Habibi G et al (2021) A policy gradient algorithm for learning to learn in multiagent reinforcement learning. In: Proceedings of ICML-21. pp 5541\u20135550"},{"issue":"1","key":"11341_CR131","doi-asserted-by":"publisher","first-page":"38","DOI":"10.1287\/opre.33.1.38","volume":"33","author":"CW Kirkwood","year":"1985","unstructured":"Kirkwood CW, Sarin RK (1985) Ranking with partial information: a method and an application. Oper Res 33(1):38\u201348","journal-title":"Oper Res"},{"key":"11341_CR132","doi-asserted-by":"publisher","first-page":"215","DOI":"10.1098\/rstb.2009.0194","volume":"365","author":"A Kirman","year":"2010","unstructured":"Kirman A, Livet P, Teschl M (2010) Rationality and emotions. Philos Trans R Soc 365:215\u2013219","journal-title":"Philos Trans R Soc"},{"key":"11341_CR133","doi-asserted-by":"crossref","unstructured":"Knauff M, Spohn W (2021a) The handbook of rationality. The MIT Press, Cambridge","DOI":"10.7551\/mitpress\/11252.001.0001"},{"key":"11341_CR134","doi-asserted-by":"crossref","unstructured":"Knauff M, Spohn W (2021b) Psychological and philosophical frameworks of rationality\u2014a systematic introduction. The handbook of rationality. The MIT Press, Cambridge","DOI":"10.7551\/mitpress\/11252.003.0004"},{"issue":"4","key":"11341_CR135","doi-asserted-by":"publisher","first-page":"293","DOI":"10.1016\/0004-3702(75)90019-3","volume":"6","author":"DE Knuth","year":"1975","unstructured":"Knuth DE, Moore RW (1975) An analysis of alpha-beta pruning. Artif Intell 6(4):293\u2013326","journal-title":"Artif Intell"},{"key":"11341_CR136","unstructured":"Kobayashi S, Hashimoto T (2007) Analysis of random agents for improving market liquidity using artificial stock market. In: Proceedings of ESSA-07"},{"issue":"4","key":"11341_CR137","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1371\/journal.pcbi.1004896","volume":"12","author":"J Kubilius","year":"2016","unstructured":"Kubilius J, Bracci S, Op de Beeck HP (2016) Deep neural networks as a computational model for human shape sensitivity. PLoS Comput Biol 12(4):1\u201326","journal-title":"PLoS Comput Biol"},{"key":"11341_CR138","doi-asserted-by":"crossref","unstructured":"Kwon M, Biyik E, Talati A, Bhasin K, Losey DP, Sadigh D (2020) When humans aren\u2019t optimal: robots that collaborate with risk-aware humans. In: Proceedings of HRI-20","DOI":"10.1145\/3319502.3374832"},{"key":"11341_CR139","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1016\/j.inffus.2022.03.003","volume":"85","author":"P Ladosz","year":"2022","unstructured":"Ladosz P, Weng L, Kim M, Oh H (2022) Exploration in deep reinforcement learning: a survey. Inf Fusion 85:1\u201322","journal-title":"Inf Fusion"},{"key":"11341_CR140","unstructured":"Laidlaw C, Dragan A (2022) The Boltzmann policy distribution: accounting for systematic suboptimality in human models. In: Proceedings of ICLR-22"},{"key":"11341_CR141","unstructured":"Lampinen AK (2023) Can language models handle recursively nested grammatical structures? A case study on comparing models and humans. arXiv: 2210.15303"},{"key":"11341_CR142","unstructured":"Lee C (2021) The game of go: bounded rationality and artificial intelligence. In: Kiel L, Elliott E (eds) Complex systems in the social and behavioral sciences: theory, method and application. University of Michigan Press, pp 157\u2013180"},{"key":"11341_CR143","unstructured":"Leech J (2015) Logic and the laws of thought. Philosophers\u2019 Imprint, 15"},{"key":"11341_CR144","unstructured":"Letcher A, Foerster J, Balduzzi D, Rockt\u00e4schel T, Whiteson S (2019) Stable opponent shaping in differentiable games. In: Proceedings of ICLR-19"},{"issue":"2","key":"11341_CR145","doi-asserted-by":"publisher","first-page":"279","DOI":"10.1111\/tops.12086","volume":"6","author":"RL Lewis","year":"2014","unstructured":"Lewis RL, Howes A, Singh S (2014) Computational rationality: linking mechanism and behavior through bounded utility maximization. Top Cogn Sci 6(2):279\u2013311","journal-title":"Top Cogn Sci"},{"key":"11341_CR146","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-01545-8","volume-title":"Essentials of game theory: a concise multidisciplinary introduction","author":"K Leyton-Brown","year":"2008","unstructured":"Leyton-Brown K, Shoham Y (2008) Essentials of game theory: a concise multidisciplinary introduction, vol 2. Morgan & Claypool Publishers, Williston"},{"key":"11341_CR147","unstructured":"Li L, Faisal AA (2023) Adversarial distributional reinforcement learning against extrapolated generalization. In: Proceedings of EWRL-23"},{"key":"11341_CR148","doi-asserted-by":"crossref","unstructured":"Li S, Wu Y, Cui X, Dong H, Fang F, Russell S (2019) Robust multi-agent reinforcement learning via minimax deep deterministic policy gradient. In: Proceedings of AAAI-19. pp 4213\u20134220","DOI":"10.1609\/aaai.v33i01.33014213"},{"issue":"6","key":"11341_CR149","doi-asserted-by":"publisher","first-page":"e333","DOI":"10.1016\/S2589-7500(23)00083-3","volume":"5","author":"H Li","year":"2023","unstructured":"Li H, Moon JT, Purkayastha S, Celi LA, Trivedi H, Gichoya JW (2023) Ethics of large language models in medicine and medical research. Lancet Digit Health 5(6):e333\u2013e335","journal-title":"Lancet Digit Health"},{"key":"11341_CR150","doi-asserted-by":"publisher","first-page":"445","DOI":"10.1038\/nature14540","volume":"521","author":"M Littman","year":"2015","unstructured":"Littman M (2015) Reinforcement learning improves behaviour from evaluative feedback. Nature 521:445\u201351","journal-title":"Nature"},{"key":"11341_CR151","unstructured":"Lu C, Willi T, de Witt CS, Foerster J (2022) Model-free opponent shaping. In: Proceedings of ICML-22"},{"issue":"137","key":"11341_CR152","doi-asserted-by":"publisher","first-page":"112","DOI":"10.1017\/S0031819100057983","volume":"36","author":"JR Lucas","year":"1961","unstructured":"Lucas JR (1961) Minds, machines and g\u00f6del. Philosophy 36(137):112\u2013127","journal-title":"Philosophy"},{"key":"11341_CR153","volume-title":"Individual choice behavior: a theoretical analysis","author":"RD Luce","year":"1959","unstructured":"Luce RD (1959) Individual choice behavior: a theoretical analysis. Wiley, New Jersey"},{"key":"11341_CR154","doi-asserted-by":"publisher","DOI":"10.1098\/rsos.240255","volume":"11","author":"O Macmillan-Scott","year":"2024","unstructured":"Macmillan-Scott O, Musolesi M (2024) (Ir)rationality and cognitive biases in large language models. R Soc Open Sci 11:240255","journal-title":"R Soc Open Sci"},{"key":"11341_CR155","doi-asserted-by":"publisher","first-page":"3910","DOI":"10.1038\/s41598-021-83182-4","volume":"11","author":"N Manome","year":"2021","unstructured":"Manome N, Shinohara S, Takahashi T, Chen Y, Chung U-I (2021) Self-incremental learning vector quantization with human cognitive biases. Sci Rep 11:3910","journal-title":"Sci Rep"},{"key":"11341_CR156","unstructured":"Mao W, Gratch J (2004) A utility-based approach to intention recognition. In: Proceedings of AAMAS-04"},{"key":"11341_CR157","volume-title":"Vision: a computational investigation into the human representation and processing of visual information","author":"D Marr","year":"1982","unstructured":"Marr D (1982) Vision: a computational investigation into the human representation and processing of visual information. W.H Freeman, San Francisco"},{"issue":"1","key":"11341_CR158","doi-asserted-by":"publisher","first-page":"29","DOI":"10.1023\/A:1015516217425","volume":"52","author":"L Martignon","year":"2002","unstructured":"Martignon L, Hoffrage U (2002) Fast, frugal, and fit: simple heuristics for paired comparison. Theor Decis 52(1):29\u201371","journal-title":"Theor Decis"},{"issue":"2","key":"11341_CR159","doi-asserted-by":"publisher","first-page":"251","DOI":"10.1093\/cje\/beq017","volume":"35","author":"N Martins","year":"2010","unstructured":"Martins N (2010) Can neuroscience inform economics? Rationality, emotions and preference formation. Camb J Econ 35(2):251\u2013267","journal-title":"Camb J Econ"},{"key":"11341_CR160","doi-asserted-by":"crossref","unstructured":"Maruyama Y (2020) Rationality, cognitive bias, and artificial intelligence: a structural perspective on quantum cognitive science. In: Proceedings of EPCE-20. pp 172\u2013188","DOI":"10.1007\/978-3-030-49183-3_14"},{"key":"11341_CR161","volume-title":"Microeconomic theory","author":"A Mas-Colell","year":"1995","unstructured":"Mas-Colell A, Whinston M, Green J (1995) Microeconomic theory. Oxford University Press, Oxford"},{"key":"11341_CR162","doi-asserted-by":"publisher","DOI":"10.1016\/j.artint.2021.103490","volume":"297","author":"P Masters","year":"2021","unstructured":"Masters P, Sardina S (2021) Expecting the unexpected: goal recognition for rational and irrational agents. Artif Intell 297:103490","journal-title":"Artif Intell"},{"issue":"4","key":"11341_CR163","first-page":"12","volume":"27","author":"J McCarthy","year":"2006","unstructured":"McCarthy J, Minsky ML, Rochester N, Shannon CE (2006) A proposal for the Dartmouth summer research project on artificial intelligence, August 31, 1955. AI Mag 27(4):12","journal-title":"AI Mag"},{"key":"11341_CR164","doi-asserted-by":"crossref","unstructured":"McElfresh DC, Chan L, Doyle K, Sinnott-Armstrong W, Conitzer V, Borg JS, Dickerson JP (2021) Indecision modeling. In: Proceedings of AAAI-21. pp 5975\u20135983","DOI":"10.1609\/aaai.v35i7.16746"},{"issue":"3","key":"11341_CR165","doi-asserted-by":"publisher","first-page":"367","DOI":"10.1016\/S0167-4870(02)00081-8","volume":"23","author":"E McIntosh","year":"2002","unstructured":"McIntosh E, Ryan M (2002) Using discrete choice experiments to derive welfare estimates for the provision of elective surgery: implications of discontinuous preferences. J Econ Psychol 23(3):367\u2013382","journal-title":"J Econ Psychol"},{"issue":"1","key":"11341_CR166","doi-asserted-by":"publisher","first-page":"11","DOI":"10.1109\/TCIAIG.2015.2491611","volume":"9","author":"R Mealing","year":"2017","unstructured":"Mealing R, Shapiro JL (2017) Opponent modelling by expectation-maximisation and sequence prediction in simplified poker. IEEE Trans Comput Intell AI Games 9(1):11\u201324","journal-title":"IEEE Trans Comput Intell AI Games"},{"issue":"1","key":"11341_CR167","doi-asserted-by":"publisher","first-page":"125","DOI":"10.1017\/S0012217300049088","volume":"40","author":"K Meeker","year":"2001","unstructured":"Meeker K (2001) Is Hume\u2019s epistemology internalist or externalist? Dialogue Can Philos Rev 40(1):125\u2013146","journal-title":"Dialogue Can Philos Rev"},{"issue":"7540","key":"11341_CR168","doi-asserted-by":"publisher","first-page":"529","DOI":"10.1038\/nature14236","volume":"518","author":"V Mnih","year":"2015","unstructured":"Mnih V, Kavukcuoglu K, Silver D, Rusu AA, Veness J, Bellemare MG et al (2015) Human-level control through deep reinforcement learning. Nature 518(7540):529\u2013533","journal-title":"Nature"},{"key":"11341_CR169","doi-asserted-by":"publisher","first-page":"277","DOI":"10.1613\/jair.1.12889","volume":"73","author":"SB Nashed","year":"2022","unstructured":"Nashed SB, Zilberstein S (2022) A survey of opponent modeling in adversarial domains. J Artif Intell Res 73:277\u2013327","journal-title":"J Artif Intell Res"},{"key":"11341_CR170","unstructured":"Newell A, Simon HA (1961) GPS, a program that simulates human thought. In: Billings H (ed) Lernende automaten. pp 109\u2013124"},{"issue":"3","key":"11341_CR171","doi-asserted-by":"publisher","first-page":"151","DOI":"10.1037\/h0048495","volume":"65","author":"A Newell","year":"1958","unstructured":"Newell A, Shaw JC, Simon HA (1958) Elements of a theory of human problem solving. Psychol Rev 65(3):151\u2013166","journal-title":"Psychol Rev"},{"key":"11341_CR172","unstructured":"Ng AY, Russell S (2000) Algorithms for inverse reinforcement learning. In: Proceedings of ICML-00, San Francisco, CA, USA. pp 663\u2013670"},{"key":"11341_CR173","doi-asserted-by":"publisher","first-page":"932","DOI":"10.1037\/0022-3514.32.5.932","volume":"32","author":"RE Nisbett","year":"1975","unstructured":"Nisbett RE, Borgida E (1975) Attribution and the psychology of prediction. J Pers Soc Psychol 32:932\u2013943","journal-title":"J Pers Soc Psychol"},{"key":"11341_CR174","volume-title":"Human inference: Strategies and shortcomings of social judgment","author":"Richard E. Nisbett","year":"1980","unstructured":"Nisbett RE, Ross L (1980) Human inference: strategies and shortcomings of social judgment. Englewood Cliffs"},{"key":"11341_CR175","doi-asserted-by":"crossref","unstructured":"Noiret S, Lumetzberger J, Kampel M (2021) Bias and fairness in computer vision applications of the criminal justice system. In: Proceedings of SSCI-21. pp 1\u20138","DOI":"10.1109\/SSCI50451.2021.9660177"},{"key":"11341_CR176","doi-asserted-by":"publisher","DOI":"10.1515\/9781400820832","volume-title":"The nature of rationality","author":"R Nozick","year":"1993","unstructured":"Nozick R (1993) The nature of rationality. Princeton University Press, Princeton"},{"key":"11341_CR177","first-page":"110","volume-title":"Rationality: psychological and philosophical perspectives","author":"DP O\u2019Brien","year":"1993","unstructured":"O\u2019Brien DP (1993) Mental logic and irrationality: we can put a man on the moon, so why can\u2019t we solve those logical reasoning problems. In: Manktelow KI, Over DE (eds) Rationality: psychological and philosophical perspectives. Routledge, London, pp 110\u2013135"},{"key":"11341_CR178","doi-asserted-by":"publisher","first-page":"5159","DOI":"10.1038\/s41467-018-07471-9","volume":"9","author":"T O\u2019Connell","year":"2018","unstructured":"O\u2019Connell T, Chun M (2018) Predicting eye movement patterns from fMRI responses to natural scenes. Nat Commun 9:5159","journal-title":"Nat Commun"},{"key":"11341_CR179","doi-asserted-by":"publisher","first-page":"161","DOI":"10.1017\/9781107295490.009","volume-title":"How biology shapes philosophy: new foundations for naturalism","author":"S Okasha","year":"2016","unstructured":"Okasha S (2016) Biology and the theory of rationality. In: Smith DL (ed) How biology shapes philosophy: new foundations for naturalism. Cambridge University Press, Cambridge, pp 161\u2013183"},{"key":"11341_CR180","unstructured":"Olorunleke O, McCalla G (2005) A condensed roadmap of agents-modelling-agents research. In: Proceedings of the IJCAI-05 workshop on modeling other agents from observation"},{"key":"11341_CR181","volume-title":"Weapons of math destruction: how big data increases inequality and threatens democracy","author":"C O\u2019Neil","year":"2016","unstructured":"O\u2019Neil C (2016) Weapons of math destruction: how big data increases inequality and threatens democracy. Penguin Books Limited, New York"},{"key":"11341_CR182","volume-title":"An introduction to game theory","author":"MJ Osborne","year":"2004","unstructured":"Osborne MJ (2004) An introduction to game theory. Oxford University Press, Oxford"},{"issue":"4","key":"11341_CR183","doi-asserted-by":"publisher","first-page":"1100","DOI":"10.1177\/08944393211073169","volume":"41","author":"O Papakyriakopoulos","year":"2023","unstructured":"Papakyriakopoulos O, Mboya AM (2023) Beyond algorithmic bias: a socio-computational interrogation of the google search by image algorithm. Soc Sci Comput Rev 41(4):1100\u20131125","journal-title":"Soc Sci Comput Rev"},{"issue":"6245","key":"11341_CR184","doi-asserted-by":"publisher","first-page":"267","DOI":"10.1126\/science.aaa8403","volume":"349","author":"DC Parkes","year":"2015","unstructured":"Parkes DC, Wellman MP (2015) Economic reasoning and artificial intelligence. Science 349(6245):267\u2013272","journal-title":"Science"},{"issue":"4","key":"11341_CR185","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3337772","volume":"52","author":"J Pawlick","year":"2019","unstructured":"Pawlick J, Colbert E, Zhu Q (2019) A game-theoretic taxonomy and survey of defensive deception for cybersecurity and privacy. ACM Comput Surv 52(4):1\u201328","journal-title":"ACM Comput Surv"},{"key":"11341_CR186","unstructured":"Peng P, Wen Y, Yang Y, Yuan Q, Tang Z, Long H, Wang J (2017) Multiagent bidirectionally-coordinated nets: emergence of human-level coordination in learning to play StarCraft combat games. arXiv: 1703.10069"},{"key":"11341_CR187","doi-asserted-by":"publisher","DOI":"10.1093\/oso\/9780198519737.001.0001","volume-title":"Truth, proof, and insight. The emperor\u2019s new mind: concerning computers, minds, and the laws of physics","author":"R Penrose","year":"1989","unstructured":"Penrose R (1989) Truth, proof, and insight. The emperor\u2019s new mind: concerning computers, minds, and the laws of physics. Oxford University Press, Oxford"},{"key":"11341_CR188","doi-asserted-by":"publisher","first-page":"878","DOI":"10.1016\/j.jbusres.2020.11.006","volume":"129","author":"G Pizzi","year":"2020","unstructured":"Pizzi G, Scarpi D, Pantano E (2020) Artificial intelligence and the new forms of interaction: who has the control when interacting with a chatbot? J Bus Res 129:878\u2013890","journal-title":"J Bus Res"},{"key":"11341_CR189","unstructured":"Polvara R, Patacchiola M, Sharma S, Wan J, Manning A, Sutton R, Cangelosi A (2018) Autonomous quadrotor landing using deep reinforcement learning. arXiv: 1709.03339"},{"key":"11341_CR190","doi-asserted-by":"publisher","first-page":"477","DOI":"10.1038\/s41586-019-1138-y","volume":"568","author":"I Rahwan","year":"2019","unstructured":"Rahwan I, Cebrian M, Obradovich N, Bongard J, Bonnefon J-F, Breazeal C et al (2019) Machine behaviour. Nature 568:477\u2013486","journal-title":"Nature"},{"key":"11341_CR191","unstructured":"Ramachandran D, Amir E (2007) Bayesian inverse reinforcement learning. In: Proceedings of IJCAI-07, San Francisco, CA, USA, pp 2586\u20132591"},{"key":"11341_CR192","unstructured":"Ram\u00edrez M, Geffner H (2011) Goal recognition over POMDPs: inferring the intention of a POMDP agent. In: Proceedings of IJCAI-11. pp 2009\u20132014"},{"key":"11341_CR193","doi-asserted-by":"crossref","unstructured":"Ren Y, Duan J, Li SE, Guan Y, Sun Q (2020) Improving generalization of reinforcement learning with minimax distributional soft actor-critic. arXiv: 2002.05502","DOI":"10.1109\/ITSC45102.2020.9294300"},{"issue":"4","key":"11341_CR194","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1371\/journal.pone.0231189","volume":"15","author":"D Rozado","year":"2020","unstructured":"Rozado D (2020) Wide range screening of algorithmic bias in word embedding models using large sentiment lexicons reveals underreported bias types. PLoS ONE 15(4):1\u201326","journal-title":"PLoS ONE"},{"key":"11341_CR195","doi-asserted-by":"publisher","DOI":"10.7551\/mitpress\/4702.001.0001","volume-title":"Modeling bounded rationality","author":"A Rubinstein","year":"1998","unstructured":"Rubinstein A (1998) Modeling bounded rationality. The MIT Press, Cambridge"},{"issue":"1","key":"11341_CR196","doi-asserted-by":"publisher","first-page":"57","DOI":"10.1016\/S0004-3702(97)00026-X","volume":"94","author":"S Russell","year":"1997","unstructured":"Russell S (1997) Rationality and intelligence. Artif Intell 94(1):57\u201377","journal-title":"Artif Intell"},{"key":"11341_CR197","volume-title":"Fundamental issues of artificial intelligence","author":"S Russell","year":"2016","unstructured":"Russell S (2016) Rationality and intelligence: a brief update. In: M\u00fcller V (ed) Fundamental issues of artificial intelligence. Springer, Cham"},{"key":"11341_CR198","volume-title":"Artificial intelligence: a modern approach","author":"S Russell","year":"2022","unstructured":"Russell S, Norvig P (2022) Artificial intelligence: a modern approach. Pearson Education, London"},{"key":"11341_CR199","unstructured":"Russell S, Subramanian D, Parr R (1993) Provably bounded optimal agents. In: Proceedings of IJCAI-93"},{"issue":"1","key":"11341_CR200","doi-asserted-by":"publisher","first-page":"59","DOI":"10.1080\/135048501750041312","volume":"8","author":"M Ryan","year":"2001","unstructured":"Ryan M, Bate A (2001) Testing the assumptions of rationality, continuity and symmetry when applying discrete choice experiments in health care. Appl Econ Lett 8(1):59\u201363","journal-title":"Appl Econ Lett"},{"issue":"6","key":"11341_CR201","doi-asserted-by":"publisher","first-page":"1153","DOI":"10.1111\/j.1747-9991.2008.00178.x","volume":"3","author":"P Rysiew","year":"2008","unstructured":"Rysiew P (2008) Rationality disputes\u2014psychology and epistemology. Philos Compass 3(6):1153\u20131176","journal-title":"Philos Compass"},{"key":"11341_CR202","doi-asserted-by":"publisher","first-page":"236","DOI":"10.1093\/0195147669.003.0011","volume-title":"Common sense, reasoning and rationality","author":"R Samuels","year":"2002","unstructured":"Samuels R, Stich S, Bishop M (2002) Ending the rationality wars: how to make disputes about human rationality disappear. In: Elio R (ed) Common sense, reasoning and rationality. Oxford University Press, Oxford, pp 236\u2013268"},{"issue":"17","key":"11341_CR203","doi-asserted-by":"publisher","first-page":"61","DOI":"10.2307\/2548836","volume":"5","author":"PA Samuelson","year":"1938","unstructured":"Samuelson PA (1938) A note on the pure theory of consumer\u2019s behaviour. Economica 5(17):61\u201371","journal-title":"Economica"},{"key":"11341_CR204","doi-asserted-by":"publisher","first-page":"317","DOI":"10.1093\/acprof:oso\/9780199230167.003.0014","volume-title":"In two minds: dual processes and beyond","author":"C Saunders","year":"2009","unstructured":"Saunders C, Over D (2009) In two minds about rationality? In: Evans JSBT, Frankish K (eds) In two minds: dual processes and beyond. Oxford University Press, Oxford, pp 317\u2013334"},{"key":"11341_CR205","unstructured":"Schadd FC, Bakkes SCJ, Spronck P (2007) Opponent modeling in real-time strategy games. In: Proceedings of GAME-ON-07. pp 61\u201370"},{"key":"11341_CR206","first-page":"101","volume":"3","author":"D Schilir\u00f2","year":"2012","unstructured":"Schilir\u00f2 D (2012) Bounded rationality and perfect rationality: psychology into economics. Theor Prac Res Econ Fields 3:101\u2013111","journal-title":"Theor Prac Res Econ Fields"},{"key":"11341_CR207","doi-asserted-by":"publisher","first-page":"112","DOI":"10.1016\/j.neuroimage.2017.12.077","volume":"180","author":"H Scholte","year":"2018","unstructured":"Scholte H (2018) Fantastic animals and where to find them. Neuroimage 180:112\u2013113","journal-title":"Neuroimage"},{"issue":"5","key":"11341_CR208","doi-asserted-by":"publisher","first-page":"902","DOI":"10.1111\/puar.13540","volume":"82","author":"G Schwarz","year":"2022","unstructured":"Schwarz G, Christensen T, Zhu X (2022) Bounded rationality, satisficing, artificial intelligence, and decision-making in public organizations: the contributions of Herbert Simon. Public Adm Rev 82(5):902\u2013904","journal-title":"Public Adm Rev"},{"key":"11341_CR209","doi-asserted-by":"publisher","first-page":"25","DOI":"10.1007\/BF01766400","volume":"4","author":"R Selten","year":"1975","unstructured":"Selten R (1975) Reexamination of the perfectness concept for equilibrium points in extensive games. Int J Game Theory 4:25\u201355","journal-title":"Int J Game Theory"},{"issue":"2","key":"11341_CR210","doi-asserted-by":"publisher","first-page":"68","DOI":"10.1145\/3624724","volume":"67","author":"M Shanahan","year":"2024","unstructured":"Shanahan M (2024) Talking about large language models. Commun ACM 67(2):68\u201379","journal-title":"Commun ACM"},{"key":"11341_CR211","unstructured":"Sharma M, Tong M, Korbak T, Duvenaud D, Askell A, Bowman SR et al (2024) Towards understanding sycophancy in language models. In: Proceedings of ICLR-24"},{"issue":"6","key":"11341_CR212","doi-asserted-by":"publisher","first-page":"90","DOI":"10.1109\/MC.2023.3262909","volume":"56","author":"D Shin","year":"2023","unstructured":"Shin D, Shin EY (2023) Data\u2019s impact on algorithmic bias. Computer 56(6):90\u201394","journal-title":"Computer"},{"key":"11341_CR213","unstructured":"Shinn N, Cassano F, Labash B, Gopinath A, Narasimhan K, Yao S (2023) Reflexion: language agents with verbal reinforcement learning. arXiv: 2303.11366"},{"key":"11341_CR214","doi-asserted-by":"publisher","first-page":"370","DOI":"10.1038\/nature22332","volume":"545","author":"H Shirado","year":"2017","unstructured":"Shirado H, Christakis N (2017) Locally noisy autonomous agents improve global human coordination in network experiments. Nature 545:370\u2013374","journal-title":"Nature"},{"key":"11341_CR215","unstructured":"Shoham Y, Powers R, Grenager T (2003) Multi-agent reinforcement learning: a critical survey. Unpublished survey"},{"issue":"7587","key":"11341_CR216","doi-asserted-by":"publisher","first-page":"484","DOI":"10.1038\/nature16961","volume":"529","author":"D Silver","year":"2016","unstructured":"Silver D, Huang A, Maddison CJ, Guez A, Sifre L, van den Driessche G et al (2016) Mastering the game of Go with deep neural networks and tree search. Nature 529(7587):484\u2013489","journal-title":"Nature"},{"key":"11341_CR217","volume-title":"Models of man: social and rational","author":"HA Simon","year":"1957","unstructured":"Simon HA (1957) Models of man: social and rational. Wiley, Hoboken"},{"key":"11341_CR218","volume-title":"Models of bounded rationality. Volume 1: economic analysis and public policy","author":"HA Simon","year":"1982","unstructured":"Simon HA (1982) Models of bounded rationality. Volume 1: economic analysis and public policy. The MIT Press, Cambridge"},{"key":"11341_CR219","unstructured":"Simsek O (2013) Linear decision rule as aspiration for simple decision heuristics. In: Burges C, Bottou L, Welling M, Ghahramani Z, Weinberger K (eds) Proceedings of NeurIPS-13"},{"key":"11341_CR220","volume-title":"Routledge handbook of bounded rationality (chapter 15)","author":"O Simsek","year":"2020","unstructured":"Simsek O (2020) Bounded rationality for artificial intelligence. In: Viale R (ed) Routledge handbook of bounded rationality (chapter 15). Routledge, London"},{"key":"11341_CR221","unstructured":"Simsek O, Buckmann M (2015) Learning from small samples: an analysis of simple decision heuristics. In: Cortes C, Lawrence ND, Lee DD, Sugiyama M,Garnett R (eds) Proceedings of NeurIPS-15. pp 3159\u20133167"},{"key":"11341_CR222","unstructured":"Simsek O, Algorta S, Kothiyal A (2016) Why most decisions are easy in tetris\u2013and perhaps in other sequential decision problems, as well. In: Proceedings of ICML-16. pp 1757\u20131765"},{"key":"11341_CR223","doi-asserted-by":"crossref","unstructured":"Skalse J, Abate A (2023) Misspecification in inverse reinforcement learning. In: Proceedings of AAAI-23. pp 15136\u201315143","DOI":"10.1609\/aaai.v37i12.26766"},{"key":"11341_CR224","first-page":"69","volume-title":"Royal institute of philosophy supplement","author":"A Sloman","year":"1993","unstructured":"Sloman A (1993) The mind as a control system. In: Hookway C, Peterson DM (eds) Royal institute of philosophy supplement. Cambridge University Press, Cambridge, pp 69\u2013110"},{"issue":"1","key":"11341_CR225","doi-asserted-by":"publisher","first-page":"3","DOI":"10.1037\/0033-2909.119.1.3","volume":"119","author":"SA Sloman","year":"1996","unstructured":"Sloman SA (1996) The empirical case for two systems of reasoning. Psychol Bull 119(1):3\u201322","journal-title":"Psychol Bull"},{"issue":"4","key":"11341_CR226","doi-asserted-by":"publisher","first-page":"329","DOI":"10.1016\/S1053-5357(02)00174-9","volume":"31","author":"P Slovic","year":"2002","unstructured":"Slovic P, Finucane M, Peters E, MacGregor DG (2002) Rational actors or rational fools: implications of the affect heuristic for behavioral economics. J Socio-Econ 31(4):329\u2013342","journal-title":"J Socio-Econ"},{"issue":"3","key":"11341_CR227","doi-asserted-by":"publisher","first-page":"455","DOI":"10.1007\/s10458-014-9261-5","volume":"29","author":"E Sonu","year":"2015","unstructured":"Sonu E, Doshi P (2015) Scalable solutions of interactive POMDPs using generalized and bounded policy iteration. Auton Agent Multi-Agent Syst 29(3):455\u2013494","journal-title":"Auton Agent Multi-Agent Syst"},{"key":"11341_CR228","doi-asserted-by":"crossref","unstructured":"Spohn W (1988) Ordinal conditional functions: a dynamic theory of epistemic states. In: Harper WL, Skyrms B (eds) Causation in decision, belief change, and statistics: proceedings of the Irvine conference on probability and causation. Springer Netherlands, Dordrecht, pp 105\u2013134","DOI":"10.1007\/978-94-009-2865-7_6"},{"key":"11341_CR229","unstructured":"Stanczak K, Augenstein I (2021) A survey on gender bias in natural language processing. arXiv:2112.14168"},{"key":"11341_CR230","doi-asserted-by":"publisher","DOI":"10.4324\/9781410603432","volume-title":"Who is rational?: studies of individual differences in reasoning","author":"KE Stanovich","year":"1999","unstructured":"Stanovich KE (1999) Who is rational?: Studies of individual differences in reasoning. Psychology Press, Hillsdale"},{"key":"11341_CR231","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1080\/00461520.2015.1125787","volume":"51","author":"KE Stanovich","year":"2016","unstructured":"Stanovich KE (2016) The comprehensive assessment of rational thinking. Educ Psychol 51:1\u201312","journal-title":"Educ Psychol"},{"key":"11341_CR232","doi-asserted-by":"publisher","first-page":"784","DOI":"10.1017\/CBO9780511977244.040","volume-title":"The Cambridge handbook of intelligence","author":"KE Stanovich","year":"2011","unstructured":"Stanovich KE, West RF, Toplak ME (2011) Intelligence and rationality. In: Sternberg RJ, Kaufman SB (eds) The Cambridge handbook of intelligence. Cambridge University Press, Cambridge, pp 784\u2013826"},{"key":"11341_CR233","volume-title":"Without good reason: the rationality debate in philosophy and cognitive science","author":"E Stein","year":"1996","unstructured":"Stein E (1996) Without good reason: the rationality debate in philosophy and cognitive science. Clarendon Press, Oxford"},{"issue":"9","key":"11341_CR234","doi-asserted-by":"publisher","first-page":"5145","DOI":"10.1002\/rnc.5935","volume":"33","author":"L Stella","year":"2023","unstructured":"Stella L, Bauso D (2023) The impact of irrational behaviors in the optional prisoner\u2019s dilemma with game-environment feedback. Int J Robust Nonlinear Control 33(9):5145\u20135158","journal-title":"Int J Robust Nonlinear Control"},{"issue":"3","key":"11341_CR235","doi-asserted-by":"publisher","first-page":"139","DOI":"10.1007\/s12064-011-0142-z","volume":"131","author":"S Still","year":"2012","unstructured":"Still S, Precup D (2012) An information-theoretic approach to curiosity-driven reinforcement learning. Theory Biosci 131(3):139\u2013148","journal-title":"Theory Biosci"},{"issue":"1","key":"11341_CR236","doi-asserted-by":"publisher","first-page":"66","DOI":"10.1080\/0020174X.2012.643628","volume":"55","author":"T Sturm","year":"2012","unstructured":"Sturm T (2012) The \u201cRationality Wars\u2019\u2019 in psychology: where they are and where they could go. Inquiry 55(1):66\u201381","journal-title":"Inquiry"},{"issue":"407","key":"11341_CR237","doi-asserted-by":"publisher","first-page":"751","DOI":"10.2307\/2233854","volume":"101","author":"R Sugden","year":"1991","unstructured":"Sugden R (1991) Rational choice: a survey of contributions from economics and philosophy. Econ J 101(407):751\u2013785","journal-title":"Econ J"},{"key":"11341_CR238","doi-asserted-by":"crossref","unstructured":"Sukthankar G, Sycara K (2005) A cost minimization approach to human behavior recognition. In: Proceedings of AAMAS-05. pp 1067\u20131074","DOI":"10.1145\/1082473.1082635"},{"key":"11341_CR239","volume-title":"Irrationality: the enemy within","author":"S Sutherland","year":"1992","unstructured":"Sutherland S (1992) Irrationality: the enemy within. Constable and Company, London"},{"issue":"3","key":"11341_CR240","doi-asserted-by":"publisher","first-page":"499","DOI":"10.1111\/j.1467-8640.1996.tb00273.x","volume":"12","author":"M Tambe","year":"1995","unstructured":"Tambe M, Rosenbloom PS (1995) Event tracking in a dynamic multi-agent environment. Comput Intell 12(3):499\u2013522","journal-title":"Comput Intell"},{"key":"11341_CR241","doi-asserted-by":"crossref","unstructured":"Taniguchi H, Sato H, Shirakawa T (2017) Application of human cognitive mechanisms to Na\u00efve Bayes text classifier. In: Proceedings of AIP-17. p 360016","DOI":"10.1063\/1.4992545"},{"issue":"2","key":"11341_CR242","doi-asserted-by":"publisher","first-page":"56","DOI":"10.9746\/jcmsi.12.56","volume":"12","author":"H Taniguchi","year":"2019","unstructured":"Taniguchi H, Sato H, Shirakawa T (2019) Implementation of human cognitive bias on neural network and its application to breast cancer diagnosis. SICE J Control Meas Syst Integr 12(2):56\u201364","journal-title":"SICE J Control Meas Syst Integr"},{"key":"11341_CR243","doi-asserted-by":"publisher","first-page":"467","DOI":"10.1207\/s15516709cog2803_8","volume":"28","author":"K Tentori","year":"2004","unstructured":"Tentori K, Bonini N, Osherson D (2004) The conjunction fallacy: a misunderstanding about conjunction? Cogn Sci 28:467\u2013477","journal-title":"Cogn Sci"},{"issue":"10","key":"11341_CR244","doi-asserted-by":"publisher","first-page":"1192","DOI":"10.1016\/j.jval.2018.04.1822","volume":"21","author":"T Tervonen","year":"2018","unstructured":"Tervonen T, Schmidt-Ott T, Marsh K, Bridges JF, Quaife M, Janssen E (2018) Assessing rationality in discrete choice experiments in health: an investigation into the use of dominance tests. Value Health 21(10):1192\u20131197","journal-title":"Value Health"},{"issue":"1","key":"11341_CR245","doi-asserted-by":"publisher","first-page":"39","DOI":"10.1016\/0167-2681(80)90051-7","volume":"1","author":"R Thaler","year":"1980","unstructured":"Thaler R (1980) Toward a positive theory of consumer choice. J Econ Behav Organ 1(1):39\u201360","journal-title":"J Econ Behav Organ"},{"key":"11341_CR246","first-page":"245","volume-title":"Aggregation and revelation of preferences","author":"W Thomson","year":"1979","unstructured":"Thomson W (1979) Maximin strategies and elicitation of preferences. In: Laffon J-J (ed) Aggregation and revelation of preferences. North-Holland, Amsterdam, pp 245\u2013268"},{"key":"11341_CR247","unstructured":"Tian X, Zhuo HH, Kambhampati S (2016) Discovering underlying plans based on distributed representations of actions. In: Proceedings of AAMAS-16. pp 1135\u20131143"},{"key":"11341_CR248","doi-asserted-by":"crossref","unstructured":"Tourlakis G (2022) G\u00f6del\u2019s first incompleteness theorem via the halting problem. In: Computability. Springer","DOI":"10.1007\/978-3-030-83202-5_8"},{"issue":"11","key":"11341_CR249","doi-asserted-by":"publisher","first-page":"3102","DOI":"10.1073\/pnas.1519157113","volume":"113","author":"K Tsetsos","year":"2016","unstructured":"Tsetsos K, Moran R, Moreland J, Chater N, Usher M, Summerfield C (2016) Economic irrationality is optimal during noisy decision making. Proc Natl Acad Sci 113(11):3102\u20133107","journal-title":"Proc Natl Acad Sci"},{"key":"11341_CR250","doi-asserted-by":"publisher","first-page":"433","DOI":"10.1093\/mind\/LIX.236.433","volume":"59","author":"AM Turing","year":"1950","unstructured":"Turing AM (1950) Computing machinery and intelligence. Mind 59:433\u2013460","journal-title":"Mind"},{"issue":"2","key":"11341_CR251","doi-asserted-by":"publisher","first-page":"105","DOI":"10.1037\/h0031322","volume":"76","author":"A Tversky","year":"1971","unstructured":"Tversky A, Kahneman D (1971) Belief in the law of small numbers. Psychol Bull 76(2):105\u2013110","journal-title":"Psychol Bull"},{"issue":"4","key":"11341_CR252","doi-asserted-by":"publisher","first-page":"293","DOI":"10.1037\/0033-295X.90.4.293","volume":"90","author":"A Tversky","year":"1983","unstructured":"Tversky A, Kahneman D (1983) Extensional versus intuitive reasoning: the conjunction fallacy in probability judgment. Psychol Rev 90(4):293\u2013315","journal-title":"Psychol Rev"},{"issue":"4","key":"11341_CR253","doi-asserted-by":"publisher","first-page":"S251","DOI":"10.1086\/296365","volume":"59","author":"A Tversky","year":"1986","unstructured":"Tversky A, Kahneman D (1986) Rational choice and the framing of decisions. J Bus 59(4):S251\u2013S278","journal-title":"J Bus"},{"issue":"2","key":"11341_CR254","doi-asserted-by":"publisher","first-page":"201","DOI":"10.1257\/jep.4.2.201","volume":"4","author":"A Tversky","year":"1990","unstructured":"Tversky A, Thaler RH (1990) Anomalies: preference reversals. J Econ Perspect 4(2):201\u2013211","journal-title":"J Econ Perspect"},{"key":"11341_CR255","unstructured":"Uprety S, Song D (2018) Reconciling irrational human behavior with AI based decision making: a quantum probabilistic approach. arXiv:1808.04600"},{"key":"11341_CR256","volume-title":"Mathematical methods in the social sciences (chapter 9)","author":"H Uzawa","year":"1960","unstructured":"Uzawa H (1960) Preference and rational choice in the theory of consumption. In: Arrow K, Karlin S, Suppes P (eds) Mathematical methods in the social sciences (chapter 9). Stanford University Press, Stanford"},{"key":"11341_CR257","unstructured":"Van Den\u00a0Herik H, Donkers H, Spronck P (2005) Opponent modelling and commercial games. In: Proceedings of CIG-05. pp 15\u201325"},{"key":"11341_CR258","doi-asserted-by":"crossref","unstructured":"van\u00a0der Hoek W, Wooldridge M (2002) Tractable multiagent planning for epistemic goals. In: Proceedings of AAMAS-02, New York, USA. pp 1167\u20131174","DOI":"10.1145\/545056.545095"},{"issue":"2","key":"11341_CR259","doi-asserted-by":"publisher","first-page":"135","DOI":"10.1093\/jigpal\/11.2.135","volume":"11","author":"W van der Hoek","year":"2003","unstructured":"van der Hoek W, Wooldridge M (2003) Towards a logic of rational agency. Log J IGPL 11(2):135\u2013159","journal-title":"Log J IGPL"},{"key":"11341_CR260","doi-asserted-by":"crossref","unstructured":"Vered M, Kaminka GA (2017) Heuristic online goal recognition in continuous domains. In: Proceedings of IJCAI-17. pp 4447\u20134454","DOI":"10.24963\/ijcai.2017\/621"},{"issue":"3","key":"11341_CR261","doi-asserted-by":"publisher","first-page":"273","DOI":"10.1080\/14640746808400161","volume":"20","author":"PC Wason","year":"1968","unstructured":"Wason PC (1968) Reasoning about a rule. Q J Exp Psychol 20(3):273\u2013281","journal-title":"Q J Exp Psychol"},{"key":"11341_CR262","doi-asserted-by":"publisher","first-page":"113","DOI":"10.1016\/j.jesp.2014.01.005","volume":"52","author":"A Waytz","year":"2014","unstructured":"Waytz A, Heafner J, Epley N (2014) The mind in the machine: anthropomorphism increases trust in an autonomous vehicle. J Exp Soc Psychol 52:113\u2013117","journal-title":"J Exp Soc Psychol"},{"key":"11341_CR263","volume-title":"The handbook of rationality","author":"R Wedgwood","year":"2021","unstructured":"Wedgwood R (2021) Practical and theoretical rationality. In: Knauff M, Spohn W (eds) The handbook of rationality. The MIT Press, Cambridge"},{"key":"11341_CR264","doi-asserted-by":"crossref","unstructured":"Wen Y, Yang Y, Wang J (2020) Modelling bounded rationality in multi-agent interactions by generalized recursive reasoning. In: Proceedings of IJCAI-20","DOI":"10.24963\/ijcai.2020\/58"},{"key":"11341_CR265","unstructured":"Wheeler G (2020) Bounded rationality. In: Zalta EN (ed) The Stanford encyclopedia of philosophy, Fall 2020 edn. Metaphysics Research Lab, Stanford University"},{"key":"11341_CR266","unstructured":"Williams M, Carroll M, Narang A, Weisser C, Murphy B, Dragan A (2025) On targeted manipulation and deception when optimizing LLMs for user feedback. arXiv: 2411.02306"},{"key":"11341_CR267","unstructured":"Wilson B, Hoffman J, Morgenstern J (2019) Predictive inequity in object detection. arXiv: 1902.11097"},{"key":"11341_CR268","doi-asserted-by":"publisher","DOI":"10.7551\/mitpress\/5804.001.0001","volume-title":"Reasoning about rational agents","author":"M Wooldridge","year":"2000","unstructured":"Wooldridge M (2000) Reasoning about rational agents. The MIT Press, Cambridge\/London"},{"key":"11341_CR269","unstructured":"Yang Y, Wen Y, Yu L, Zhang W, Bai Y, Wang J (2018) A study of AI population dynamics with million-agent reinforcement learning. In: Proceedings of AAMAS-18. pp 2133\u20132135"},{"key":"11341_CR270","unstructured":"Yu C, Liu J, Nemati S (2020) Reinforcement learning in healthcare: a survey. arXiv: 1908.08796"},{"key":"11341_CR271","doi-asserted-by":"publisher","first-page":"321","DOI":"10.1007\/978-3-030-60990-0_12","volume-title":"Handbook of reinforcement learning and control","author":"K Zhang","year":"2021","unstructured":"Zhang K, Yang Z, Ba\u015far T (2021) Multi-agent reinforcement learning: a selective overview of theories and algorithms. In: Vamvoudakis KG, Wan Y, Lewis FL, Cansever D (eds) Handbook of reinforcement learning and control. Springer, Cham, pp 321\u2013384"},{"key":"11341_CR272","unstructured":"Ziebart BD (2010) Modeling purposeful adaptive behavior with the principle of maximum causal entropy (Unpublished Doctoral Dissertation). Carnegie Mellon University"},{"key":"11341_CR273","unstructured":"Ziebart BD, Bagnell JA, Dey AK (2010) modeling interaction via the principle of maximum causal entropy. In: Proceedings of ICML-10. pp 1255\u20131262"},{"key":"11341_CR274","volume-title":"Metareasoning and bounded rationality. Metareasoning: thinking about thinking","author":"S Zilberstein","year":"2011","unstructured":"Zilberstein S (2011) Metareasoning and bounded rationality. Metareasoning: thinking about thinking. The MIT Press, Cambridge"},{"issue":"3","key":"11341_CR275","first-page":"11","volume":"14","author":"Zindel M\u00e1rcia Longen","year":"2014","unstructured":"Zindel ML, Zindel T, Quirino MG (2014) Cognitive bias and their implications on the financial market. Int J Mech Mechatron Eng","journal-title":"International Journal of Engineering &amp; Technology"}],"container-title":["Artificial Intelligence Review"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10462-025-11341-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10462-025-11341-4\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10462-025-11341-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,25]],"date-time":"2025-10-25T03:32:43Z","timestamp":1761363163000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10462-025-11341-4"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,8,22]]},"references-count":275,"journal-issue":{"issue":"11","published-online":{"date-parts":[[2025,11]]}},"alternative-id":["11341"],"URL":"https:\/\/doi.org\/10.1007\/s10462-025-11341-4","relation":{},"ISSN":["1573-7462"],"issn-type":[{"value":"1573-7462","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,8,22]]},"assertion":[{"value":"29 July 2025","order":1,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"22 August 2025","order":2,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare no conflict of interest.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}],"article-number":"352"}}