{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,23]],"date-time":"2026-06-23T10:25:20Z","timestamp":1782210320417,"version":"3.54.5"},"update-to":[{"DOI":"10.1371\/journal.pcbi.1012383","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2024,11,4]],"date-time":"2024-11-04T00:00:00Z","timestamp":1730678400000}}],"reference-count":96,"publisher":"Public Library of Science (PLoS)","issue":"10","license":[{"start":{"date-parts":[[2024,10,18]],"date-time":"2024-10-18T00:00:00Z","timestamp":1729209600000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["www.ploscompbiol.org"],"crossmark-restriction":false},"short-container-title":["PLoS Comput Biol"],"abstract":"<jats:p>Dual-process theories play a central role in both psychology and neuroscience, figuring prominently in domains ranging from executive control to reward-based learning to judgment and decision making. In each of these domains, two mechanisms appear to operate concurrently, one relatively high in computational complexity, the other relatively simple. Why is neural information processing organized in this way? We propose an answer to this question based on the notion of compression. The key insight is that dual-process structure can enhance adaptive behavior by allowing an agent to minimize the description length of its own behavior. We apply a single model based on this observation to findings from research on executive control, reward-based learning, and judgment and decision making, showing that seemingly diverse dual-process phenomena can be understood as domain-specific consequences of a single underlying set of computational principles.<\/jats:p>","DOI":"10.1371\/journal.pcbi.1012383","type":"journal-article","created":{"date-parts":[[2024,10,18]],"date-time":"2024-10-18T17:34:25Z","timestamp":1729272865000},"page":"e1012383","update-policy":"https:\/\/doi.org\/10.1371\/journal.pcbi.corrections_policy","source":"Crossref","is-referenced-by-count":10,"title":["Understanding dual process cognition via the minimum description length principle"],"prefix":"10.1371","volume":"20","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-5778-2635","authenticated-orcid":true,"given":"Ted","family":"Moskovitz","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Kevin J.","family":"Miller","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5560-3341","authenticated-orcid":true,"given":"Maneesh","family":"Sahani","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Matthew M.","family":"Botvinick","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"340","published-online":{"date-parts":[[2024,10,18]]},"reference":[{"key":"pcbi.1012383.ref001","volume-title":"The principles of psychology","author":"W James","year":"1890"},{"key":"pcbi.1012383.ref002","doi-asserted-by":"crossref","first-page":"135","DOI":"10.1146\/annurev-psych-113011-143750","article-title":"Executive functions","volume":"64","author":"A Diamond","year":"2013","journal-title":"Annu Rev Psychol"},{"issue":"6","key":"pcbi.1012383.ref003","doi-asserted-by":"crossref","first-page":"1249","DOI":"10.1111\/cogs.12126","article-title":"The computational and neural basis of cognitive control: charted territory and new frontiers","volume":"38","author":"MM Botvinick","year":"2014","journal-title":"Cogn Sci"},{"issue":"2","key":"pcbi.1012383.ref004","doi-asserted-by":"crossref","first-page":"312","DOI":"10.1016\/j.neuron.2013.09.007","article-title":"Goals and habits in the brain","volume":"80","author":"RJ Dolan","year":"2013","journal-title":"Neuron"},{"issue":"6","key":"pcbi.1012383.ref005","doi-asserted-by":"crossref","first-page":"945","DOI":"10.1037\/rev0000201","article-title":"A theory of actions and habits: The interaction of rate correlation and contiguity systems in free-operant behavior","volume":"127","author":"OD Perez","year":"2020","journal-title":"Psychol Rev"},{"key":"pcbi.1012383.ref006","doi-asserted-by":"crossref","first-page":"255","DOI":"10.1146\/annurev.psych.59.103006.093629","article-title":"Dual-processing accounts of reasoning, judgment, and social cognition","volume":"59","author":"JST Evans","year":"2008","journal-title":"Annu Rev Psychol"},{"key":"pcbi.1012383.ref007","volume-title":"Thinking, fast and slow","author":"D Kahneman","year":"2011"},{"issue":"20","key":"pcbi.1012383.ref008","doi-asserted-by":"crossref","first-page":"7338","DOI":"10.1073\/pnas.0502455102","article-title":"Prefrontal cortex and flexible cognitive control: Rules without symbols","volume":"102","author":"NP Rougier","year":"2005","journal-title":"Proc Natl Acad Sci U S A"},{"issue":"12","key":"pcbi.1012383.ref009","doi-asserted-by":"crossref","first-page":"1704","DOI":"10.1038\/nn1560","article-title":"Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control","volume":"8","author":"ND Daw","year":"2005","journal-title":"Nat Neurosci"},{"issue":"2","key":"pcbi.1012383.ref010","doi-asserted-by":"crossref","first-page":"217","DOI":"10.1016\/j.neuron.2013.07.007","article-title":"The expected value of control: an integrative theory of anterior cingulate cortex function","volume":"79","author":"A Shenhav","year":"2013","journal-title":"Neuron"},{"issue":"5","key":"pcbi.1012383.ref011","doi-asserted-by":"crossref","first-page":"e1002055","DOI":"10.1371\/journal.pcbi.1002055","article-title":"Speed\/accuracy trade-off between the habitual and the goal-directed processes","volume":"7","author":"M Keramati","year":"2011","journal-title":"PLoS Comput Biol"},{"issue":"11","key":"pcbi.1012383.ref012","doi-asserted-by":"crossref","first-page":"700","DOI":"10.1016\/j.tics.2015.08.013","article-title":"Deciding how to decide: Self-control and meta-decision making","volume":"19","author":"YL Boureau","year":"2015","journal-title":"Trends Cogn Sci"},{"issue":"6","key":"pcbi.1012383.ref013","doi-asserted-by":"crossref","first-page":"762","DOI":"10.1037\/rev0000075","article-title":"Strategy selection as rational metareasoning","volume":"124","author":"F Lieder","year":"2017","journal-title":"Psychol Rev"},{"issue":"2","key":"pcbi.1012383.ref014","doi-asserted-by":"crossref","first-page":"292","DOI":"10.1037\/rev0000120","article-title":"Habits without values","volume":"126","author":"KJ Miller","year":"2019","journal-title":"Psychol Rev"},{"key":"pcbi.1012383.ref015","doi-asserted-by":"crossref","first-page":"111","DOI":"10.1016\/j.cortex.2019.03.004","article-title":"Developmental frontal brain activation differences in overcoming heuristic bias","volume":"117","author":"K Mevel","year":"2019","journal-title":"Cortex"},{"key":"pcbi.1012383.ref016","first-page":"137","volume-title":"Neuroscience of decision making","author":"W De Neys","year":"2011"},{"issue":"1","key":"pcbi.1012383.ref017","doi-asserted-by":"crossref","first-page":"167","DOI":"10.1146\/annurev.neuro.24.1.167","article-title":"An integrative theory of prefrontal cortex function","volume":"24","author":"EK Miller","year":"2001","journal-title":"Annu Rev Neurosci"},{"issue":"5","key":"pcbi.1012383.ref018","doi-asserted-by":"crossref","first-page":"244","DOI":"10.1016\/j.tics.2015.03.003","article-title":"Degree of automaticity and the prefrontal cortex","volume":"19","author":"HA Jeon","year":"2015","journal-title":"Trends Cogn Sci"},{"key":"pcbi.1012383.ref019","doi-asserted-by":"crossref","first-page":"380","DOI":"10.3389\/fpsyg.2020.00380","article-title":"How sequential interactive processing within frontostriatal loops supports a continuum of habitual to controlled processing","volume":"11","author":"RC O\u2019Reilly","year":"2020","journal-title":"Front Psychol"},{"key":"pcbi.1012383.ref020","doi-asserted-by":"crossref","first-page":"259","DOI":"10.1146\/annurev.psych.58.110405.085654","article-title":"Social cognitive neuroscience: a review of core processes","volume":"58","author":"MD Lieberman","year":"2007","journal-title":"Annu Rev Psychol"},{"key":"pcbi.1012383.ref021","article-title":"Habit formation","author":"KS Smith","year":"2022","journal-title":"Dialogues Clin Neurosci"},{"issue":"9","key":"pcbi.1012383.ref022","doi-asserted-by":"crossref","first-page":"757","DOI":"10.1016\/j.tics.2021.06.001","article-title":"Rationalizing constraints on the capacity for cognitive control","volume":"25","author":"S Musslick","year":"2021","journal-title":"Trends Cogn Sci"},{"key":"pcbi.1012383.ref023","doi-asserted-by":"crossref","first-page":"195","DOI":"10.1016\/bs.plm.2021.02.004","volume-title":"Psychology of Learning and Motivation","author":"L Lai","year":"2021"},{"key":"pcbi.1012383.ref024","article-title":"Heuristics from bounded meta-learned inference","author":"M Binz","year":"2022","journal-title":"Psychol Rev"},{"issue":"1","key":"pcbi.1012383.ref025","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/s41467-021-25123-3","article-title":"Linear reinforcement learning in planning, grid fields, and cognitive control","volume":"12","author":"P Piray","year":"2021","journal-title":"Nat Commun"},{"key":"pcbi.1012383.ref026","doi-asserted-by":"crossref","DOI":"10.1017\/S0140525X1900061X","article-title":"Resource-rational analysis: Understanding human cognition as the optimal use of limited computational resources","volume":"43","author":"F Lieder","year":"2020","journal-title":"Behav Brain Sci"},{"key":"pcbi.1012383.ref027","article-title":"Building machines that learn and think like people","volume":"40","author":"BM Lake","year":"2017","journal-title":"Behav Brain Sci"},{"key":"pcbi.1012383.ref028","volume-title":"Information Theory, Inference, and Learning Algorithms","author":"DJC MacKay","year":"2003"},{"key":"pcbi.1012383.ref029","volume-title":"Universal artificial intelligence: Sequential decisions based on algorithmic probability","author":"M Hutter","year":"2004"},{"key":"pcbi.1012383.ref030","doi-asserted-by":"crossref","DOI":"10.7551\/mitpress\/4643.001.0001","volume-title":"The minimum description length principle","author":"PD Gr\u00fcnwald","year":"2007"},{"issue":"5","key":"pcbi.1012383.ref031","doi-asserted-by":"crossref","first-page":"330","DOI":"10.1002\/wcs.1406","article-title":"The simplicity principle in perception and cognition","volume":"7","author":"J Feldman","year":"2016","journal-title":"Wiley Interdiscip Rev Cogn Sci"},{"key":"pcbi.1012383.ref032","doi-asserted-by":"crossref","DOI":"10.1007\/978-0-387-49820-1","volume-title":"An Introduction to Kolmogorov Complexity and Its Applications","author":"M Li","year":"2008"},{"key":"pcbi.1012383.ref033","doi-asserted-by":"crossref","unstructured":"Hinton GE, Van Camp D. Keeping the neural networks simple by minimizing the description length of the weights. In: Proceedings of the sixth annual conference on Computational learning theory; 1993. p. 5\u201313.","DOI":"10.1145\/168304.168306"},{"issue":"4","key":"pcbi.1012383.ref034","doi-asserted-by":"crossref","first-page":"800","DOI":"10.1109\/TNN.2004.828762","article-title":"Variational learning and bits-back coding: an information-theoretic view to Bayesian learning","volume":"15","author":"A Honkela","year":"2004","journal-title":"IEEE Trans Neural Netw"},{"key":"pcbi.1012383.ref035","article-title":"The description length of deep learning models","volume":"31","author":"L Blier","year":"2018","journal-title":"Adv Neural Inf Process Syst"},{"issue":"6","key":"pcbi.1012383.ref036","doi-asserted-by":"crossref","first-page":"2913","DOI":"10.1109\/TSP.2012.2187203","article-title":"An MDL framework for sparse coding and dictionary learning","volume":"60","author":"I Ramirez","year":"2012","journal-title":"IEEE Trans Signal Process"},{"key":"pcbi.1012383.ref037","unstructured":"Grunwald P. A tutorial introduction to the minimum description length principle. arXiv preprint math\/0406077. 2004."},{"key":"pcbi.1012383.ref038","unstructured":"Moskovitz T, Kao TC, Sahani M, Botvinick M. Minimum Description Length Control. In: The Eleventh International Conference on Learning Representations; 2023."},{"key":"pcbi.1012383.ref039","first-page":"12","volume-title":"ICML","author":"CG Atkeson","year":"1997"},{"key":"pcbi.1012383.ref040","volume-title":"Reinforcement learning: An introduction","author":"RS Sutton","year":"2018"},{"key":"pcbi.1012383.ref041","volume-title":"Advances in Neural Information Processing Systems","author":"DP Kingma","year":"2015"},{"issue":"1007","key":"pcbi.1012383.ref042","first-page":"453","article-title":"An invariant form for the prior probability in estimation problems","volume":"186","author":"H Jeffreys","year":"1946","journal-title":"Proc R Soc Lond A Math Phys Sci"},{"issue":"01","key":"pcbi.1012383.ref043","doi-asserted-by":"crossref","DOI":"10.1142\/S2661335219300018","article-title":"Minimum description length revisited","volume":"11","author":"P Gr\u00fcnwald","year":"2019","journal-title":"Int J Math Ind"},{"issue":"7","key":"pcbi.1012383.ref044","doi-asserted-by":"crossref","first-page":"2099","DOI":"10.1016\/j.neuropsychologia.2007.11.029","article-title":"The role of ventromedial prefrontal cortex in navigation: a case of impaired wayfinding and rehabilitation","volume":"46","author":"E Ciaramelli","year":"2008","journal-title":"Neuropsychologia"},{"issue":"6","key":"pcbi.1012383.ref045","doi-asserted-by":"crossref","first-page":"643","DOI":"10.1037\/h0054651","article-title":"Studies of interference in serial verbal reactions","volume":"18","author":"JR Stroop","year":"1935","journal-title":"J Exp Psychol"},{"issue":"1","key":"pcbi.1012383.ref046","doi-asserted-by":"crossref","first-page":"22","DOI":"10.1162\/089892906775250012","article-title":"Neural mechanisms of cognitive control: An integrative model of Stroop task performance and fMRI data","volume":"18","author":"SA Herd","year":"2006","journal-title":"J Cogn Neurosci"},{"issue":"12","key":"pcbi.1012383.ref047","doi-asserted-by":"crossref","first-page":"899","DOI":"10.1038\/s41562-018-0401-9","article-title":"Mental labour","volume":"2","author":"W Kool","year":"2018","journal-title":"Nat Hum Behav"},{"key":"pcbi.1012383.ref048","doi-asserted-by":"crossref","first-page":"63","DOI":"10.1016\/j.actpsy.2013.11.010","article-title":"Context-specific control and context selection in conflict tasks","volume":"146","author":"N Schouppe","year":"2014","journal-title":"Acta Psychol (Amst)"},{"key":"pcbi.1012383.ref049","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1016\/j.neuropsychologia.2018.09.013","article-title":"An information-theoretic perspective on the costs of cognition","volume":"123","author":"A Zenon","year":"2019","journal-title":"Neuropsychologia"},{"issue":"6","key":"pcbi.1012383.ref050","doi-asserted-by":"crossref","first-page":"1204","DOI":"10.1016\/j.neuron.2011.02.027","article-title":"Model-based influences on humans\u2019 choices and striatal prediction errors","volume":"69","author":"ND Daw","year":"2011","journal-title":"Neuron"},{"key":"pcbi.1012383.ref051","article-title":"Identifying Model-Based and Model-Free Patterns in Behavior on Multi-Step Tasks","author":"KJ Miller","year":"2016","journal-title":"bioRxiv"},{"issue":"4","key":"pcbi.1012383.ref052","doi-asserted-by":"crossref","first-page":"585","DOI":"10.1016\/j.neuron.2010.04.016","article-title":"States versus rewards: dissociable neural prediction error signals underlying model-based and model-free reinforcement learning","volume":"66","author":"J Gl\u00e4scher","year":"2010","journal-title":"Neuron"},{"issue":"3","key":"pcbi.1012383.ref053","doi-asserted-by":"crossref","first-page":"955","DOI":"10.1016\/j.neuroimage.2011.06.071","article-title":"Separate encoding of model-based and model-free valuations in the human brain","volume":"58","author":"UR Beierholm","year":"2011","journal-title":"Neuroimage"},{"issue":"1","key":"pcbi.1012383.ref054","doi-asserted-by":"crossref","first-page":"147","DOI":"10.1038\/s41386-021-01108-0","article-title":"Reinforcement-learning in fronto-striatal circuits","volume":"47","author":"B Averbeck","year":"2022","journal-title":"Neuropsychopharmacology"},{"issue":"15","key":"pcbi.1012383.ref055","doi-asserted-by":"crossref","first-page":"R860","DOI":"10.1016\/j.cub.2020.06.051","article-title":"Model-based decision making and model-free learning","volume":"30","author":"N Drummond","year":"2020","journal-title":"Curr Biol"},{"issue":"1135","key":"pcbi.1012383.ref056","doi-asserted-by":"crossref","first-page":"67","DOI":"10.1098\/rstb.1985.0010","article-title":"Actions and habits: the development of behavioural autonomy","volume":"308","author":"A Dickinson","year":"1985","journal-title":"Philos Trans R Soc Lond B Biol Sci"},{"key":"pcbi.1012383.ref057","article-title":"From predictive models to cognitive models: Separable behavioral processes underlying reward learning in the rat","author":"KJ Miller","year":"2018","journal-title":"Europe PMC"},{"issue":"12","key":"pcbi.1012383.ref058","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1371\/journal.pcbi.1004648","article-title":"Simple Plans or Sophisticated Habits? State, Transition and Learning Interactions in the Two-Step Task","volume":"11","author":"T Akam","year":"2015","journal-title":"PLoS Comput Biol"},{"issue":"10","key":"pcbi.1012383.ref059","doi-asserted-by":"crossref","first-page":"1053","DOI":"10.1038\/s41562-020-0905-y","article-title":"Humans primarily use model-based inference in the two-stage task","volume":"4","author":"C Feher da Silva","year":"2020","journal-title":"Nat Hum Behav"},{"issue":"4","key":"pcbi.1012383.ref060","doi-asserted-by":"crossref","first-page":"914","DOI":"10.1016\/j.neuron.2013.08.009","article-title":"Disruption of dorsolateral prefrontal cortex decreases model-based in favor of model-free control in humans","volume":"80","author":"P Smittenaar","year":"2013","journal-title":"Neuron"},{"issue":"5","key":"pcbi.1012383.ref061","doi-asserted-by":"crossref","first-page":"751","DOI":"10.1177\/0956797612463080","article-title":"The curse of planning: dissecting multiple reinforcement-learning systems by taxing the central executive","volume":"24","author":"AR Otto","year":"2013","journal-title":"Psychol Sci"},{"issue":"10","key":"pcbi.1012383.ref062","doi-asserted-by":"crossref","first-page":"576","DOI":"10.1038\/s41583-020-0355-6","article-title":"Beyond dichotomies in reinforcement learning","volume":"21","author":"AGE Collins","year":"2020","journal-title":"Nat Rev Neurosci"},{"issue":"1","key":"pcbi.1012383.ref063","doi-asserted-by":"crossref","first-page":"149","DOI":"10.1016\/j.neuron.2020.10.013","article-title":"The anterior cingulate cortex predicts future states to mediate model-based action selection","volume":"109","author":"T Akam","year":"2021","journal-title":"Neuron"},{"issue":"9","key":"pcbi.1012383.ref064","doi-asserted-by":"crossref","first-page":"1269","DOI":"10.1038\/nn.4613","article-title":"Dorsal hippocampus contributes to model-based planning","volume":"20","author":"KJ Miller","year":"2017","journal-title":"Nat Neurosci"},{"key":"pcbi.1012383.ref065","doi-asserted-by":"crossref","first-page":"523","DOI":"10.3758\/s13415-015-0347-6","article-title":"Model-based learning protects against forming habits","volume":"15","author":"CM Gillan","year":"2015","journal-title":"Cogn Affect Behav Neurosci"},{"issue":"3","key":"pcbi.1012383.ref066","first-page":"271","article-title":"Omission learning after instrumental pretraining","volume":"51","author":"A Dickinson","year":"1998","journal-title":"Q J Exp Psychol B"},{"issue":"2","key":"pcbi.1012383.ref067","doi-asserted-by":"crossref","first-page":"189","DOI":"10.1016\/j.bbr.2005.07.012","article-title":"Inactivation of dorsolateral striatum enhances sensitivity to changes in the action\u2013outcome contingency in instrumental conditioning","volume":"166","author":"HH Yin","year":"2006","journal-title":"Behav Brain Res"},{"issue":"4","key":"pcbi.1012383.ref068","doi-asserted-by":"crossref","first-page":"294","DOI":"10.1016\/j.tics.2018.01.009","article-title":"Hierarchical active inference: a theory of motivated control","volume":"22","author":"G Pezzulo","year":"2018","journal-title":"Trends Cogn Sci"},{"issue":"1","key":"pcbi.1012383.ref069","doi-asserted-by":"crossref","first-page":"53","DOI":"10.1037\/0278-7393.29.1.53","article-title":"Take the best or look at the rest? Factors influencing \u201cone-reason\u201d decision making","volume":"29","author":"BR Newell","year":"2003","journal-title":"J Exp Psychol Learn Mem Cogn"},{"issue":"3","key":"pcbi.1012383.ref070","doi-asserted-by":"crossref","first-page":"332","DOI":"10.1037\/0033-295X.97.3.332","article-title":"On the control of automatic processes: a parallel distributed processing account of the Stroop effect","volume":"97","author":"JD Cohen","year":"1990","journal-title":"Psychol Rev"},{"issue":"1","key":"pcbi.1012383.ref071","doi-asserted-by":"crossref","first-page":"126","DOI":"10.1037\/0278-7393.14.1.126","article-title":"Training and Stroop-like interference: evidence for a continuum of automaticity","volume":"14","author":"CM MacLeod","year":"1988","journal-title":"J Exp Psychol Learn Mem Cogn"},{"key":"pcbi.1012383.ref072","article-title":"A neural model of task compositionality with natural language instructions","author":"R Riveland","year":"2022","journal-title":"bioRxiv"},{"issue":"2","key":"pcbi.1012383.ref073","doi-asserted-by":"crossref","first-page":"257","DOI":"10.1016\/j.conb.2010.01.008","article-title":"Computational models of cognitive control","volume":"20","author":"RC O\u2019Reilly","year":"2010","journal-title":"Curr Opin Neurobiol"},{"issue":"4","key":"pcbi.1012383.ref074","doi-asserted-by":"crossref","first-page":"e1006043","DOI":"10.1371\/journal.pcbi.1006043","article-title":"Rational metareasoning and the plasticity of cognitive control","volume":"14","author":"F Lieder","year":"2018","journal-title":"PLoS Comput Biol"},{"key":"pcbi.1012383.ref075","doi-asserted-by":"crossref","first-page":"375","DOI":"10.1016\/j.neuropsychologia.2014.04.014","article-title":"A neural network model of individual differences in task switching abilities","volume":"62","author":"SA Herd","year":"2014","journal-title":"Neuropsychologia"},{"issue":"3","key":"pcbi.1012383.ref076","doi-asserted-by":"crossref","first-page":"297","DOI":"10.1006\/cogp.2001.0770","article-title":"Task switching: A PDP model","volume":"44","author":"SJ Gilbert","year":"2002","journal-title":"Cogn Psychol"},{"issue":"10-12","key":"pcbi.1012383.ref077","doi-asserted-by":"crossref","first-page":"1332","DOI":"10.1016\/j.neucom.2005.12.102","article-title":"Computational and neural mechanisms of task switching","volume":"69","author":"JR Reynolds","year":"2006","journal-title":"Neurocomputing"},{"issue":"6","key":"pcbi.1012383.ref078","doi-asserted-by":"crossref","first-page":"860","DOI":"10.1038\/s41593-018-0147-8","article-title":"Prefrontal cortex as a meta-reinforcement learning system","volume":"21","author":"JX Wang","year":"2018","journal-title":"Nat Neurosci"},{"key":"pcbi.1012383.ref079","doi-asserted-by":"crossref","first-page":"e53262","DOI":"10.7554\/eLife.53262","article-title":"Dopamine role in learning and action inference","volume":"9","author":"R Bogacz","year":"2020","journal-title":"Elife"},{"issue":"3","key":"pcbi.1012383.ref080","doi-asserted-by":"crossref","first-page":"632","DOI":"10.1037\/0033-295X.114.3.632","article-title":"A neurobiological theory of automaticity in perceptual categorization","volume":"114","author":"FG Ashby","year":"2007","journal-title":"Psychol Rev"},{"key":"pcbi.1012383.ref081","article-title":"Action prediction error: a value-free dopaminergic teaching signal that drives stable learning","author":"F Greenstreet","year":"2022","journal-title":"bioRxiv"},{"key":"pcbi.1012383.ref082","doi-asserted-by":"crossref","first-page":"283","DOI":"10.3758\/CABN.2.4.283","article-title":"Mechanisms underlying dependencies of performance on stimulus history in a two-alternative forced-choice task","volume":"2","author":"RY Cho","year":"2002","journal-title":"Cogn Affect Behav Neurosci"},{"issue":"1","key":"pcbi.1012383.ref083","doi-asserted-by":"crossref","first-page":"195","DOI":"10.1016\/j.neuron.2013.10.018","article-title":"Autonomous mechanism of internal choice estimate underlies decision inertia","volume":"81","author":"R Akaishi","year":"2014","journal-title":"Neuron"},{"issue":"2","key":"pcbi.1012383.ref084","doi-asserted-by":"crossref","first-page":"333","DOI":"10.1162\/jocn_a_00894","article-title":"Attentional selection can be predicted by reinforcement learning of task-relevant stimulus features weighted by value-independent stickiness","volume":"28","author":"M Balcarras","year":"2016","journal-title":"J Cogn Neurosci"},{"issue":"3","key":"pcbi.1012383.ref085","doi-asserted-by":"crossref","first-page":"687","DOI":"10.1016\/j.neuron.2013.11.028","article-title":"Neural computations underlying arbitration between model-based and model-free learning","volume":"81","author":"SW Lee","year":"2014","journal-title":"Neuron"},{"key":"pcbi.1012383.ref086","first-page":"81","article-title":"Representativeness revisited: Attribute substitution in intuitive judgment","volume":"49","author":"D Kahneman","year":"2002","journal-title":"Heuristics and biases: The psychology of intuitive judgment"},{"issue":"12","key":"pcbi.1012383.ref087","doi-asserted-by":"crossref","first-page":"e1008497","DOI":"10.1371\/journal.pcbi.1008497","article-title":"Value-complexity tradeoff explains mouse navigational learning","volume":"16","author":"N Amir","year":"2020","journal-title":"PLoS Comput Biol"},{"issue":"8","key":"pcbi.1012383.ref088","doi-asserted-by":"crossref","first-page":"1153","DOI":"10.1038\/s41562-022-01357-z","article-title":"Human inference reflects a normative balance of complexity and accuracy","volume":"6","author":"G Tavoni","year":"2022","journal-title":"Nat Hum Behav"},{"key":"pcbi.1012383.ref089","author":"RA Lerch","year":"2018","journal-title":"Policy generalization in capacity-limited reinforcement learning"},{"issue":"1","key":"pcbi.1012383.ref090","doi-asserted-by":"crossref","first-page":"19","DOI":"10.1016\/S1364-6613(02)00005-0","article-title":"Simplicity: a unifying principle in cognitive science?","volume":"7","author":"N Chater","year":"2003","journal-title":"Trends Cogn Sci"},{"issue":"2","key":"pcbi.1012383.ref091","doi-asserted-by":"crossref","first-page":"170","DOI":"10.1016\/j.tics.2017.11.005","article-title":"Frontal cortex and the hierarchical control of behavior","volume":"22","author":"D Badre","year":"2018","journal-title":"Trends Cogn Sci"},{"key":"pcbi.1012383.ref092","unstructured":"Tirumala D, Galashov A, Noh H, Hasenclever L, Pascanu R, Schwarz J, Desjardins G, Czarnecki WM, Ahuja A, Teh YW, et al. Behavior priors for efficient reinforcement learning. arXiv preprint arXiv:2010.14274. 2020."},{"key":"pcbi.1012383.ref093","unstructured":"Galashov A, Jayakumar SM, Hasenclever L, Tirumala D, Schwarz J, Desjardins G, et al. Information asymmetry in KL-regularized RL. arXiv preprint arXiv:1905.01240. 2019."},{"key":"pcbi.1012383.ref094","unstructured":"Goyal A, Islam R, Strouse DJ, Ahmed Z, Larochelle H, Botvinick M, et al. InfoBot: Transfer and Exploration via the Information Bottleneck. In: International Conference on Learning Representations; 2018."},{"key":"pcbi.1012383.ref095","article-title":"Distral: Robust multitask reinforcement learning","volume":"30","author":"Y Teh","year":"2017","journal-title":"Adv Neural Inf Process Syst"},{"key":"pcbi.1012383.ref096","unstructured":"Moskovitz T, Arbel M, Parker-Holder J, Pacchiano A. Towards an Understanding of Default Policies in Multitask Policy Optimization. In: Proceedings of The 25th International Conference on Artificial Intelligence and Statistics. PMLR; 2022. p. 10661\u201310686."}],"updated-by":[{"DOI":"10.1371\/journal.pcbi.1012383","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2024,11,4]],"date-time":"2024-11-04T00:00:00Z","timestamp":1730678400000}}],"container-title":["PLOS Computational Biology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1012383","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,11,4]],"date-time":"2024-11-04T18:42:50Z","timestamp":1730745770000},"score":1,"resource":{"primary":{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1012383"}},"subtitle":[],"editor":[{"given":"Christoph","family":"Mathys","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"editor"}]}],"short-title":[],"issued":{"date-parts":[[2024,10,18]]},"references-count":96,"journal-issue":{"issue":"10","published-online":{"date-parts":[[2024,10,18]]}},"URL":"https:\/\/doi.org\/10.1371\/journal.pcbi.1012383","relation":{"new_version":[{"id-type":"doi","id":"10.1371\/journal.pcbi.1012383","asserted-by":"object"}]},"ISSN":["1553-7358"],"issn-type":[{"value":"1553-7358","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,10,18]]}}}