{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,27]],"date-time":"2026-02-27T06:24:12Z","timestamp":1772173452011,"version":"3.50.1"},"update-to":[{"DOI":"10.1371\/journal.pcbi.1013226","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2025,7,7]],"date-time":"2025-07-07T00:00:00Z","timestamp":1751846400000}}],"reference-count":71,"publisher":"Public Library of Science (PLoS)","issue":"7","license":[{"start":{"date-parts":[[2025,7,2]],"date-time":"2025-07-02T00:00:00Z","timestamp":1751414400000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/100000181","name":"Air Force Office of Scientific Research","doi-asserted-by":"publisher","award":["FA9550-20-1-0413"],"award-info":[{"award-number":["FA9550-20-1-0413"]}],"id":[{"id":"10.13039\/100000181","id-type":"DOI","asserted-by":"publisher"}]},{"name":"NIH","award":["U19NS113201"],"award-info":[{"award-number":["U19NS113201"]}]}],"content-domain":{"domain":["www.ploscompbiol.org"],"crossmark-restriction":false},"short-container-title":["PLoS Comput Biol"],"abstract":"<jats:p>Dopamine release in the nucleus accumbens has been hypothesized to signal the difference between observed and predicted reward, known as reward prediction error, suggesting a biological implementation for reinforcement learning. Rigorous tests of this hypothesis require assumptions about how the brain maps sensory signals to reward predictions, yet this mapping is still poorly understood. In particular, the mapping is non-trivial when sensory signals provide ambiguous information about the hidden state of the environment. Previous work using classical conditioning tasks has suggested that reward predictions are generated conditional on probabilistic beliefs about the hidden state, such that dopamine implicitly reflects these beliefs. Here we test this hypothesis in the context of an instrumental task (a two-armed bandit), where the hidden state switches stochastically. We measured choice behavior and recorded dLight signals that reflect dopamine release in the nucleus accumbens core. Model comparison among a wide set of cognitive models based on the behavioral data favored models that used Bayesian updating of probabilistic beliefs. These same models also quantitatively matched mesolimbic dLight measurements better than non-Bayesian alternatives. We conclude that probabilistic belief computation contributes to instrumental task performance in mice and is reflected in mesolimbic dopamine signaling.<\/jats:p>","DOI":"10.1371\/journal.pcbi.1013226","type":"journal-article","created":{"date-parts":[[2025,7,2]],"date-time":"2025-07-02T14:01:08Z","timestamp":1751464868000},"page":"e1013226","update-policy":"https:\/\/doi.org\/10.1371\/journal.pcbi.corrections_policy","source":"Crossref","is-referenced-by-count":5,"title":["Nucleus accumbens dopamine release reflects Bayesian inference during instrumental learning"],"prefix":"10.1371","volume":"21","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-2548-2750","authenticated-orcid":true,"given":"Albert J.","family":"Q\u00fc","sequence":"first","affiliation":[]},{"given":"Lung-Hao","family":"Tai","sequence":"additional","affiliation":[]},{"given":"Christopher D.","family":"Hall","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0009-0001-6991-6805","authenticated-orcid":true,"given":"Emilie M.","family":"Tu","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0330-9367","authenticated-orcid":true,"given":"Maria K.","family":"Eckstein","sequence":"additional","affiliation":[]},{"given":"Karyna","family":"Mishchanchuk","sequence":"additional","affiliation":[]},{"given":"Wan Chen","family":"Lin","sequence":"additional","affiliation":[]},{"given":"Juliana B.","family":"Chase","sequence":"additional","affiliation":[]},{"given":"Andrew F.","family":"MacAskill","sequence":"additional","affiliation":[]},{"given":"Anne G. E.","family":"Collins","sequence":"additional","affiliation":[]},{"given":"Samuel J.","family":"Gershman","sequence":"additional","affiliation":[]},{"given":"Linda","family":"Wilbrecht","sequence":"additional","affiliation":[]}],"member":"340","published-online":{"date-parts":[[2025,7,2]]},"reference":[{"key":"pcbi.1013226.ref001","volume-title":"Reinforcement learning: an introduction","author":"RS Sutton","year":"2018"},{"issue":"2","key":"pcbi.1013226.ref002","doi-asserted-by":"crossref","first-page":"483","DOI":"10.1093\/cercor\/bhn098","article-title":"Determining a role for ventromedial prefrontal cortex in encoding action-based value signals during reward-related decision making","volume":"19","author":"J Gl\u00e4scher","year":"2009","journal-title":"Cereb Cortex"},{"key":"pcbi.1013226.ref003","doi-asserted-by":"crossref","first-page":"347","DOI":"10.1016\/j.neuroimage.2014.09.018","article-title":"Cognitive flexibility in adolescence: neural and behavioral mechanisms of reward prediction error processing in adaptive decision making during development","volume":"104","author":"TU Hauser","year":"2015","journal-title":"Neuroimage"},{"issue":"21","key":"pcbi.1013226.ref004","doi-asserted-by":"crossref","DOI":"10.1016\/j.cub.2017.09.051","article-title":"Caudate microstimulation increases value of specific choices","volume":"27","author":"SR Santacruz","year":"2017","journal-title":"Curr Biol"},{"issue":"5","key":"pcbi.1013226.ref005","doi-asserted-by":"crossref","first-page":"1936","DOI":"10.1523\/JNEUROSCI.16-05-01936.1996","article-title":"A framework for mesencephalic dopamine systems based on predictive Hebbian learning","volume":"16","author":"PR Montague","year":"1996","journal-title":"J Neurosci"},{"issue":"5306","key":"pcbi.1013226.ref006","doi-asserted-by":"crossref","first-page":"1593","DOI":"10.1126\/science.275.5306.1593","article-title":"A neural substrate of prediction and reward","volume":"275","author":"W Schultz","year":"1997","journal-title":"Science"},{"issue":"3","key":"pcbi.1013226.ref007","doi-asserted-by":"crossref","first-page":"479","DOI":"10.1038\/nn.4239","article-title":"Dopamine neurons share common response function for reward prediction error","volume":"19","author":"N Eshel","year":"2016","journal-title":"Nat Neurosci"},{"issue":"5","key":"pcbi.1013226.ref008","doi-asserted-by":"crossref","first-page":"830","DOI":"10.1038\/s41593-023-01310-x","article-title":"Dopaminergic prediction errors in the ventral tegmental area reflect a multithreaded predictive model","volume":"26","author":"YK Takahashi","year":"2023","journal-title":"Nat Neurosci"},{"issue":"2","key":"pcbi.1013226.ref009","doi-asserted-by":"crossref","first-page":"1068","DOI":"10.1152\/jn.00158.2010","article-title":"A pallidus-habenula-dopamine pathway signals inferred stimulus values","volume":"104","author":"ES Bromberg-Martin","year":"2010","journal-title":"J Neurophysiol"},{"issue":"2","key":"pcbi.1013226.ref010","doi-asserted-by":"crossref","first-page":"286","DOI":"10.1038\/s41593-023-01542-x","article-title":"Dopamine-independent effect of rewards on choices through hidden-state inference","volume":"27","author":"M Blanco-Pozo","year":"2024","journal-title":"Nat Neurosci"},{"issue":"1","key":"pcbi.1013226.ref011","doi-asserted-by":"crossref","first-page":"1891","DOI":"10.1038\/s41467-018-04397-0","article-title":"Belief state representation in the dopamine system","volume":"9","author":"BM Babayan","year":"2018","journal-title":"Nat Commun"},{"issue":"4","key":"pcbi.1013226.ref012","doi-asserted-by":"crossref","first-page":"581","DOI":"10.1038\/nn.4520","article-title":"Dopamine reward prediction errors reflect hidden-state inference across time","volume":"20","author":"CK Starkweather","year":"2017","journal-title":"Nat Neurosci"},{"issue":"11","key":"pcbi.1013226.ref013","doi-asserted-by":"crossref","first-page":"703","DOI":"10.1038\/s41583-019-0220-7","article-title":"Believing in dopamine","volume":"20","author":"SJ Gershman","year":"2019","journal-title":"Nat Rev Neurosci"},{"issue":"6","key":"pcbi.1013226.ref014","doi-asserted-by":"crossref","first-page":"2407","DOI":"10.1523\/JNEUROSCI.1989-14.2015","article-title":"Reversal learning and dopamine: a bayesian perspective","volume":"35","author":"VD Costa","year":"2015","journal-title":"J Neurosci"},{"issue":"6","key":"pcbi.1013226.ref015","doi-asserted-by":"crossref","first-page":"821","DOI":"10.1016\/j.cub.2017.02.026","article-title":"Midbrain dopamine neurons signal belief in choice accuracy during a perceptual decision","volume":"27","author":"A Lak","year":"2017","journal-title":"Curr Biol"},{"issue":"9","key":"pcbi.1013226.ref016","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pcbi.1011067","article-title":"Emergence of belief-like representations through reinforcement learning","volume":"19","author":"JA Hennig","year":"2023","journal-title":"PLoS Comput Biol"},{"issue":"9","key":"pcbi.1013226.ref017","doi-asserted-by":"crossref","first-page":"1281","DOI":"10.1038\/nn.3188","article-title":"Transient stimulation of distinct subpopulations of striatal neurons mimics changes in action value","volume":"15","author":"L-H Tai","year":"2012","journal-title":"Nat Neurosci"},{"key":"pcbi.1013226.ref018","doi-asserted-by":"crossref","first-page":"101106","DOI":"10.1016\/j.dcn.2022.101106","article-title":"Reinforcement learning and Bayesian inference provide complementary models for the unique advantage of adolescents in stochastic reversal","volume":"55","author":"MK Eckstein","year":"2022","journal-title":"Dev Cogn Neurosci"},{"issue":"15","key":"pcbi.1013226.ref019","doi-asserted-by":"crossref","DOI":"10.1073\/pnas.2113961119","article-title":"Mice exhibit stochastic and efficient action switching during probabilistic decision making","volume":"119","author":"CC Beron","year":"2022","journal-title":"Proc Natl Acad Sci U S A"},{"issue":"3","key":"pcbi.1013226.ref020","doi-asserted-by":"crossref","DOI":"10.1016\/j.cub.2021.12.006","article-title":"Serotonin neurons modulate learning rate through uncertainty","volume":"32","author":"CD Grossman","year":"2022","journal-title":"Curr Biol"},{"key":"pcbi.1013226.ref021","doi-asserted-by":"crossref","DOI":"10.1126\/science.adq5874","volume-title":"Hidden state inference requires abstract contextual representations in ventral hippocampus","author":"K Mishchanchuk","year":"2024"},{"issue":"6","key":"pcbi.1013226.ref022","doi-asserted-by":"crossref","first-page":"845","DOI":"10.1038\/nn.4287","article-title":"Reward and choice encoding in terminals of midbrain dopamine neurons depends on striatal target","volume":"19","author":"NF Parker","year":"2016","journal-title":"Nat Neurosci"},{"issue":"2","key":"pcbi.1013226.ref023","doi-asserted-by":"crossref","first-page":"93","DOI":"10.1159\/000342000","article-title":"A trade-off between feedback-based learning and episodic memory for feedback events: evidence from Parkinson\u2019s disease","volume":"11","author":"K Foerde","year":"2013","journal-title":"Neurodegener Dis"},{"issue":"3","key":"pcbi.1013226.ref024","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pbio.1001293","article-title":"Reasoning, learning, and creativity: frontal lobe function and human decision-making","volume":"10","author":"A Collins","year":"2012","journal-title":"PLoS Biol"},{"key":"pcbi.1013226.ref025","doi-asserted-by":"crossref","first-page":"66","DOI":"10.1016\/j.cobeha.2020.10.003","article-title":"The role of executive function in shaping reinforcement learning","volume":"38","author":"M Rmus","year":"2021","journal-title":"Curr Opin Behav Sci"},{"issue":"6","key":"pcbi.1013226.ref026","article-title":"A unified framework for dopamine signals across timescales","volume":"183","author":"HR Kim","year":"2020","journal-title":"Cell"},{"issue":"3","key":"pcbi.1013226.ref027","doi-asserted-by":"crossref","first-page":"622","DOI":"10.1016\/j.cell.2015.07.015","article-title":"Circuit architecture of VTA dopamine neurons revealed by systematic input-output mapping","volume":"162","author":"KT Beier","year":"2015","journal-title":"Cell"},{"issue":"2","key":"pcbi.1013226.ref028","doi-asserted-by":"crossref","first-page":"247","DOI":"10.1016\/j.neuron.2015.08.037","article-title":"Dopamine prediction errors in reward learning and addiction: from theory to neural circuitry","volume":"88","author":"R Keiflin","year":"2015","journal-title":"Neuron"},{"issue":"3","key":"pcbi.1013226.ref029","doi-asserted-by":"crossref","first-page":"387","DOI":"10.1016\/j.conb.2011.02.011","article-title":"Ventral striatum: a critical look at models of learning and evaluation","volume":"21","author":"MAA van der Meer","year":"2011","journal-title":"Curr Opin Neurobiol"},{"issue":"6396","key":"pcbi.1013226.ref030","doi-asserted-by":"crossref","DOI":"10.1126\/science.aat4422","article-title":"Ultrafast neuronal imaging of dopamine dynamics with designed genetically encoded sensors","volume":"360","author":"T Patriarchi","year":"2018","journal-title":"Science"},{"issue":"17","key":"pcbi.1013226.ref031","article-title":"Transient food insecurity during the juvenile-adolescent period affects adult weight, cognitive flexibility, and dopamine neurobiology","volume":"32","author":"WC Lin","year":"2022","journal-title":"Curr Biol"},{"issue":"1","key":"pcbi.1013226.ref032","doi-asserted-by":"crossref","DOI":"10.1016\/j.neuron.2020.01.017","article-title":"Inference-based decisions in a hidden state foraging task: differential contributions of prefrontal cortical areas","volume":"106","author":"P Vertechi","year":"2020","journal-title":"Neuron"},{"issue":"6","key":"pcbi.1013226.ref033","doi-asserted-by":"crossref","first-page":"532","DOI":"10.1037\/0033-295X.87.6.532","article-title":"A model for Pavlovian learning: variations in the effectiveness of conditioned but not of unconditioned stimuli.","volume":"87","author":"JM Pearce","year":"1980","journal-title":"Psychol Rev"},{"issue":"31","key":"pcbi.1013226.ref034","doi-asserted-by":"crossref","first-page":"9861","DOI":"10.1523\/JNEUROSCI.6157-08.2009","article-title":"Validation of decision-making models and analysis of decision variables in the rat basal ganglia","volume":"29","author":"M Ito","year":"2009","journal-title":"J Neurosci"},{"issue":"47","key":"pcbi.1013226.ref035","doi-asserted-by":"crossref","first-page":"29381","DOI":"10.1073\/pnas.1912330117","article-title":"Computational evidence for hierarchically structured reinforcement learning in humans","volume":"117","author":"MK Eckstein","year":"2020","journal-title":"Proc Natl Acad Sci U S A"},{"issue":"12","key":"pcbi.1013226.ref036","doi-asserted-by":"crossref","first-page":"973","DOI":"10.1111\/j.1467-9280.2005.01646.x","article-title":"Using cognitive models to map relations between neuropsychological disorders and human decision-making deficits","volume":"16","author":"E Yechiam","year":"2005","journal-title":"Psychol Sci"},{"issue":"41","key":"pcbi.1013226.ref037","doi-asserted-by":"crossref","first-page":"16311","DOI":"10.1073\/pnas.0706111104","article-title":"Genetic triple dissociation reveals multiple roles for dopamine in reinforcement learning","volume":"104","author":"MJ Frank","year":"2007","journal-title":"Proc Natl Acad Sci U S A"},{"issue":"2","key":"pcbi.1013226.ref038","doi-asserted-by":"crossref","first-page":"551","DOI":"10.1523\/JNEUROSCI.5498-10.2012","article-title":"Neural prediction errors reveal a risk-sensitive reinforcement-learning process in the human brain","volume":"32","author":"Y Niv","year":"2012","journal-title":"J Neurosci"},{"issue":"5","key":"pcbi.1013226.ref039","doi-asserted-by":"crossref","first-page":"1320","DOI":"10.3758\/s13423-014-0790-3","article-title":"Do learning rates adapt to the distribution of rewards?","volume":"22","author":"SJ Gershman","year":"2015","journal-title":"Psychon Bull Rev"},{"issue":"1","key":"pcbi.1013226.ref040","doi-asserted-by":"crossref","first-page":"3574","DOI":"10.1038\/s41598-020-80593-7","article-title":"Dissociation between asymmetric value updating and perseverance in human reinforcement learning","volume":"11","author":"M Sugawara","year":"2021","journal-title":"Sci Rep"},{"issue":"6","key":"pcbi.1013226.ref041","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pbio.1001093","article-title":"Counterfactual choice and learning in a neural network centered on human lateral frontopolar cortex","volume":"9","author":"ED Boorman","year":"2011","journal-title":"PLoS Biol"},{"issue":"6","key":"pcbi.1013226.ref042","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pcbi.1004953","article-title":"The computational development of reinforcement learning during adolescence","volume":"12","author":"S Palminteri","year":"2016","journal-title":"PLoS Comput Biol"},{"issue":"8","key":"pcbi.1013226.ref043","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pcbi.1005684","article-title":"Confirmation bias in human reinforcement learning: Evidence from counterfactual feedback processing","volume":"13","author":"S Palminteri","year":"2017","journal-title":"PLoS Comput Biol"},{"issue":"32","key":"pcbi.1013226.ref044","doi-asserted-by":"crossref","first-page":"8360","DOI":"10.1523\/JNEUROSCI.1010-06.2006","article-title":"The role of the ventromedial prefrontal cortex in abstract state-based inference during decision making in humans","volume":"26","author":"AN Hampton","year":"2006","journal-title":"J Neurosci"},{"issue":"1","key":"pcbi.1013226.ref045","doi-asserted-by":"crossref","first-page":"129","DOI":"10.1016\/j.neuron.2005.05.020","article-title":"Midbrain dopamine neurons encode a quantitative reward prediction error signal","volume":"47","author":"HM Bayer","year":"2005","journal-title":"Neuron"},{"key":"pcbi.1013226.ref046","doi-asserted-by":"crossref","first-page":"95","DOI":"10.1016\/j.conb.2020.08.014","article-title":"Dopamine signals as temporal difference errors: recent advances","volume":"67","author":"CK Starkweather","year":"2021","journal-title":"Curr Opin Neurobiol"},{"issue":"6","key":"pcbi.1013226.ref047","doi-asserted-by":"crossref","first-page":"1204","DOI":"10.1016\/j.neuron.2011.02.027","article-title":"Model-based influences on humans\u2019 choices and striatal prediction errors","volume":"69","author":"ND Daw","year":"2011","journal-title":"Neuron"},{"issue":"16","key":"pcbi.1013226.ref048","doi-asserted-by":"crossref","first-page":"4332","DOI":"10.1523\/JNEUROSCI.2700-16.2017","article-title":"Working memory load strengthens reward prediction errors","volume":"37","author":"AGE Collins","year":"2017","journal-title":"J Neurosci"},{"key":"pcbi.1013226.ref049","doi-asserted-by":"crossref","first-page":"105","DOI":"10.1016\/B978-0-12-812098-9.00005-X","article-title":"Learning structures through reinforcement","volume-title":"Goal-directed decision making","author":"AGE Collins","year":"2018"},{"issue":"5","key":"pcbi.1013226.ref050","doi-asserted-by":"crossref","first-page":"1346","DOI":"10.3758\/s13415-023-01104-5","article-title":"Lowered inter-stimulus discriminability hurts incremental contributions to learning","volume":"23","author":"AH Yoo","year":"2023","journal-title":"Cogn Affect Behav Neurosci"},{"issue":"6191","key":"pcbi.1013226.ref051","doi-asserted-by":"crossref","first-page":"1481","DOI":"10.1126\/science.1252254","article-title":"Human cognition. Foundations of human reasoning in the prefrontal cortex","volume":"344","author":"M Donoso","year":"2014","journal-title":"Science"},{"issue":"7889","key":"pcbi.1013226.ref052","doi-asserted-by":"crossref","first-page":"489","DOI":"10.1038\/s41586-021-04129-3","article-title":"Contextual inference underlies the learning of sensorimotor repertoires","volume":"600","author":"JB Heald","year":"2021","journal-title":"Nature"},{"issue":"1","key":"pcbi.1013226.ref053","doi-asserted-by":"crossref","first-page":"190","DOI":"10.1037\/a0030852","article-title":"Cognitive control over learning: creating, clustering, and generalizing task-set structure","volume":"120","author":"AGE Collins","year":"2013","journal-title":"Psychol Rev"},{"issue":"10","key":"pcbi.1013226.ref054","doi-asserted-by":"crossref","first-page":"2502","DOI":"10.1073\/pnas.1720963115","article-title":"Within- and across-trial dynamics of human EEG reveal cooperative interplay between reinforcement learning and working memory","volume":"115","author":"AGE Collins","year":"2018","journal-title":"Proc Natl Acad Sci U S A"},{"issue":"7464","key":"pcbi.1013226.ref055","doi-asserted-by":"crossref","first-page":"575","DOI":"10.1038\/nature12475","article-title":"Prolonged dopamine signalling in striatum signals proximity and value of distant rewards","volume":"500","author":"MW Howe","year":"2013","journal-title":"Nature"},{"issue":"2","key":"pcbi.1013226.ref056","doi-asserted-by":"crossref","first-page":"111470","DOI":"10.1016\/j.celrep.2022.111470","article-title":"Midbrain dopamine neurons signal phasic and ramping reward prediction error during goal-directed navigation","volume":"41","author":"K Farrell","year":"2022","journal-title":"Cell Rep"},{"issue":"49","key":"pcbi.1013226.ref057","doi-asserted-by":"crossref","first-page":"16585","DOI":"10.1523\/JNEUROSCI.3958-10.2010","article-title":"The flexible approach hypothesis: unification of effort and cue-responding hypotheses for the role of nucleus accumbens dopamine in the activation of reward-seeking behavior","volume":"30","author":"SM Nicola","year":"2010","journal-title":"J Neurosci"},{"issue":"5","key":"pcbi.1013226.ref058","doi-asserted-by":"crossref","first-page":"2549","DOI":"10.1152\/jn.00465.2017","article-title":"Limbic-motor integration by neural excitations and inhibitions in the nucleus accumbens","volume":"118","author":"SE Morrison","year":"2017","journal-title":"J Neurophysiol"},{"key":"pcbi.1013226.ref059","doi-asserted-by":"crossref","DOI":"10.7554\/eLife.62583","article-title":"Slowly evolving dopaminergic activity modulates the moment-to-moment probability of reward-related self-timed movements","volume":"10","author":"AE Hamilos","year":"2021","journal-title":"Elife"},{"issue":"10","key":"pcbi.1013226.ref060","doi-asserted-by":"crossref","first-page":"1762","DOI":"10.1038\/s41593-023-01401-9","article-title":"Unique functional responses differentially map onto genetic subtypes of dopamine neurons","volume":"26","author":"M Azcorra","year":"2023","journal-title":"Nat Neurosci"},{"issue":"21","key":"pcbi.1013226.ref061","doi-asserted-by":"crossref","DOI":"10.1016\/j.cub.2021.08.052","article-title":"Dopamine release in the nucleus accumbens core signals perceived saliency","volume":"31","author":"MG Kutlu","year":"2021","journal-title":"Curr Biol"},{"issue":"8","key":"pcbi.1013226.ref062","doi-asserted-by":"crossref","first-page":"1071","DOI":"10.1038\/s41593-022-01126-1","article-title":"Dopamine signaling in the nucleus accumbens core mediates latent inhibition","volume":"25","author":"MG Kutlu","year":"2022","journal-title":"Nat Neurosci"},{"issue":"6","key":"pcbi.1013226.ref063","doi-asserted-by":"crossref","first-page":"1046","DOI":"10.1002\/jnr.24587","article-title":"Heterogeneity in striatal dopamine circuits: Form and function in dynamic reward seeking","volume":"98","author":"AL Collins","year":"2020","journal-title":"J Neurosci Res"},{"issue":"35","key":"pcbi.1013226.ref064","doi-asserted-by":"crossref","DOI":"10.1523\/JNEUROSCI.0120-24.2024","article-title":"Dopamine release in the nucleus accumbens core encodes the general excitatory components of learning","volume":"44","author":"M Taira","year":"2024","journal-title":"J Neurosci"},{"issue":"4","key":"pcbi.1013226.ref065","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pcbi.1011516","article-title":"Dopamine encoding of novelty facilitates efficient uncertainty-driven exploration","volume":"20","author":"Y Wang","year":"2024","journal-title":"PLoS Comput Biol"},{"issue":"7947","key":"pcbi.1013226.ref066","doi-asserted-by":"crossref","first-page":"294","DOI":"10.1038\/s41586-022-05614-z","article-title":"Mesolimbic dopamine adapts the rate of learning from action","volume":"614","author":"LT Coddington","year":"2023","journal-title":"Nature"},{"issue":"6626","key":"pcbi.1013226.ref067","doi-asserted-by":"crossref","DOI":"10.1126\/science.abq6740","article-title":"Mesolimbic dopamine release conveys causal associations","volume":"378","author":"H Jeong","year":"2022","journal-title":"Science"},{"issue":"12","key":"pcbi.1013226.ref068","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pcbi.1004648","article-title":"Simple plans or sophisticated habits? State, transition and learning interactions in the two-step task","volume":"11","author":"T Akam","year":"2015","journal-title":"PLoS Comput Biol"},{"issue":"9","key":"pcbi.1013226.ref069","doi-asserted-by":"crossref","first-page":"1269","DOI":"10.1038\/nn.4613","article-title":"Dorsal hippocampus contributes to model-based planning","volume":"20","author":"KJ Miller","year":"2017","journal-title":"Nat Neurosci"},{"issue":"152","key":"pcbi.1013226.ref070","doi-asserted-by":"crossref","DOI":"10.3791\/60278","article-title":"Multi-fiber photometry to record neural activity in freely-moving animals","author":"E Martianova","year":"2019","journal-title":"J Vis Exp"},{"issue":"6","key":"pcbi.1013226.ref071","doi-asserted-by":"crossref","first-page":"1347","DOI":"10.1162\/089976602753712972","article-title":"Multiple model-based reinforcement learning","volume":"14","author":"K Doya","year":"2002","journal-title":"Neural Comput"}],"updated-by":[{"DOI":"10.1371\/journal.pcbi.1013226","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2025,7,7]],"date-time":"2025-07-07T00:00:00Z","timestamp":1751846400000}}],"container-title":["PLOS Computational Biology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1013226","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,7]],"date-time":"2025-07-07T15:24:02Z","timestamp":1751901842000},"score":1,"resource":{"primary":{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1013226"}},"subtitle":[],"editor":[{"given":"Tianming","family":"Yang","sequence":"first","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2025,7,2]]},"references-count":71,"journal-issue":{"issue":"7","published-online":{"date-parts":[[2025,7,2]]}},"URL":"https:\/\/doi.org\/10.1371\/journal.pcbi.1013226","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2023.11.10.566306","asserted-by":"object"}]},"ISSN":["1553-7358"],"issn-type":[{"value":"1553-7358","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,7,2]]}}}