{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"institution":[{"name":"bioRxiv"}],"indexed":{"date-parts":[[2026,1,15]],"date-time":"2026-01-15T11:31:16Z","timestamp":1768476676902,"version":"3.49.0"},"posted":{"date-parts":[[2018,1,7]]},"group-title":"Neuroscience","reference-count":41,"publisher":"openRxiv","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"accepted":{"date-parts":[[2019,9,18]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                <jats:p>Diffusion decision models (DDMs) are immensely successful models for decision-making under uncertainty and time pressure. In the context of perceptual decision making, these models typically start with two input units, organized in a neuron-antineuron pair. In contrast, in the brain, sensory inputs are encoded through the activity of large neuronal populations. Moreover, while DDMs are wired by hand, the nervous system must learn the weights of the network through trial and error. There is currently no normative theory of learning in DDMs and therefore no theory of how decision makers could learn to make optimal decisions in this context. Here, we derive the first such rule for learning a near-optimal linear combination of DDM inputs based on trial-by-trial feedback. The rule is Bayesian in the sense that it learns not only the mean of the weights but also the uncertainty around this mean in the form of a covariance matrix. In this rule, the rate of learning is proportional (resp. inversely proportional) to confidence for incorrect (resp. correct) decisions. Furthermore, we show that, in volatile environments, the rule predicts a bias towards repeating the same choice after correct decisions, with a bias strength that is modulated by the previous choice\u2019s difficulty. Finally, we extend our learning rule to cases for which one of the choices is more likely a priori, which provides new insights into how such biases modulate the mechanisms leading to optimal decisions in diffusion models.<\/jats:p>\n                <jats:sec>\n                  <jats:title>Significance Statement<\/jats:title>\n                  <jats:p>Popular models for the tradeoff between speed and accuracy of everyday decisions usually assume fixed, low-dimensional sensory inputs. In contrast, in the brain, these inputs are distributed across larger populations of neurons, and their interpretation needs to be learned from feedback. We ask how such learning could occur and demonstrate that efficient learning is significantly modulated by decision confidence. This modulation predicts a particular dependency pattern between consecutive choices, and provides new insight into how a priori biases for particular choices modulate the mechanisms leading to efficient decisions in these models.<\/jats:p>\n                <\/jats:sec>","DOI":"10.1101\/244269","type":"posted-content","created":{"date-parts":[[2018,1,8]],"date-time":"2018-01-08T01:10:15Z","timestamp":1515373815000},"source":"Crossref","is-referenced-by-count":5,"title":["Learning optimal decisions with confidence"],"prefix":"10.64898","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-7846-0408","authenticated-orcid":false,"given":"Jan","family":"Drugowitsch","sequence":"first","affiliation":[]},{"given":"Andr\u00e9 G.","family":"Mendon\u00e7a","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7913-9109","authenticated-orcid":false,"given":"Zachary F.","family":"Mainen","sequence":"additional","affiliation":[]},{"given":"Alexandre","family":"Pouget","sequence":"additional","affiliation":[]}],"member":"54368","reference":[{"key":"2019092012334641000_244269v3.1","doi-asserted-by":"crossref","unstructured":"Doya K , Ishii S , Pouget A , Rao RPN (2006) Bayesian Brain: Probabilistic Approaches to Neural Coding (MIT Press).","DOI":"10.7551\/mitpress\/9780262042383.001.0001"},{"key":"2019092012334641000_244269v3.2","doi-asserted-by":"publisher","DOI":"10.1037\/\/0033-295X.85.2.59"},{"key":"2019092012334641000_244269v3.3","doi-asserted-by":"publisher","DOI":"10.1162\/neco.2008.12-06-420"},{"key":"2019092012334641000_244269v3.4","doi-asserted-by":"publisher","DOI":"10.1037\/0033-295X.113.4.700"},{"key":"2019092012334641000_244269v3.5","doi-asserted-by":"publisher","DOI":"10.1037\/0033-295X.111.2.333"},{"key":"2019092012334641000_244269v3.6","unstructured":"Frazier PI , Yu AJ (2008) Sequential hypothesis testing under stochastic deadlines. Adv Neural Inf Process Syst:1\u20138."},{"key":"2019092012334641000_244269v3.7","doi-asserted-by":"crossref","first-page":"12400","DOI":"10.1038\/ncomms12400","article-title":"Optimal policy for value-based decision-making","volume":"7","year":"2016","journal-title":"Nat Commun"},{"key":"2019092012334641000_244269v3.8","doi-asserted-by":"publisher","DOI":"10.1523\/JNEUROSCI.4010-11.2012"},{"key":"2019092012334641000_244269v3.9","doi-asserted-by":"publisher","DOI":"10.1016\/S0896-6273(02)00971-6"},{"key":"2019092012334641000_244269v3.10","doi-asserted-by":"publisher","DOI":"10.7554\/elife.02224"},{"key":"2019092012334641000_244269v3.11","doi-asserted-by":"publisher","DOI":"10.1038\/81504"},{"key":"2019092012334641000_244269v3.12","doi-asserted-by":"publisher","DOI":"10.1080\/03772063.2003.11416335"},{"key":"2019092012334641000_244269v3.13","doi-asserted-by":"publisher","DOI":"10.1038\/nature02169"},{"key":"2019092012334641000_244269v3.14","doi-asserted-by":"publisher","DOI":"10.1016\/j.tics.2006.05.004"},{"key":"2019092012334641000_244269v3.15","doi-asserted-by":"publisher","DOI":"10.1037\/\/0033-295X.88.6.552"},{"key":"2019092012334641000_244269v3.16","doi-asserted-by":"publisher","DOI":"10.1037\/a0012667"},{"key":"2019092012334641000_244269v3.17","doi-asserted-by":"publisher","DOI":"10.1037\/a0033152"},{"key":"2019092012334641000_244269v3.18","doi-asserted-by":"publisher","DOI":"10.1523\/JNEUROSCI.5613-10.2011"},{"key":"2019092012334641000_244269v3.19","doi-asserted-by":"publisher","DOI":"10.1162\/neco.2010.12-08-930"},{"key":"2019092012334641000_244269v3.20","doi-asserted-by":"publisher","DOI":"10.1167\/5.8.376"},{"key":"2019092012334641000_244269v3.21","unstructured":"Berger JO (1993) Statistical Decision Theory and Bayesian Analysis (Springer). 2nd Editio."},{"key":"2019092012334641000_244269v3.22","doi-asserted-by":"publisher","DOI":"10.1038\/nn.4240"},{"key":"2019092012334641000_244269v3.23","doi-asserted-by":"publisher","DOI":"10.1038\/nrn1888"},{"key":"2019092012334641000_244269v3.24","doi-asserted-by":"publisher","DOI":"10.1038\/nn.3807"},{"key":"2019092012334641000_244269v3.25","doi-asserted-by":"publisher","DOI":"10.1126\/science.1169405"},{"key":"2019092012334641000_244269v3.26","unstructured":"Bishop C (2006) Pattern Recognition and Machine Learning (Springer)."},{"key":"2019092012334641000_244269v3.27","doi-asserted-by":"publisher","DOI":"10.7554\/eLife.06633"},{"key":"2019092012334641000_244269v3.28","doi-asserted-by":"crossref","unstructured":"Cover TM , Thomas JA (2006) Elements of Information Theory (Wiley). 2nd Editio.","DOI":"10.1002\/047174882X"},{"key":"2019092012334641000_244269v3.29","unstructured":"Murphy KP (2012) Machine Learning: a Probabilistic Perspective (MIT Press)."},{"key":"2019092012334641000_244269v3.30","unstructured":"Graepel T , Qui\u00f1onero-Candela J , Borchert T , Herbrich R (2010) Web-Scale Bayesian Click-Through Rate Prediction for Sponsored Search Advertising in Microsoft\u2019s Bing Search Engine. Proceedings of the 27th International Conference on Machine Learning (ICML-10) Available at: http:\/\/citeseerx.ist.psu.edu\/viewdoc\/download?doi=10.1.1.165.5644&rep=rep1&type=pdf http:\/\/machinelearning.wustl.edu\/mlpapers\/paper_files\/icml2010_GraepelCBH10.pdf."},{"key":"2019092012334641000_244269v3.31","doi-asserted-by":"crossref","unstructured":"Chu W , Zinkevich M , Li L , Thomas A , Tseng B (2011) Unbiased online active learning in data streams. Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining - KDD \u201911 (ACM Press, New York, New York, USA), p 195.","DOI":"10.1145\/2020408.2020444"},{"key":"2019092012334641000_244269v3.32","unstructured":"Sutton RS , Barto AG (2018) Reinforcement learing: an introduction (MIT Press). 2nd editio."},{"issue":"5","key":"2019092012334641000_244269v3.33","doi-asserted-by":"crossref","first-page":"1083","DOI":"10.1016\/j.neuron.2018.07.035","article-title":"Counterfactual Reasoning Underlies the Learning of Priors in Decision Making","volume":"99","year":"2018","journal-title":"Neuron"},{"key":"2019092012334641000_244269v3.34","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.1615773114"},{"key":"2019092012334641000_244269v3.35","doi-asserted-by":"publisher","DOI":"10.1038\/nn1790"},{"key":"2019092012334641000_244269v3.36","doi-asserted-by":"publisher","DOI":"10.1016\/j.neuron.2008.09.021"},{"key":"2019092012334641000_244269v3.37","doi-asserted-by":"publisher","DOI":"10.1523\/JNEUROSCI.6689-10.2011"},{"key":"2019092012334641000_244269v3.38","first-page":"1873","article-title":"Sequential effects: Superstition or rational behavior?","volume":"21","year":"2009","journal-title":"Adv Neural Inf Process Syst"},{"key":"2019092012334641000_244269v3.39","unstructured":"Mendon\u00e7a AG , et al. (2019) The impact of learning on perceptual decisions and its implication for speed-accuracy tradeoffs. bioRxiv:1\u201364."},{"key":"2019092012334641000_244269v3.40","doi-asserted-by":"publisher","DOI":"10.1038\/ncomms14637"},{"key":"2019092012334641000_244269v3.41","doi-asserted-by":"publisher","DOI":"10.1523\/JNEUROSCI.5948-11.2012"}],"container-title":[],"original-title":[],"link":[{"URL":"https:\/\/syndication.highwire.org\/content\/doi\/10.1101\/244269","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,1,14]],"date-time":"2026-01-14T22:35:55Z","timestamp":1768430155000},"score":1,"resource":{"primary":{"URL":"http:\/\/biorxiv.org\/lookup\/doi\/10.1101\/244269"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,1,7]]},"references-count":41,"URL":"https:\/\/doi.org\/10.1101\/244269","relation":{"is-preprint-of":[{"id-type":"doi","id":"10.1073\/pnas.1906787116","asserted-by":"subject"}]},"subject":[],"published":{"date-parts":[[2018,1,7]]},"subtype":"preprint"}}