Dopamine-independent effect of rewards on choices through hidden-state inference.

Saved in:
Bibliographic Details
Title: Dopamine-independent effect of rewards on choices through hidden-state inference.
Authors: Blanco-Pozo, Marta (AUTHOR), Akam, Thomas (AUTHOR), Walton, Mark E. (AUTHOR)
Source: Nature Neuroscience. Feb2024, Vol. 27 Issue 2, p286-297. 12p.
Abstract: Dopamine is implicated in adaptive behavior through reward prediction error (RPE) signals that update value estimates. There is also accumulating evidence that animals in structured environments can use inference processes to facilitate behavioral flexibility. However, it is unclear how these two accounts of reward-guided decision-making should be integrated. Using a two-step task for mice, we show that dopamine reports RPEs using value information inferred from task structure knowledge, alongside information about reward rate and movement. Nonetheless, although rewards strongly influenced choices and dopamine activity, neither activating nor inhibiting dopamine neurons at trial outcome affected future choice. These data were recapitulated by a neural network model where cortex learned to track hidden task states by predicting observations, while basal ganglia learned values and actions via RPEs. This shows that the influence of rewards on choices can stem from dopamine-independent information they convey about the world's state, not the dopaminergic RPEs they produce. Reward can both update values and convey information about the state of the world. Dopamine recordings and manipulations in highly trained mice making decisions, alongside modeling, show that dopamine supports the former but not the latter process. [ABSTRACT FROM AUTHOR]
Copyright of Nature Neuroscience is the property of Springer Nature and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Database: Psychology and Behavioral Sciences Collection
Description
Abstract:Dopamine is implicated in adaptive behavior through reward prediction error (RPE) signals that update value estimates. There is also accumulating evidence that animals in structured environments can use inference processes to facilitate behavioral flexibility. However, it is unclear how these two accounts of reward-guided decision-making should be integrated. Using a two-step task for mice, we show that dopamine reports RPEs using value information inferred from task structure knowledge, alongside information about reward rate and movement. Nonetheless, although rewards strongly influenced choices and dopamine activity, neither activating nor inhibiting dopamine neurons at trial outcome affected future choice. These data were recapitulated by a neural network model where cortex learned to track hidden task states by predicting observations, while basal ganglia learned values and actions via RPEs. This shows that the influence of rewards on choices can stem from dopamine-independent information they convey about the world's state, not the dopaminergic RPEs they produce. Reward can both update values and convey information about the state of the world. Dopamine recordings and manipulations in highly trained mice making decisions, alongside modeling, show that dopamine supports the former but not the latter process. [ABSTRACT FROM AUTHOR]
ISSN:10976256
DOI:10.1038/s41593-023-01542-x