Author
Listed:
- Sean R Maulhardt
- Alec Solway
- Caroline J Charpentier
Abstract
When receiving a reward after a sequence of multiple events, how do we determine which event caused the reward? This problem, known as temporal credit assignment, can be difficult for humans to solve given the temporal uncertainty in the environment. Research to date has attempted to isolate dimensions of delay and reward during decision-making, but algorithmic solutions to temporal learning problems and the effect of uncertainty on learning remain underexplored. To further our understanding, we adapted a reward learning task that creates a temporal credit assignment problem by combining sequentially delayed rewards, intervening events, and varying uncertainty via the amount of information presented during feedback. Using computational modeling, two learning strategies were developed: an eligibility trace, whereby previously selected actions are updated as a function of the temporal sequence, and a tabular update, whereby only systematically related past actions (rather than unrelated intervening events) are updated. We hypothesized that reduced information uncertainty would correlate with increased use of the tabular strategy, given the model’s capacity to incorporate additional feedback information. Both models effectively learned the task, and predicted choices made by participants (N = 142) as well as specific behavioral signatures of credit assignment. Consistent with our hypothesis, the tabular model outperformed the eligibility model under low information uncertainty, as evidenced by more accurate predictions of participants’ behavior and an increase in tabular weight. These findings provide new insights into the mechanisms implemented by humans to solve temporal credit assignment and adapt their strategy in varying environments.Author summary: People routinely experience uncertain and temporally extended environments that might be encountered again in the future. To overcome the burden of relearning these environments, people detect patterns that will facilitate future decision-making. Although pattern detection is complex, one such mechanism – known as credit assignment – is particularly important when identifying decisions to repeat and those to avoid. Credit assignment allows people to assign value to past experiences and then utilize these values for quick and efficient decision-making. However, this process becomes substantially more complex when the delay between a decision and its associated outcome increases, and when the available information decreases, making the environment more uncertain. Our research experimentally isolates the effects of delay to understand the strategies people use to solve credit assignment problems in environments with low and high uncertainty. We discover that information uncertainty influences how credit assignment strategies are deployed. Our methods, analyses, and results pave the way for more complex real-world experimentation that seeks to understand the interaction between reward, time, and uncertainty.
Suggested Citation
Sean R Maulhardt & Alec Solway & Caroline J Charpentier, 2026.
"Information uncertainty influences learning strategy from sequentially delayed rewards,"
PLOS Computational Biology, Public Library of Science, vol. 22(2), pages 1-23, February.
Handle:
RePEc:plo:pcbi00:1013879
DOI: 10.1371/journal.pcbi.1013879
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1013879. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.