Parallel Representation of Value-Based and Finite State-Based Strategies in the Ventral and Dorsal Striatum

My bibliography Save this article

Parallel Representation of Value-Based and Finite State-Based Strategies in the Ventral and Dorsal Striatum

Author

Listed:

Makoto Ito
Kenji Doya

Registered:

Abstract

Previous theoretical studies of animal and human behavioral learning have focused on the dichotomy of the value-based strategy using action value functions to predict rewards and the model-based strategy using internal models to predict environmental states. However, animals and humans often take simple procedural behaviors, such as the “win-stay, lose-switch” strategy without explicit prediction of rewards or states. Here we consider another strategy, the finite state-based strategy, in which a subject selects an action depending on its discrete internal state and updates the state depending on the action chosen and the reward outcome. By analyzing choice behavior of rats in a free-choice task, we found that the finite state-based strategy fitted their behavioral choices more accurately than value-based and model-based strategies did. When fitted models were run autonomously with the same task, only the finite state-based strategy could reproduce the key feature of choice sequences. Analyses of neural activity recorded from the dorsolateral striatum (DLS), the dorsomedial striatum (DMS), and the ventral striatum (VS) identified significant fractions of neurons in all three subareas for which activities were correlated with individual states of the finite state-based strategy. The signal of internal states at the time of choice was found in DMS, and for clusters of states was found in VS. In addition, action values and state values of the value-based strategy were encoded in DMS and VS, respectively. These results suggest that both the value-based strategy and the finite state-based strategy are implemented in the striatum.Author Summary: The neural mechanism of decision-making, a cognitive process to select one action among multiple possibilities, is a fundamental issue in neuroscience. Previous studies have revealed the roles of the cerebral cortex and the basal ganglia in decision-making, by assuming that subjects take a value-based reinforcement learning strategy, in which the expected reward for each action candidate is updated. However, animals and humans often use simple procedural strategies, such as “win-stay, lose-switch.” In this study, we consider a finite state-based strategy, in which a subject acts depending on its discrete internal state and updates the state based on reward feedback. We found that the finite state-based strategy could reproduce the choice behavior of rats in a binary choice task with higher accuracy than the value-based strategy. Interestingly, neuronal activity in the striatum, a crucial brain region for reward-based learning, encoded information regarding both strategies. These results suggest that both the value-based strategy and the finite state-based strategy are implemented in the striatum.

Suggested Citation

Makoto Ito & Kenji Doya, 2015. "Parallel Representation of Value-Based and Finite State-Based Strategies in the Ventral and Dorsal Striatum," PLOS Computational Biology, Public Library of Science, vol. 11(11), pages 1-25, November.

Handle: RePEc:plo:pcbi00:1004540
DOI: 10.1371/journal.pcbi.1004540

Download full text from publisher

References listed on IDEAS

Anitha Pasupathy & Earl K. Miller, 2005. "Different time courses of learning-related activity in the prefrontal cortex and striatum," Nature, Nature, vol. 433(7028), pages 873-876, February.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Freund, Richard & Favara, Marta & Porter, Catherine & Behrman, Jere R., 2022. "Social Protection and Foundational Cognitive Skills during Adolescence: Evidence from a Large Public Works Programme," IZA Discussion Papers 15551, Institute of Labor Economics (IZA).
- Richard Freund & Marta Favara & Catherine Porter & Jere Behrman, 2022. "Social protection and foundational cognitive skills during adolescence: evidence from a large Public Works Programme," PIER Working Paper Archive 22-022, Penn Institute for Economic Research, Department of Economics, University of Pennsylvania.
Lisa Katharina Pendt & Iris Reuter & Hermann Müller, 2011. "Motor Skill Learning, Retention, and Control Deficits in Parkinson's Disease," PLOS ONE, Public Library of Science, vol. 6(7), pages 1-10, July.
Francesco Ceccarelli & Lorenzo Ferrucci & Fabrizio Londei & Surabhi Ramawat & Emiliano Brunamonti & Aldo Genovesio, 2023. "Static and dynamic coding in distinct cell types during associative learning in the prefrontal cortex," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
Johannes Algermissen & Jennifer C. Swart & René Scheeringa & Roshan Cools & Hanneke E. M. den Ouden, 2024. "Prefrontal signals precede striatal signals for biased credit assignment in motivational learning biases," Nature Communications, Nature, vol. 15(1), pages 1-19, December.
Naveen Sendhilnathan & Anna Ipata & Michael E. Goldberg, 2021. "Mid-lateral cerebellar complex spikes encode multiple independent reward-related signals during reinforcement learning," Nature Communications, Nature, vol. 12(1), pages 1-10, December.

More about this item

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1004540. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Parallel Representation of Value-Based and Finite State-Based Strategies in the Ventral and Dorsal Striatum

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data