IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1007334.html

Learning the structure of the world: The adaptive nature of state-space and action representations in multi-stage decision-making

Author

Listed:
  • Amir Dezfouli
  • Bernard W Balleine

Abstract

State-space and action representations form the building blocks of decision-making processes in the brain; states map external cues to the current situation of the agent whereas actions provide the set of motor commands from which the agent can choose to achieve specific goals. Although these factors differ across environments, it is currently unknown whether or how accurately state and action representations are acquired by the agent because previous experiments have typically provided this information a priori through instruction or pre-training. Here we studied how state and action representations adapt to reflect the structure of the world when such a priori knowledge is not available. We used a sequential decision-making task in rats in which they were required to pass through multiple states before reaching the goal, and for which the number of states and how they map onto external cues were unknown a priori. We found that, early in training, animals selected actions as if the task was not sequential and outcomes were the immediate consequence of the most proximal action. During the course of training, however, rats recovered the true structure of the environment and made decisions based on the expanded state-space, reflecting the multiple stages of the task. Similarly, we found that the set of actions expanded with training, although the emergence of new action sequences was sensitive to the experimental parameters and specifics of the training procedure. We conclude that the profile of choices shows a gradual shift from simple representations to more complex structures compatible with the structure of the world.Author summary: Everyday decision-making tasks typically require taking multiple actions and passing through multiple states before reaching desired goals. Such states constitute the state-space of the task. Here we show that, contrary to current assumptions, the state-space is not static but rather expands during training as subjects discover new states that help them efficiently solve the task. Similarly, within the same task, we show that subjects initially only consider taking simple actions, but as training progresses the set of actions can expand to include useful action sequences that reach the goal directly by passing through multiple states. These results provide evidence that state-space and action representations are not static but are acquired and then adapted to reflect the structure of the world.

Suggested Citation

  • Amir Dezfouli & Bernard W Balleine, 2019. "Learning the structure of the world: The adaptive nature of state-space and action representations in multi-stage decision-making," PLOS Computational Biology, Public Library of Science, vol. 15(9), pages 1-22, September.
  • Handle: RePEc:plo:pcbi00:1007334
    DOI: 10.1371/journal.pcbi.1007334
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1007334
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1007334&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1007334?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Amir Dezfouli & Bernard W Balleine, 2013. "Actions, Action Sequences and Habits: Evidence That Goal-Directed and Habitual Action Control Are Hierarchically Organized," PLOS Computational Biology, Public Library of Science, vol. 9(12), pages 1-14, December.
    2. Shiva Farashahi & Katherine Rowe & Zohra Aslami & Daeyeol Lee & Alireza Soltani, 2017. "Feature-based learning improves adaptability without compromising precision," Nature Communications, Nature, vol. 8(1), pages 1-16, December.
    3. Amir Dezfouli & Kristi Griffiths & Fabio Ramos & Peter Dayan & Bernard W Balleine, 2019. "Models that learn how humans learn: The case of decision-making and its disorders," PLOS Computational Biology, Public Library of Science, vol. 15(6), pages 1-33, June.
    4. Thomas Akam & Rui Costa & Peter Dayan, 2015. "Simple Plans or Sophisticated Habits? State, Transition and Learning Interactions in the Two-Step Task," PLOS Computational Biology, Public Library of Science, vol. 11(12), pages 1-25, December.
    5. Bates, Douglas & Mächler, Martin & Bolker, Ben & Walker, Steve, 2015. "Fitting Linear Mixed-Effects Models Using lme4," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 67(i01).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Wouter Kool & Fiery A Cushman & Samuel J Gershman, 2016. "When Does Model-Based Control Pay Off?," PLOS Computational Biology, Public Library of Science, vol. 12(8), pages 1-34, August.
    2. Shiva Farashahi & Alireza Soltani, 2021. "Computational mechanisms of distributed value representations and mixed learning strategies," Nature Communications, Nature, vol. 12(1), pages 1-18, December.
    3. Bruno Miranda & W M Nishantha Malalasekera & Timothy E Behrens & Peter Dayan & Steven W Kennerley, 2020. "Combined model-free and model-sensitive reinforcement learning in non-human primates," PLOS Computational Biology, Public Library of Science, vol. 16(6), pages 1-25, June.
    4. Timea Marton & Jinglong Wang & Amaury Vaysse & Wei Yu & Pierre-Henri Commere & Quentin Holleville & Tristan Espie-Caullet & Richard Frock & Ludovic Deriano, 2025. "Polymerase theta repairs persistent G1-induced DNA breaks in S-phase during class switch recombination," Nature Communications, Nature, vol. 16(1), pages 1-16, December.
    5. Chang Sik Kim & Jonathan Cairns & Valentina Quarantotti & Bogumil Kaczkowski & Yinhai Wang & Peter Konings & Xiang Zhang, 2024. "A statistical simulation model to guide the choices of analytical methods in arrayed CRISPR screen experiments," PLOS ONE, Public Library of Science, vol. 19(8), pages 1-16, August.
    6. Madeleine Seale & Annamaria Kiss & Simone Bovio & Ignazio Maria Viola & Enrico Mastropaolo & Arezki Boudaoud & Naomi Nakayama, 2022. "Dandelion pappus morphing is actuated by radially patterned material swelling," Nature Communications, Nature, vol. 13(1), pages 1-14, December.
    7. JANSSENS, Jochen & DE CORTE, Annelies & SÖRENSEN, Kenneth, 2016. "Water distribution network design optimisation with respect to reliability," Working Papers 2016007, University of Antwerp, Faculty of Business and Economics.
    8. Marion Chatelain, 2023. "Endogeic Earthworms Avoid Soil Mimicking Metal Pollution Levels in Urban Parks," Sustainability, MDPI, vol. 15(15), pages 1-17, July.
    9. Silva Larson & Anne (Giger)-Dray & Tina Cornioley & Manithaythip Thephavanh & Phomma Thammavong & Sisavan Vorlasan & John G. Connell & Magnus Moglia & Peter Case & Kim S. Alexander & Pascal Perez, 2020. "A Game-Based Approach to Exploring Gender Differences in Smallholder Decisions to Change Farming Practices: White Rice Production in Laos," Sustainability, MDPI, vol. 12(16), pages 1-22, August.
    10. Teruaki Kido & Yuko Yotsumoto & Masamichi J. Hayashi, 2025. "Hierarchical representations of relative numerical magnitudes in the human frontoparietal cortex," Nature Communications, Nature, vol. 16(1), pages 1-15, December.
    11. Sławomir Kujawski & Agnieszka Kujawska & Mariusz Kozakiewicz & Djordje G. Jakovljevic & Błażej Stankiewicz & Julia L. Newton & Kornelia Kędziora-Kornatowska & Paweł Zalewski, 2022. "Effects of Sitting Callisthenic Balance and Resistance Exercise Programs on Cognitive Function in Older Participants," IJERPH, MDPI, vol. 19(22), pages 1-18, November.
    12. Raymond Hernandez & Elizabeth A. Pyatak & Cheryl L. P. Vigen & Haomiao Jin & Stefan Schneider & Donna Spruijt-Metz & Shawn C. Roll, 2021. "Understanding Worker Well-Being Relative to High-Workload and Recovery Activities across a Whole Day: Pilot Testing an Ecological Momentary Assessment Technique," IJERPH, MDPI, vol. 18(19), pages 1-17, October.
    13. Christopher Hassall & Michael Nisbet & Evan Norcliffe & He Wang, 2024. "The Potential Health Benefits of Urban Tree Planting Suggested through Immersive Environments," Land, MDPI, vol. 13(3), pages 1-12, February.
    14. Jie Zhao & Ji Chen & Damien Beillouin & Hans Lambers & Yadong Yang & Pete Smith & Zhaohai Zeng & Jørgen E. Olesen & Huadong Zang, 2022. "Global systematic review with meta-analysis reveals yield advantage of legume-based rotations and its drivers," Nature Communications, Nature, vol. 13(1), pages 1-9, December.
    15. Elisabeth Beckmann & Lukas Olbrich & Joseph Sakshaug, 2024. "Multivariate assessment of interviewer-related errors in a cross-national economic survey (Lukas Olbrich, Elisabeth Beckmann, Joseph W. Sakshaug)," Working Papers 253, Oesterreichische Nationalbank (Austrian Central Bank).
    16. Migchelbrink, Koen & Raymaekers, Pieter, 2023. "Nudging people to pay their parking fines on time. Evidence from a cluster-randomized field experiment," Journal of Behavioral and Experimental Economics (formerly The Journal of Socio-Economics), Elsevier, vol. 105(C).
    17. repec:ags:aaea22:335993 is not listed on IDEAS
    18. F J Heather & D Z Childs & A M Darnaude & J L Blanchard, 2018. "Using an integral projection model to assess the effect of temperature on the growth of gilthead seabream Sparus aurata," PLOS ONE, Public Library of Science, vol. 13(5), pages 1-19, May.
    19. Francisco Ruiz-Raya & Jose C Noguera & Alberto Velando, 2022. "Light received by embryos promotes postnatal junior phenotypes in a seabird [The evolution of social behavior]," Behavioral Ecology, International Society for Behavioral Ecology, vol. 33(6), pages 1047-1057.
    20. Valentina Krenz & Arjen Alink & Tobias Sommer & Benno Roozendaal & Lars Schwabe, 2023. "Time-dependent memory transformation in hippocampus and neocortex is semantic in nature," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    21. Thomas A. Verschut & Renny Ng & Nicolas P. Doubovetzky & Guillaume Calvez & Jan L. Sneep & Adriaan J. Minnaard & Chih-Ying Su & Mikael A. Carlsson & Bregje Wertheim & Jean-Christophe Billeter, 2023. "Aggregation pheromones have a non-linear effect on oviposition behavior in Drosophila melanogaster," Nature Communications, Nature, vol. 14(1), pages 1-15, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1007334. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.