IDEAS home Printed from https://ideas.repec.org/a/nat/nathum/v7y2023i11d10.1038_s41562-023-01662-1.html
   My bibliography  Save this article

Developmental changes in exploration resemble stochastic optimization

Author

Listed:
  • Anna P. Giron

    (Human and Machine Cognition Lab, University of Tübingen
    Attention and Affect Lab, University of Tübingen)

  • Simon Ciranka

    (Center for Adaptive Rationality, Max Planck Institute for Human Development
    Max Planck UCL Centre for Computational Psychiatry and Ageing Research)

  • Eric Schulz

    (MPRG Computational Principles of Intelligence, Max Planck Institute for Biological Cybernetics)

  • Wouter Bos

    (University of Amsterdam
    Amsterdam Brain and Cognition, University of Amsterdam)

  • Azzurra Ruggeri

    (MPRG iSearch, Max Planck Institute for Human Development
    School of Social Sciences and Technology, Technical University Munich
    Central European University)

  • Björn Meder

    (MPRG iSearch, Max Planck Institute for Human Development
    Institute for Mind, Brain and Behavior, Health and Medical University)

  • Charley M. Wu

    (Human and Machine Cognition Lab, University of Tübingen
    Center for Adaptive Rationality, Max Planck Institute for Human Development)

Abstract

Human development is often described as a ‘cooling off’ process, analogous to stochastic optimization algorithms that implement a gradual reduction in randomness over time. Yet there is ambiguity in how to interpret this analogy, due to a lack of concrete empirical comparisons. Using data from n = 281 participants ages 5 to 55, we show that cooling off does not only apply to the single dimension of randomness. Rather, human development resembles an optimization process of multiple learning parameters, for example, reward generalization, uncertainty-directed exploration and random temperature. Rapid changes in parameters occur during childhood, but these changes plateau and converge to efficient values in adulthood. We show that while the developmental trajectory of human parameters is strikingly similar to several stochastic optimization algorithms, there are important differences in convergence. None of the optimization algorithms tested were able to discover reliably better regions of the strategy space than adult participants on this task.

Suggested Citation

  • Anna P. Giron & Simon Ciranka & Eric Schulz & Wouter Bos & Azzurra Ruggeri & Björn Meder & Charley M. Wu, 2023. "Developmental changes in exploration resemble stochastic optimization," Nature Human Behaviour, Nature, vol. 7(11), pages 1955-1967, November.
  • Handle: RePEc:nat:nathum:v:7:y:2023:i:11:d:10.1038_s41562-023-01662-1
    DOI: 10.1038/s41562-023-01662-1
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41562-023-01662-1
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1038/s41562-023-01662-1?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Magda Dubois & Tobias U. Hauser, 2022. "Value-free random exploration is linked to impulsivity," Nature Communications, Nature, vol. 13(1), pages 1-17, December.
    2. Simon Ciranka & Juan Linde-Domingo & Ivan Padezhki & Clara Wicharz & Charley M. Wu & Bernhard Spitzer, 2022. "Asymmetric reinforcement learning facilitates human inference of transitive relations," Nature Human Behaviour, Nature, vol. 6(4), pages 555-564, April.
    3. Marieke Jepma & Jessica V Schaaf & Ingmar Visser & Hilde M Huizenga, 2020. "Uncertainty-driven regulation of learning and exploration in adolescents: A computational account," PLOS Computational Biology, Public Library of Science, vol. 16(9), pages 1-29, September.
    4. Joseph Henrich & Steven J. Heine & Ara Norenzayan, 2010. "Most people are not WEIRD," Nature, Nature, vol. 466(7302), pages 29-29, July.
    5. Charley M Wu & Eric Schulz & Mona M Garvert & Björn Meder & Nicolas W Schuck, 2020. "Similarities and differences in spatial and non-spatial cognitive maps," PLOS Computational Biology, Public Library of Science, vol. 16(9), pages 1-28, September.
    6. Samuel J Gershman, 2015. "A Unifying Probabilistic View of Associative Learning," PLOS Computational Biology, Public Library of Science, vol. 11(11), pages 1-20, November.
    7. Ronald E. Dahl & Nicholas B. Allen & Linda Wilbrecht & Ahna Ballonoff Suleiman, 2018. "Importance of investing in adolescence from a developmental science perspective," Nature, Nature, vol. 554(7693), pages 441-450, February.
    8. Stefano Palminteri & Emma J Kilford & Giorgio Coricelli & Sarah-Jayne Blakemore, 2016. "The Computational Development of Reinforcement Learning during Adolescence," PLOS Computational Biology, Public Library of Science, vol. 12(6), pages 1-25, June.
    9. Nathaniel D. Daw & John P. O'Doherty & Peter Dayan & Ben Seymour & Raymond J. Dolan, 2006. "Cortical substrates for exploratory decisions in humans," Nature, Nature, vol. 441(7095), pages 876-879, June.
    10. Charley M. Wu & Eric Schulz & Maarten Speekenbrink & Jonathan D. Nelson & Björn Meder, 2018. "Generalization guides human exploration in vast decision spaces," Nature Human Behaviour, Nature, vol. 2(12), pages 915-924, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Olschewski, Sebastian & Diao, Linan & Rieskamp, Jörg, 2021. "Reinforcement learning about asset variability and correlation in repeated portfolio decisions," Journal of Behavioral and Experimental Finance, Elsevier, vol. 32(C).
    2. Magda Dubois & Tobias U. Hauser, 2022. "Value-free random exploration is linked to impulsivity," Nature Communications, Nature, vol. 13(1), pages 1-17, December.
    3. Maël Lebreton & Karin Bacily & Stefano Palminteri & Jan B Engelmann, 2019. "Contextual influence on confidence judgments in human reinforcement learning," PLOS Computational Biology, Public Library of Science, vol. 15(4), pages 1-27, April.
    4. Ruth Pauli & Inti A. Brazil & Gregor Kohls & Miriam C. Klein-Flügge & Jack C. Rogers & Dimitris Dikeos & Roberta Dochnal & Graeme Fairchild & Aranzazu Fernández-Rivas & Beate Herpertz-Dahlmann & Amaia, 2023. "Action initiation and punishment learning differ from childhood to adolescence while reward learning remains stable," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    5. Yongping Bao & Ludwig Danwitz & Fabian Dvorak & Sebastian Fehrler & Lars Hornuf & Hsuan Yu Lin & Bettina von Helversen, 2022. "Similarity and Consistency in Algorithm-Guided Exploration," CESifo Working Paper Series 10188, CESifo.
    6. John A. List, 2024. "Optimally generate policy-based evidence before scaling," Nature, Nature, vol. 626(7999), pages 491-499, February.
    7. Bouma, J.A. & Nguyen, Binh & van der Heijden, Eline & Dijk, J.J., 2018. "Analysing Group Contract Design Using a Lab and a Lab-in-the-Field Threshold Public Good Experiment," Discussion Paper 2018-049, Tilburg University, Center for Economic Research.
    8. Peter S. Riefer & Bradley C. Love, 2015. "Unfazed by Both the Bull and Bear: Strategic Exploration in Dynamic Environments," Games, MDPI, vol. 6(3), pages 1-11, August.
    9. Makoto Naruse & Eiji Yamamoto & Takashi Nakao & Takuma Akimoto & Hayato Saigo & Kazuya Okamura & Izumi Ojima & Georg Northoff & Hirokazu Hori, 2018. "Why is the environment important for decision making? Local reservoir model for choice-based learning," PLOS ONE, Public Library of Science, vol. 13(10), pages 1-17, October.
    10. repec:cup:judgdm:v:16:y:2021:i:6:p:1392-1412 is not listed on IDEAS
    11. Ran Xu & Navid Ghaffarzadegan, 2018. "Neuroscience bridging scientific disciplines in health: Who builds the bridge, who pays for it?," Scientometrics, Springer;Akadémiai Kiadó, vol. 117(2), pages 1183-1204, November.
    12. Holger Mohr & Katharina Zwosta & Dimitrije Markovic & Sebastian Bitzer & Uta Wolfensteller & Hannes Ruge, 2018. "Deterministic response strategies in a trial-and-error learning task," PLOS Computational Biology, Public Library of Science, vol. 14(11), pages 1-19, November.
    13. Cornand, Camille & Hubert, Paul, 2020. "On the external validity of experimental inflation forecasts: A comparison with five categories of field expectations," Journal of Economic Dynamics and Control, Elsevier, vol. 110(C).
    14. Michael Muthukrishna & Joseph Henrich & Wataru Toyokawa & Takeshi Hamamura & Tatsuya Kameda & Steven J Heine, 2018. "Overconfidence is universal? Elicitation of Genuine Overconfidence (EGO) procedure reveals systematic differences across domain, task knowledge, and incentives in four populations," PLOS ONE, Public Library of Science, vol. 13(8), pages 1-30, August.
    15. Christina Fang & Daniel Levinthal, 2009. "Near-Term Liability of Exploitation: Exploration and Exploitation in Multistage Problems," Organization Science, INFORMS, vol. 20(3), pages 538-551, June.
    16. Hu, Yingyao & Kayaba, Yutaka & Shum, Matthew, 2013. "Nonparametric learning rules from bandit experiments: The eyes have it!," Games and Economic Behavior, Elsevier, vol. 81(C), pages 215-231.
    17. James Derbyshire & Mandeep Dhami & Ian Belton & Dilek Önkal, 2023. "The value of experiments in futures and foresight science as illustrated by the case of scenario planning," Futures & Foresight Science, John Wiley & Sons, vol. 5(2), June.
    18. Ingo S. Seifert & Julia M. Rohrer & Boris Egloff & Stefan C. Schmukle, 2021. "The Development of the Rank-Order Stability of the Big Five across the Life Span," SOEPpapers on Multidisciplinary Panel Data Research 1156, DIW Berlin, The German Socio-Economic Panel (SOEP).
    19. Bria Long & Judith E. Fan & Holly Huey & Zixian Chai & Michael C. Frank, 2024. "Parallel developmental changes in children’s production and recognition of line drawings of visual concepts," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    20. Manpreet Blessin & Sophie Lehmann & Angela M. Kunzler & Rolf van Dick & Klaus Lieb, 2022. "Resilience Interventions Conducted in Western and Eastern Countries—A Systematic Review," IJERPH, MDPI, vol. 19(11), pages 1-25, June.
    21. Shawn Grover & John F. Helliwell, 2019. "How’s Life at Home? New Evidence on Marriage and the Set Point for Happiness," Journal of Happiness Studies, Springer, vol. 20(2), pages 373-390, February.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:nathum:v:7:y:2023:i:11:d:10.1038_s41562-023-01662-1. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.