Optimal structure of metaplasticity for adaptive learning

Optimal structure of metaplasticity for adaptive learning

Author

Listed:

Peyman Khorsand
Alireza Soltani

Abstract

Learning from reward feedback in a changing environment requires a high degree of adaptability, yet the precise estimation of reward information demands slow updates. In the framework of estimating reward probability, here we investigated how this tradeoff between adaptability and precision can be mitigated via metaplasticity, i.e. synaptic changes that do not always alter synaptic efficacy. Using the mean-field and Monte Carlo simulations we identified ‘superior’ metaplastic models that can substantially overcome the adaptability-precision tradeoff. These models can achieve both adaptability and precision by forming two separate sets of meta-states: reservoirs and buffers. Synapses in reservoir meta-states do not change their efficacy upon reward feedback, whereas those in buffer meta-states can change their efficacy. Rapid changes in efficacy are limited to synapses occupying buffers, creating a bottleneck that reduces noise without significantly decreasing adaptability. In contrast, more-populated reservoirs can generate a strong signal without manifesting any observable plasticity. By comparing the behavior of our model and a few competing models during a dynamic probability estimation task, we found that superior metaplastic models perform close to optimally for a wider range of model parameters. Finally, we found that metaplastic models are robust to changes in model parameters and that metaplastic transitions are crucial for adaptive learning since replacing them with graded plastic transitions (transitions that change synaptic efficacy) reduces the ability to overcome the adaptability-precision tradeoff. Overall, our results suggest that ubiquitous unreliability of synaptic changes evinces metaplasticity that can provide a robust mechanism for mitigating the tradeoff between adaptability and precision and thus adaptive learning.Author summary: Successful learning from our experience and feedback from the environment requires that the reward value assigned to a given option or action to be updated by a precise amount after each feedback. In the standard model for reward-based learning known as reinforcement learning, the learning rates determine the strength of such update. A large learning rate allows fast update of values (large adaptability) but introduces noise (small precision), whereas a small learning rate does the opposite. Thus, learning seems to be bounded by a tradeoff between adaptability and precision. Here, we asked whether there are synaptic mechanisms that are capable of adjusting the brain’s level of plasticity according to reward statistics, and, therefore, allow the learning process to be adaptive. We showed that metaplasticity, changes in the synaptic state that shape future synaptic modifications without any observable changes in the strength of synapses, could provide such a mechanism and furthermore, identified the optimal structure of such metaplasticity. We propose that metaplasticity, which sometimes causes no observable changes in behavior and thus could be perceived as a lack of learning, can provide a robust mechanism for adaptive learning.

Suggested Citation

Peyman Khorsand & Alireza Soltani, 2017. "Optimal structure of metaplasticity for adaptive learning," PLOS Computational Biology, Public Library of Science, vol. 13(6), pages 1-22, June.

Handle: RePEc:plo:pcbi00:1005630
DOI: 10.1371/journal.pcbi.1005630

Download full text from publisher

References listed on IDEAS

Seneta, E., 1993. "Sensitivity of finite Markov chains under perturbation," Statistics & Probability Letters, Elsevier, vol. 17(2), pages 163-168, May.
Elise Payzan-LeNestour & Peter Bossaerts, 2011. "Risk, Unexpected Uncertainty, and Estimation Uncertainty: Bayesian Learning in Unstable Settings," PLOS Computational Biology, Public Library of Science, vol. 7(1), pages 1-14, January.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Payam Piray & Nathaniel D. Daw, 2021. "A model for learning based on the joint estimation of stochasticity and volatility," Nature Communications, Nature, vol. 12(1), pages 1-16, December.
Micha Heilbron & Florent Meyniel, 2019. "Confidence resets reveal hierarchical adaptive learning in humans," PLOS Computational Biology, Public Library of Science, vol. 15(4), pages 1-24, April.
Shiva Farashahi & Alireza Soltani, 2021. "Computational mechanisms of distributed value representations and mixed learning strategies," Nature Communications, Nature, vol. 12(1), pages 1-18, December.
Payam Piray & Nathaniel D Daw, 2020. "A simple model for learning in volatile environments," PLOS Computational Biology, Public Library of Science, vol. 16(7), pages 1-26, July.
Payam Piray & Nathaniel D. Daw, 2024. "Computational processes of simultaneous learning of stochasticity and volatility in humans," Nature Communications, Nature, vol. 15(1), pages 1-16, December.
Boluwatife Ikwunne & Jolie Parham & Erdem Pulcu, 2025. "A nonlinear relationship between prediction errors and learning rates in human reinforcement-learning," PLOS Computational Biology, Public Library of Science, vol. 21(9), pages 1-21, September.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Micha Heilbron & Florent Meyniel, 2019. "Confidence resets reveal hierarchical adaptive learning in humans," PLOS Computational Biology, Public Library of Science, vol. 15(4), pages 1-24, April.
Payam Piray & Nathaniel D Daw, 2020. "A simple model for learning in volatile environments," PLOS Computational Biology, Public Library of Science, vol. 16(7), pages 1-26, July.
Daniel S Kluger & Nico Broers & Marlen A Roehe & Moritz F Wurm & Niko A Busch & Ricarda I Schubotz, 2020. "Exploitation of local and global information in predictive processing," PLOS ONE, Public Library of Science, vol. 15(4), pages 1-17, April.
Dimitrije Marković & Andrea M F Reiter & Stefan J Kiebel, 2019. "Predicting change: Approximate inference under explicit representation of temporal structure in changing environments," PLOS Computational Biology, Public Library of Science, vol. 15(1), pages 1-31, January.
Vahid Moosavi & Giulio Isacchini, 2016. "A Markovian Model of the Evolving World Input-Output Network," Papers 1612.06186, arXiv.org, revised Sep 2017.
Cruz, Juan Alberto Rojas, 2020. "Sensitivity of the stationary distributions of denumerable Markov chains," Statistics & Probability Letters, Elsevier, vol. 166(C).
Li Xin Lim & Rei Akaishi & Sébastien Hélie, 2025. "Memory Constraints in Uncertainty Misestimation: A Computational Model of Working Memory and Environmental Change Detection," Mathematics, MDPI, vol. 13(15), pages 1-33, July.
VIEILLE, Nicolas & SOLAN, Eilon, 2002. "Perturbed Markov Chains," HEC Research Papers Series 757, HEC Paris.
- Eilon Solan & Nicolas Vieille, 2002. "Perturbed Markov Chains," Discussion Papers 1342, Northwestern University, Center for Mathematical Studies in Economics and Management Science.
- Eilon Solan & Nicolas Vieille, 2002. "Perturbed Markov Chains," Working Papers hal-00593647, HAL.
- Nicolas Vieille & Eilon Solan, 2003. "Perturbed Markov chains," Post-Print hal-00464967, HAL.
Sam Gijsen & Miro Grundei & Robert T Lange & Dirk Ostwald & Felix Blankenburg, 2021. "Neural surprise in somatosensory Bayesian learning," PLOS Computational Biology, Public Library of Science, vol. 17(2), pages 1-36, February.
Philipp Schustek & Rubén Moreno-Bote, 2018. "Instance-based generalization for human judgments about uncertainty," PLOS Computational Biology, Public Library of Science, vol. 14(6), pages 1-27, June.
Jill X O'Reilly & Saad Jbabdi & Matthew F S Rushworth & Timothy E J Behrens, 2013. "Brain Systems for Probabilistic and Dynamic Prediction: Computational Specificity and Integration," PLOS Biology, Public Library of Science, vol. 11(9), pages 1-14, September.
Vahid Moosavi & Giulio Isacchini, 2017. "A Markovian model of evolving world input-output network," PLOS ONE, Public Library of Science, vol. 12(10), pages 1-18, October.
Maria Gamboa & Maria Jesus Lopez-Herrero, 2020. "The Effect of Setting a Warning Vaccination Level on a Stochastic SIVS Model with Imperfect Vaccine," Mathematics, MDPI, vol. 8(7), pages 1-23, July.
Sang Wan Lee & John P O’Doherty & Shinsuke Shimojo, 2015. "Neural Computations Mediating One-Shot Learning in the Human Brain," PLOS Biology, Public Library of Science, vol. 13(4), pages 1-36, April.
P.-C.G. Vassiliou, 2021. "Non-Homogeneous Markov Set Systems," Mathematics, MDPI, vol. 9(5), pages 1-25, February.
Nazanin Mohammadi Sepahvand & Elisabeth Stöttinger & James Danckert & Britt Anderson, 2014. "Sequential Decisions: A Computational Comparison of Observational and Reinforcement Accounts," PLOS ONE, Public Library of Science, vol. 9(4), pages 1-8, April.
Fletcher, Cameron S. & Ganegodage, K. Renuka & Hildenbrand, Marian D. & Rambaldi, Alicia N., 2022. "The behaviour of property prices when affected by infrequent floods," Land Use Policy, Elsevier, vol. 122(C).
Florent Meyniel & Daniel Schlunegger & Stanislas Dehaene, 2015. "The Sense of Confidence during Probabilistic Learning: A Normative Account," PLOS Computational Biology, Public Library of Science, vol. 11(6), pages 1-25, June.
Kristoffer C. Aberg & Levi Antle & Rony Paz, 2025. "Estimation-uncertainty affects decisions with and without learning opportunities," Nature Communications, Nature, vol. 16(1), pages 1-17, December.
Bruno B Averbeck, 2015. "Theory of Choice in Bandit, Information Sampling and Foraging Tasks," PLOS Computational Biology, Public Library of Science, vol. 11(3), pages 1-28, March.

More about this item

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1005630. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Optimal structure of metaplasticity for adaptive learning

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data