IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0342305.html

Giving AI agents a sense of control facilitates reinforcement learning in multitasking scenarios

Author

Listed:
  • Annika Österdiekhoff
  • Nils Wendel Heinrich
  • Nele Russwinkel
  • Stefan Kopp

Abstract

Having to control multiple tasks in parallel poses challenges for humans and artificial agents alike. In artificial intelligence, specific forms of reinforcement learning (RL), most notably hierarchical and model-based RL, have shown promising results in scenarios where tasks or skills need to be switched adaptively. However, RL agents still encounter difficulties when faced with serial multitasking that involves switching control between continuously running subtasks, such as changing the radio station while driving in traffic. Inspired by human cognitive processes, we hypothesize that maintaining a sense of control is a key mechanism facilitating such task-switching decisions. We propose a mathematical formulation of a situational sense of control that consists of two components: an evaluative indicator of the predictability of action outcomes and a predictive indicator of a need for control in individual subtasks. We integrate this model of a sense of control into a hierarchical RL agent and evaluate its performance in a Collect Asteroids game environment, in which one must alternate between navigating two spaceships to collect as many asteroids as possible. Comparing RL agents with and without a sense of control, as well as with human participants, shows that equipping RL agents with a sense of control results in significant performance improvements. Our findings indicate that agents equipped with a sense of control prioritize more complex tasks, exhibit increased switching behavior, and make switches at strategically optimal times, leading to superior overall performance. The incorporation of cognitive mechanisms, inspired by human behavior, into RL agents thus appears to yield considerable enhancements in performance when acting in complex and dynamic environments.

Suggested Citation

  • Annika Österdiekhoff & Nils Wendel Heinrich & Nele Russwinkel & Stefan Kopp, 2026. "Giving AI agents a sense of control facilitates reinforcement learning in multitasking scenarios," PLOS ONE, Public Library of Science, vol. 21(2), pages 1-27, February.
  • Handle: RePEc:plo:pone00:0342305
    DOI: 10.1371/journal.pone.0342305
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0342305
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0342305&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0342305?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. David Silver & Julian Schrittwieser & Karen Simonyan & Ioannis Antonoglou & Aja Huang & Arthur Guez & Thomas Hubert & Lucas Baker & Matthew Lai & Adrian Bolton & Yutian Chen & Timothy Lillicrap & Fan , 2017. "Mastering the game of Go without human knowledge," Nature, Nature, vol. 550(7676), pages 354-359, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yuchen Zhang & Wei Yang, 2022. "Breakthrough invention and problem complexity: Evidence from a quasi‐experiment," Strategic Management Journal, Wiley Blackwell, vol. 43(12), pages 2510-2544, December.
    2. Jiacheng Zhang & Haolan Zhang, 2025. "Towards Human-like Artificial Intelligence: A Review of Anthropomorphic Computing in AI and Future Trends," Mathematics, MDPI, vol. 13(13), pages 1-49, June.
    3. Daníelsson, Jón & Macrae, Robert & Uthemann, Andreas, 2022. "Artificial intelligence and systemic risk," Journal of Banking & Finance, Elsevier, vol. 140(C).
    4. Zhang, Xi & Wang, Qin & Bi, Xiaowen & Li, Donghong & Liu, Dong & Yu, Yuanjin & Tse, Chi Kong, 2024. "Mitigating cascading failure in power grids with deep reinforcement learning-based remedial actions," Reliability Engineering and System Safety, Elsevier, vol. 250(C).
    5. Yucheng Yang & Chiyuan Wang & Andreas Schaab & Benjamin Moll, 2025. "Structural Reinforcement Learning for Heterogeneous Agent Macroeconomics," Papers 2512.18892, arXiv.org.
    6. Wang, Peixiang & Xu, Qihang & Li, Yufei & Chen, Qunlong & Tao, Jinghan & Qin, Wei & Huang, Heng & Zou, Ying, 2025. "Learning-based hybrid algorithms for container relocation problem with storage plan," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 197(C).
    7. Omar Al-Ani & Sanjoy Das, 2022. "Reinforcement Learning: Theory and Applications in HEMS," Energies, MDPI, vol. 15(17), pages 1-37, September.
    8. Ostheimer, Julia & Chowdhury, Soumitra & Iqbal, Sarfraz, 2021. "An alliance of humans and machines for machine learning: Hybrid intelligent systems and their design principles," Technology in Society, Elsevier, vol. 66(C).
    9. Boute, Robert N. & Gijsbrechts, Joren & van Jaarsveld, Willem & Vanvuchelen, Nathalie, 2022. "Deep reinforcement learning for inventory control: A roadmap," European Journal of Operational Research, Elsevier, vol. 298(2), pages 401-412.
    10. Rui Wang & Ming Lyu & Jie Zhang, 2025. "A Multi-Robot Collaborative Exploration Method Based on Deep Reinforcement Learning and Knowledge Distillation," Mathematics, MDPI, vol. 13(1), pages 1-17, January.
    11. Zhou, Yuhao & Wang, Yanwei, 2022. "An integrated framework based on deep learning algorithm for optimizing thermochemical production in heavy oil reservoirs," Energy, Elsevier, vol. 253(C).
    12. Hongseok Namkoong & Yuanzhe Ma & Peter W. Glynn, 2026. "Minimax Optimal Estimation of Stability Under Distribution Shift," Operations Research, INFORMS, vol. 74(1), pages 464-483, January.
    13. Mandal, Ankit & Tiwari, Yash & Panigrahi, Prasanta K. & Pal, Mayukha, 2022. "Physics aware analytics for accurate state prediction of dynamical systems," Chaos, Solitons & Fractals, Elsevier, vol. 164(C).
    14. Adnan Jafar & Alessandra Kobayati & Michael A. Tsoukas & Ahmad Haidar, 2024. "Personalized insulin dosing using reinforcement learning for high-fat meals and aerobic exercises in type 1 diabetes: a proof-of-concept trial," Nature Communications, Nature, vol. 15(1), pages 1-12, December.
    15. Lan Luo, By & Shi, Chengchun & Wang, Jitao & Wu, Zhenke & Li, Lexin, 2025. "Multivariate dynamic mediation analysis under a reinforcement learning framework," LSE Research Online Documents on Economics 127112, London School of Economics and Political Science, LSE Library.
    16. Bossert, Leonie & Hagendorff, Thilo, 2021. "Animals and AI. The role of animals in AI research and application – An overview and ethical evaluation," Technology in Society, Elsevier, vol. 67(C).
    17. Yang, Zhengzhi & Zheng, Lei & Perc, Matjaž & Li, Yumeng, 2024. "Interaction state Q-learning promotes cooperation in the spatial prisoner's dilemma game," Applied Mathematics and Computation, Elsevier, vol. 463(C).
    18. Derrik E Asher & Anjon Basak & Rolando Fernandez & Piyush K Sharma & Erin G Zaroukian & Christopher D Hsu & Michael R Dorothy & Thomas Mahre & Gerardo Galindo & Luke Frerichs & John Rogers & John Foss, 2023. "Strategic maneuver and disruption with reinforcement learning approaches for multi-agent coordination," The Journal of Defense Modeling and Simulation, , vol. 20(4), pages 509-526, October.
    19. Artur Kwasek & Maria Kocot & Izabela Gontarek & Igor Protasowicki & Bartosz Blaszczak, 2024. "Negative Faces of Artificial Intelligence in the Conditions of the Knowledge-Based Economy," European Research Studies Journal, European Research Studies Journal, vol. 0(2), pages 465-477.
    20. Chung-Yuan Chang & Yen-Wei Feng & Tejender Singh Rawat & Shih-Wei Chen & Albert Shihchun Lin, 2025. "Optimization of laser annealing parameters based on bayesian reinforcement learning," Journal of Intelligent Manufacturing, Springer, vol. 36(4), pages 2479-2492, April.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0342305. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.