IDEAS home Printed from https://ideas.repec.org/a/gam/jsusta/v15y2023i24p16741-d1298114.html
   My bibliography  Save this article

Harnessing Online Knowledge Transfer for Enhanced Search and Rescue Decisions via Multi-Agent Reinforcement Learning

Author

Listed:
  • Luona Song

    (School of Economics and Management, Beijing Information Science and Technology University, Beijing 100192, China)

  • Zhigang Wen

    (School of Electronic Engineering, Beijing University of Posts and Telecommunications, Beijing 100876, China)

  • Junjie Teng

    (School of Electronic Engineering, Beijing University of Posts and Telecommunications, Beijing 100876, China)

  • Jian Zhang

    (School of Economics and Management, Beijing Information Science and Technology University, Beijing 100192, China)

  • Merveille Nicolas

    (Department of Strategy and Social and Environmental Responsibility, Université du Québec à Montréal, Montréal, QC H3C 3P8, Canada)

Abstract

In the rapidly evolving domain of the Internet of Things (IoT), devices play an instrumental role in high-stakes scenarios like search and rescue (SAR) operations. Traditional decision-making processes within SAR missions often struggle to cope with the dynamic and unpredictable nature of such environments, leading to inefficiencies and delayed responses. This paper aims to explore the potential of multi-agent reinforcement learning (MARL) to improve the decision-making process within SAR operations underpinned by IoT. Functional, current methods are limited by their static decision frameworks and inability to adapt in real time to the chaotic variables present in SAR situations. We introduced a novel MARL framework and compared its performance against benchmark strategies, specifically the multi-agent deep deterministic policy gradient (MADDPG) approach. Uniquely enhanced by online knowledge transfer, the framework leverages the capabilities of the deep deterministic policy gradient (DDPG) method. The preliminary findings underscore the proposed framework’s superior efficiency and speed in SAR contexts. Our research highlights MARL’s transformative potential, positing it as a groundbreaking strategy for IoT-based decision making in high-pressure SAR environments with suggestions for further studies in varied real-world scenarios.

Suggested Citation

  • Luona Song & Zhigang Wen & Junjie Teng & Jian Zhang & Merveille Nicolas, 2023. "Harnessing Online Knowledge Transfer for Enhanced Search and Rescue Decisions via Multi-Agent Reinforcement Learning," Sustainability, MDPI, vol. 15(24), pages 1-18, December.
  • Handle: RePEc:gam:jsusta:v:15:y:2023:i:24:p:16741-:d:1298114
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2071-1050/15/24/16741/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2071-1050/15/24/16741/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jsusta:v:15:y:2023:i:24:p:16741-:d:1298114. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.