IDEAS home Printed from https://ideas.repec.org/a/gam/jsusta/v16y2024i10p4180-d1395855.html
   My bibliography  Save this article

An Improved Q-Learning Algorithm for Optimizing Sustainable Remanufacturing Systems

Author

Listed:
  • Shujin Qin

    (College of Economics and Management, Shangqiu Normal University, Shangqiu 476000, China)

  • Xiaofei Zhang

    (College of Information and Control Engineering, Liaoning Petrochemical University, Fushun 113001, China)

  • Jiacun Wang

    (Department of Computer Science and Software Engineering, Monmouth University, West Long Branch, NJ 07764, USA)

  • Xiwang Guo

    (College of Information and Control Engineering, Liaoning Petrochemical University, Fushun 113001, China)

  • Liang Qi

    (Department of Computer Science and Technology, Shandong University of Science and Technology, Qingdao 266590, China)

  • Jinrui Cao

    (Computer Science Department, New Jersey City University, Jersey City, NJ 07102, USA)

  • Yizhi Liu

    (College of Information and Control Engineering, Liaoning Petrochemical University, Fushun 113001, China)

Abstract

In our modern society, there has been a noticeable increase in pollution due to the trend of post-use handling of items. This necessitates the adoption of recycling and remanufacturing processes, advocating for sustainable resource management. This paper aims to address the issue of disassembly line balancing. Existing disassembly methods largely rely on manual labor, raising concerns regarding safety and sustainability. This paper proposes a human–machine collaborative disassembly approach to enhance safety and optimize resource utilization, aligning with sustainable development goals. A mixed-integer programming model is established, considering various disassembly techniques for hazardous and delicate parts, with the objective of minimizing the total disassembly time. The CPLEX solver is employed to enhance model accuracy. An improvement is made to the Q-learning algorithm in reinforcement learning to tackle the bilateral disassembly line balancing problem in human–machine collaboration. This approach outperforms CPLEX in both solution efficiency and quality, especially for large-scale problems. A comparative analysis with the original Q-learning algorithm and SARSA algorithm validates the superiority of the proposed algorithm in terms of convergence speed and solution quality.

Suggested Citation

  • Shujin Qin & Xiaofei Zhang & Jiacun Wang & Xiwang Guo & Liang Qi & Jinrui Cao & Yizhi Liu, 2024. "An Improved Q-Learning Algorithm for Optimizing Sustainable Remanufacturing Systems," Sustainability, MDPI, vol. 16(10), pages 1-18, May.
  • Handle: RePEc:gam:jsusta:v:16:y:2024:i:10:p:4180-:d:1395855
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2071-1050/16/10/4180/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2071-1050/16/10/4180/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jsusta:v:16:y:2024:i:10:p:4180-:d:1395855. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.