A Graph Pointer Network-Based Multi-Objective Deep Reinforcement Learning Algorithm for Solving the Traveling Salesman Problem

My bibliography Save this article

A Graph Pointer Network-Based Multi-Objective Deep Reinforcement Learning Algorithm for Solving the Traveling Salesman Problem

Author

Listed:

Jeewaka Perera
(Department of Computer Science, California State University, Fresno, Fresno, CA 93740, USA
Faculty of Computing, Sri Lanka Institute of Information Technology, Malabe 10115, Sri Lanka)
Shih-Hsi Liu
(Department of Computer Science, California State University, Fresno, Fresno, CA 93740, USA)
Marjan Mernik
(Faculty of Electrical Engineering and Computer Science, University of Maribor, Koroška cesta 46, 2000 Maribor, Slovenia)
Matej Črepinšek
(Faculty of Electrical Engineering and Computer Science, University of Maribor, Koroška cesta 46, 2000 Maribor, Slovenia)
Miha Ravber
(Faculty of Electrical Engineering and Computer Science, University of Maribor, Koroška cesta 46, 2000 Maribor, Slovenia)

Registered:

Abstract

Traveling Salesman Problems (TSPs) have been a long-lasting interesting challenge to researchers in different areas. The difficulty of such problems scales up further when multiple objectives are considered concurrently. Plenty of work in evolutionary algorithms has been introduced to solve multi-objective TSPs with promising results, and the work in deep learning and reinforcement learning has been surging. This paper introduces a multi-objective deep graph pointer network-based reinforcement learning (MODGRL) algorithm for multi-objective TSPs. The MODGRL improves an earlier multi-objective deep reinforcement learning algorithm, called DRL-MOA, by utilizing a graph pointer network to learn the graphical structures of TSPs. Such improvements allow MODGRL to be trained on a small-scale TSP, but can find optimal solutions for large scale TSPs. NSGA-II, MOEA/D and SPEA2 are selected to compare with MODGRL and DRL-MOA. Hypervolume, spread and coverage over Pareto front (CPF) quality indicators were selected to assess the algorithms’ performance. In terms of the hypervolume indicator that represents the convergence and diversity of Pareto-frontiers, MODGRL outperformed all the competitors on the three well-known benchmark problems. Such findings proved that MODGRL, with the improved graph pointer network, indeed performed better, measured by the hypervolume indicator, than DRL-MOA and the three other evolutionary algorithms. MODGRL and DRL-MOA were comparable in the leading group, measured by the spread indicator. Although MODGRL performed better than DRL-MOA, both of them were just average regarding the evenness and diversity measured by the CPF indicator. Such findings remind that different performance indicators measure Pareto-frontiers from different perspectives. Choosing a well-accepted and suitable performance indicator to one’s experimental design is very critical, and may affect the conclusions. Three evolutionary algorithms were also experimented on with extra iterations, to validate whether extra iterations affected the performance. The results show that NSGA-II and SPEA2 were greatly improved measured by the Spread and CPF indicators. Such findings raise fairness concerns on algorithm comparisons using different fixed stopping criteria for different algorithms, which appeared in the DRL-MOA work and many others. Through these lessons, we concluded that MODGRL indeed performed better than DRL-MOA in terms of hypervolumne, and we also urge researchers on fair experimental designs and comparisons, in order to derive scientifically sound conclusions.

Suggested Citation

Jeewaka Perera & Shih-Hsi Liu & Marjan Mernik & Matej Črepinšek & Miha Ravber, 2023. "A Graph Pointer Network-Based Multi-Objective Deep Reinforcement Learning Algorithm for Solving the Traveling Salesman Problem," Mathematics, MDPI, vol. 11(2), pages 1-21, January.

Handle: RePEc:gam:jmathe:v:11:y:2023:i:2:p:437-:d:1035310

Download full text from publisher

References listed on IDEAS

Audet, Charles & Bigeon, Jean & Cartier, Dominique & Le Digabel, Sébastien & Salomon, Ludovic, 2021. "Performance indicators in multiobjective optimization," European Journal of Operational Research, Elsevier, vol. 292(2), pages 397-422.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Yeh, Wei-Chang, 2024. "Time-reliability optimization for the stochastic traveling salesman problem," Reliability Engineering and System Safety, Elsevier, vol. 248(C).
Nour Elhouda Chalabi & Abdelouahab Attia & Abderraouf Bouziane & Mahmoud Hassaballah & Abed Alanazi & Adel Binbusayyis, 2023. "An Archive-Guided Equilibrium Optimizer Based on Epsilon Dominance for Multi-Objective Optimization Problems," Mathematics, MDPI, vol. 11(12), pages 1-30, June.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

de Freitas, Juliana Campos & Cantane, Daniela Renata & Rocha, Humberto & Dias, Joana, 2024. "A multiobjective beam angle optimization framework for intensity-modulated radiation therapy," European Journal of Operational Research, Elsevier, vol. 318(1), pages 286-296.
Zandieh, Fatemeh & Ghannadpour, Seyed Farid, 2023. "A comprehensive risk assessment view on interval type-2 fuzzy controller for a time-dependent HazMat routing problem," European Journal of Operational Research, Elsevier, vol. 305(2), pages 685-707.
Dalila B. M. M. Fontes & S. Mahdi Homayouni, 2023. "A bi-objective multi-population biased random key genetic algorithm for joint scheduling quay cranes and speed adjustable vehicles in container terminals," Flexible Services and Manufacturing Journal, Springer, vol. 35(1), pages 241-268, March.
Mesquita-Cunha, Mariana & Figueira, José Rui & Barbosa-Póvoa, Ana Paula, 2023. "New ϵ−constraint methods for multi-objective integer linear programming: A Pareto front representation approach," European Journal of Operational Research, Elsevier, vol. 306(1), pages 286-307.
Gaggero, Mauro & Paolucci, Massimo & Ronco, Roberto, 2023. "Exact and heuristic solution approaches for energy-efficient identical parallel machine scheduling with time-of-use costs," European Journal of Operational Research, Elsevier, vol. 311(3), pages 845-866.
Ducardo L. Molina & Juan Ricardo Vidal Medina & Alexis Sagastume Gutiérrez & Juan J. Cabello Eras & Jesús A. Lopez & Simón Hincapie & Enrique C. Quispe, 2023. "Multiobjective Optimization of the Energy Efficiency and the Steam Flow in a Bagasse Boiler," Sustainability, MDPI, vol. 15(14), pages 1-17, July.
Hamed Khosravi & Taofeeq Olajire & Ahmed Shoyeb Raihan & Imtiaz Ahmed, 2024. "A data driven sequential learning framework to accelerate and optimize multi-objective manufacturing decisions," Journal of Intelligent Manufacturing, Springer, vol. 35(8), pages 4087-4112, December.
Jean Bigeon & Sébastien Le Digabel & Ludovic Salomon, 2021. "DMulti-MADS: mesh adaptive direct multisearch for bound-constrained blackbox multiobjective optimization," Computational Optimization and Applications, Springer, vol. 79(2), pages 301-338, June.
Francisco Jonatas Siqueira Coelho & Allan Rivalles Souza Feitosa & André Luís Michels Alcântara & Kaifeng Li & Ronaldo Ferreira Lima & Victor Rios Silva & Abel Guilhermino da Silva-Filho, 2023. "HyMOTree: Automatic Hyperparameters Tuning for Non-Technical Loss Detection Based on Multi-Objective and Tree-Based Algorithms," Energies, MDPI, vol. 16(13), pages 1-22, June.
Gholamreza Shojatalab & Seyed Hadi Nasseri & Iraj Mahdavi, 2023. "New multi-objective optimization model for tourism systems with fuzzy data and new approach developed epsilon constraint method," OPSEARCH, Springer;Operational Research Society of India, vol. 60(3), pages 1360-1385, September.
Raka Jovanovic & Antonio P. Sanfilippo & Stefan Voß, 2022. "Fixed set search applied to the multi-objective minimum weighted vertex cover problem," Journal of Heuristics, Springer, vol. 28(4), pages 481-508, August.
Gonzalo Sánchez-Contreras & Adrián Fernández-Rodríguez & Antonio Fernández-Cardador & Asunción P. Cucala, 2023. "A Two-Level Fuzzy Multi-Objective Design of ATO Driving Commands for Energy-Efficient Operation of Metropolitan Railway Lines," Sustainability, MDPI, vol. 15(12), pages 1-24, June.
Duro, João A. & Ozturk, Umud Esat & Oara, Daniel C. & Salomon, Shaul & Lygoe, Robert J. & Burke, Richard & Purshouse, Robin C., 2023. "Methods for constrained optimization of expensive mixed-integer multi-objective problems, with application to an internal combustion engine design problem," European Journal of Operational Research, Elsevier, vol. 307(1), pages 421-446.
Charles Audet & Frédéric Messine & Jordan Ninin, 2022. "Numerical certification of Pareto optimality for biobjective nonlinear problems," Journal of Global Optimization, Springer, vol. 83(4), pages 891-908, August.
Benjamin G. Thengvall & Shane N. Hall & Michael P. Deskevich, 2025. "Measuring the effectiveness and efficiency of simulation optimization metaheuristic algorithms," Journal of Heuristics, Springer, vol. 31(1), pages 1-21, March.
Abdulaziz Almalaq & Tawfik Guesmi & Saleh Albadran, 2023. "A Hybrid Chaotic-Based Multiobjective Differential Evolution Technique for Economic Emission Dispatch Problem," Energies, MDPI, vol. 16(12), pages 1-34, June.
Maleknia, Morteza & Soleimani-damaneh, Majid, 2024. "An effective subgradient algorithm via Mifflin’s line search for nonsmooth nonconvex multiobjective optimization," European Journal of Operational Research, Elsevier, vol. 319(2), pages 505-516.

More about this item

Keywords

multi-objective optimization; traveling salesman problems; deep reinforcement learning;
All these keywords.

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:11:y:2023:i:2:p:437-:d:1035310. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

A Graph Pointer Network-Based Multi-Objective Deep Reinforcement Learning Algorithm for Solving the Traveling Salesman Problem

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data