Author
Listed:
- Zhao, Haoyang
- Huang, Lianzhong
- Ma, Ranqi
- Wang, Kai
- Cao, Jianlin
- Li, Daize
- Wang, Cong
- Zhang, Rui
- Lan, Tian
- Wang, Tiancheng
Abstract
Improving the energy efficiency of oceangoing vessels and reducing fuel consumption represent a significant engineering challenge. For hybrid-powered ships, the energy efficiency optimization involves complex decisions and multiple competing objectives, where conventional reinforcement learning algorithms often suffer from performance imbalance. To address this, this study proposes a Multi-objective Hierarchical Reinforcement Learning (MHRL) framework for sail-diesel hybrid vessels, aiming to achieve coordinated optimization of ship speed and sail control under dynamic sea conditions. A data-driven approach is first adopted to develop a deep neural network model for predicting fuel consumption and speed, establishing a baseline for optimization. The MHRL framework employs a hierarchical architecture to decouple objectives and the action space into two sub-tasks: a high-level agent optimizes main engine speed to balance voyage schedule and energy consumption, while a low-level agent adjusts sail angle and deployment to maximize wind energy utilization. Both agents utilize the Proximal Policy Optimization (PPO) algorithm, and the modular low-level agent design allows for extensibility. Validation was conducted on a sail-equipped Very Large Crude Carrier (VLCC) across three representative routes covering various time scales. Results demonstrate that the MHRL framework enables wind sail-assisted ships to achieve a fuel saving rate of 3.66%-5.02%, with the equivalent energy saving from sails increasing with voyage length. Compared to the baseline PPO algorithm, MHRL improves the optimization success rate by 55.55% and demonstrates significant enhancements in energy-saving ratio and convergence speed. Moreover, the framework exhibits strong robustness, effectively solving the multi-objective optimization problem for sail-assisted ships.
Suggested Citation
Zhao, Haoyang & Huang, Lianzhong & Ma, Ranqi & Wang, Kai & Cao, Jianlin & Li, Daize & Wang, Cong & Zhang, Rui & Lan, Tian & Wang, Tiancheng, 2026.
"A multi-objective hierarchical reinforcement learning framework for energy consumption optimization of wing sail-assisted ships,"
Energy, Elsevier, vol. 358(C).
Handle:
RePEc:eee:energy:v:358:y:2026:i:c:s0360544226014969
DOI: 10.1016/j.energy.2026.141390
Download full text from publisher
As the access to this document is restricted, you may want to
for a different version of it.
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:energy:v:358:y:2026:i:c:s0360544226014969. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/energy .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.