Heterogeneous attention-driven deep reinforcement learning for solving EVRPs with pickup-delivery and time windows

Heterogeneous attention-driven deep reinforcement learning for solving EVRPs with pickup-delivery and time windows

Author

Listed:

Guan, Qingshu
Cao, Hui
Niu, Tiansen
Jia, Lixin
Yan, Dapeng
Chen, Badong

Abstract

Electric vehicle routing problems (EVRPs) have attracted growing interest in the pursuit of sustainable transportation, driven by the environmental benefits and energy efficiency of electric vehicles (EVs). Nevertheless, mainstream approaches predominantly focus on minimizing travel distance rather than propulsion energy and overlook key logistical constraints such as time windows and pickup-delivery demands, which are critical in modern express operations. To address these limitations, we investigate an energy-optimal EVRP with pickup-delivery and time windows (EVRP-PDTW) and develop a high-resolution energy consumption model that integrates time-dependent driving dynamics, detailed road information, and battery charging behavior. Building on this foundation, we cast the routing task as a Markov decision process and propose a Heterogeneous Attention-driven Deep Reinforcement Learning (HA-DRL) framework. The encoder leverages a heterogeneous attention mechanism to capture role-specific interactions among depots, customers, and charging stations, while the decoder incorporates a dynamic-aware context embedding to capture state transitions and temporal feasibility. We analyze how these design choices structure the decision space and align the learned policy with the underlying energy model, thereby explaining the observed energy savings. Experiments on synthetic and real-world datasets show that HA-DRL outperforms a suite of heuristic and DRL-based methods by a clear margin, reducing average energy consumption by 51.30 kWh (a 6.43% improvement in optimality gap) over the competitive NCS approach in large-scale scenarios involving 200 customers and 40 charging stations. These achievements underscore the promise of HA-DRL in advancing energy-aware routing solutions for real-world EV logistics systems.

Suggested Citation

Guan, Qingshu & Cao, Hui & Niu, Tiansen & Jia, Lixin & Yan, Dapeng & Chen, Badong, 2026. "Heterogeneous attention-driven deep reinforcement learning for solving EVRPs with pickup-delivery and time windows," Applied Energy, Elsevier, vol. 407(C).

Handle: RePEc:eee:appene:v:407:y:2026:i:c:s0306261926000073
DOI: 10.1016/j.apenergy.2026.127355

Download full text from publisher

As the access to this document is restricted, you may want to

for a different version of it.

References listed on IDEAS

Zhang, Xinfang & Zhang, Zhe & Liu, Yang & Xu, Zhigang & Qu, Xiaobo, 2024. "A review of machine learning approaches for electric vehicle energy consumption modelling in urban transportation," Renewable Energy, Elsevier, vol. 234(C).
Sumitkumar, Rathor & Al-Sumaiti, Ameena Saad, 2024. "Shared autonomous electric vehicle: Towards social economy of energy and mobility from power-transportation nexus perspective," Renewable and Sustainable Energy Reviews, Elsevier, vol. 197(C).
Jiang, Yupeng & Hu, Wei & Gu, Wenjuan & Yu, Yongguang & Xu, Meng, 2025. "A multi-mode hybrid electric vehicle routing problem with time windows," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 195(C).
Vincent F. Yu & Shin-Yu Lin, 2016. "Solving the location-routing problem with simultaneous pickup and delivery by simulated annealing," International Journal of Production Research, Taylor & Francis Journals, vol. 54(2), pages 526-549, January.
Tang, Mengcheng & Zhuang, Weichao & Li, Bingbing & Liu, Haoji & Song, Ziyou & Yin, Guodong, 2023. "Energy-optimal routing for electric vehicles using deep reinforcement learning with transformer," Applied Energy, Elsevier, vol. 350(C).
Liang, Yufu & Zhao, Wanzhong & Wu, Jinwei & Xu, Kunhao & Zhou, Xiaochuan & Luan, Zhongkai & Wang, Chunyan, 2025. "Energy-efficient driving for distributed electric vehicles considering wheel loss energy: A distributed strategy based on multi-agent architecture," Applied Energy, Elsevier, vol. 384(C).
Marius M. Solomon, 1987. "Algorithms for the Vehicle Routing and Scheduling Problems with Time Window Constraints," Operations Research, INFORMS, vol. 35(2), pages 254-265, April.
Li, Yuan & Chen, Haoxun & Prins, Christian, 2016. "Adaptive large neighborhood search for the pickup and delivery problem with time windows, profits, and reserved requests," European Journal of Operational Research, Elsevier, vol. 252(1), pages 27-38.
Kerr Ding & Michael Chin & Yunlong Zhao & Wei Huang & Binh Khanh Mai & Huanan Wang & Peng Liu & Yang Yang & Yunan Luo, 2024. "Machine learning-guided co-optimization of fitness and diversity facilitates combinatorial library design in enzyme engineering," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
Lera-Romero, Gonzalo & Miranda Bront, Juan José & Soulignac, Francisco J., 2024. "A branch-cut-and-price algorithm for the time-dependent electric vehicle routing problem with time windows," European Journal of Operational Research, Elsevier, vol. 312(3), pages 978-995.
Yang, Yanru & Liu, Yu & Zhang, Yihang & Shu, Shaolong & Zheng, Junsheng, 2025. "DEST-GNN: A double-explored spatio-temporal graph neural network for multi-site intra-hour PV power forecasting," Applied Energy, Elsevier, vol. 378(PA).
Chen, Xiao & Wang, Hao & Zheng, Zilong & Lu, Fei, 2025. "Electro-thermal analysis of inductively coupled power transfer in pavement for electric vehicle charging," Applied Energy, Elsevier, vol. 378(PA).
Felipe, Ángel & Ortuño, M. Teresa & Righini, Giovanni & Tirado, Gregorio, 2014. "A heuristic approach for the green vehicle routing problem with multiple technologies and partial recharges," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 71(C), pages 111-128.
Ho, G.T.S. & Tang, Yuk Ming & Leung, Eric K.H. & Tong, P.H., 2025. "Integrated reinforcement learning of automated guided vehicles dynamic path planning for smart logistics and operations," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 196(C).
Li, Yujing & Zhang, Zhisheng & Xing, Qiang, 2025. "Real-time online charging control of electric vehicle charging station based on a multi-agent deep reinforcement learning," Energy, Elsevier, vol. 319(C).
Xu, Liangcai & Gu, Xubo & Song, Ziyou, 2025. "Optimal charging for large-scale heterogeneous electric vehicles: A novel paradigm based on learning and backward clustering," Applied Energy, Elsevier, vol. 382(C).

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Wang, Yong & Wei, Zikai & Luo, Siyu & Zhou, Jingxin & Zhen, Lu, 2024. "Collaboration and resource sharing in the multidepot time-dependent vehicle routing problem with time windows," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 192(C).
Raeesi, Ramin & Zografos, Konstantinos G., 2020. "The electric vehicle routing problem with time windows and synchronised mobile battery swapping," Transportation Research Part B: Methodological, Elsevier, vol. 140(C), pages 101-129.
Cortés-Murcia, David L. & Prodhon, Caroline & Murat Afsar, H., 2019. "The electric vehicle routing problem with time windows, partial recharges and satellite customers," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 130(C), pages 184-206.
Guo, Jian & Hu, Zhaolin & Tian, Bin & Wei, Jinxiang, 2025. "Modeling and optimizing routing problems with customer satisfaction under stochastic travel times," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 204(C).
Raeesi, Ramin & Zografos, Konstantinos G., 2022. "Coordinated routing of electric commercial vehicles with intra-route recharging and en-route battery swapping," European Journal of Operational Research, Elsevier, vol. 301(1), pages 82-109.
Frey, Christian M.M. & Jungwirth, Alexander & Frey, Markus & Kolisch, Rainer, 2023. "The vehicle routing problem with time windows and flexible delivery locations," European Journal of Operational Research, Elsevier, vol. 308(3), pages 1142-1159.
Goeke, Dominik & Schneider, Michael, 2015. "Routing a mixed fleet of electric and conventional vehicles," European Journal of Operational Research, Elsevier, vol. 245(1), pages 81-99.
Sadati, Mir Ehsan Hesam & Çatay, Bülent, 2021. "A hybrid variable neighborhood search approach for the multi-depot green vehicle routing problem," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 149(C).
Liu, Yiming & Roberto, Baldacci & Zhou, Jianwen & Yu, Yang & Zhang, Yu & Sun, Wei, 2023. "Efficient feasibility checks and an adaptive large neighborhood search algorithm for the time-dependent green vehicle routing problem with time windows," European Journal of Operational Research, Elsevier, vol. 310(1), pages 133-155.
Jose Carlos Molina & Ignacio Eguia & Jesus Racero, 2019. "Reducing pollutant emissions in a waste collection vehicle routing problem using a variable neighborhood tabu search algorithm: a case study," TOP: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 27(2), pages 253-287, July.
Schiffer, Maximilian & Walther, Grit, 2017. "The electric location routing problem with time windows and partial recharging," European Journal of Operational Research, Elsevier, vol. 260(3), pages 995-1013.
Goeke, D. & Schneider, M., 2015. "Routing a Mixed Fleet of Electric and Conventional Vehicles," Publications of Darmstadt Technical University, Institute for Business Studies (BWL) 65939, Darmstadt Technical University, Department of Business Administration, Economics and Law, Institute for Business Studies (BWL).
Schiffer, Maximilian & Walther, Grit, 2018. "Strategic planning of electric logistics fleet networks: A robust location-routing approach," Omega, Elsevier, vol. 80(C), pages 31-42.
Gaute Messel Nafstad & Guy Desaulniers & Magnus Stålhane, 2025. "Branch-Price-and-Cut for the Electric Vehicle Routing Problem with Heterogeneous Recharging Technologies and Nonlinear Recharging Functions," Transportation Science, INFORMS, vol. 59(3), pages 628-646, June.
Vidal, Thibaut & Laporte, Gilbert & Matl, Piotr, 2020. "A concise guide to existing and emerging vehicle routing problem variants," European Journal of Operational Research, Elsevier, vol. 286(2), pages 401-416.
Santos, Maria João & Curcio, Eduardo & Mulati, Mauro Henrique & Amorim, Pedro & Miyazawa, Flávio Keidi, 2020. "A robust optimization approach for the vehicle routing problem with selective backhauls," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 136(C).
Macrina, Giusy & Laporte, Gilbert & Guerriero, Francesca & Di Puglia Pugliese, Luigi, 2019. "An energy-efficient green-vehicle routing problem with mixed vehicle fleet, partial battery recharging and time windows," European Journal of Operational Research, Elsevier, vol. 276(3), pages 971-982.
Maximilian Schiffer & Michael Schneider & Grit Walther & Gilbert Laporte, 2019. "Vehicle Routing and Location Routing with Intermediate Stops: A Review," Transportation Science, INFORMS, vol. 53(2), pages 319-343, March.
Meyer, Anne & Gschwind, Timo & Amberg, Boris & Colling, Dominik, 2025. "Exact algorithms for routing electric autonomous mobile robots in intralogistics," European Journal of Operational Research, Elsevier, vol. 323(3), pages 830-851.
Dönmez, Sercan & Koç, Çağrı & Altıparmak, Fulya, 2022. "The mixed fleet vehicle routing problem with partial recharging by multiple chargers: Mathematical model and adaptive large neighborhood search," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 167(C).

More about this item

Keywords

; ; ; ; ;

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:appene:v:407:y:2026:i:c:s0306261926000073. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/405891/description#description .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Heterogeneous attention-driven deep reinforcement learning for solving EVRPs with pickup-delivery and time windows

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data