Dynamic On-Demand Crowdshipping Using Constrained and Heuristics-Embedded Double Dueling Deep Q-Network

My bibliography Save this article

Dynamic On-Demand Crowdshipping Using Constrained and Heuristics-Embedded Double Dueling Deep Q-Network

Author

Listed:

Parvez Farazi, Nahid
Zou, Bo
Tulabandhula, Theja

Registered:

Abstract

This paper proposes a deep reinforcement learning (DRL)-based approach to the dynamic on-demand crowdshipping problem in which requests constantly arrive in a crowdshipping system for pickup and delivery within limited time windows. The request pickup and delivery are performed by crowdsourcees, who are ordinary people dynamically arriving in and leaving the crowdshipping system, and dedicating their limited and heterogeneous available time and carrying capacity to crowdshipping. In return, crowdsourcees get paid by the delivery service provider who periodically assigns requests to crowdsourcees in the course of a day to minimize shipping cost. We adopt heuristics-embedded Deep Q-Network (DQN) algorithms that incorporate double and dueling structures, to train DRL agents. The idea of heuristics-embedded training is conceived by designing an elaborate action space where several refined local search heuristics are embedded to direct the specific action to take once an action type is chosen by DRL, with the purpose of preserving tractability of DRL training. To tackle the hard constraints pertaining to crowdsourcee and request time windows, we propose and integrate three new strategies (feasibility enforced local search, multiple schedules with different penalties, and exponential penalty) as part of the DRL training and testing. Extensive numerical analysis is conducted and shows that Double Dueling DQN with the exponential penalty strategy demonstrates the best performance. We compare the performance of the agent trained by Double Dueling DQN with conventional heuristic approaches, and find that the agent yields total shipping costs that are on average 24–37% lower than the conventional heuristic approaches. For problem instances that can be solved to optimality, the optimality gap using the trained agent is also quite small, in the range of 3–7%. Moreover, the trained agent is robust to stationary/non-stationary demand patterns. Lastly, our approach is further compared with a recent study that uses heuristics-embedded DQN, and shows superior performance (total shipping costs on average 19% lower) as a result of several differences.

Suggested Citation

Parvez Farazi, Nahid & Zou, Bo & Tulabandhula, Theja, 2022. "Dynamic On-Demand Crowdshipping Using Constrained and Heuristics-Embedded Double Dueling Deep Q-Network," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 166(C).

Handle: RePEc:eee:transe:v:166:y:2022:i:c:s1366554522002678
DOI: 10.1016/j.tre.2022.102890

Download full text from publisher

As the access to this document is restricted, you may want to search for a different version of it.

References listed on IDEAS

Margaretha Gansterer & Richard F. Hartl & Philipp E. H. Salzmann, 2018. "Exact solutions for the collaborative pickup and delivery problem," Central European Journal of Operations Research, Springer;Slovak Society for Operations Research;Hungarian Operational Research Society;Czech Society for Operations Research;Österr. Gesellschaft für Operations Research (ÖGOR);Slovenian Society Informatika - Section for Operational Research;Croatian Operational Research Society, vol. 26(2), pages 357-371, June.
Michele D. Simoni & Edoardo Marcucci & Valerio Gatta & Christian G. Claudel, 2020. "Potential last-mile impacts of crowdshipping services: a simulation-based evaluation," Transportation, Springer, vol. 47(4), pages 1933-1954, August.
Quan Lu & Maged Dessouky, 2004. "An Exact Algorithm for the Multiple Vehicle Pickup and Delivery Problem," Transportation Science, INFORMS, vol. 38(4), pages 503-514, November.
Le, Tho V. & Ukkusuri, Satish V. & Xue, Jiawei & Van Woensel, Tom, 2021. "Designing pricing and compensation schemes by integrating matching and routing models for crowd-shipping systems," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 149(C).
Ahamed, Tanvir & Zou, Bo & Farazi, Nahid Parvez & Tulabandhula, Theja, 2021. "Deep Reinforcement Learning for Crowdsourced Urban Delivery," Transportation Research Part B: Methodological, Elsevier, vol. 152(C), pages 227-257.
Stefan Ropke & David Pisinger, 2006. "An Adaptive Large Neighborhood Search Heuristic for the Pickup and Delivery Problem with Time Windows," Transportation Science, INFORMS, vol. 40(4), pages 455-472, November.
Volodymyr Mnih & Koray Kavukcuoglu & David Silver & Andrei A. Rusu & Joel Veness & Marc G. Bellemare & Alex Graves & Martin Riedmiller & Andreas K. Fidjeland & Georg Ostrovski & Stig Petersen & Charle, 2015. "Human-level control through deep reinforcement learning," Nature, Nature, vol. 518(7540), pages 529-533, February.
Kafle, Nabin & Zou, Bo & Lin, Jane, 2017. "Design and modeling of a crowdsource-enabled system for urban parcel relay and delivery," Transportation Research Part B: Methodological, Elsevier, vol. 99(C), pages 62-82.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Du, Jianhui & Zhang, Zhiqin & Wang, Xu & Lau, Hoong Chuin, 2023. "A hierarchical optimization approach for dynamic pickup and delivery problem with LIFO constraints," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 175(C).
Perez, Daniel & Zou, Bo & Farazi, Nahid Parvez, 2025. "Package delivery by electric vertical takeoff and landing aircraft? An attractiveness assessment," Journal of Air Transport Management, Elsevier, vol. 124(C).

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Patricija Bajec & Danijela Tuljak-Suban, 2022. "A Strategic Approach for Promoting Sustainable Crowdshipping in Last-Mile Deliveries," Sustainability, MDPI, vol. 14(20), pages 1-17, October.
Rossolov, Oleksandr & Susilo, Yusak O., 2024. "Are consumers ready to pay extra for crowd-shipping e-groceries and why? A hybrid choice analysis for developing economies," Transportation Research Part A: Policy and Practice, Elsevier, vol. 187(C).
Yang, Dingtong & Hyland, Michael F. & Jayakrishnan, R., 2024. "Tackling the crowdsourced shared-trip delivery problem at scale with a novel decomposition heuristic," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 188(C).
Martin Savelsbergh & Marlin W. Ulmer, 2024. "Challenges and opportunities in crowdsourced delivery planning and operations—an update," Annals of Operations Research, Springer, vol. 343(2), pages 639-661, December.
Mancini, Simona & Gansterer, Margaretha, 2022. "Bundle generation for last-mile delivery with occasional drivers," Omega, Elsevier, vol. 108(C).
Tapia, Rodrigo J. & Kourounioti, Ioanna & Thoen, Sebastian & de Bok, Michiel & Tavasszy, Lori, 2023. "A disaggregate model of passenger-freight matching in crowdshipping services," Transportation Research Part A: Policy and Practice, Elsevier, vol. 169(C).
He, Shan & Dai, Ying & Ma, Zu-Jun, 2023. "To offer or not to offer? The optimal value-insured strategy for crowdsourced delivery platforms," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 173(C).
Baals, Julian & Emde, Simon & Turkensteen, Marcel, 2023. "Minimizing earliness-tardiness costs in supplier networks—A just-in-time truck routing problem," European Journal of Operational Research, Elsevier, vol. 306(2), pages 707-741.
Simona Mancini & Margaretha Gansterer, 2024. "Bundle generation for the vehicle routing problem with occasional drivers and time windows," Flexible Services and Manufacturing Journal, Springer, vol. 36(4), pages 1189-1221, December.
Limon Barua & Bo Zou & Yan Zhou & Yulin Liu, 2023. "Modeling household online shopping demand in the U.S.: a machine learning approach and comparative investigation between 2009 and 2017," Transportation, Springer, vol. 50(2), pages 437-476, April.
Soriano, Adria & Gansterer, Margaretha & Hartl, Richard F., 2023. "The multi-depot vehicle routing problem with profit fairness," International Journal of Production Economics, Elsevier, vol. 255(C).
Wang, Xiuwen & Zhen, Lu & Wang, Shuaian, 2024. "Optimizing an express delivery mode based on high-speed railway and crowd-couriers," Transport Policy, Elsevier, vol. 159(C), pages 157-177.
Shejun Deng & Yingying Yuan & Yong Wang & Haizhong Wang & Charles Koll, 2020. "Collaborative multicenter logistics delivery network optimization with resource sharing," PLOS ONE, Public Library of Science, vol. 15(11), pages 1-31, November.
Margaretha Gansterer & Richard F. Hartl & Sarah Wieser, 2021. "Assignment constraints in shared transportation services," Annals of Operations Research, Springer, vol. 305(1), pages 513-539, October.
Zhang, Zhenzhen & Liu, Mengyang & Lim, Andrew, 2015. "A memetic algorithm for the patient transportation problem," Omega, Elsevier, vol. 54(C), pages 60-71.
Yu, Vincent F. & Jodiawan, Panca & Redi, A.A.N. Perwira, 2022. "Crowd-shipping problem with time windows, transshipment nodes, and delivery options," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 157(C).
Hou, Liwen & Li, Dong & Zhang, Dali, 2018. "Ride-matching and routing optimisation: Models and a large neighbourhood search heuristic," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 118(C), pages 143-162.
Stokkink, Patrick & Cordeau, Jean-François & Geroliminis, Nikolas, 2024. "A column and row generation approach to the crowd-shipping problem with transfers," Omega, Elsevier, vol. 128(C).
Ghaderi, Hadi & Zhang, Lele & Tsai, Pei-Wei & Woo, Jihoon, 2022. "Crowdsourced last-mile delivery with parcel lockers," International Journal of Production Economics, Elsevier, vol. 251(C).
Hao, Peng & Liu, Haishan & Liao, Yejia & Boriboonsomsin, Kanok & Barth, Matthew J, 2022. "Developing Environmentally Friendly Solutions for On-Demand Food Delivery Service," Institute of Transportation Studies, Working Paper Series qt89c461pv, Institute of Transportation Studies, UC Davis.

More about this item

Keywords

Dynamic on-demand crowdshipping; Deep reinforcement learning; Double Dueling DQN; Action space design; Local search embedding; Hard constraint handling strategies;
All these keywords.

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:transe:v:166:y:2022:i:c:s1366554522002678. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/600244/description#description .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Dynamic On-Demand Crowdshipping Using Constrained and Heuristics-Embedded Double Dueling Deep Q-Network

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data