IDEAS home Printed from https://ideas.repec.org/a/eee/transe/v166y2022ics1366554522002678.html
   My bibliography  Save this article

Dynamic On-Demand Crowdshipping Using Constrained and Heuristics-Embedded Double Dueling Deep Q-Network

Author

Listed:
  • Parvez Farazi, Nahid
  • Zou, Bo
  • Tulabandhula, Theja

Abstract

This paper proposes a deep reinforcement learning (DRL)-based approach to the dynamic on-demand crowdshipping problem in which requests constantly arrive in a crowdshipping system for pickup and delivery within limited time windows. The request pickup and delivery are performed by crowdsourcees, who are ordinary people dynamically arriving in and leaving the crowdshipping system, and dedicating their limited and heterogeneous available time and carrying capacity to crowdshipping. In return, crowdsourcees get paid by the delivery service provider who periodically assigns requests to crowdsourcees in the course of a day to minimize shipping cost. We adopt heuristics-embedded Deep Q-Network (DQN) algorithms that incorporate double and dueling structures, to train DRL agents. The idea of heuristics-embedded training is conceived by designing an elaborate action space where several refined local search heuristics are embedded to direct the specific action to take once an action type is chosen by DRL, with the purpose of preserving tractability of DRL training. To tackle the hard constraints pertaining to crowdsourcee and request time windows, we propose and integrate three new strategies (feasibility enforced local search, multiple schedules with different penalties, and exponential penalty) as part of the DRL training and testing. Extensive numerical analysis is conducted and shows that Double Dueling DQN with the exponential penalty strategy demonstrates the best performance. We compare the performance of the agent trained by Double Dueling DQN with conventional heuristic approaches, and find that the agent yields total shipping costs that are on average 24–37% lower than the conventional heuristic approaches. For problem instances that can be solved to optimality, the optimality gap using the trained agent is also quite small, in the range of 3–7%. Moreover, the trained agent is robust to stationary/non-stationary demand patterns. Lastly, our approach is further compared with a recent study that uses heuristics-embedded DQN, and shows superior performance (total shipping costs on average 19% lower) as a result of several differences.

Suggested Citation

  • Parvez Farazi, Nahid & Zou, Bo & Tulabandhula, Theja, 2022. "Dynamic On-Demand Crowdshipping Using Constrained and Heuristics-Embedded Double Dueling Deep Q-Network," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 166(C).
  • Handle: RePEc:eee:transe:v:166:y:2022:i:c:s1366554522002678
    DOI: 10.1016/j.tre.2022.102890
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S1366554522002678
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.tre.2022.102890?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Stefan Ropke & David Pisinger, 2006. "An Adaptive Large Neighborhood Search Heuristic for the Pickup and Delivery Problem with Time Windows," Transportation Science, INFORMS, vol. 40(4), pages 455-472, November.
    2. Margaretha Gansterer & Richard F. Hartl & Philipp E. H. Salzmann, 2018. "Exact solutions for the collaborative pickup and delivery problem," Central European Journal of Operations Research, Springer;Slovak Society for Operations Research;Hungarian Operational Research Society;Czech Society for Operations Research;Österr. Gesellschaft für Operations Research (ÖGOR);Slovenian Society Informatika - Section for Operational Research;Croatian Operational Research Society, vol. 26(2), pages 357-371, June.
    3. Volodymyr Mnih & Koray Kavukcuoglu & David Silver & Andrei A. Rusu & Joel Veness & Marc G. Bellemare & Alex Graves & Martin Riedmiller & Andreas K. Fidjeland & Georg Ostrovski & Stig Petersen & Charle, 2015. "Human-level control through deep reinforcement learning," Nature, Nature, vol. 518(7540), pages 529-533, February.
    4. Michele D. Simoni & Edoardo Marcucci & Valerio Gatta & Christian G. Claudel, 2020. "Potential last-mile impacts of crowdshipping services: a simulation-based evaluation," Transportation, Springer, vol. 47(4), pages 1933-1954, August.
    5. Quan Lu & Maged Dessouky, 2004. "An Exact Algorithm for the Multiple Vehicle Pickup and Delivery Problem," Transportation Science, INFORMS, vol. 38(4), pages 503-514, November.
    6. Le, Tho V. & Ukkusuri, Satish V. & Xue, Jiawei & Van Woensel, Tom, 2021. "Designing pricing and compensation schemes by integrating matching and routing models for crowd-shipping systems," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 149(C).
    7. Ahamed, Tanvir & Zou, Bo & Farazi, Nahid Parvez & Tulabandhula, Theja, 2021. "Deep Reinforcement Learning for Crowdsourced Urban Delivery," Transportation Research Part B: Methodological, Elsevier, vol. 152(C), pages 227-257.
    8. Kafle, Nabin & Zou, Bo & Lin, Jane, 2017. "Design and modeling of a crowdsource-enabled system for urban parcel relay and delivery," Transportation Research Part B: Methodological, Elsevier, vol. 99(C), pages 62-82.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Du, Jianhui & Zhang, Zhiqin & Wang, Xu & Lau, Hoong Chuin, 2023. "A hierarchical optimization approach for dynamic pickup and delivery problem with LIFO constraints," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 175(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Patricija Bajec & Danijela Tuljak-Suban, 2022. "A Strategic Approach for Promoting Sustainable Crowdshipping in Last-Mile Deliveries," Sustainability, MDPI, vol. 14(20), pages 1-17, October.
    2. Mancini, Simona & Gansterer, Margaretha, 2022. "Bundle generation for last-mile delivery with occasional drivers," Omega, Elsevier, vol. 108(C).
    3. Tapia, Rodrigo J. & Kourounioti, Ioanna & Thoen, Sebastian & de Bok, Michiel & Tavasszy, Lori, 2023. "A disaggregate model of passenger-freight matching in crowdshipping services," Transportation Research Part A: Policy and Practice, Elsevier, vol. 169(C).
    4. He, Shan & Dai, Ying & Ma, Zu-Jun, 2023. "To offer or not to offer? The optimal value-insured strategy for crowdsourced delivery platforms," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 173(C).
    5. Baals, Julian & Emde, Simon & Turkensteen, Marcel, 2023. "Minimizing earliness-tardiness costs in supplier networks—A just-in-time truck routing problem," European Journal of Operational Research, Elsevier, vol. 306(2), pages 707-741.
    6. Limon Barua & Bo Zou & Yan Zhou & Yulin Liu, 2023. "Modeling household online shopping demand in the U.S.: a machine learning approach and comparative investigation between 2009 and 2017," Transportation, Springer, vol. 50(2), pages 437-476, April.
    7. Soriano, Adria & Gansterer, Margaretha & Hartl, Richard F., 2023. "The multi-depot vehicle routing problem with profit fairness," International Journal of Production Economics, Elsevier, vol. 255(C).
    8. Shejun Deng & Yingying Yuan & Yong Wang & Haizhong Wang & Charles Koll, 2020. "Collaborative multicenter logistics delivery network optimization with resource sharing," PLOS ONE, Public Library of Science, vol. 15(11), pages 1-31, November.
    9. Margaretha Gansterer & Richard F. Hartl & Sarah Wieser, 2021. "Assignment constraints in shared transportation services," Annals of Operations Research, Springer, vol. 305(1), pages 513-539, October.
    10. Zhang, Zhenzhen & Liu, Mengyang & Lim, Andrew, 2015. "A memetic algorithm for the patient transportation problem," Omega, Elsevier, vol. 54(C), pages 60-71.
    11. Yu, Vincent F. & Jodiawan, Panca & Redi, A.A.N. Perwira, 2022. "Crowd-shipping problem with time windows, transshipment nodes, and delivery options," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 157(C).
    12. Hou, Liwen & Li, Dong & Zhang, Dali, 2018. "Ride-matching and routing optimisation: Models and a large neighbourhood search heuristic," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 118(C), pages 143-162.
    13. Ghaderi, Hadi & Zhang, Lele & Tsai, Pei-Wei & Woo, Jihoon, 2022. "Crowdsourced last-mile delivery with parcel lockers," International Journal of Production Economics, Elsevier, vol. 251(C).
    14. Hao, Peng & Liu, Haishan & Liao, Yejia & Boriboonsomsin, Kanok & Barth, Matthew J, 2022. "Developing Environmentally Friendly Solutions for On-Demand Food Delivery Service," Institute of Transportation Studies, Working Paper Series qt89c461pv, Institute of Transportation Studies, UC Davis.
    15. Hua, Shijia & Zeng, Wenjia & Liu, Xinglu & Qi, Mingyao, 2022. "Optimality-guaranteed algorithms on the dynamic shared-taxi problem," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 164(C).
    16. Yan, Yimo & Chow, Andy H.F. & Ho, Chin Pang & Kuo, Yong-Hong & Wu, Qihao & Ying, Chengshuo, 2022. "Reinforcement learning for logistics and supply chain management: Methodologies, state of the art, and future opportunities," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 162(C).
    17. Stefan Ropke & Jean-François Cordeau, 2009. "Branch and Cut and Price for the Pickup and Delivery Problem with Time Windows," Transportation Science, INFORMS, vol. 43(3), pages 267-286, August.
    18. Wang, Li & Xu, Min & Qin, Hu, 2023. "Joint optimization of parcel allocation and crowd routing for crowdsourced last-mile delivery," Transportation Research Part B: Methodological, Elsevier, vol. 171(C), pages 111-135.
    19. Mahmoudi, Monirehalsadat & Chen, Junhua & Shi, Tie & Zhang, Yongxiang & Zhou, Xuesong, 2019. "A cumulative service state representation for the pickup and delivery problem with transfers," Transportation Research Part B: Methodological, Elsevier, vol. 129(C), pages 351-380.
    20. Abdulkader, M.M.S. & Gajpal, Yuvraj & ElMekkawy, Tarek Y., 2018. "Vehicle routing problem in omni-channel retailing distribution systems," International Journal of Production Economics, Elsevier, vol. 196(C), pages 43-55.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:transe:v:166:y:2022:i:c:s1366554522002678. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/600244/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.