IDEAS home Printed from https://ideas.repec.org/a/eee/ejores/v310y2023i3p1218-1233.html
   My bibliography  Save this article

An online reinforcement learning approach to charging and order-dispatching optimization for an e-hailing electric vehicle fleet

Author

Listed:
  • Yan, Pengyu
  • Yu, Kaize
  • Chao, Xiuli
  • Chen, Zhibin

Abstract

Given the uncertainty of orders and the dynamically changing workload of charging stations, how to dispatch and charge electric vehicle (EV) fleets becomes a significant challenge facing e-hailing platforms. The common practice is to dispatch EVs to serve orders by heuristic matching methods but enable EV drivers to independently make charging decisions based on their experiences, which may compromise the platform’s performance. This study proposes a Markov decision process to jointly optimize the charging and order-dispatching schemes for an e-hailing EV fleet, which provides pick-up services for passengers only from a designated transportation hub (i.e., no pick-up from different locations). The objective is to maximize the total revenue of the fleet throughout a finite horizon. The complete state transition equations of the EV fleet are formulated to track the state-of-charge of their batteries. To learn the charging and order-dispatching policy in a dynamic stochastic environment, an online approximation algorithm is developed, which integrates the model-based reinforcement learning (RL) framework with a novel SARSA(Δ)-sample average approximation (SAA) architecture. Compared with the model-free RL algorithm and approximation dynamic programming (ADP), our algorithm explores high-quality decisions by an SAA model with empirical state transitions and exploits the best decisions so far by an SARSA(Δ) sample-trajectory updating. Computational results based on a real case show that, compared with the existing heuristic method and the ADP in the literature, the proposed approach increases the daily revenue by an average of 31.76% and 14.22%, respectively.

Suggested Citation

  • Yan, Pengyu & Yu, Kaize & Chao, Xiuli & Chen, Zhibin, 2023. "An online reinforcement learning approach to charging and order-dispatching optimization for an e-hailing electric vehicle fleet," European Journal of Operational Research, Elsevier, vol. 310(3), pages 1218-1233.
  • Handle: RePEc:eee:ejores:v:310:y:2023:i:3:p:1218-1233
    DOI: 10.1016/j.ejor.2023.03.039
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0377221723002667
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ejor.2023.03.039?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Anil Yazici, M. & Kamga, Camille & Singhal, Abhishek, 2016. "Modeling taxi drivers’ decisions for improving airport ground access: John F. Kennedy airport case," Transportation Research Part A: Policy and Practice, Elsevier, vol. 91(C), pages 48-60.
    2. Johannes Royset, 2013. "On sample size control in sample average approximations for solving smooth stochastic programs," Computational Optimization and Applications, Springer, vol. 55(2), pages 265-309, June.
    3. Xingjian Liu, 2020. "Assessing airport ground access by public transport in Chinese cities," Urban Studies, Urban Studies Journal Limited, vol. 57(2), pages 267-285, February.
    4. Hung, Ying-Chao & PakHai Lok, Horace & Michailidis, George, 2022. "Optimal routing for electric vehicle charging systems with stochastic demand: A heavy traffic approximation approach," European Journal of Operational Research, Elsevier, vol. 299(2), pages 526-541.
    5. Hua, Yikang & Zhao, Dongfang & Wang, Xin & Li, Xiaopeng, 2019. "Joint infrastructure planning and fleet management for one-way electric car sharing under time-varying uncertain demand," Transportation Research Part B: Methodological, Elsevier, vol. 128(C), pages 185-206.
    6. Ons Sassi & Ammar Oulamara, 2017. "Electric vehicle scheduling and optimal charging problem: complexity, exact and heuristic approaches," International Journal of Production Research, Taylor & Francis Journals, vol. 55(2), pages 519-535, January.
    7. Al-Kanj, Lina & Nascimento, Juliana & Powell, Warren B., 2020. "Approximate dynamic programming for planning a ride-hailing system using autonomous fleets of electric vehicles," European Journal of Operational Research, Elsevier, vol. 284(3), pages 1088-1106.
    8. Bongiovanni, Claudia & Kaspi, Mor & Geroliminis, Nikolas, 2019. "The electric autonomous dial-a-ride problem," Transportation Research Part B: Methodological, Elsevier, vol. 122(C), pages 436-456.
    9. Gregory A. Godfrey & Warren B. Powell, 2002. "An Adaptive Dynamic Programming Algorithm for Dynamic Fleet Management, II: Multiperiod Travel Times," Transportation Science, INFORMS, vol. 36(1), pages 40-54, February.
    10. Quetschlich, Mathias & Moetz, André & Otto, Boris, 2021. "Optimisation model for multi-item multi-echelon supply chains with nested multi-level products," European Journal of Operational Research, Elsevier, vol. 290(1), pages 144-158.
    11. Boyacı, Burak & Zografos, Konstantinos G. & Geroliminis, Nikolas, 2015. "An optimization framework for the development of efficient one-way car-sharing systems," European Journal of Operational Research, Elsevier, vol. 240(3), pages 718-733.
    12. Emelogu, Adindu & Chowdhury, Sudipta & Marufuzzaman, Mohammad & Bian, Linkan & Eksioglu, Burak, 2016. "An enhanced sample average approximation method for stochastic optimization," International Journal of Production Economics, Elsevier, vol. 182(C), pages 230-252.
    13. Nolz, Pamela C. & Absi, Nabil & Feillet, Dominique & Seragiotto, Clóvis, 2022. "The consistent electric-Vehicle routing problem with backhauls and charging management," European Journal of Operational Research, Elsevier, vol. 302(2), pages 700-716.
    14. Long He & Guangrui Ma & Wei Qi & Xin Wang, 2021. "Charging an Electric Vehicle-Sharing Fleet," Manufacturing & Service Operations Management, INFORMS, vol. 23(2), pages 471-487, March.
    15. Gregory A. Godfrey & Warren B. Powell, 2002. "An Adaptive Dynamic Programming Algorithm for Dynamic Fleet Management, I: Single Period Travel Times," Transportation Science, INFORMS, vol. 36(1), pages 21-39, February.
    16. Crainic, Teodor Gabriel, 2000. "Service network design in freight transportation," European Journal of Operational Research, Elsevier, vol. 122(2), pages 272-288, April.
    17. Mourgaya, M. & Vanderbeck, F., 2007. "Column generation based heuristic for tactical planning in multi-period vehicle routing," European Journal of Operational Research, Elsevier, vol. 183(3), pages 1028-1041, December.
    18. Zhang, Dong & Liu, Yang & He, Shuangchi, 2019. "Vehicle assignment and relays for one-way electric car-sharing systems," Transportation Research Part B: Methodological, Elsevier, vol. 120(C), pages 125-146.
    19. Florio, Alexandre M. & Gendreau, Michel & Hartl, Richard F. & Minner, Stefan & Vidal, Thibaut, 2023. "Recent advances in vehicle routing with stochastic demands: Bayesian learning for correlated demands and elementary branch-price-and-cut," European Journal of Operational Research, Elsevier, vol. 306(3), pages 1081-1093.
    20. Paquay, Célia & Crama, Yves & Pironet, Thierry, 2020. "Recovery management for a dial-a-ride system with real-time disruptions," European Journal of Operational Research, Elsevier, vol. 280(3), pages 953-969.
    21. Zhang, Jian & Luo, Kelin & Florio, Alexandre M. & Van Woensel, Tom, 2023. "Solving large-scale dynamic vehicle routing problems with stochastic requests," European Journal of Operational Research, Elsevier, vol. 306(2), pages 596-614.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Liu, Yang & Xie, Jiaohong & Chen, Nan, 2022. "Stochastic one-way carsharing systems with dynamic relocation incentives through preference learning," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 166(C).
    2. Zolfagharinia, Hossein & Haughton, Michael A., 2017. "Operational flexibility in the truckload trucking industry," Transportation Research Part B: Methodological, Elsevier, vol. 104(C), pages 437-460.
    3. Bekli, Seyma & Boyacı, Burak & Zografos, Konstantinos G., 2021. "Enhancing the performance of one-way electric carsharing systems through the optimum deployment of fast chargers," Transportation Research Part B: Methodological, Elsevier, vol. 152(C), pages 118-139.
    4. Bansal, Vishal & Kumar, Deepak Prakash & Roy, Debjit & Subramanian, Shankar C., 2022. "Performance evaluation and optimization of design parameters for electric vehicle-sharing platforms by considering vehicle dynamics," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 166(C).
    5. Yanhong Yin & Han Wang & Jimin Xiong & Yufeng Zhu & Zhanfeng Tang, 2021. "Estimation of optimum supply of shared cars based on personal travel behaviors in condition of minimum energy consumption," Environment, Development and Sustainability: A Multidisciplinary Approach to the Theory and Practice of Sustainable Development, Springer, vol. 23(9), pages 13324-13339, September.
    6. Heydar, Mojtaba & Mardaneh, Elham & Loxton, Ryan, 2022. "Approximate dynamic programming for an energy-efficient parallel machine scheduling problem," European Journal of Operational Research, Elsevier, vol. 302(1), pages 363-380.
    7. Dong‐Ping Song & Jonathan Carter, 2008. "Optimal empty vehicle redistribution for hub‐and‐spoke transportation systems," Naval Research Logistics (NRL), John Wiley & Sons, vol. 55(2), pages 156-171, March.
    8. Christine Fricker & Nicolas Gast, 2016. "Incentives and redistribution in homogeneous bike-sharing systems with stations of finite capacity," EURO Journal on Transportation and Logistics, Springer;EURO - The Association of European Operational Research Societies, vol. 5(3), pages 261-291, August.
    9. Luke Schenk & Diego Klabjan, 2008. "Intramarket Optimization for Express Package Carriers," Transportation Science, INFORMS, vol. 42(4), pages 530-545, November.
    10. Cui, Shaohua & Ma, Xiaolei & Zhang, Mingheng & Yu, Bin & Yao, Baozhen, 2022. "The parallel mobile charging service for free-floating shared electric vehicle clusters," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 160(C).
    11. George, David K. & Xia, Cathy H., 2011. "Fleet-sizing and service availability for a vehicle rental system via closed queueing networks," European Journal of Operational Research, Elsevier, vol. 211(1), pages 198-207, May.
    12. Qingyu Luo & Zhihao Ye & Hongfei Jia, 2023. "A Charging Planning Method for Shared Electric Vehicles with the Collaboration of Mobile and Fixed Facilities," Sustainability, MDPI, vol. 15(22), pages 1-16, November.
    13. Weimin Ma & Jiakai Chen & Hua Ke, 2021. "Electric Vehicle Assignment Considering Users’ Waiting Time," Sustainability, MDPI, vol. 13(23), pages 1-14, December.
    14. Zolfagharinia, Hossein & Haughton, Michael, 2016. "Effective truckload dispatch decision methods with incomplete advance load information," European Journal of Operational Research, Elsevier, vol. 252(1), pages 103-121.
    15. Baris Yildiz & Martin Savelsbergh, 2019. "Provably High-Quality Solutions for the Meal Delivery Routing Problem," Transportation Science, INFORMS, vol. 53(5), pages 1372-1388, September.
    16. Xu, Min & Meng, Qiang, 2019. "Fleet sizing for one-way electric carsharing services considering dynamic vehicle relocation and nonlinear charging profile," Transportation Research Part B: Methodological, Elsevier, vol. 128(C), pages 23-49.
    17. Cheung, Bernard K.-S. & Choy, K.L. & Li, Chung-Lun & Shi, Wenzhong & Tang, Jian, 2008. "Dynamic routing model and solution methods for fleet management with mobile technologies," International Journal of Production Economics, Elsevier, vol. 113(2), pages 694-705, June.
    18. Zolfagharinia, Hossein & Haughton, Michael, 2014. "The benefit of advance load information for truckload carriers," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 70(C), pages 34-54.
    19. Schmid, Verena, 2012. "Solving the dynamic ambulance relocation and dispatching problem using approximate dynamic programming," European Journal of Operational Research, Elsevier, vol. 219(3), pages 611-621.
    20. Al-Kanj, Lina & Nascimento, Juliana & Powell, Warren B., 2020. "Approximate dynamic programming for planning a ride-hailing system using autonomous fleets of electric vehicles," European Journal of Operational Research, Elsevier, vol. 284(3), pages 1088-1106.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:ejores:v:310:y:2023:i:3:p:1218-1233. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/eor .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.