IDEAS home Printed from https://ideas.repec.org/a/eee/transb/v140y2020icp210-235.html
   My bibliography  Save this article

An actor-critic deep reinforcement learning approach for metro train scheduling with rolling stock circulation under stochastic demand

Author

Listed:
  • Ying, Cheng-shuo
  • Chow, Andy H.F.
  • Chin, Kwai-Sang

Abstract

This paper presents a novel actor-critic deep reinforcement learning approach for metro train scheduling with circulation of limited rolling stock. The scheduling problem is modeled as a Markov decision process driven by stochastic passenger demand. As in most dynamic optimization problems, the complexity of the scheduling process grows exponentially with the amount of states, decisions, and uncertainties involved. This study aims to address this ‘curses of dimensionality’ issue by adopting an actor-critic deep reinforcement learning solution framework. The framework simplifies the evaluation and searching process for potential optimal solutions by parameterizing the original state and decision spaces with the use of artificial neural networks. A deep deterministic policy gradient algorithm is developed for training the artificial neural networks via simulated system transitions before the actor-critic agent can be applied for online schedule control. The proposed approach is tested with a real-world scenario configured with data collected from the Victoria Line of London Underground, UK. Experiment results illustrate the advantages of the proposed method over a range of established meta-heuristics in terms of computing time, system efficiency, and robustness under different stochastic environments. This study innovates urban transit operations with state-of-the-art computer science and dynamic optimization techniques.

Suggested Citation

  • Ying, Cheng-shuo & Chow, Andy H.F. & Chin, Kwai-Sang, 2020. "An actor-critic deep reinforcement learning approach for metro train scheduling with rolling stock circulation under stochastic demand," Transportation Research Part B: Methodological, Elsevier, vol. 140(C), pages 210-235.
  • Handle: RePEc:eee:transb:v:140:y:2020:i:c:p:210-235
    DOI: 10.1016/j.trb.2020.08.005
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0191261520303829
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.trb.2020.08.005?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Corman, Francesco & D’Ariano, Andrea & Marra, Alessio D. & Pacciarelli, Dario & Samà, Marcella, 2017. "Integrating train scheduling and delay management in real-time railway traffic control," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 105(C), pages 213-239.
    2. Cacchiani, Valentina & Furini, Fabio & Kidd, Martin Philip, 2016. "Approaches to a real-world Train Timetabling Problem in a railway node," Omega, Elsevier, vol. 58(C), pages 97-110.
    3. Scheepmaker, Gerben M. & Goverde, Rob M.P. & Kroon, Leo G., 2017. "Review of energy-efficient train control and timetabling," European Journal of Operational Research, Elsevier, vol. 257(2), pages 355-376.
    4. Chow, Andy H.F. & Pavlides, Aris, 2018. "Cost functions and multi-objective timetabling of mixed train services," Transportation Research Part A: Policy and Practice, Elsevier, vol. 113(C), pages 335-356.
    5. Yin, Jiateng & Tang, Tao & Yang, Lixing & Gao, Ziyou & Ran, Bin, 2016. "Energy-efficient metro train rescheduling with uncertain time-variant passenger demands: An approximate dynamic programming approach," Transportation Research Part B: Methodological, Elsevier, vol. 91(C), pages 178-210.
    6. Li, Xiang & Lo, Hong K., 2014. "Energy minimization in dynamic train scheduling and control for metro rail operations," Transportation Research Part B: Methodological, Elsevier, vol. 70(C), pages 269-284.
    7. D'Ariano, Andrea & Pacciarelli, Dario & Pranzo, Marco, 2007. "A branch and bound algorithm for scheduling trains in a railway network," European Journal of Operational Research, Elsevier, vol. 183(2), pages 643-657, December.
    8. Robenek, Tomáš & Maknoon, Yousef & Azadeh, Shadi Sharif & Chen, Jianghang & Bierlaire, Michel, 2016. "Passenger centric train timetabling problem," Transportation Research Part B: Methodological, Elsevier, vol. 89(C), pages 107-126.
    9. Shao, Hu & Lam, William H.K. & Sumalee, Agachai & Chen, Anthony & Hazelton, Martin L., 2014. "Estimation of mean and covariance of peak hour origin–destination demands from day-to-day traffic counts," Transportation Research Part B: Methodological, Elsevier, vol. 68(C), pages 52-75.
    10. Tampère, Chris M.J. & Corthout, Ruben & Cattrysse, Dirk & Immers, Lambertus H., 2011. "A generic class of first order node models for dynamic macroscopic simulation of traffic flows," Transportation Research Part B: Methodological, Elsevier, vol. 45(1), pages 289-309, January.
    11. Yang, Yudi & Fan, Yueyue & Wets, Roger J.B., 2018. "Stochastic travel demand estimation: Improving network identifiability using multi-day observation sets," Transportation Research Part B: Methodological, Elsevier, vol. 107(C), pages 192-211.
    12. Guo, Xin & Sun, Huijun & Wu, Jianjun & Jin, Jiangang & Zhou, Jin & Gao, Ziyou, 2017. "Multiperiod-based timetable optimization for metro transit networks," Transportation Research Part B: Methodological, Elsevier, vol. 96(C), pages 46-67.
    13. Yang, Lixing & Qi, Jianguo & Li, Shukai & Gao, Yuan, 2016. "Collaborative optimization for train scheduling and train stop planning on high-speed railways," Omega, Elsevier, vol. 64(C), pages 57-76.
    14. Li, Xiang & Lo, Hong K., 2014. "An energy-efficient scheduling and speed control approach for metro rail operations," Transportation Research Part B: Methodological, Elsevier, vol. 64(C), pages 73-89.
    15. Yang, Xin & Chen, Anthony & Ning, Bin & Tang, Tao, 2017. "Bi-objective programming approach for solving the metro timetable optimization problem with dwell time uncertainty," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 97(C), pages 22-37.
    16. Chow, Andy H.F. & Li, Shuai & Zhong, Renxin, 2017. "Multi-objective optimal control formulations for bus service reliability with traffic signals," Transportation Research Part B: Methodological, Elsevier, vol. 103(C), pages 248-268.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Ying, Cheng-shuo & Chow, Andy H.F. & Nguyen, Hoa T.M. & Chin, Kwai-Sang, 2022. "Multi-agent deep reinforcement learning for adaptive coordinated metro service operations with flexible train composition," Transportation Research Part B: Methodological, Elsevier, vol. 161(C), pages 36-59.
    2. Yin, Jiateng & Wang, Miao & D’Ariano, Andrea & Zhang, Jinlei & Yang, Lixing, 2023. "Synchronization of train timetables in an urban rail network: A bi-objective optimization approach," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 174(C).
    3. Yan, Dongyang & Li, Keping & Zhu, Qiaozhen & Liu, Yanyan, 2023. "A railway accident prevention method based on reinforcement learning – Active preventive strategy by multi-modal data," Reliability Engineering and System Safety, Elsevier, vol. 234(C).
    4. Nguyen, Hoa T.M. & Chow, Andy H.F. & Ying, Cheng-shuo, 2021. "Pareto routing and scheduling of dynamic urban rail transit services with multi-objective cross entropy method," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 156(C).
    5. Yan, Yimo & Chow, Andy H.F. & Ho, Chin Pang & Kuo, Yong-Hong & Wu, Qihao & Ying, Chengshuo, 2022. "Reinforcement learning for logistics and supply chain management: Methodologies, state of the art, and future opportunities," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 162(C).
    6. Li, Wenqing & Ni, Shaoquan, 2022. "Train timetabling with the general learning environment and multi-agent deep reinforcement learning," Transportation Research Part B: Methodological, Elsevier, vol. 157(C), pages 230-251.
    7. Wang, Xuekai & D’Ariano, Andrea & Su, Shuai & Tang, Tao, 2023. "Cooperative train control during the power supply shortage in metro system: A multi-agent reinforcement learning approach," Transportation Research Part B: Methodological, Elsevier, vol. 170(C), pages 244-278.
    8. Ola G. El‐Taliawi & Nihit Goyal & Michael Howlett, 2021. "Holding out the promise of Lasswell's dream: Big data analytics in public policy research and teaching," Review of Policy Research, Policy Studies Organization, vol. 38(6), pages 640-660, November.
    9. Han, Zhenyu & Han, Baoming & Li, Dewei & Ning, Shangbin & Yang, Ruixia & Yin, Yonghao, 2021. "Train timetabling in rail transit network under uncertain and dynamic demand using Advanced and Adaptive NSGA-II," Transportation Research Part B: Methodological, Elsevier, vol. 154(C), pages 65-99.
    10. Angel A. Juan & Peter Keenan & Rafael Martí & Seán McGarraghy & Javier Panadero & Paula Carroll & Diego Oliva, 2023. "A review of the role of heuristics in stochastic optimisation: from metaheuristics to learnheuristics," Annals of Operations Research, Springer, vol. 320(2), pages 831-861, January.
    11. Su, Z.C. & Chow, Andy H.F. & Fang, C.L. & Liang, E.M. & Zhong, R.X., 2023. "Hierarchical control for stochastic network traffic with reinforcement learning," Transportation Research Part B: Methodological, Elsevier, vol. 167(C), pages 196-216.
    12. Kuo, Yong-Hong & Leung, Janny M.Y. & Yan, Yimo, 2023. "Public transport for smart cities: Recent innovations and future challenges," European Journal of Operational Research, Elsevier, vol. 306(3), pages 1001-1026.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yin, Jiateng & Yang, Lixing & Tang, Tao & Gao, Ziyou & Ran, Bin, 2017. "Dynamic passenger demand oriented metro train scheduling with energy-efficiency and waiting time minimization: Mixed-integer linear programming approaches," Transportation Research Part B: Methodological, Elsevier, vol. 97(C), pages 182-213.
    2. Zhang, Yongxiang & D'Ariano, Andrea & He, Bisheng & Peng, Qiyuan, 2019. "Microscopic optimization model and algorithm for integrating train timetabling and track maintenance task scheduling," Transportation Research Part B: Methodological, Elsevier, vol. 127(C), pages 237-278.
    3. Li, Wenqing & Ni, Shaoquan, 2022. "Train timetabling with the general learning environment and multi-agent deep reinforcement learning," Transportation Research Part B: Methodological, Elsevier, vol. 157(C), pages 230-251.
    4. Liu, Renming & Li, Shukai & Yang, Lixing, 2020. "Collaborative optimization for metro train scheduling and train connections combined with passenger flow control strategy," Omega, Elsevier, vol. 90(C).
    5. Jiang, Feng & Cacchiani, Valentina & Toth, Paolo, 2017. "Train timetabling by skip-stop planning in highly congested lines," Transportation Research Part B: Methodological, Elsevier, vol. 104(C), pages 149-174.
    6. Yang, Songpo & Liao, Feixiong & Wu, Jianjun & Timmermans, Harry J.P. & Sun, Huijun & Gao, Ziyou, 2020. "A bi-objective timetable optimization model incorporating energy allocation and passenger assignment in an energy-regenerative metro system," Transportation Research Part B: Methodological, Elsevier, vol. 133(C), pages 85-113.
    7. Zhou, Leishan & Tong, Lu (Carol) & Chen, Junhua & Tang, Jinjin & Zhou, Xuesong, 2017. "Joint optimization of high-speed train timetables and speed profiles: A unified modeling approach using space-time-speed grid networks," Transportation Research Part B: Methodological, Elsevier, vol. 97(C), pages 157-181.
    8. Zhou, Li & Yang, Xin & Wang, Huan & Wu, Jianjun & Chen, Lei & Yin, Haodong & Qu, Yunchao, 2020. "A robust train timetable optimization approach for reducing the number of waiting passengers in metro systems," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 558(C).
    9. Shi, Jungang & Yang, Lixing & Yang, Jing & Gao, Ziyou, 2018. "Service-oriented train timetabling with collaborative passenger flow control on an oversaturated metro line: An integer linear optimization approach," Transportation Research Part B: Methodological, Elsevier, vol. 110(C), pages 26-59.
    10. Qi, Jianguo & Yang, Lixing & Di, Zhen & Li, Shukai & Yang, Kai & Gao, Yuan, 2018. "Integrated optimization for train operation zone and stop plan with passenger distributions," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 109(C), pages 151-173.
    11. Wang, Pengling & Goverde, Rob M.P., 2019. "Multi-train trajectory optimization for energy-efficient timetabling," European Journal of Operational Research, Elsevier, vol. 272(2), pages 621-635.
    12. Zhaoxiang Tan & Shaofeng Lu & Kai Bao & Shaoning Zhang & Chaoxian Wu & Jie Yang & Fei Xue, 2018. "Adaptive Partial Train Speed Trajectory Optimization," Energies, MDPI, vol. 11(12), pages 1-33, November.
    13. Xue, Hongjiao & Jia, Limin & Li, Jian & Guo, Jianyuan, 2022. "Jointly optimized demand-oriented train timetable and passenger flow control strategy for a congested subway line under a short-turning operation pattern," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 593(C).
    14. Meloni, Carlo & Pranzo, Marco & Samà, Marcella, 2021. "Risk of delay evaluation in real-time train scheduling with uncertain dwell times," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 152(C).
    15. Andreas Bärmann & Alexander Martin & Oskar Schneider, 2021. "Efficient Formulations and Decomposition Approaches for Power Peak Reduction in Railway Traffic via Timetabling," Transportation Science, INFORMS, vol. 55(3), pages 747-767, May.
    16. Blanco, Víctor & Conde, Eduardo & Hinojosa, Yolanda & Puerto, Justo, 2020. "An optimization model for line planning and timetabling in automated urban metro subway networks. A case study," Omega, Elsevier, vol. 92(C).
    17. Wang, Xuekai & Tang, Tao & Su, Shuai & Yin, Jiateng & Gao, Ziyou & Lv, Nan, 2021. "An integrated energy-efficient train operation approach based on the space-time-speed network methodology," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 150(C).
    18. Li, Wenxin & Peng, Qiyuan & Wen, Chao & Wang, Pengling & Lessan, Javad & Xu, Xinyue, 2020. "Joint optimization of delay-recovery and energy-saving in a metro system: A case study from China," Energy, Elsevier, vol. 202(C).
    19. Albrecht, Amie & Howlett, Phil & Pudney, Peter & Vu, Xuan & Zhou, Peng, 2018. "The two-train separation problem on non-level track—driving strategies that minimize total required tractive energy subject to prescribed section clearance times," Transportation Research Part B: Methodological, Elsevier, vol. 111(C), pages 135-167.
    20. Huanhuan Lv & Yuzhao Zhang & Kang Huang & Xiaotong Yu & Jianjun Wu, 2019. "An Energy-Efficient Timetable Optimization Approach in a Bi-DirectionUrban Rail Transit Line: A Mixed-Integer Linear Programming Model," Energies, MDPI, vol. 12(14), pages 1-24, July.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:transb:v:140:y:2020:i:c:p:210-235. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/548/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.