IDEAS home Printed from https://ideas.repec.org/a/eee/transb/v161y2022icp36-59.html
   My bibliography  Save this article

Multi-agent deep reinforcement learning for adaptive coordinated metro service operations with flexible train composition

Author

Listed:
  • Ying, Cheng-shuo
  • Chow, Andy H.F.
  • Nguyen, Hoa T.M.
  • Chin, Kwai-Sang

Abstract

This paper presents an adaptive control system for coordinated metro operations with flexible train composition by using a multi-agent deep reinforcement learning (MADRL) approach. The control problem is formulated as a Markov decision process (MDP) with multiple agents regulating different service lines in a metro network with passenger transfer. To ensure the overall computational effectiveness and stability of the control system, we adopt an actor–critic reinforcement learning framework in which each control agent is associated with a critic function for estimating future system states and an actor function deriving local operational decisions. The critics and actors in the MADRL are represented by multi-layer artificial neural networks (ANNs). A multi-agent deep deterministic policy gradient (MADDPG) algorithm is developed for training the actor and critic ANNs through successive simulated transitions over the entire metro network. The developed framework is tested with a real-world scenario in Bakerloo and Victoria Lines of London Underground, UK. Experiment results demonstrate that the proposed method can outperform previous centralized optimization and distributed control approaches in terms of solution quality and performance achieved. Further analysis shows the merits of MADRL for coordinated service regulation with flexible train composition. This study contributes to real-time coordinated metro network services with flexible train composition and advanced optimization techniques.

Suggested Citation

  • Ying, Cheng-shuo & Chow, Andy H.F. & Nguyen, Hoa T.M. & Chin, Kwai-Sang, 2022. "Multi-agent deep reinforcement learning for adaptive coordinated metro service operations with flexible train composition," Transportation Research Part B: Methodological, Elsevier, vol. 161(C), pages 36-59.
  • Handle: RePEc:eee:transb:v:161:y:2022:i:c:p:36-59
    DOI: 10.1016/j.trb.2022.05.001
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0191261522000728
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.trb.2022.05.001?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Wu, Weitiao & Liu, Ronghui & Jin, Wenzhou, 2016. "Designing robust schedule coordination scheme for transit networks with safety control margins," Transportation Research Part B: Methodological, Elsevier, vol. 93(PA), pages 495-519.
    2. Abdolmaleki, Mojtaba & Masoud, Neda & Yin, Yafeng, 2020. "Transit timetable synchronization for transfer time minimization," Transportation Research Part B: Methodological, Elsevier, vol. 131(C), pages 143-159.
    3. Ying, Cheng-shuo & Chow, Andy H.F. & Chin, Kwai-Sang, 2020. "An actor-critic deep reinforcement learning approach for metro train scheduling with rolling stock circulation under stochastic demand," Transportation Research Part B: Methodological, Elsevier, vol. 140(C), pages 210-235.
    4. Yin, Jiateng & D’Ariano, Andrea & Wang, Yihui & Yang, Lixing & Tang, Tao, 2021. "Timetable coordination in a rail transit network with time-dependent passenger demand," European Journal of Operational Research, Elsevier, vol. 295(1), pages 183-202.
    5. B. G. Heydecker & J. D. Addison, 2005. "Analysis of Dynamic Traffic Equilibrium with Departure Time Choice," Transportation Science, INFORMS, vol. 39(1), pages 39-57, February.
    6. Haahr, Jørgen T. & Wagenaar, Joris C. & Veelenturf, Lucas P. & Kroon, Leo G., 2016. "A comparison of two exact methods for passenger railway rolling stock (re)scheduling," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 91(C), pages 15-32.
    7. Chow, Andy H.F. & Pavlides, Aris, 2018. "Cost functions and multi-objective timetabling of mixed train services," Transportation Research Part A: Policy and Practice, Elsevier, vol. 113(C), pages 335-356.
    8. Yin, Jiateng & Tang, Tao & Yang, Lixing & Gao, Ziyou & Ran, Bin, 2016. "Energy-efficient metro train rescheduling with uncertain time-variant passenger demands: An approximate dynamic programming approach," Transportation Research Part B: Methodological, Elsevier, vol. 91(C), pages 178-210.
    9. Nguyen, Hoa T.M. & Chow, Andy H.F. & Ying, Cheng-shuo, 2021. "Pareto routing and scheduling of dynamic urban rail transit services with multi-objective cross entropy method," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 156(C).
    10. Guo, Xin & Wu, Jianjun & Sun, Huijun & Yang, Xin & Jin, Jian Gang & Wang, David Z.W., 2020. "Scheduling synchronization in urban rail transit networks: Trade-offs between transfer passenger and last train operation," Transportation Research Part A: Policy and Practice, Elsevier, vol. 138(C), pages 463-490.
    11. Rachel C. W. Wong & Tony W. Y. Yuen & Kwok Wah Fung & Janny M. Y. Leung, 2008. "Optimizing Timetable Synchronization for Rail Mass Transit," Transportation Science, INFORMS, vol. 42(1), pages 57-69, February.
    12. Šemrov, D. & Marsetič, R. & Žura, M. & Todorovski, L. & Srdic, A., 2016. "Reinforcement learning approach for train rescheduling on a single-track railway," Transportation Research Part B: Methodological, Elsevier, vol. 86(C), pages 250-267.
    13. Fonseca, João Paiva & van der Hurk, Evelien & Roberti, Roberto & Larsen, Allan, 2018. "A matheuristic for transfer synchronization through integrated timetabling and vehicle scheduling," Transportation Research Part B: Methodological, Elsevier, vol. 109(C), pages 128-149.
    14. Guo, Xin & Sun, Huijun & Wu, Jianjun & Jin, Jiangang & Zhou, Jin & Gao, Ziyou, 2017. "Multiperiod-based timetable optimization for metro transit networks," Transportation Research Part B: Methodological, Elsevier, vol. 96(C), pages 46-67.
    15. Kang, Liujiang & Zhu, Xiaoning & Sun, Huijun & Puchinger, Jakob & Ruthmair, Mario & Hu, Bin, 2016. "Modeling the first train timetabling problem with minimal missed trains and synchronization time differences in subway networks," Transportation Research Part B: Methodological, Elsevier, vol. 93(PA), pages 17-36.
    16. Liu, Tao & Ceder, Avishai (Avi), 2018. "Integrated public transport timetable synchronization and vehicle scheduling with demand assignment: A bi-objective bi-level model using deficit function approach," Transportation Research Part B: Methodological, Elsevier, vol. 117(PB), pages 935-955.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Kuo, Yong-Hong & Leung, Janny M.Y. & Yan, Yimo, 2023. "Public transport for smart cities: Recent innovations and future challenges," European Journal of Operational Research, Elsevier, vol. 306(3), pages 1001-1026.
    2. Hu, Yuting & Li, Shukai & Dessouky, Maged M. & Yang, Lixing & Gao, Ziyou, 2022. "Computationally efficient train timetable generation of metro networks with uncertain transfer walking time to reduce passenger waiting time: A generalized Benders decomposition-based method," Transportation Research Part B: Methodological, Elsevier, vol. 163(C), pages 210-231.
    3. Li, Wenqing & Ni, Shaoquan, 2022. "Train timetabling with the general learning environment and multi-agent deep reinforcement learning," Transportation Research Part B: Methodological, Elsevier, vol. 157(C), pages 230-251.
    4. Cortés, Cristián E. & Gil, Cristiam & Gschwender, Antonio & Rey, Pablo A., 2023. "The bus synchronization timetabling problem with dwelling times," Transportation Research Part B: Methodological, Elsevier, vol. 174(C).
    5. Li, Shukai & Zhou, Xuesong & Yang, Lixing & Gao, Ziyou, 2018. "Automatic train regulation of complex metro networks with transfer coordination constraints: A distributed optimal control framework," Transportation Research Part B: Methodological, Elsevier, vol. 117(PA), pages 228-253.
    6. Yin, Jiateng & Wang, Miao & D’Ariano, Andrea & Zhang, Jinlei & Yang, Lixing, 2023. "Synchronization of train timetables in an urban rail network: A bi-objective optimization approach," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 174(C).
    7. Huang, Kang & Wu, Jianjun & Sun, Huijun & Yang, Xin & Gao, Ziyou & Feng, Xujie, 2022. "Timetable synchronization optimization in a subway–bus network," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 608(P1).
    8. Wu, Weitiao & Liu, Ronghui & Jin, Wenzhou & Ma, Changxi, 2019. "Stochastic bus schedule coordination considering demand assignment and rerouting of passengers," Transportation Research Part B: Methodological, Elsevier, vol. 121(C), pages 275-303.
    9. Kang, Liujiang & Li, Hao & Sun, Huijun & Wu, Jianjun & Cao, Zhiguang & Buhigiro, Nsabimana, 2021. "First train timetabling and bus service bridging in intermodal bus-and-train transit networks," Transportation Research Part B: Methodological, Elsevier, vol. 149(C), pages 443-462.
    10. Mitra Heidari & Seyyed-Mahdi Hosseini-Motlagh & Nariman Nikoo, 2020. "A subway planning bi-objective multi-period optimization model integrating timetabling and vehicle scheduling: a case study of Tehran," Transportation, Springer, vol. 47(1), pages 417-443, February.
    11. Yin, Jiateng & Pu, Fan & Yang, Lixing & D’Ariano, Andrea & Wang, Zhouhong, 2023. "Integrated optimization of rolling stock allocation and train timetables for urban rail transit networks: A benders decomposition approach," Transportation Research Part B: Methodological, Elsevier, vol. 176(C).
    12. Kang, Liujiang & Meng, Qiang, 2017. "Two-phase decomposition method for the last train departure time choice in subway networks," Transportation Research Part B: Methodological, Elsevier, vol. 104(C), pages 568-582.
    13. Pan, Hanchuan & Yang, Lixing & Liang, Zhe, 2023. "Demand-oriented integration optimization of train timetabling and rolling stock circulation planning with flexible train compositions: A column-generation-based approach," European Journal of Operational Research, Elsevier, vol. 305(1), pages 184-206.
    14. Kang, Liujiang & Sun, Huijun & Wu, Jianjun & Gao, Ziyou, 2020. "Last train station-skipping, transfer-accessible and energy-efficient scheduling in subway networks," Energy, Elsevier, vol. 206(C).
    15. Ying, Cheng-shuo & Chow, Andy H.F. & Chin, Kwai-Sang, 2020. "An actor-critic deep reinforcement learning approach for metro train scheduling with rolling stock circulation under stochastic demand," Transportation Research Part B: Methodological, Elsevier, vol. 140(C), pages 210-235.
    16. Guo, Xin & Wu, Jianjun & Sun, Huijun & Yang, Xin & Jin, Jian Gang & Wang, David Z.W., 2020. "Scheduling synchronization in urban rail transit networks: Trade-offs between transfer passenger and last train operation," Transportation Research Part A: Policy and Practice, Elsevier, vol. 138(C), pages 463-490.
    17. Wang, Xuekai & D’Ariano, Andrea & Su, Shuai & Tang, Tao, 2023. "Cooperative train control during the power supply shortage in metro system: A multi-agent reinforcement learning approach," Transportation Research Part B: Methodological, Elsevier, vol. 170(C), pages 244-278.
    18. Kang, Liujiang & Zhu, Xiaoning & Sun, Huijun & Wu, Jianjun & Gao, Ziyou & Hu, Bin, 2019. "Last train timetabling optimization and bus bridging service management in urban railway transit networks," Omega, Elsevier, vol. 84(C), pages 31-44.
    19. Nguyen, Hoa T.M. & Chow, Andy H.F. & Ying, Cheng-shuo, 2021. "Pareto routing and scheduling of dynamic urban rail transit services with multi-objective cross entropy method," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 156(C).
    20. Abdolmaleki, Mojtaba & Masoud, Neda & Yin, Yafeng, 2020. "Transit timetable synchronization for transfer time minimization," Transportation Research Part B: Methodological, Elsevier, vol. 131(C), pages 143-159.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:transb:v:161:y:2022:i:c:p:36-59. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/548/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.