IDEAS home Printed from https://ideas.repec.org/a/eee/reensy/v257y2025ipbs0951832025000894.html
   My bibliography  Save this article

Developing a deep reinforcement learning model for safety risk prediction at subway construction sites

Author

Listed:
  • Zhou, Zhipeng
  • Zhuo, Wen
  • Cui, Jianqiang
  • Luan, Haiying
  • Chen, Yudi
  • Lin, Dong

Abstract

Underground construction work is heavily affected by surrounding hydrogeology, adjacent pipelines, and existing subway lines, which can lead to a high degree of uncertainty and generate safety risk on site. In order to overcome rigid thinking of causal factors within a structured framework and incorporate features of different accidents, this study adopted grounded theory for the investigation on factors contributing to workplace accidents in subway construction. The deep reinforcement learning model of double deep Q-network (DDQN) was developed for predicting subway construction safety risk, which integrated the advantage of reinforcement learning in decision making with the advantage of deep learning in objection perception. The findings denoted that DDQN performed better than other machine learning models inclusive of random forest, extreme gradient boosting, k-nearest neighbor, and support vector machine. Contributing factors relevant to subway construction accidents were quantitatively analyzed using permutation importance of attributes. It was beneficial for determining how the 37 contributing factors had negative effects on subway construction safety risk. Safety measures for risk reduction and controlling could be optimized according to permutation importance of individual contributing factor, which paved a new way for the promotion of safety management performance at subway construction sites.

Suggested Citation

  • Zhou, Zhipeng & Zhuo, Wen & Cui, Jianqiang & Luan, Haiying & Chen, Yudi & Lin, Dong, 2025. "Developing a deep reinforcement learning model for safety risk prediction at subway construction sites," Reliability Engineering and System Safety, Elsevier, vol. 257(PB).
  • Handle: RePEc:eee:reensy:v:257:y:2025:i:pb:s0951832025000894
    DOI: 10.1016/j.ress.2025.110885
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0951832025000894
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ress.2025.110885?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Huo, Xiaosen & Yin, Yuan & Jiao, Liudan & Zhang, Yu, 2024. "A data-driven and knowledge graph-based analysis of the risk hazard coupling mechanism in subway construction accidents," Reliability Engineering and System Safety, Elsevier, vol. 250(C).
    2. Maidana, Renan G. & Parhizkar, Tarannom & Gomola, Alojz & Utne, Ingrid B. & Mosleh, Ali, 2023. "Supervised dynamic probabilistic risk assessment: Review and comparison of methods," Reliability Engineering and System Safety, Elsevier, vol. 230(C).
    3. Anuoluwapo Ajayi & Lukumon Oyedele & Hakeem Owolabi & Olugbenga Akinade & Muhammad Bilal & Juan Manuel Davila Delgado & Lukman Akanbi, 2020. "Deep Learning Models for Health and Safety Risk Prediction in Power Infrastructure Projects," Risk Analysis, John Wiley & Sons, vol. 40(10), pages 2019-2039, October.
    4. Li, Jue & Li, Heng & Wang, Fan & Cheng, Andy S.K. & Yang, Xincong & Wang, Hongwei, 2021. "Proactive analysis of construction equipment operators’ hazard perception error based on cognitive modeling and a dynamic Bayesian network," Reliability Engineering and System Safety, Elsevier, vol. 205(C).
    5. Karar, Ahmed Noaman & Labib, Ashraf & Jones, Dylan, 2024. "A resilience-based maintenance optimisation framework using multiple criteria and Knapsack methods," Reliability Engineering and System Safety, Elsevier, vol. 241(C).
    6. Zhang, Limao & Wu, Xianguo & Skibniewski, Miroslaw J. & Zhong, Jingbing & Lu, Yujie, 2014. "Bayesian-network-based safety risk analysis in construction projects," Reliability Engineering and System Safety, Elsevier, vol. 131(C), pages 29-39.
    7. Guo, Jian & Luo, Cheng & Ma, Kaijiang, 2023. "Risk coupling analysis of road transportation accidents of hazardous materials in complicated maritime environment," Reliability Engineering and System Safety, Elsevier, vol. 229(C).
    8. Love, Peter E.D. & Tenekedjiev, Kiril, 2022. "Understanding near-miss count data on construction sites using Greedy D-vine Copula Marginal Regression: A comment," Reliability Engineering and System Safety, Elsevier, vol. 217(C).
    9. Tapas K. Das & Abhijit Gosavi & Sridhar Mahadevan & Nicholas Marchalleck, 1999. "Solving Semi-Markov Decision Problems Using Average Reward Reinforcement Learning," Management Science, INFORMS, vol. 45(4), pages 560-574, April.
    10. David M Gligor & Carol L Esmark & Ismail Gölgeci, 2016. "Building international business theory: A grounded theory approach," Journal of International Business Studies, Palgrave Macmillan;Academy of International Business, vol. 47(1), pages 93-111, January.
    11. Daniel J. Mankowitz & Andrea Michi & Anton Zhernov & Marco Gelmi & Marco Selvi & Cosmin Paduraru & Edouard Leurent & Shariq Iqbal & Jean-Baptiste Lespiau & Alex Ahern & Thomas Köppe & Kevin Millikin &, 2023. "Faster sorting algorithms discovered using deep reinforcement learning," Nature, Nature, vol. 618(7964), pages 257-263, June.
    12. Talayeh Razzaghi & Ilya Safro & Joseph Ewing & Ehsan Sadrfaridpour & John D. Scott, 2019. "Predictive models for bariatric surgery risks with imbalanced medical datasets," Annals of Operations Research, Springer, vol. 280(1), pages 1-18, September.
    13. Guo, Qingjun & Amin, Shohel & Hao, Qianwen & Haas, Olivier, 2020. "Resilience assessment of safety system at subway construction sites applying analytic network process and extension cloud models," Reliability Engineering and System Safety, Elsevier, vol. 201(C).
    14. Isabelle Walsh & Frantz Rowe, 2023. "BIBGT: combining bibliometrics and grounded theory to conduct a literature review," European Journal of Information Systems, Taylor & Francis Journals, vol. 32(4), pages 653-674, July.
    15. Chang, Leilei & Zhang, Limao & Xu, Xiaobin, 2023. "Causality-based multi-model ensemble learning for safety assessment in metro tunnel construction," Reliability Engineering and System Safety, Elsevier, vol. 234(C).
    16. Volodymyr Mnih & Koray Kavukcuoglu & David Silver & Andrei A. Rusu & Joel Veness & Marc G. Bellemare & Alex Graves & Martin Riedmiller & Andreas K. Fidjeland & Georg Ostrovski & Stig Petersen & Charle, 2015. "Human-level control through deep reinforcement learning," Nature, Nature, vol. 518(7540), pages 529-533, February.
    17. Lan, He & Ma, Xiaoxue & Qiao, Weiliang & Ma, Laihao, 2022. "On the causation of seafarers’ unsafe acts using grounded theory and association rule," Reliability Engineering and System Safety, Elsevier, vol. 223(C).
    18. Yang Miang Goh & Peter E. D. Love & Helen Brown & Jeffery Spickett, 2012. "Organizational Accidents: A Systemic Model of Production versus Protection," Journal of Management Studies, Wiley Blackwell, vol. 49(1), pages 52-76, January.
    19. Yongchi Ma & Martin de Jong & Joop Koppenjan, 2013. "Identifying Organizational and Contractual Drivers behind Subway Construction Accidents in China: A Tale of Three Cities," Journal of Urban Technology, Taylor & Francis Journals, vol. 20(3), pages 39-64, July.
    20. Yeh, Wei-Chang, 2023. "Building reliable budget-based binary-state networks," Reliability Engineering and System Safety, Elsevier, vol. 240(C).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Huo, Xiaosen & Yin, Yuan & Jiao, Liudan & Zhang, Yu, 2024. "A data-driven and knowledge graph-based analysis of the risk hazard coupling mechanism in subway construction accidents," Reliability Engineering and System Safety, Elsevier, vol. 250(C).
    2. Shen, Shui-Long & Lin, Song-Shun & Zhou, Annan, 2023. "A cloud model-based approach for risk analysis of excavation system," Reliability Engineering and System Safety, Elsevier, vol. 231(C).
    3. Behice Meltem Kayhan & Gokalp Yildiz, 2023. "Reinforcement learning applications to machine scheduling problems: a comprehensive literature review," Journal of Intelligent Manufacturing, Springer, vol. 34(3), pages 905-929, March.
    4. Wen, He & Khan, Faisal & AbouRizk, Simaan & Fu, Gui, 2024. "Understanding of causality and its mathematical representation in accident modeling," Reliability Engineering and System Safety, Elsevier, vol. 250(C).
    5. Tulika Saha & Sriparna Saha & Pushpak Bhattacharyya, 2020. "Towards sentiment aided dialogue policy learning for multi-intent conversations using hierarchical reinforcement learning," PLOS ONE, Public Library of Science, vol. 15(7), pages 1-28, July.
    6. Mahmoud Mahfouz & Angelos Filos & Cyrine Chtourou & Joshua Lockhart & Samuel Assefa & Manuela Veloso & Danilo Mandic & Tucker Balch, 2019. "On the Importance of Opponent Modeling in Auction Markets," Papers 1911.12816, arXiv.org.
    7. Lixiang Zhang & Yan Yan & Yaoguang Hu, 2024. "Deep reinforcement learning for dynamic scheduling of energy-efficient automated guided vehicles," Journal of Intelligent Manufacturing, Springer, vol. 35(8), pages 3875-3888, December.
    8. Benjamin Heinbach & Peter Burggräf & Johannes Wagner, 2024. "gym-flp: A Python Package for Training Reinforcement Learning Algorithms on Facility Layout Problems," SN Operations Research Forum, Springer, vol. 5(1), pages 1-26, March.
    9. Woo Jae Byun & Bumkyu Choi & Seongmin Kim & Joohyun Jo, 2023. "Practical Application of Deep Reinforcement Learning to Optimal Trade Execution," FinTech, MDPI, vol. 2(3), pages 1-16, June.
    10. Lu, Yu & Xiang, Yue & Huang, Yuan & Yu, Bin & Weng, Liguo & Liu, Junyong, 2023. "Deep reinforcement learning based optimal scheduling of active distribution system considering distributed generation, energy storage and flexible load," Energy, Elsevier, vol. 271(C).
    11. Yuhong Wang & Lei Chen & Hong Zhou & Xu Zhou & Zongsheng Zheng & Qi Zeng & Li Jiang & Liang Lu, 2021. "Flexible Transmission Network Expansion Planning Based on DQN Algorithm," Energies, MDPI, vol. 14(7), pages 1-21, April.
    12. Pedro Reis & Ana Paula Serra & Jo~ao Gama, 2025. "The Role of Deep Learning in Financial Asset Management: A Systematic Review," Papers 2503.01591, arXiv.org.
    13. Yang, Bofan & Zhang, Lin & Zhang, Bo & Xiang, Yang & An, Lei & Wang, Wenfeng, 2022. "Complex equipment system resilience: Composition, measurement and element analysis," Reliability Engineering and System Safety, Elsevier, vol. 228(C).
    14. Michelle M. LaMar, 2018. "Markov Decision Process Measurement Model," Psychometrika, Springer;The Psychometric Society, vol. 83(1), pages 67-88, March.
    15. Yang, Ting & Zhao, Liyuan & Li, Wei & Zomaya, Albert Y., 2021. "Dynamic energy dispatch strategy for integrated energy system based on improved deep reinforcement learning," Energy, Elsevier, vol. 235(C).
    16. Wang, Yi & Qiu, Dawei & Sun, Mingyang & Strbac, Goran & Gao, Zhiwei, 2023. "Secure energy management of multi-energy microgrid: A physical-informed safe reinforcement learning approach," Applied Energy, Elsevier, vol. 335(C).
    17. Zhang, Huixian & Wei, Xiukun & Liu, Zhiqiang & Ding, Yaning & Guan, Qingluan, 2025. "Condition-based maintenance for multi-state systems with prognostic and deep reinforcement learning," Reliability Engineering and System Safety, Elsevier, vol. 255(C).
    18. Neha Soni & Enakshi Khular Sharma & Narotam Singh & Amita Kapoor, 2019. "Impact of Artificial Intelligence on Businesses: from Research, Innovation, Market Deployment to Future Shifts in Business Models," Papers 1905.02092, arXiv.org.
    19. Ande Chang & Yuting Ji & Chunguang Wang & Yiming Bie, 2024. "CVDMARL: A Communication-Enhanced Value Decomposition Multi-Agent Reinforcement Learning Traffic Signal Control Method," Sustainability, MDPI, vol. 16(5), pages 1-17, March.
    20. Sun, Hongchang & Niu, Yanlei & Li, Chengdong & Zhou, Changgeng & Zhai, Wenwen & Chen, Zhe & Wu, Hao & Niu, Lanqiang, 2022. "Energy consumption optimization of building air conditioning system via combining the parallel temporal convolutional neural network and adaptive opposition-learning chimp algorithm," Energy, Elsevier, vol. 259(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:reensy:v:257:y:2025:i:pb:s0951832025000894. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: https://www.journals.elsevier.com/reliability-engineering-and-system-safety .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.