IDEAS home Printed from https://ideas.repec.org/a/gam/jsusta/v16y2024i12p5081-d1415190.html
   My bibliography  Save this article

Research on the Prediction of Sustainable Safety Production in Building Construction Based on Text Data

Author

Listed:
  • Jifei Fan

    (School of Civil Engineering, Lanzhou University of Technology, Lanzhou 730050, China)

  • Daopeng Wang

    (School of Civil Engineering, Lanzhou University of Technology, Lanzhou 730050, China)

  • Ping Liu

    (School of Civil Engineering, Lanzhou University of Technology, Lanzhou 730050, China)

  • Jiaming Xu

    (School of Civil Engineering, Lanzhou University of Technology, Lanzhou 730050, China)

Abstract

Given the complexity and variability of modern construction projects, safety risk management has become increasingly challenging, while traditional methods exhibit deficiencies in handling complex dynamic environments, particularly those involving unstructured text data. Consequently, this study proposes a text data-based risk prediction method for building construction safety. Initially, heuristic Chinese automatic word segmentation, which incorporates mutual information, information entropy statistics, and the TF-IDF algorithm, preprocesses text data to extract risk factor keywords and construct accident attribute variables. At the same time, the Spearman correlation coefficient is utilized to eliminate the multicollinearity between feature variables. Next, the XGBoost algorithm is employed to develop a model for predicting the risks associated with safe production. Its performance is optimized through three experimental scenarios. The results indicate that the model achieves satisfactory overall performance after hyperparameter tuning, with the prediction accuracy and F1 score reaching approximately 86%. Finally, the SHAP model interpretation technique identifies critical factors influencing the safety production risk in building construction, highlighting project managers’ attention to safety, government regulation, safety design, and emergency response as critical determinants of accident severity. The main objective of this study is to minimize human intervention in risk assessment and to construct a text data-based risk prediction model for building construction safety production using the rich empirical knowledge embedded in unstructured accident text, with the aim of reducing safety production accidents and promoting the sustainable development of construction safety in the industry. This model not only enables a paradigm shift toward intelligent risk control in safety production but also provides theoretical and practical insights into decision-making and technical support in safety production.

Suggested Citation

  • Jifei Fan & Daopeng Wang & Ping Liu & Jiaming Xu, 2024. "Research on the Prediction of Sustainable Safety Production in Building Construction Based on Text Data," Sustainability, MDPI, vol. 16(12), pages 1-21, June.
  • Handle: RePEc:gam:jsusta:v:16:y:2024:i:12:p:5081-:d:1415190
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2071-1050/16/12/5081/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2071-1050/16/12/5081/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Gambrill, Eileen & Shlonsky, Aron, 2000. "Risk assessment in context," Children and Youth Services Review, Elsevier, vol. 22(11-12), pages 813-837.
    2. Gandomi, Amir & Haider, Murtaza, 2015. "Beyond the hype: Big data concepts, methods, and analytics," International Journal of Information Management, Elsevier, vol. 35(2), pages 137-144.
    3. Aven, Terje, 2016. "Risk assessment and risk management: Review of recent advances on their foundation," European Journal of Operational Research, Elsevier, vol. 253(1), pages 1-13.
    4. Haiyan Chen & Yihua Mao & Yidong Xu & Rui Wang, 2023. "The Impact of Wearable Devices on the Construction Safety of Building Workers: A Systematic Review," Sustainability, MDPI, vol. 15(14), pages 1-15, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Julia Eichholz & Thorsten Knauer & Sandra Winkelmann, 2023. "Digital Maturity of Forecasting and its Impact in Times of Crisis," Schmalenbach Journal of Business Research, Springer, vol. 75(4), pages 443-481, December.
    2. de Camargo Fiorini, Paula & Roman Pais Seles, Bruno Michel & Chiappetta Jabbour, Charbel Jose & Barberio Mariano, Enzo & de Sousa Jabbour, Ana Beatriz Lopes, 2018. "Management theory and big data literature: From a review to a research agenda," International Journal of Information Management, Elsevier, vol. 43(C), pages 112-129.
    3. Amiri, Babak & Karimianghadim, Ramin, 2024. "A novel text clustering model based on topic modelling and social network analysis," Chaos, Solitons & Fractals, Elsevier, vol. 181(C).
    4. Julia Eichholz & Nicole Hoffmann & Anja Schwering, 2024. "The role of risk management orientation and the planning function of budgeting in enhancing organizational resilience and its effect on competitive advantages during times of crises," Journal of Management Control: Zeitschrift für Planung und Unternehmenssteuerung, Springer, vol. 35(1), pages 17-58, March.
    5. Adesuwa Omorede & Juan Francisco Prados-Castillo & Amalia Cristina Casas-Jurado, 2025. "Researching entrepreneurship using big data: implementation, benefits, and challenges," International Entrepreneurship and Management Journal, Springer, vol. 21(1), pages 1-22, December.
    6. repec:arp:tjssrr:2019:p:69-75 is not listed on IDEAS
    7. Acharya, Abhilash & Singh, Sanjay Kumar & Pereira, Vijay & Singh, Poonam, 2018. "Big data, knowledge co-creation and decision making in fashion industry," International Journal of Information Management, Elsevier, vol. 42(C), pages 90-101.
    8. Mussard, Stéphane & Pi Alperin, María Noel, 2021. "Accounting for risk factors on health outcomes: The case of Luxembourg," European Journal of Operational Research, Elsevier, vol. 291(3), pages 1180-1197.
    9. Zohreh Molamohamadi & Mahmoud Asad Samani & Marjan Karimi, 2024. "Reviewing the historical milestones of risk management," SN Business & Economics, Springer, vol. 4(12), pages 1-20, December.
    10. Harkaran Kava & Konstantina Spanaki & Thanos Papadopoulos & Stella Despoudi & Oscar Rodriguez-Espindola & Masoud Fakhimi, 2021. "Data Analytics Diffusion in the UK Renewable Energy Sector: An Innovation Perspective," Post-Print hal-03781046, HAL.
    11. Oesterreich, Thuy Duong & Anton, Eduard & Teuteberg, Frank & Dwivedi, Yogesh K, 2022. "The role of the social and technical factors in creating business value from big data analytics: A meta-analysis," Journal of Business Research, Elsevier, vol. 153(C), pages 128-149.
    12. Zio, E., 2018. "The future of risk assessment," Reliability Engineering and System Safety, Elsevier, vol. 177(C), pages 176-190.
    13. Zeng, Qing & Lu, Xinjie & Xu, Jin & Lin, Yu, 2024. "Macro-Driven Stock Market Volatility Prediction: Insights from a New Hybrid Machine Learning Approach," International Review of Financial Analysis, Elsevier, vol. 96(PB).
    14. Johannes Habel & Sascha Alavi & Nicolas Heinitz, 2023. "A theory of predictive sales analytics adoption," AMS Review, Springer;Academy of Marketing Science, vol. 13(1), pages 34-54, June.
    15. Chotia, Varun & Khoualdi, Kamel & Broccardo, Laura & Yaqub, Muhammad Zafar, 2025. "The role of cyber security and digital transformation in gaining competitive advantage through Strategic Management Accounting," Technology in Society, Elsevier, vol. 81(C).
    16. Tiago Carneiro & Winnie Ng Picoto & Inês Pinto, 2023. "Big Data Analytics and Firm Performance in the Hotel Sector," Tourism and Hospitality, MDPI, vol. 4(2), pages 1-13, April.
    17. Weronika Jakubczak & Anna Golebiowska & Dariusz Prokopowicz, 2021. "The Legal and Security Aspects of ICT and Industry 4.0 Importance for Financial Industry 4.0 Development," European Research Studies Journal, European Research Studies Journal, vol. 0(4 - Part ), pages 169-181.
    18. Mangirdas Morkunas & Gintaras Cernius & Gintare Giriuniene, 2019. "Assessing Business Risks of Natural Gas Trading Companies: Evidence from GET Baltic," Energies, MDPI, vol. 12(14), pages 1-14, July.
    19. Scholz, Roland W. & Czichos, Reiner & Parycek, Peter & Lampoltshammer, Thomas J., 2020. "Organizational vulnerability of digital threats: A first validation of an assessment method," European Journal of Operational Research, Elsevier, vol. 282(2), pages 627-643.
    20. Gianfranco Marotta & Phillipe Krahnhof & Cam-Duc Au, 2022. "A Critical Analysis of Budgeting Processes from the Pharmaceutical Industry and Beyond," Journal of Applied Finance & Banking, SCIENPRESS Ltd, vol. 12(3), pages 1-3.
    21. Dr Jason Mwanza & Nothando Tshuma, 2023. "Mitigating Business Risk in Manufacturing SMEs: A nexus between informal and formal business risk management: A case of Bulawayo, Zimbabwe," International Journal of Research and Innovation in Social Science, International Journal of Research and Innovation in Social Science (IJRISS), vol. 7(1), pages 1107-1138, January.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jsusta:v:16:y:2024:i:12:p:5081-:d:1415190. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.