IDEAS home Printed from https://ideas.repec.org/a/sae/risrel/v238y2024i5p945-956.html
   My bibliography  Save this article

Combining BERT with numerical variables to classify injury leave based on accident description

Author

Listed:
  • Plínio MS Ramos
  • July B Macedo
  • Caio BS Maior
  • Márcio C Moura
  • Isis D Lins

Abstract

The occurrence of work accidents may threaten the workers’ health and lead to consequences for the organizations as well, such as restructuring of work and direct/indirect costs with the absence of the worker. In this context, accident investigation reports contain information that can support companies to propose preventive and mitigative measures and identify causes and consequences of injury events. However, this information is frequently complex, redundant, and/or incomplete. Additionally, a complete human review of the entire database is arduous, considering numerous reports produced by a company. Indeed, Natural Language Processing (NLP)-based techniques are suitable for analyzing a massive amount of textual information. In this paper, we adopted NLP techniques to determine whether an injury leave would be expected from a given accident report. The methodology was applied to accident reports collected from an actual hydroelectric power company using Bidirectional Encoder Representations from Transformers (BERT), a state-of-art NLP method. The text representations provided by BERT model were combined with numerical and binary variables extracted from the accident reports. These combined variables are input to a Multilayer Perceptron (MLP) that predicts the occurrence of the accident leave for a given accident. After cross-validation, the results showed a median accuracy of 73.5%. Additionally, we discuss several reports that presented high and low proportions of correct classifications by the models tested and discussed the possible reasons. Indeed, accident investigation reports provide useful knowledge to support decisions in the safety context.

Suggested Citation

  • Plínio MS Ramos & July B Macedo & Caio BS Maior & Márcio C Moura & Isis D Lins, 2024. "Combining BERT with numerical variables to classify injury leave based on accident description," Journal of Risk and Reliability, , vol. 238(5), pages 945-956, October.
  • Handle: RePEc:sae:risrel:v:238:y:2024:i:5:p:945-956
    DOI: 10.1177/1748006X221140194
    as

    Download full text from publisher

    File URL: https://journals.sagepub.com/doi/10.1177/1748006X221140194
    Download Restriction: no

    File URL: https://libkey.io/10.1177/1748006X221140194?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Bevilacqua, Maurizio & Ciarapica, Filippo Emanuele, 2018. "Human factor risk management in the process industry: A case study," Reliability Engineering and System Safety, Elsevier, vol. 169(C), pages 149-159.
    2. das Chagas Moura, Márcio & Azevedo, Rafael Valença & Droguett, Enrique López & Chaves, Leandro Rego & Lins, Isis Didier & Vilela, Romulo Fernando & Filho, Romero Sales, 2016. "Estimation of expected number of accidents and workforce unavailability through Bayesian population variability analysis and Markov-based model," Reliability Engineering and System Safety, Elsevier, vol. 150(C), pages 136-146.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ewa DUDEK & Karolina KRZYKOWSKA-PIOTROWSKA & Mirosław SIERGIEJCZYK, 2020. "Risk Management In (Air) Transport With Exemplary Risk Analysis Based On The Tolerability Matrix," Transport Problems, Silesian University of Technology, Faculty of Transport, vol. 15(2), pages 143-156, June.
    2. Maurizio Bevilacqua & Eleonora Bottani & Filippo Emanuele Ciarapica & Francesco Costantino & Luciano Di Donato & Alessandra Ferraro & Giovanni Mazzuto & Andrea Monteriù & Giorgia Nardini & Marco Orten, 2020. "Digital Twin Reference Model Development to Prevent Operators’ Risk in Process Plants," Sustainability, MDPI, vol. 12(3), pages 1-17, February.
    3. Abrishami, Shokoufeh & Khakzad, Nima & Hosseini, Seyed Mahmoud, 2020. "A data-based comparison of BN-HRA models in assessing human error probability: An offshore evacuation case study," Reliability Engineering and System Safety, Elsevier, vol. 202(C).
    4. Westreich, Sara & Perlman, Yael & Winkler, Michael, 2021. "Analysis and Implications of the Management of Near-Miss Events: A Game Theoretic Approach," Reliability Engineering and System Safety, Elsevier, vol. 212(C).
    5. Huiyue Diao & Majid Ghorbani, 2018. "Production risk caused by human factors: a multiple case study of thermal power plants," Frontiers of Business Research in China, Springer, vol. 12(1), pages 1-27, December.
    6. Che, Haiyang & Zeng, Shengkui & Guo, Jianbin, 2019. "Reliability assessment of man-machine systems subject to mutually dependent machine degradation and human errors," Reliability Engineering and System Safety, Elsevier, vol. 190(C), pages 1-1.
    7. Zhou, Jian-Lan & Tu, Ren-Fang & Xiao, Hai, 2022. "Large-scale group decision-making to facilitate inter-rater reliability of human-factors analysis for the railway system," Reliability Engineering and System Safety, Elsevier, vol. 228(C).
    8. Wenjun Zhang & Xiangkun Meng & Xue Yang & Hongguang Lyu & Xiang-Yu Zhou & Qingwu Wang, 2022. "A Practical Risk-Based Model for Early Warning of Seafarer Errors Using Integrated Bayesian Network and SPAR-H," IJERPH, MDPI, vol. 19(16), pages 1-14, August.
    9. Jia, Xiaohui & Zhang, Donghui, 2021. "Prediction of maritime logistics service risks applying soft set based association rule: An early warning model," Reliability Engineering and System Safety, Elsevier, vol. 207(C).
    10. Antonello, Federico & Baraldi, Piero & Shokry, Ahmed & Zio, Enrico & Gentile, Ugo & Serio, Luigi, 2021. "Association rules extraction for the identification of functional dependencies in complex technical infrastructures," Reliability Engineering and System Safety, Elsevier, vol. 209(C).

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:sae:risrel:v:238:y:2024:i:5:p:945-956. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: SAGE Publications (email available below). General contact details of provider: .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.