IDEAS home Printed from https://ideas.repec.org/a/gam/jdataj/v6y2021i12p129-d694430.html

Shipping Accidents Dataset: Data-Driven Directions for Assessing Accident’s Impact and Improving Safety Onboard

Author

Listed:
  • Panagiotis Panagiotidis

    (Konnekt-able Technologies Ltd., X91 W0XW Waterford, Ireland)

  • Kyriakos Giannakis

    (Konnekt-able Technologies Ltd., X91 W0XW Waterford, Ireland)

  • Nikolaos Angelopoulos

    (Konnekt-able Technologies Ltd., X91 W0XW Waterford, Ireland)

  • Angelos Liapis

    (Konnekt-able Technologies Ltd., X91 W0XW Waterford, Ireland)

Abstract

Recent tragic marine incidents indicate that more efficient safety procedures and emergency management systems are needed. During the 2014–2019 period, 320 accidents cost 496 lives, and 5424 accidents caused 6210 injuries. Ideally, we need historical data from real accident cases of ships to develop data-driven solutions. According to the literature, the most critical factor to the post-incident management phase is human error. However, no structured datasets record the crew’s actions during an incident and the human factors that contributed to its occurrence. To overcome the limitations mentioned above, we decided to utilise the unstructured information from accident reports conducted by governmental organisations to create a new, well-structured dataset of maritime accidents and provide intuitions for its usage. Our dataset contains all the information that the majority of the marine datasets include, such as the place, the date, and the conditions during the post-incident phase, e.g., weather data. Additionally, the proposed dataset contains attributes related to each incident’s environmental/financial impact, as well as a concise description of the post-incident events, highlighting the crew’s actions and the human factors that contributed to the incident. We utilise this dataset to predict the incident’s impact and provide data-driven directions regarding the improvement of the post-incident safety procedures for specific types of ships.

Suggested Citation

  • Panagiotis Panagiotidis & Kyriakos Giannakis & Nikolaos Angelopoulos & Angelos Liapis, 2021. "Shipping Accidents Dataset: Data-Driven Directions for Assessing Accident’s Impact and Improving Safety Onboard," Data, MDPI, vol. 6(12), pages 1-19, December.
  • Handle: RePEc:gam:jdataj:v:6:y:2021:i:12:p:129-:d:694430
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2306-5729/6/12/129/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2306-5729/6/12/129/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Gérard Biau & Erwan Scornet, 2016. "A random forest guided tour," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 25(2), pages 197-227, June.
    2. Wang, Huanxin & Liu, Zhengjiang & Wang, Xinjian & Graham, Tony & Wang, Jin, 2021. "An analysis of factors affecting the severity of marine accidents," Reliability Engineering and System Safety, Elsevier, vol. 210(C).
    3. J.S. Cramer, 2002. "The Origins of Logistic Regression," Tinbergen Institute Discussion Papers 02-119/4, Tinbergen Institute.
    4. Xiaolong Wang & Boling Zhang & Xu Zhao & Lulu Wang & Ruipeng Tong, 2020. "Exploring the Underlying Causes of Chinese Eastern Star, Korean Sewol, and Thai Phoenix Ferry Accidents by Employing the HFACS-MA," IJERPH, MDPI, vol. 17(11), pages 1-19, June.
    5. Gérard Biau & Erwan Scornet, 2016. "Rejoinder on: A random forest guided tour," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 25(2), pages 264-268, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Navneet Kaur & Lav Gupta, 2025. "Explainable AI Assisted IoMT Security in Future 6G Networks," Future Internet, MDPI, vol. 17(5), pages 1-30, May.
    2. Hou, Lei & Elsworth, Derek & Zhang, Fengshou & Wang, Zhiyuan & Zhang, Jianbo, 2023. "Evaluation of proppant injection based on a data-driven approach integrating numerical and ensemble learning models," Energy, Elsevier, vol. 264(C).
    3. Blum, Ricardo & Hiabu, Munir & Mammen, Enno & Meyer, Joseph T., 2025. "Pure interaction effects unseen by Random Forests," Computational Statistics & Data Analysis, Elsevier, vol. 212(C).
    4. Aouad, Anthony & Almaksour, Khaled & Abbes, Dhaker, 2024. "Storage management optimization based on electrical consumption and production forecast in a photovoltaic system," Mathematics and Computers in Simulation (MATCOM), Elsevier, vol. 224(PB), pages 128-147.
    5. Asmae Chakir & Mohamed Tabaa, 2024. "Hybrid Renewable Production Scheduling for a PV–Wind-EV-Battery Architecture Using Sequential Quadratic Programming and Long Short-Term Memory–K-Nearest Neighbors Learning for Smart Buildings," Sustainability, MDPI, vol. 16(5), pages 1-24, March.
    6. Giacomo Caterini, 2018. "Classifying Firms with Text Mining," DEM Working Papers 2018/09, Department of Economics and Management.
    7. Ma, Zhikai & Huo, Qian & Wang, Wei & Zhang, Tao, 2023. "Voltage-temperature aware thermal runaway alarming framework for electric vehicles via deep learning with attention mechanism in time-frequency domain," Energy, Elsevier, vol. 278(C).
    8. Gordeev, Stepan & Steinbach, Sandro, 2024. "Determinants of PTA design: Insights from machine learning," International Economics, Elsevier, vol. 178(C).
    9. Patrick Krennmair & Timo Schmid, 2022. "Flexible domain prediction using mixed effects random forests," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 71(5), pages 1865-1894, November.
    10. Qiu, Yuhang & Hui, Yunze & Zhao, Pengxiang & Cai, Cheng-Hao & Dai, Baiqian & Dou, Jinxiao & Bhattacharya, Sankar & Yu, Jianglong, 2024. "A novel image expression-driven modeling strategy for coke quality prediction in the smart cokemaking process," Energy, Elsevier, vol. 294(C).
    11. Frink, Nicolas & Schmid, Timo, 2025. "Small area prediction of counts under machine learning-type mixed models," Computational Statistics & Data Analysis, Elsevier, vol. 211(C).
    12. Ye Tian & Xiaobai Angela Yao & Marguerite Madden & Andrew Grundstein, 2024. "Synergic effects of meteorological factors on urban form-outdoor exercise relationship: A study with crowdsourced data," Journal of Geographical Systems, Springer, vol. 26(1), pages 47-72, January.
    13. Jie Shi & Arno P. J. M. Siebes & Siamak Mehrkanoon, 2023. "TransCORALNet: A Two-Stream Transformer CORAL Networks for Supply Chain Credit Assessment Cold Start," Papers 2311.18749, arXiv.org.
    14. Nela Ivković & Željana Bašić & Ivan Jerković, 2024. "Classifying age from medial clavicle using a 30-year threshold: An image analysis based approach," PLOS ONE, Public Library of Science, vol. 19(11), pages 1-16, November.
    15. Pedro Forquesato, 2022. "Who Benefits from Political Connections in Brazilian Municipalities," Papers 2204.09450, arXiv.org.
    16. Bourdouxhe, Axel & Wibail, Lionel & Claessens, Hugues & Dufrêne, Marc, 2023. "Modeling potential natural vegetation: A new light on an old concept to guide nature conservation in fragmented and degraded landscapes," Ecological Modelling, Elsevier, vol. 481(C).
    17. Park, Beomjin & Park, Changyi, 2023. "Multiclass Laplacian support vector machine with functional analysis of variance decomposition," Computational Statistics & Data Analysis, Elsevier, vol. 187(C).
    18. Manuel J. García Rodríguez & Vicente Rodríguez Montequín & Francisco Ortega Fernández & Joaquín M. Villanueva Balsera, 2019. "Public Procurement Announcements in Spain: Regulations, Data Analysis, and Award Price Estimator Using Machine Learning," Complexity, Hindawi, vol. 2019, pages 1-20, November.
    19. Borup, Daniel & Christensen, Bent Jesper & Mühlbach, Nicolaj Søndergaard & Nielsen, Mikkel Slot, 2023. "Targeting predictors in random forest regression," International Journal of Forecasting, Elsevier, vol. 39(2), pages 841-868.
    20. F. Leung & M. Law & S. K. Djeng, 2024. "Deterministic modelling of implied volatility in cryptocurrency options with underlying multiple resolution momentum indicator and non-linear machine learning regression algorithm," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 10(1), pages 1-25, December.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jdataj:v:6:y:2021:i:12:p:129-:d:694430. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager The email address of this maintainer does not seem to be valid anymore. Please ask MDPI Indexing Manager to update the entry or send us the correct address (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.