IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0268327.html
   My bibliography  Save this article

A causal learning framework for the analysis and interpretation of COVID-19 clinical data

Author

Listed:
  • Elisa Ferrari
  • Luna Gargani
  • Greta Barbieri
  • Lorenzo Ghiadoni
  • Francesco Faita
  • Davide Bacciu

Abstract

We present a workflow for clinical data analysis that relies on Bayesian Structure Learning (BSL), an unsupervised learning approach, robust to noise and biases, that allows to incorporate prior medical knowledge into the learning process and that provides explainable results in the form of a graph showing the causal connections among the analyzed features. The workflow consists in a multi-step approach that goes from identifying the main causes of patient’s outcome through BSL, to the realization of a tool suitable for clinical practice, based on a Binary Decision Tree (BDT), to recognize patients at high-risk with information available already at hospital admission time. We evaluate our approach on a feature-rich dataset of Coronavirus disease (COVID-19), showing that the proposed framework provides a schematic overview of the multi-factorial processes that jointly contribute to the outcome. We compare our findings with current literature on COVID-19, showing that this approach allows to re-discover established cause-effect relationships about the disease. Further, our approach yields to a highly interpretable tool correctly predicting the outcome of 85% of subjects based exclusively on 3 features: age, a previous history of chronic obstructive pulmonary disease and the PaO2/FiO2 ratio at the time of arrival to the hospital. The inclusion of additional information from 4 routine blood tests (Creatinine, Glucose, pO2 and Sodium) increases predictive accuracy to 94.5%.

Suggested Citation

  • Elisa Ferrari & Luna Gargani & Greta Barbieri & Lorenzo Ghiadoni & Francesco Faita & Davide Bacciu, 2022. "A causal learning framework for the analysis and interpretation of COVID-19 clinical data," PLOS ONE, Public Library of Science, vol. 17(5), pages 1-21, May.
  • Handle: RePEc:plo:pone00:0268327
    DOI: 10.1371/journal.pone.0268327
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0268327
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0268327&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0268327?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Jonathan G. Richens & Ciarán M. Lee & Saurabh Johri, 2020. "Publisher Correction: Improving the accuracy of medical diagnosis with causal machine learning," Nature Communications, Nature, vol. 11(1), pages 1-1, December.
    2. Davide Bacciu & Terence Etchells & Paulo Lisboa & Joe Whittaker, 2013. "Efficient identification of independence networks using mutual information," Computational Statistics, Springer, vol. 28(2), pages 621-646, April.
    3. Uusitalo, Laura, 2007. "Advantages and challenges of Bayesian networks in environmental modelling," Ecological Modelling, Elsevier, vol. 203(3), pages 312-318.
    4. Stefano Beretta & Mauro Castelli & Ivo Gonçalves & Roberto Henriques & Daniele Ramazzotti, 2018. "Learning the Structure of Bayesian Networks: A Quantitative Assessment of the Effect of Different Algorithmic Schemes," Complexity, Hindawi, vol. 2018, pages 1-12, September.
    5. Jonathan G. Richens & Ciarán M. Lee & Saurabh Johri, 2020. "Improving the accuracy of medical diagnosis with causal machine learning," Nature Communications, Nature, vol. 11(1), pages 1-9, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Seou Choi & Yannick Salamin & Charles Roques-Carmes & Rumen Dangovski & Di Luo & Zhuo Chen & Michael Horodynski & Jamison Sloan & Shiekh Zia Uddin & Marin Soljačić, 2024. "Photonic probabilistic machine learning using quantum vacuum noise," Nature Communications, Nature, vol. 15(1), pages 1-8, December.
    2. Alexander Lavin & Ciarán M. Gilligan-Lee & Alessya Visnjic & Siddha Ganju & Dava Newman & Sujoy Ganguly & Danny Lange & Atílím Güneş Baydin & Amit Sharma & Adam Gibson & Stephan Zheng & Eric P. Xing &, 2022. "Technology readiness levels for machine learning systems," Nature Communications, Nature, vol. 13(1), pages 1-19, December.
    3. Katerina Rigana & Ernst C. Wit & Samantha Cook, 2024. "Navigating Market Turbulence: Insights from Causal Network Contagion Value at Risk," Papers 2402.06032, arXiv.org.
    4. Shilin Zheng & Mengdan Li, 2022. "Does aggressive tweeting by the government help to control the COVID‐19 outbreak? Evidence from China," Economics of Transition and Institutional Change, John Wiley & Sons, vol. 30(4), pages 691-713, October.
    5. Raafat M Munshi, 2024. "Novel ensemble learning approach with SVM-imputed ADASYN features for enhanced cervical cancer prediction," PLOS ONE, Public Library of Science, vol. 19(1), pages 1-20, January.
    6. Bangfeng Wang & Yiwei Li & Mengfan Zhou & Yulong Han & Mingyu Zhang & Zhaolong Gao & Zetai Liu & Peng Chen & Wei Du & Xingcai Zhang & Xiaojun Feng & Bi-Feng Liu, 2023. "Smartphone-based platforms implementing microfluidic detection with image-based artificial intelligence," Nature Communications, Nature, vol. 14(1), pages 1-18, December.
    7. Forney Andrew & Mueller Scott, 2022. "Causal inference in AI education: A primer," Journal of Causal Inference, De Gruyter, vol. 10(1), pages 141-173, January.
    8. Di Zhang & Xinping Yan & Zaili Yang & Jin Wang, 2014. "An accident data–based approach for congestion risk assessment of inland waterways: A Yangtze River case," Journal of Risk and Reliability, , vol. 228(2), pages 176-188, April.
    9. Zhang, Quanzhong & Wei, Haiyan & Liu, Jing & Zhao, Zefang & Ran, Qiao & Gu, Wei, 2021. "A Bayesian network with fuzzy mathematics for species habitat suitability analysis: A case with limited Angelica sinensis (Oliv.) Diels data," Ecological Modelling, Elsevier, vol. 450(C).
    10. Jim Lewis & Kerrie Mengersen & Laurie Buys & Desley Vine & John Bell & Peter Morris & Gerard Ledwich, 2015. "Systems Modelling of the Socio-Technical Aspects of Residential Electricity Use and Network Peak Demand," PLOS ONE, Public Library of Science, vol. 10(7), pages 1-21, July.
    11. Nicholson, Ann E. & Flores, M. Julia, 2011. "Combining state and transition models with dynamic Bayesian networks," Ecological Modelling, Elsevier, vol. 222(3), pages 555-566.
    12. Moe, S. Jannicke & Haande, Sigrid & Couture, Raoul-Marie, 2016. "Climate change, cyanobacteria blooms and ecological status of lakes: A Bayesian network approach," Ecological Modelling, Elsevier, vol. 337(C), pages 330-347.
    13. Meineri, Eric & Dahlberg, C. Johan & Hylander, Kristoffer, 2015. "Using Gaussian Bayesian Networks to disentangle direct and indirect associations between landscape physiography, environmental variables and species distribution," Ecological Modelling, Elsevier, vol. 313(C), pages 127-136.
    14. Mostafa Shaaban & Carmen Schwartz & Joseph Macpherson & Annette Piorr, 2021. "A Conceptual Model Framework for Mapping, Analyzing and Managing Supply–Demand Mismatches of Ecosystem Services in Agricultural Landscapes," Land, MDPI, vol. 10(2), pages 1-19, January.
    15. De Iuliis, Melissa & Kammouh, Omar & Cimellaro, Gian Paolo & Tesfamariam, Solomon, 2021. "Quantifying restoration time of power and telecommunication lifelines after earthquakes using Bayesian belief network model," Reliability Engineering and System Safety, Elsevier, vol. 208(C).
    16. Dayong Li & Zengchuan Dong & Liyao Shi & Jintao Liu & Zhenye Zhu & Wei Xu, 2019. "Risk Probability Assessment of Sudden Water Pollution in the Plain River Network Based on Random Discharge from Multiple Risk Sources," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 33(12), pages 4051-4065, September.
    17. Tiller, Rachel Gjelsvik & Hansen, Lillian & Richards, Russell & Strand, Hillevi, 2015. "Work segmentation in the Norwegian salmon industry: The application of segmented labor market theory to work migrants on the island community of Frøya, Norway," Marine Policy, Elsevier, vol. 51(C), pages 563-572.
    18. Leonel Lara-Estrada & Livia Rasche & L. Enrique Sucar & Uwe A. Schneider, 2018. "Inferring Missing Climate Data for Agricultural Planning Using Bayesian Networks," Land, MDPI, vol. 7(1), pages 1-13, January.
    19. Jinjia Zhang & Yiping Zeng & Genserik Reniers & Jie Liu, 2022. "Analysis of the Interaction Mechanism of the Risk Factors of Gas Explosions in Chinese Underground Coal Mines," IJERPH, MDPI, vol. 19(2), pages 1-18, January.
    20. Nguyen, Minh-Hoang, 2023. "Investigating urban residents' involvement in biodiversity conservation in protected areas: Empirical evidence from Vietnam," Thesis Commons z2hjv, Center for Open Science.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0268327. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.