IDEAS home Printed from https://ideas.repec.org/a/plo/pntd00/0013618.html
   My bibliography  Save this article

Unraveling the drivers of leptospirosis risk in Thailand using machine learning

Author

Listed:
  • Pikkanet Suttirat
  • Sudarat Chadsuthi
  • Charin Modchang
  • Joacim Rocklöv

Abstract

Leptospirosis poses a significant public health challenge in Thailand, driven by a complex mix of environmental and socioeconomic factors. This study develops an XGBoost machine learning model to predict leptospirosis outbreak risk at the provincial level in Thailand, integrating climatic, socioeconomic, and agricultural features. Using national surveillance data from 2007-2022, the model was trained to classify provinces as high or low risk based on the median incidence rate. The model’s predictive performance was validated for the years 2018–2022, spanning pre-COVID-19, COVID-19, and post-COVID-19 periods. SHapley Additive exPlanation (SHAP) analysis was employed to identify key predictive factors. The optimized XGBoost model achieved high predictive accuracy for the pre-pandemic (AUC = 0.937 with 95% CI: 0.878 – 0.976) and post-pandemic (AUC = 0.951 with 95% CI: 0.861 – 0.999) testing periods. SHAP analysis revealed rice production factors, household size, and specific climatic variables as the strongest predictors of leptospirosis risk. However, model performance declined during the COVID-19 pandemic (2020–2021), suggesting surveillance disruption and potential underreporting. This study demonstrates the utility of machine learning for predicting leptospirosis risk in Thailand and highlights the complex interplay of environmental and socioeconomic factors in driving outbreaks. The adaptable modeling framework provides a foundation for developing early warning systems and targeted interventions to reduce the burden of this neglected tropical disease.Author summary: Leptospirosis, a disease caused by Leptospira bacteria, poses a significant public health challenge in Thailand. The bacteria thrive in contaminated environments, particularly those associated with rice farming. In this study, we developed a machine learning model to predict the risk of leptospirosis outbreaks in Thailand based on climatic, socioeconomic, and agricultural factors. Our analysis revealed that rice production practices, household size, and specific climatic variables were the strongest predictors of leptospirosis risk. We also observed a reduction in model performance during the COVID-19 pandemic, suggesting surveillance disruptions and potential underreporting. These findings highlight and explain the complex interplay of environmental and socioeconomic factors in driving leptospirosis outbreaks. Our adaptable modeling framework provides a foundation for developing early warning systems and targeted interventions to reduce the burden of this often-overlooked tropical disease. Better understanding the factors that contribute to leptospirosis risk can guide responses to protecting vulnerable populations and improving public health outcomes in Thailand and beyond in times of socio-environmental changes.

Suggested Citation

  • Pikkanet Suttirat & Sudarat Chadsuthi & Charin Modchang & Joacim Rocklöv, 2025. "Unraveling the drivers of leptospirosis risk in Thailand using machine learning," PLOS Neglected Tropical Diseases, Public Library of Science, vol. 19(10), pages 1-16, October.
  • Handle: RePEc:plo:pntd00:0013618
    DOI: 10.1371/journal.pntd.0013618
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosntds/article?id=10.1371/journal.pntd.0013618
    Download Restriction: no

    File URL: https://journals.plos.org/plosntds/article/file?id=10.1371/journal.pntd.0013618&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pntd.0013618?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. repec:plo:pntd00:0003898 is not listed on IDEAS
    Full references (including those not matched with items on IDEAS)

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pntd00:0013618. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosntds (email available below). General contact details of provider: https://journals.plos.org/plosntds/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.