IDEAS home Printed from https://ideas.repec.org/a/plo/pntd00/0012026.html
   My bibliography  Save this article

Machine learning for predicting Chagas disease infection in rural areas of Brazil

Author

Listed:
  • Fabio De Rose Ghilardi
  • Gabriel Silva
  • Thallyta Maria Vieira
  • Ariela Mota
  • Ana Luiza Bierrenbach
  • Renata Fiuza Damasceno
  • Lea Campos de Oliveira
  • Alexandre Dias Porto Chiavegatto Filho
  • Ester Sabino

Abstract

Introduction: Chagas disease is a severe parasitic illness that is prevalent in Latin America and often goes unaddressed. Early detection and treatment are critical in preventing the progression of the illness and its associated life-threatening complications. In recent years, machine learning algorithms have emerged as powerful tools for disease prediction and diagnosis. Methods: In this study, we developed machine learning algorithms to predict the risk of Chagas disease based on five general factors: age, gender, history of living in a mud or wooden house, history of being bitten by a triatomine bug, and family history of Chagas disease. We analyzed data from the Retrovirus Epidemiology Donor Study (REDS) to train five popular machine learning algorithms. The sample comprised 2,006 patients, divided into 75% for training and 25% for testing algorithm performance. We evaluated the model performance using precision, recall, and AUC-ROC metrics. Results: The Adaboost algorithm yielded an AUC-ROC of 0.772, a precision of 0.199, and a recall of 0.612. We simulated the decision boundary using various thresholds and observed that in this dataset a threshold of 0.45 resulted in a 100% recall. This finding suggests that employing such a threshold could potentially save 22.5% of the cost associated with mass testing of Chagas disease. Conclusion: Our findings highlight the potential of applying machine learning to improve the sensitivity and effectiveness of Chagas disease diagnosis and prevention. Furthermore, we emphasize the importance of integrating socio-demographic and environmental factors into neglected disease prediction models to enhance their performance. Author summary: Chagas disease, a severe parasitic illness prevalent in Latin America, poses significant challenges due to delayed detection and treatment. Machine learning algorithms, advanced computer programs, have emerged as valuable tools for disease prediction and diagnosis. In our study, we utilized these algorithms to forecast Chagas disease risk based on factors such as age, gender, and living conditions. Drawing on data from the Retrovirus Epidemiology Donor Study (REDS), we trained five algorithms, with one showing promising results, achieving an impressive score of 0.772 out of 1. By establishing a specific threshold, we could potentially reduce testing costs while maintaining high detection rates. This research highlights the potential of machine learning in improving Chagas disease diagnosis and prevention by incorporating socio-demographic and environmental factors. Integrating these elements into predictive models has the potential to enhance their effectiveness and sensitivity, thereby improving disease management outcomes and ultimately reducing the burden of Chagas disease in affected regions.

Suggested Citation

  • Fabio De Rose Ghilardi & Gabriel Silva & Thallyta Maria Vieira & Ariela Mota & Ana Luiza Bierrenbach & Renata Fiuza Damasceno & Lea Campos de Oliveira & Alexandre Dias Porto Chiavegatto Filho & Ester , 2024. "Machine learning for predicting Chagas disease infection in rural areas of Brazil," PLOS Neglected Tropical Diseases, Public Library of Science, vol. 18(4), pages 1-11, April.
  • Handle: RePEc:plo:pntd00:0012026
    DOI: 10.1371/journal.pntd.0012026
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosntds/article?id=10.1371/journal.pntd.0012026
    Download Restriction: no

    File URL: https://journals.plos.org/plosntds/article/file?id=10.1371/journal.pntd.0012026&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pntd.0012026?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pntd00:0012026. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosntds (email available below). General contact details of provider: https://journals.plos.org/plosntds/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.