IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0330899.html
   My bibliography  Save this article

Optimizing ensemble machine learning models for accurate liver disease prediction in healthcare

Author

Listed:
  • W El Atifi
  • O El Rhazouani
  • Fida Muhammad Khan
  • H Sekkat

Abstract

Liver disease encompasses a range of conditions affecting the liver, including hepatitis, cirrhosis, fatty liver, and liver cancer. It can be caused by infections, alcohol abuse, obesity, or genetic factors, and it often progresses silently until advanced stages. Early detection and lifestyle adjustments are essential for effective management and to prevent severe liver damage. This study explores the application of machine learning (ML) techniques to predict liver disease, leveraging a dataset to compare the performance of several ensemble classifiers. The algorithms include the Random Forrest Classifier, Ada Boost Classifier, and Gradient Boosting Classifier. After a series of feature extraction and selection, hyperparameter tuning by Randomized Search CV and GridSearchCV, we aimed to determine the best model for liver disease prediction in terms of accuracy, precision, recall, and F1-score. The results showed that the Random Forest Classifier, optimized with GridSearchCV, achieved the highest accuracy at just over 85.17%. The considerations presented in this classifier can be considered for potential use as a precise diagnostic tool for liver disease diagnostics as these measurements indicate that this classifier works balanced with precision at 0.85 for both the presence and absence of the given disease as well as recall of 0.81 for its presence and 0.87 for its absence and F1-measure of 0.83 and 0.85 respectively. There were also relatively high performances of AdaBoost Classifier and Gradient Boosting Classifier, though none of the classifiers outperformed Random Forest Classifier significantly. The research has shown the potential of ensemble ML techniques, especially in the diagnosis of medical conditions, including liver diseases which, if diagnosed early, are critical. The results add evidence regarding the applicability of the ML models in clinical practices with the potential to improve diagnostic activities and consequently the outcomes of patients. Future studies will build on these models, testing them on larger and more diverse sets of data, including aspects of deep learning, and apply the research to other disease domains. The work presented in this research offers a starting point for carrying out innovations with ML in the sphere of healthcare to progress the methods of diagnosing diseases and treatment.

Suggested Citation

  • W El Atifi & O El Rhazouani & Fida Muhammad Khan & H Sekkat, 2025. "Optimizing ensemble machine learning models for accurate liver disease prediction in healthcare," PLOS ONE, Public Library of Science, vol. 20(8), pages 1-20, August.
  • Handle: RePEc:plo:pone00:0330899
    DOI: 10.1371/journal.pone.0330899
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0330899
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0330899&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0330899?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0330899. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.