IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0310218.html
   My bibliography  Save this article

Unveiling diabetes onset: Optimized XGBoost with Bayesian optimization for enhanced prediction

Author

Listed:
  • Muhammad Rizwan Khurshid
  • Sadaf Manzoor
  • Touseef Sadiq
  • Lal Hussain
  • Mohammed Shahbaz Khan
  • Ashit Kumar Dutta

Abstract

Diabetes, a chronic condition affecting millions worldwide, necessitates early intervention to prevent severe complications. While accurately predicting diabetes onset or progression remains challenging due to complex and imbalanced datasets, recent advancements in machine learning offer potential solutions. Traditional prediction models, often limited by default parameters, have been superseded by more sophisticated approaches. Leveraging Bayesian optimization to fine-tune XGBoost, researchers can harness the power of complex data analysis to improve predictive accuracy. By identifying key factors influencing diabetes risk, personalized prevention strategies can be developed, ultimately enhancing patient outcomes. Successful implementation requires meticulous data management, stringent ethical considerations, and seamless integration into healthcare systems. This study focused on optimizing the hyperparameters of an XGBoost ensemble machine learning model using Bayesian optimization. Compared to grid search XGBoost (accuracy: 97.24%, F1-score: 95.72%, MCC: 81.02%), the XGBoost with Bayesian optimization achieved slightly improved performance (accuracy: 97.26%, F1-score: 95.72%, MCC:81.18%). Although the improvements observed in this study are modest, the optimized XGBoost model with Bayesian optimization represents a promising step towards revolutionizing diabetes prevention and treatment. This approach holds significant potential to improve outcomes for individuals at risk of developing diabetes.

Suggested Citation

  • Muhammad Rizwan Khurshid & Sadaf Manzoor & Touseef Sadiq & Lal Hussain & Mohammed Shahbaz Khan & Ashit Kumar Dutta, 2025. "Unveiling diabetes onset: Optimized XGBoost with Bayesian optimization for enhanced prediction," PLOS ONE, Public Library of Science, vol. 20(1), pages 1-29, January.
  • Handle: RePEc:plo:pone00:0310218
    DOI: 10.1371/journal.pone.0310218
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0310218
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0310218&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0310218?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0310218. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.