IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0341436.html

Developing count regression techniques for predicting the number of new type 2 diabetes cases in Saudi Arabia

Author

Listed:
  • Faten Al-hussein
  • Laleh Tafakori
  • Mali Abdollahian
  • Khalid Al-Shali

Abstract

Type 2 diabetes (T2D) is a chronic condition affecting millions globally. A robust predictive model to estimate the number of new cases of T2D can facilitate precise monitoring and effective intervention strategies. This study aims to predict the number of new T2D cases per month in Saudi Arabia and identify the Key Performance Indicators (KPIs) associated with T2D, using count regression models, Poisson Regression (PR), Negative Binomial Regression (NBR), Poisson Inverse Gaussian Regression (PIGR), and Bell Regression (BR). De-identified data from 1,000 patients with T2D in Saudi Arabia were used to develop the models. The performance of the full models, which include recommended Key Performance Indicators (KPIs), is compared using metrics such as the coefficient of determination (R2), root mean squared error (RMSE), mean absolute error (MAE), 10-fold cross-validation (CV-10), Akaike information criterion (AIC), and Bayesian information criterion (BIC). The most significant KPIs identified by the full models were utilized to develop the reduced models. The full NBR model outperformed other models, achieving R² of 0.88, RMSE of 0.93, MAE of 0.69, CV-10 of 1.21, AIC = 873.23, and BIC = 880. The reduced NBR model, focusing solely on the five most influential variables (marital status, age, body mass index (BMI), total cholesterol (TC), and high-density lipoprotein (HDL)), with R² = 0.84, RMSE = 1.10, MAE = 0.86, CV-10 = 1.37, AIC = 899, and BIC = 910, also outperformed other reduced models. The Likelihood Ratio Test (LRT) did not show a significant difference between the full and reduced NBR models (p = 0.694), supporting the adequacy of the reduced model. The proposed reduced model, utilizing only five significant KPIs, can help healthcare providers develop effective, targeted strategies by monitoring a smaller number of KPIs to reduce the rising number of T2D cases in Saudi Arabia.

Suggested Citation

  • Faten Al-hussein & Laleh Tafakori & Mali Abdollahian & Khalid Al-Shali, 2026. "Developing count regression techniques for predicting the number of new type 2 diabetes cases in Saudi Arabia," PLOS ONE, Public Library of Science, vol. 21(1), pages 1-21, January.
  • Handle: RePEc:plo:pone00:0341436
    DOI: 10.1371/journal.pone.0341436
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0341436
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0341436&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0341436?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Dexter Cahoy & Elvira Di Nardo & Federico Polito, 2021. "Flexible models for overdispersed and underdispersed count data," Statistical Papers, Springer, vol. 62(6), pages 2969-2990, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Célestin C. Kokonendji & Sobom M. Somé & Youssef Esstafa & Marcelo Bourguignon, 2023. "On Underdispersed Count Kernels for Smoothing Probability Mass Functions," Stats, MDPI, vol. 6(4), pages 1-15, November.
    2. Seng Huat Ong & Shin Zhu Sim & Shuangzhe Liu & Hari M. Srivastava, 2023. "A Family of Finite Mixture Distributions for Modelling Dispersion in Count Data," Stats, MDPI, vol. 6(3), pages 1-14, September.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0341436. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.