Author
Listed:
- Oliver Mendoza-Cano
- Xóchitl Trujillo
- Miguel Huerta
- Mónica Ríos-Silva
- Agustin Lugo-Radillo
- Jaime Alberto Bricio-Barrios
- Verónica Benites-Godínez
- Herguin Benjamin Cuevas-Arellano
- Juan Manuel Uribe-Ramos
- Ramón Solano-Barajas
- Jesús Venegas-Ramírez
- Eder Fernando Ríos-Bracamontes
- Luis A García-Solórzano
- Arlette A Camacho-delaCruz
- Efrén Murillo-Zamora
Abstract
Uncontrolled hypertension (HTN) increases the risk of adverse health events. This study aimed to identify key predictors of uncontrolled HTN in 1,308 Mexican adults with a prior diagnosis of HTN who were undergoing pharmacological treatment. We utilized data from the 2022 National Health and Nutrition Survey and applied data-driven algorithms within an artificial intelligence framework to enhance predictive accuracy and interpretability. Specifically, we integrated Random Forest, XGBoost, LASSO regression, and SHAP analysis. Uncontrolled HTN was defined as systolic blood pressure ≥ 130 mmHg or diastolic blood pressure ≥ 80 mmHg based on two readings. We applied LASSO regression to exclude unrelated factors and trained Random Forest and XGBoost algorithms to identify the most important predictors. The key contributors to model accuracy in Random Forest were years since HTN diagnosis (11.9), age (9.4), and source of medical care (4.6), while SHAP analysis in XGBoost further highlighted age (0.115) and source of medical care (0.065) as significant factors. When compared to a traditional logistic regression model, the data-driven approach demonstrated superior predictive performance, with Random Forest achieving an AUC of 0.75 (95% CI 0.72–0.77) versus logistic regression (AUC = 0.61, 95% CI 0.59–0.64). XGBoost exhibited lower predictive capacity (AUC = 0.54, 95% CI 0.49–0.60). These findings underscore the importance of age, duration since diagnosis, and source of medical care in predicting uncontrolled HTN. If replicated, this evidence can inform public health strategies to better target at-risk populations and optimize HTN management through data-driven interventions.
Suggested Citation
Oliver Mendoza-Cano & Xóchitl Trujillo & Miguel Huerta & Mónica Ríos-Silva & Agustin Lugo-Radillo & Jaime Alberto Bricio-Barrios & Verónica Benites-Godínez & Herguin Benjamin Cuevas-Arellano & Juan Ma, 2025.
"Data-driven identification of key predictors of uncontrolled hypertension: A cross-sectional study,"
PLOS ONE, Public Library of Science, vol. 20(9), pages 1-13, September.
Handle:
RePEc:plo:pone00:0331565
DOI: 10.1371/journal.pone.0331565
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0331565. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.