Author
Listed:
- Jaclyn M Hall
- Jie Xu
- Marta G Walsh
- Hee-Deok Cho
- Grant Harrell
- Shailina A Keshwani
- Steven M Smith
- Stephanie A S Staras
Abstract
Background: Hypertension (HTN) is a complex condition with significant heterogeneity in presentation and treatment response. Identifying distinct subphenotypes of HTN may improve our understanding of its underlying mechanisms and guide more precise treatment or public health initiatives. Methods: Using EHR and Medicaid claims data from the OneFlorida+ research consortium (2012–2021), we identified a cohort of adult Floridians with newly diagnosed HTN (first diagnosis following two outpatient blood pressures ≥140/90 mmHg & no prior anti-HTN treatment). We extracted demographic and clinical data from the diagnosis visit and ≤1 year prior. We used hierarchical clustering (unsupervised machine learning) to identify distinct subphenotypes within the OneFlorida+ HTN population. Results: A total of 40,686 patients were included (mean ± SD age, 60.9 ± 17.5 y; 55% women). Five subphenotypes (S1-5) were identified. S1 was characterized by older age, higher Body Mass Index (BMI), and prevalent type 2 diabetes. S2 included over 50% of Black patients who were primarily women, younger, with higher BMI, but living in communities with higher levels of socioeconomic vulnerabilities. S3 contained a higher percentage of Hispanic patients with comparatively lower BMI. S4 is characterized by higher age and co-morbidities. S5 had 94% of patients with chronic kidney disease. Distinctions in social determinants of health factors were also observed. Conclusions: Unsupervised learning identified 5 HTN subphenotypes varying in demographic, socioeconomic, and risk profiles. Further investigation into the biological mechanisms of these subphenotypes and the relationships to social factors may enhance our ability to deliver targeted interventions that consider social policy implications in addition to the traditional behavioral and physiological interventions.
Suggested Citation
Jaclyn M Hall & Jie Xu & Marta G Walsh & Hee-Deok Cho & Grant Harrell & Shailina A Keshwani & Steven M Smith & Stephanie A S Staras, 2025.
"Unsupervised learning using EHR and census data to identify distinct subphenotypes of newly diagnosed hypertension patients,"
PLOS ONE, Public Library of Science, vol. 20(7), pages 1-14, July.
Handle:
RePEc:plo:pone00:0326776
DOI: 10.1371/journal.pone.0326776
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0326776. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.