Author
Listed:
- Xi Lu
(Department of Pharmaceutical Health Outcomes and Policy, College of Pharmacy, University of Houston, Houston, TX 77004, USA
Population Health Outcomes and Pharmacoepidemiology Education and Research Center (P-HOPER Center), University of Houston, Houston, TX 77004, USA)
- Jieni Li
(Department of Pharmaceutical Health Outcomes and Policy, College of Pharmacy, University of Houston, Houston, TX 77004, USA
Population Health Outcomes and Pharmacoepidemiology Education and Research Center (P-HOPER Center), University of Houston, Houston, TX 77004, USA)
- Rajender R. Aparasu
(Department of Pharmaceutical Health Outcomes and Policy, College of Pharmacy, University of Houston, Houston, TX 77004, USA
Population Health Outcomes and Pharmacoepidemiology Education and Research Center (P-HOPER Center), University of Houston, Houston, TX 77004, USA)
- Nebil Yusuf
(Department of Computer Science, University of Houston, Houston, TX 77004, USA)
- Cen Wu
(Department of Statistics, Kansas State University, Manhattan, KS 66506, USA)
Abstract
There is a growing interest in applying statistical machine learning methods, such as LASSO regression and its extensions, to analyze healthcare datasets. The existing study has examined LASSO and group LASSO regression with categorical predictors that are widely used in healthcare studies to represent variables with nominal or ordinal categories. Despite the success of these studies, statistical inference procedures and quantifying uncertainty for regression with categorical predictors have largely been overlooked, partly due to the theoretical challenges practitioners face when applying these methods in behavioral research. In this article, we aim to fill this gap by investigating from a Bayesian perspective. Specifically, we conduct Bayesian LASSO analysis with categorical predictors under different coding strategies, and thoroughly investigate the impact of four representative coding strategies on variable selection and prediction. In particular, we have conducted uncertainty quantification in terms of marginal Bayesian credible intervals by leveraging the advantage that fully Bayesian analysis can enable exact statistical inference even on finite samples. In this study, we demonstrate that the variable selection, estimation and prediction of Bayesian LASSO are influenced by the coding strategies with the real-world Medical Expenditure Panel Survey (MEPS) data. The performance of Bayesian LASSO has also been compared with LASSO and linear regression.
Suggested Citation
Xi Lu & Jieni Li & Rajender R. Aparasu & Nebil Yusuf & Cen Wu, 2025.
"Bayesian LASSO with Categorical Predictors: Coding Strategies, Uncertainty Quantification, and Healthcare Applications,"
Forecasting, MDPI, vol. 7(4), pages 1-27, November.
Handle:
RePEc:gam:jforec:v:7:y:2025:i:4:p:69-:d:1799735
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jforec:v:7:y:2025:i:4:p:69-:d:1799735. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.