IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v13y2025i14p2287-d1702968.html
   My bibliography  Save this article

Robust Variable Selection via Bayesian LASSO-Composite Quantile Regression with Empirical Likelihood: A Hybrid Sampling Approach

Author

Listed:
  • Ruisi Nan

    (School of Science, Hubei University of Technology, Wuhan 430068, China)

  • Jingwei Wang

    (School of Science, Hubei University of Technology, Wuhan 430068, China)

  • Hanfang Li

    (School of Science, Hubei University of Technology, Wuhan 430068, China)

  • Youxi Luo

    (School of Science, Hubei University of Technology, Wuhan 430068, China)

Abstract

Since the advent of composite quantile regression (CQR), its inherent robustness has established it as a pivotal methodology for high-dimensional data analysis. High-dimensional outlier contamination refers to data scenarios where the number of observed dimensions ( p ) is much greater than the sample size ( n ) and there are extreme outliers in the response variables or covariates (e.g., p / n > 0.1). Traditional penalized regression techniques, however, exhibit notable vulnerability to data outliers during high-dimensional variable selection, often leading to biased parameter estimates and compromised resilience. To address this critical limitation, we propose a novel empirical likelihood (EL)-based variable selection framework that integrates a Bayesian LASSO penalty within the composite quantile regression framework. By constructing a hybrid sampling mechanism that incorporates the Expectation–Maximization (EM) algorithm and Metropolis–Hastings (M-H) algorithm within the Gibbs sampling scheme, this approach effectively tackles variable selection in high-dimensional settings with outlier contamination. This innovative design enables simultaneous optimization of regression coefficients and penalty parameters, circumventing the need for ad hoc selection of optimal penalty parameters—a long-standing challenge in conventional LASSO estimation. Moreover, the proposed method imposes no restrictive assumptions on the distribution of random errors in the model. Through Monte Carlo simulations under outlier interference and empirical analysis of two U.S. house price datasets, we demonstrate that the new approach significantly enhances variable selection accuracy, reduces estimation bias for key regression coefficients, and exhibits robust resistance to data outlier contamination.

Suggested Citation

  • Ruisi Nan & Jingwei Wang & Hanfang Li & Youxi Luo, 2025. "Robust Variable Selection via Bayesian LASSO-Composite Quantile Regression with Empirical Likelihood: A Hybrid Sampling Approach," Mathematics, MDPI, vol. 13(14), pages 1-20, July.
  • Handle: RePEc:gam:jmathe:v:13:y:2025:i:14:p:2287-:d:1702968
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/13/14/2287/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/13/14/2287/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Zou, Hui, 2006. "The Adaptive Lasso and Its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1418-1429, December.
    2. Kai-Tai Fang & Rahul Mukerjee, 2006. "Empirical-type likelihoods allowing posterior credible sets with frequentist validity: Higher-order asymptotics," Biometrika, Biometrika Trust, vol. 93(3), pages 723-733, September.
    3. Mahdieh Bayati & Seyed Kamran Ghoreishi & Jingjing Wu, 2021. "Bayesian analysis of restricted penalized empirical likelihood," Computational Statistics, Springer, vol. 36(2), pages 1321-1339, June.
    4. Bedoui, Adel & Lazar, Nicole A., 2020. "Bayesian empirical likelihood for ridge and lasso regressions," Computational Statistics & Data Analysis, Elsevier, vol. 145(C).
    5. Puying Zhao & Malay Ghosh & J. N. K. Rao & Changbao Wu, 2020. "Bayesian empirical likelihood inference with complex survey data," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 82(1), pages 155-174, February.
    6. Sanjay Chaudhuri & Debashis Mondal & Teng Yin, 2017. "Hamiltonian Monte Carlo sampling in Bayesian empirical likelihood computation," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(1), pages 293-320, January.
    7. Huilan Liu & Hu Yang & Changgen Peng, 2019. "Weighted composite quantile regression for single index model with missing covariates at random," Computational Statistics, Springer, vol. 34(4), pages 1711-1740, December.
    8. Nardi, Y. & Rinaldo, A., 2011. "Autoregressive process modeling via the Lasso procedure," Journal of Multivariate Analysis, Elsevier, vol. 102(3), pages 528-549, March.
    9. Fan J. & Li R., 2001. "Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1348-1360, December.
    10. Hui Zou & Trevor Hastie, 2005. "Addendum: Regularization and variable selection via the elastic net," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(5), pages 768-768, November.
    11. Taddy, Matthew A. & Kottas, Athanasios, 2010. "A Bayesian Nonparametric Approach to Inference for Quantile Regression," Journal of Business & Economic Statistics, American Statistical Association, vol. 28(3), pages 357-369.
    12. Hui Zou & Trevor Hastie, 2005. "Regularization and variable selection via the elastic net," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(2), pages 301-320, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Marcelo C. Medeiros & Eduardo F. Mendes, 2015. "l1-Regularization of High-Dimensional Time-Series Models with Flexible Innovations," Textos para discussão 636, Department of Economics PUC-Rio (Brazil).
    2. Ricardo P. Masini & Marcelo C. Medeiros & Eduardo F. Mendes, 2023. "Machine learning advances for time series forecasting," Journal of Economic Surveys, Wiley Blackwell, vol. 37(1), pages 76-111, February.
    3. Kai Yang & Xue Ding & Xiaohui Yuan, 2022. "Bayesian empirical likelihood inference and order shrinkage for autoregressive models," Statistical Papers, Springer, vol. 63(1), pages 97-121, February.
    4. Dengluan Dai & Anmin Tang & Jinli Ye, 2023. "High-Dimensional Variable Selection for Quantile Regression Based on Variational Bayesian Method," Mathematics, MDPI, vol. 11(10), pages 1-22, May.
    5. Tutz, Gerhard & Pößnecker, Wolfgang & Uhlmann, Lorenz, 2015. "Variable selection in general multinomial logit models," Computational Statistics & Data Analysis, Elsevier, vol. 82(C), pages 207-222.
    6. Umberto Amato & Anestis Antoniadis & Italia De Feis & Irene Gijbels, 2021. "Penalised robust estimators for sparse and high-dimensional linear models," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 30(1), pages 1-48, March.
    7. Wang, Christina Dan & Chen, Zhao & Lian, Yimin & Chen, Min, 2022. "Asset selection based on high frequency Sharpe ratio," Journal of Econometrics, Elsevier, vol. 227(1), pages 168-188.
    8. Bartosz Uniejewski, 2024. "Regularization for electricity price forecasting," Operations Research and Decisions, Wroclaw University of Science and Technology, Faculty of Management, vol. 34(3), pages 267-286.
    9. Peter Bühlmann & Jacopo Mandozzi, 2014. "High-dimensional variable screening and bias in subsequent inference, with an empirical comparison," Computational Statistics, Springer, vol. 29(3), pages 407-430, June.
    10. Capanu, Marinela & Giurcanu, Mihai & Begg, Colin B. & Gönen, Mithat, 2023. "Subsampling based variable selection for generalized linear models," Computational Statistics & Data Analysis, Elsevier, vol. 184(C).
    11. Ander Wilson & Brian J. Reich, 2014. "Confounder selection via penalized credible regions," Biometrics, The International Biometric Society, vol. 70(4), pages 852-861, December.
    12. Loann David Denis Desboulets, 2018. "A Review on Variable Selection in Regression Analysis," Econometrics, MDPI, vol. 6(4), pages 1-27, November.
    13. Zeyu Bian & Erica E. M. Moodie & Susan M. Shortreed & Sahir Bhatnagar, 2023. "Variable selection in regression‐based estimation of dynamic treatment regimes," Biometrics, The International Biometric Society, vol. 79(2), pages 988-999, June.
    14. Lee, Ji Hyung & Shi, Zhentao & Gao, Zhan, 2022. "On LASSO for predictive regression," Journal of Econometrics, Elsevier, vol. 229(2), pages 322-349.
    15. Christidis, Anthony-Alexander & Van Aelst, Stefan & Zamar, Ruben, 2025. "Multi-model subset selection," Computational Statistics & Data Analysis, Elsevier, vol. 203(C).
    16. Simona Buscemi & Antonella Plaia, 2020. "Model selection in linear mixed-effect models," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 104(4), pages 529-575, December.
    17. Yongjin Li & Qingzhao Zhang & Qihua Wang, 2017. "Penalized estimation equation for an extended single-index model," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 69(1), pages 169-187, February.
    18. Yang, Yanlin & Hu, Xuemei & Jiang, Huifeng, 2022. "Group penalized logistic regressions predict up and down trends for stock prices," The North American Journal of Economics and Finance, Elsevier, vol. 59(C).
    19. Zhang, Tonglin, 2024. "Variables selection using L0 penalty," Computational Statistics & Data Analysis, Elsevier, vol. 190(C).
    20. Takumi Saegusa & Tianzhou Ma & Gang Li & Ying Qing Chen & Mei-Ling Ting Lee, 2020. "Variable Selection in Threshold Regression Model with Applications to HIV Drug Adherence Data," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 12(3), pages 376-398, December.

    More about this item

    Keywords

    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:13:y:2025:i:14:p:2287-:d:1702968. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.