IDEAS home Printed from https://ideas.repec.org/a/spr/alstar/v106y2022i1d10.1007_s10182-021-00403-x.html
   My bibliography  Save this article

Variable selection and collinearity processing for multivariate data via row-elastic-net regularization

Author

Listed:
  • Bingzhen Chen

    (Renmin University of China)

  • Wenjuan Zhai

    (Cangzhou Jiaotong College)

  • Lingchen Kong

    (Beijing Jiaotong University)

Abstract

Multivariate data is collected in many fields, such as chemometrics, econometrics, financial engineering and genetics. In multivariate data, heteroscedasticity and collinearity occur frequently. And selecting material predictors is also a key issue when analyzing multivariate data. To accomplish these tasks, multivariate linear regression model is often constructed. We thus propose row-sparse elastic-net regularized multivariate Huber regression model in this paper. For this new model, we proof its grouping effect property and the property of resisting sample outliers. Based on the KKT condition, an accelerated proximal sub-gradient algorithm is designed to solve the proposed model and its convergency is also established. To demonstrate the accuracy and efficiency, simulation and real data experiments are carried out. The numerical results show that the new model can deal with heteroscedasticity and collinearity well.

Suggested Citation

  • Bingzhen Chen & Wenjuan Zhai & Lingchen Kong, 2022. "Variable selection and collinearity processing for multivariate data via row-elastic-net regularization," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 106(1), pages 79-96, March.
  • Handle: RePEc:spr:alstar:v:106:y:2022:i:1:d:10.1007_s10182-021-00403-x
    DOI: 10.1007/s10182-021-00403-x
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10182-021-00403-x
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10182-021-00403-x?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Koenker, Roger W & Bassett, Gilbert, Jr, 1978. "Regression Quantiles," Econometrica, Econometric Society, vol. 46(1), pages 33-50, January.
    2. Simila, Timo & Tikka, Jarkko, 2007. "Input selection and shrinkage in multiresponse linear regression," Computational Statistics & Data Analysis, Elsevier, vol. 52(1), pages 406-422, September.
    3. Jordi Barretina & Giordano Caponigro & Nicolas Stransky & Kavitha Venkatesan & Adam A. Margolin & Sungjoon Kim & Christopher J.Wilson & Joseph Lehár & Gregory V. Kryukov & Dmitriy Sonkin & Anupama Red, 2012. "Addendum: The Cancer Cell Line Encyclopedia enables predictive modelling of anticancer drug sensitivity," Nature, Nature, vol. 492(7428), pages 290-290, December.
    4. Jordi Barretina & Giordano Caponigro & Nicolas Stransky & Kavitha Venkatesan & Adam A. Margolin & Sungjoon Kim & Christopher J. Wilson & Joseph Lehár & Gregory V. Kryukov & Dmitriy Sonkin & Anupama Re, 2012. "The Cancer Cell Line Encyclopedia enables predictive modelling of anticancer drug sensitivity," Nature, Nature, vol. 483(7391), pages 603-607, March.
    5. Hui Zou & Trevor Hastie, 2005. "Addendum: Regularization and variable selection via the elastic net," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(5), pages 768-768, November.
    6. Hui Zou & Trevor Hastie, 2005. "Regularization and variable selection via the elastic net," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(2), pages 301-320, April.
    7. Leo Breiman & Jerome H. Friedman, 1997. "Predicting Multivariate Responses in Multiple Linear Regression," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 59(1), pages 3-54.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Wentao Qu & Xianchao Xiu & Huangyue Chen & Lingchen Kong, 2023. "A Survey on High-Dimensional Subspace Clustering," Mathematics, MDPI, vol. 11(2), pages 1-39, January.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Seungyeul Yoo & Abhilasha Sinha & Dawei Yang & Nasser K. Altorki & Radhika Tandon & Wenhui Wang & Deebly Chavez & Eunjee Lee & Ayushi S. Patel & Takashi Sato & Ranran Kong & Bisen Ding & Eric E. Schad, 2022. "Integrative network analysis of early-stage lung adenocarcinoma identifies aurora kinase inhibition as interceptor of invasion and progression," Nature Communications, Nature, vol. 13(1), pages 1-17, December.
    2. Zhang, Ting & Wang, Lei, 2020. "Smoothed empirical likelihood inference and variable selection for quantile regression with nonignorable missing response," Computational Statistics & Data Analysis, Elsevier, vol. 144(C).
    3. Yu-Zhu Tian & Man-Lai Tang & Wai-Sum Chan & Mao-Zai Tian, 2021. "Bayesian bridge-randomized penalized quantile regression for ordinal longitudinal data, with application to firm’s bond ratings," Computational Statistics, Springer, vol. 36(2), pages 1289-1319, June.
    4. Yu, Dengdeng & Zhang, Li & Mizera, Ivan & Jiang, Bei & Kong, Linglong, 2019. "Sparse wavelet estimation in quantile regression with multiple functional predictors," Computational Statistics & Data Analysis, Elsevier, vol. 136(C), pages 12-29.
    5. Torossian, Léonard & Picheny, Victor & Faivre, Robert & Garivier, Aurélien, 2020. "A review on quantile regression for stochastic computer experiments," Reliability Engineering and System Safety, Elsevier, vol. 201(C).
    6. Jiang, Liewen & Bondell, Howard D. & Wang, Huixia Judy, 2014. "Interquantile shrinkage and variable selection in quantile regression," Computational Statistics & Data Analysis, Elsevier, vol. 69(C), pages 208-219.
    7. Yu-Zhu Tian & Man-Lai Tang & Mao-Zai Tian, 2021. "Bayesian joint inference for multivariate quantile regression model with L $$_{1/2}$$ 1 / 2 penalty," Computational Statistics, Springer, vol. 36(4), pages 2967-2994, December.
    8. Bousebata, Meryem & Enjolras, Geoffroy & Girard, Stéphane, 2023. "Extreme partial least-squares," Journal of Multivariate Analysis, Elsevier, vol. 194(C).
    9. Bonaccolto, Giovanni & Borri, Nicola & Consiglio, Andrea, 2023. "Breakup and default risks in the great lockdown," Journal of Banking & Finance, Elsevier, vol. 147(C).
    10. Yazhao Lv & Riquan Zhang & Weihua Zhao & Jicai Liu, 2014. "Quantile regression and variable selection for the single-index model," Journal of Applied Statistics, Taylor & Francis Journals, vol. 41(7), pages 1565-1577, July.
    11. Bayer, Sebastian, 2018. "Combining Value-at-Risk forecasts using penalized quantile regressions," Econometrics and Statistics, Elsevier, vol. 8(C), pages 56-77.
    12. Jiang, He & Tao, Changqi & Dong, Yao & Xiong, Ren, 2021. "Robust low-rank multiple kernel learning with compound regularization," European Journal of Operational Research, Elsevier, vol. 295(2), pages 634-647.
    13. Tian, Yuzhu & Song, Xinyuan, 2020. "Bayesian bridge-randomized penalized quantile regression," Computational Statistics & Data Analysis, Elsevier, vol. 144(C).
    14. Park, Seyoung & Kim, Hyunjin & Lee, Eun Ryung, 2023. "Regional quantile regression for multiple responses," Computational Statistics & Data Analysis, Elsevier, vol. 188(C).
    15. Konrad Bogner & Florian Pappenberger & Massimiliano Zappa, 2019. "Machine Learning Techniques for Predicting the Energy Consumption/Production and Its Uncertainties Driven by Meteorological Observations and Forecasts," Sustainability, MDPI, vol. 11(12), pages 1-22, June.
    16. Xu, Qifa & Zhou, Yingying & Jiang, Cuixia & Yu, Keming & Niu, Xufeng, 2016. "A large CVaR-based portfolio selection model with weight constraints," Economic Modelling, Elsevier, vol. 59(C), pages 436-447.
    17. Guangrui Tang & Neng Fan, 2022. "A Survey of Solution Path Algorithms for Regression and Classification Models," Annals of Data Science, Springer, vol. 9(4), pages 749-789, August.
    18. Petrella, Lea & Raponi, Valentina, 2019. "Joint estimation of conditional quantiles in multivariate linear regression models with an application to financial distress," Journal of Multivariate Analysis, Elsevier, vol. 173(C), pages 70-84.
    19. Liqun Yu & Nan Lin, 2017. "ADMM for Penalized Quantile Regression in Big Data," International Statistical Review, International Statistical Institute, vol. 85(3), pages 494-518, December.
    20. Jhong, Jae-Hwan & Koo, Ja-Yong, 2019. "Simultaneous estimation of quantile regression functions using B-splines and total variation penalty," Computational Statistics & Data Analysis, Elsevier, vol. 133(C), pages 228-244.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:alstar:v:106:y:2022:i:1:d:10.1007_s10182-021-00403-x. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.