IDEAS home Printed from https://ideas.repec.org/p/aim/wpaimx/1852.html
   My bibliography  Save this paper

A Review on Variable Selection in Regression Analysis

Author

Abstract

In this paper, we investigate on 39 Variable Selection procedures to give an overview of the existing literature for practitioners. "Let the data speak for themselves" has become the motto of many applied researchers since the amount of data has significantly grew. Automatic model selection have been raised by the search for data-driven theories for quite a long time now. However while great extensions have been made on the theoretical side still basic procedures are used in most empirical work, eg. Stepwise Regression. Some reviews are already available in the literature for variable selection, but always focus on a specific topic like linear regression, groups of variables or smoothly varying coefficients. Here we provide a review of main methods and state-of-the art extensions as well as a topology of them over a wide range of model structures (linear, grouped, additive, partially linear and non-parametric). We provide explanations for which methods to use for different model purposes and what are key differences among them. We also review two methods for improving variable selection in the general sense.

Suggested Citation

  • Loann D. Desboulets, 2018. "A Review on Variable Selection in Regression Analysis," AMSE Working Papers 1852, Aix-Marseille School of Economics, France.
  • Handle: RePEc:aim:wpaimx:1852
    as

    Download full text from publisher

    File URL: https://www.amse-aixmarseille.fr/sites/default/files/_dt/2012/wp_2018_-_nr_52.pdf
    Download Restriction: no
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Aneiros, Germán & Novo, Silvia & Vieu, Philippe, 2022. "Variable selection in functional regression models: A review," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    2. Eduardo Correia & Rodrigo Calili & José Francisco Pessanha & Maria Fatima Almeida, 2023. "Definition of Regulatory Targets for Electricity Non-Technical Losses: Proposition of an Automatic Model-Selection Technique for Panel Data Regressions," Energies, MDPI, vol. 16(6), pages 1-22, March.
    3. Rahi Jain & Wei Xu, 2021. "HDSI: High dimensional selection with interactions algorithm on feature selection and testing," PLOS ONE, Public Library of Science, vol. 16(2), pages 1-17, February.
    4. Fakhri J. Hasanov & Elchin Suleymanov & Heyran Aliyeva & Hezi Eynalov & Sa'd Shannak, 2022. "What Drives the Agricultural Growth in Azerbaijan? Insights from Autometrics with Super Saturation," Acta Universitatis Agriculturae et Silviculturae Mendelianae Brunensis, Mendel University Press, vol. 70(3), pages 147-174.
    5. Fakhri J. Hasanov & Muhammad Javid & Frederick L. Joutz, 2022. "Saudi Non-Oil Exports before and after COVID-19: Historical Impacts of Determinants and Scenario Analysis," Sustainability, MDPI, vol. 14(4), pages 1-38, February.
    6. Iztok Podbregar & Goran Šimić & Mirjana Radovanović & Sanja Filipović & Polona Šprajc, 2020. "International Energy Security Risk Index—Analysis of the Methodological Settings," Energies, MDPI, vol. 13(12), pages 1-15, June.
    7. Berndt Jesenko & Christian Schlögl, 2021. "The effect of web of science subject categories on clustering: the case of data-driven methods in business and economic sciences," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(8), pages 6785-6801, August.
    8. Gonzalo García-Donato & María Eugenia Castellanos & Alicia Quirós, 2021. "Bayesian Variable Selection with Applications in Health Sciences," Mathematics, MDPI, vol. 9(3), pages 1-16, January.
    9. Mukhtarov, Shahriyar & Mikayilov, Jeyhun I., 2023. "Could financial development eliminate energy poverty through renewable energy in Poland?," Energy Policy, Elsevier, vol. 182(C).
    10. Gao Wang & Abhishek Sarkar & Peter Carbonetto & Matthew Stephens, 2020. "A simple new approach to variable selection in regression, with application to genetic fine mapping," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 82(5), pages 1273-1300, December.
    11. Kimia Keshanian & Daniel Zantedeschi & Kaushik Dutta, 2022. "Features Selection as a Nash-Bargaining Solution: Applications in Online Advertising and Information Systems," INFORMS Journal on Computing, INFORMS, vol. 34(5), pages 2485-2501, September.
    12. Robert Giel & Alicja Dąbrowska, 2021. "Estimating Time Spent at the Waste Collection Point by A Garbage Truck with A Multiple Regression Model," Sustainability, MDPI, vol. 13(8), pages 1-14, April.

    More about this item

    Keywords

    variable selection; automatic modelling; sparse models;
    All these keywords.

    JEL classification:

    • C50 - Mathematical and Quantitative Methods - - Econometric Modeling - - - General
    • C59 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Other

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:aim:wpaimx:1852. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Gregory Cornu (email available below). General contact details of provider: https://edirc.repec.org/data/amseafr.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.