IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2311.02299.html
   My bibliography  Save this paper

The Fragility of Sparsity

Author

Listed:
  • Michal Koles'ar
  • Ulrich K. Muller
  • Sebastian T. Roelsgaard

Abstract

We show, using three empirical applications, that linear regression estimates which rely on the assumption of sparsity are fragile in two ways. First, we document that different choices of the regressor matrix that do not impact ordinary least squares (OLS) estimates, such as the choice of baseline category with categorical controls, can move sparsity-based estimates two standard errors or more. Second, we develop two tests of the sparsity assumption based on comparing sparsity-based estimators with OLS. The tests tend to reject the sparsity assumption in all three applications. Unless the number of regressors is comparable to or exceeds the sample size, OLS yields more robust results at little efficiency cost.

Suggested Citation

  • Michal Koles'ar & Ulrich K. Muller & Sebastian T. Roelsgaard, 2023. "The Fragility of Sparsity," Papers 2311.02299, arXiv.org, revised Jan 2024.
  • Handle: RePEc:arx:papers:2311.02299
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2311.02299
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. John J. Donohue III & Steven D. Levitt, 2001. "The Impact of Legalized Abortion on Crime," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 116(2), pages 379-420.
    2. Matias D. Cattaneo & Michael Jansson & Whitney K. Newey, 2018. "Inference in Linear Regression Models with Many Covariates and Heteroscedasticity," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 113(523), pages 1350-1361, July.
    3. Anatolyev, Stanislav & Sølvsten, Mikkel, 2023. "Testing many restrictions under heteroskedasticity," Journal of Econometrics, Elsevier, vol. 236(1).
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Ahrens, Achim & Hansen, Christian B. & Schaffer, Mark E & Wiemann, Thomas, 2024. "Model Averaging and Double Machine Learning," IZA Discussion Papers 16714, Institute of Labor Economics (IZA).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Patrick Kline & Raffaele Saggio & Mikkel Sølvsten, 2020. "Leave‐Out Estimation of Variance Components," Econometrica, Econometric Society, vol. 88(5), pages 1859-1898, September.
    2. Ng Cheuk Fai, 2022. "Robust Inference in High Dimensional Linear Model with Cluster Dependence," Papers 2212.05554, arXiv.org.
    3. Alexandre Belloni & Victor Chernozhukov & Denis Chetverikov & Christian Hansen & Kengo Kato, 2018. "High-dimensional econometrics and regularized GMM," CeMMAP working papers CWP35/18, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    4. Grönqvist, Hans & Niknami, Susan, 2014. "Alcohol availability and crime: Lessons from liberalized weekend sales restrictions," Journal of Urban Economics, Elsevier, vol. 81(C), pages 77-84.
    5. Hyunseok Jung & Xiaodong Liu, 2023. "Testing for Peer Effects without Specifying the Network Structure," Papers 2306.09806, arXiv.org, revised Mar 2024.
    6. Alejandro Gaviria & Carlos Medina & Leonardo Morales & Jairo Núñez, 2010. "The Cost of Avoiding Crime: The Case of Bogotá," NBER Chapters, in: The Economics of Crime: Lessons For and From Latin America, pages 101-132, National Bureau of Economic Research, Inc.
    7. Chenchuan (Mark) Li & Ulrich K. Müller, 2021. "Linear regression with many controls of limited explanatory power," Quantitative Economics, Econometric Society, vol. 12(2), pages 405-442, May.
    8. Alejandro Gaviria & Carlos Medina & Jorge Tamayo, 2010. "Assessing the Link between Adolescent Fertility and Urban Crime," Borradores de Economia 6860, Banco de la Republica.
    9. Nouman Khaliq & Muhammad Shabbir & Zahira Batool, 2019. "Exploring the Influence of Unemployment on Criminal Behavior in Punjab, Pakistan," Global Regional Review, Humanity Only, vol. 4(1), pages 402-409, March.
    10. Annette N. Brown & Drew B. Cameron & Benjamin D. K. Wood, 2014. "Quality evidence for policymaking: I'll believe it when I see the replication," Journal of Development Effectiveness, Taylor & Francis Journals, vol. 6(3), pages 215-235, September.
    11. Griffin Edwards, 2014. "Doing Their Duty: An Empirical Analysis of the Unintended Effect of Tarasoff v. Regents on Homicidal Activity," Journal of Law and Economics, University of Chicago Press, vol. 57(2), pages 321-348.
    12. Geert Dhaene & Koen Jochmans, 2015. "Split-panel Jackknife Estimation of Fixed-effect Models," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 82(3), pages 991-1030.
    13. Mitrut, Andreea & Wolff, François-Charles, 2011. "The impact of legalized abortion on child health outcomes and abandonment. Evidence from Romania," Journal of Health Economics, Elsevier, vol. 30(6), pages 1219-1231.
    14. Hussey, Andrew & Nikolsko-Rzhevskyy, Alex & Walker, Jay, 2010. "AIDing Contraception: HIV and Recent Trends in Abortion Rates," MPRA Paper 20895, University Library of Munich, Germany.
    15. Andrew Leigh, 2020. "The Second Convict Age: Explaining the Return of Mass Imprisonment in Australia," The Economic Record, The Economic Society of Australia, vol. 96(313), pages 187-208, June.
    16. Riccardo D'Adamo, 2018. "Cluster-Robust Standard Errors for Linear Regression Models with Many Controls," Papers 1806.07314, arXiv.org, revised Apr 2019.
    17. McCrary, Justin & Lee, David S., 2009. "The Deterrence Effect of Prison: Dynamic Theory and Evidence," Berkeley Olin Program in Law & Economics, Working Paper Series qt2gh1r30h, Berkeley Olin Program in Law & Economics.
    18. Smith, Simon C. & Timmermann, Allan & Zhu, Yinchu, 2019. "Variable selection in panel models with breaks," Journal of Econometrics, Elsevier, vol. 212(1), pages 323-344.
    19. Guo, Xu & Li, Runze & Liu, Jingyuan & Zeng, Mudong, 2023. "Statistical inference for linear mediation models with high-dimensional mediators and application to studying stock reaction to COVID-19 pandemic," Journal of Econometrics, Elsevier, vol. 235(1), pages 166-179.
    20. Bowmaker, Simon W. & Emerson, Patrick M., 2009. "Still Waiting for Mister Right? Asymmetric Information, Abortion Laws and the Timing of Marriage," IZA Discussion Papers 4176, Institute of Labor Economics (IZA).

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2311.02299. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.