IDEAS home Printed from https://ideas.repec.org/a/bla/scjsta/v49y2022i4p1636-1668.html

On the sign recovery by least absolute shrinkage and selection operator, thresholded least absolute shrinkage and selection operator, and thresholded basis pursuit denoising

Author

Listed:
  • Patrick J.C. Tardivel
  • Małgorzata Bogdan

Abstract

Basis pursuit (BP), basis pursuit deNoising (BPDN), and least absolute shrinkage and selection operator (LASSO) are popular methods for identifying important predictors in the high‐dimensional linear regression model Y=Xβ+ε. By definition, when ε=0, BP uniquely recovers β when Xβ=Xb and β≠b implies ‖b‖1>‖β‖1 (identifiability condition). Furthermore, LASSO can recover the sign of β only under a much stronger irrepresentability condition. Meanwhile, it is known that the model selection properties of LASSO can be improved by hard thresholding its estimates. This article supports these findings by proving that thresholded LASSO, thresholded BPDN, and thresholded BP recover the sign of β in both the noisy and noiseless cases if and only if β is identifiable and large enough. In particular, if X has iid Gaussian entries and the number of predictors grows linearly with the sample size, then these thresholded estimators can recover the sign of β when the signal sparsity is asymptotically below the Donoho–Tanner transition curve. This is in contrast to the regular LASSO, which asymptotically, recovers the sign of β only when the signal sparsity tends to 0. Numerical experiments show that the identifiability condition, unlike the irrepresentability condition, does not seem to be affected by the structure of the correlations in the X matrix.

Suggested Citation

  • Patrick J.C. Tardivel & Małgorzata Bogdan, 2022. "On the sign recovery by least absolute shrinkage and selection operator, thresholded least absolute shrinkage and selection operator, and thresholded basis pursuit denoising," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 49(4), pages 1636-1668, December.
  • Handle: RePEc:bla:scjsta:v:49:y:2022:i:4:p:1636-1668
    DOI: 10.1111/sjos.12568
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/sjos.12568
    Download Restriction: no

    File URL: https://libkey.io/10.1111/sjos.12568?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Zou, Hui, 2006. "The Adaptive Lasso and Its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1418-1429, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Pierre Bertrand & Michel Broniatowski & Wolfgang Stummer, 2025. "Hybrid Random Concentrated Optimization Without Convexity Assumption," AMSE Working Papers 2524, Aix-Marseille School of Economics, France.
    2. Piotr Graczyk & Ulrike Schneider & Tomasz Skalski & Patrick Tardivel, 2026. "A Unified Framework for Pattern Recovery in Penalized and Thresholded Estimation and its Geometry," Journal of Optimization Theory and Applications, Springer, vol. 208(1), pages 1-41, January.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Tutz, Gerhard & Pößnecker, Wolfgang & Uhlmann, Lorenz, 2015. "Variable selection in general multinomial logit models," Computational Statistics & Data Analysis, Elsevier, vol. 82(C), pages 207-222.
    2. Gozgor, Giray & Li, Jing & Saleem, Irfan & Shinwari, Riazullah, 2025. "The impact of women's political empowerment on renewable energy demand: Evidence from OECD countries," Energy Economics, Elsevier, vol. 141(C).
    3. Margherita Giuzio, 2017. "Genetic algorithm versus classical methods in sparse index tracking," Decisions in Economics and Finance, Springer;Associazione per la Matematica, vol. 40(1), pages 243-256, November.
    4. Xu, Yang & Zhao, Shishun & Hu, Tao & Sun, Jianguo, 2021. "Variable selection for generalized odds rate mixture cure models with interval-censored failure time data," Computational Statistics & Data Analysis, Elsevier, vol. 156(C).
    5. Emmanouil Androulakis & Christos Koukouvinos & Kalliopi Mylona & Filia Vonta, 2010. "A real survival analysis application via variable selection methods for Cox's proportional hazards model," Journal of Applied Statistics, Taylor & Francis Journals, vol. 37(8), pages 1399-1406.
    6. Li, Chunyu & Lou, Chenxin & Luo, Dan & Xing, Kai, 2021. "Chinese corporate distress prediction using LASSO: The role of earnings management," International Review of Financial Analysis, Elsevier, vol. 76(C).
    7. Yiping Yang & Peixin Zhao & Dongsheng Wu, 2025. "Robust variable selection with exponential squared loss for linear mixed-effects models," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 88(6), pages 1023-1049, August.
    8. Ying Huang & Shibasish Dasgupta, 2019. "Likelihood-Based Methods for Assessing Principal Surrogate Endpoints in Vaccine Trials," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 11(3), pages 504-523, December.
    9. Linton, Oliver & Seo, Myung Hwan & Whang, Yoon-Jae, 2023. "Testing stochastic dominance with many conditioning variables," Journal of Econometrics, Elsevier, vol. 235(2), pages 507-527.
    10. Zhongnan Jin & Jie Min & Yili Hong & Pang Du & Qingyu Yang, 2024. "Multivariate Functional Clustering with Variable Selection and Application to Sensor Data from Engineering Systems," INFORMS Joural on Data Science, INFORMS, vol. 3(2), pages 203-218, October.
    11. Mert Demirer & Francis X. Diebold & Laura Liu & Kamil Yilmaz, 2018. "Estimating global bank network connectedness," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 33(1), pages 1-15, January.
    12. Mandys, F., 2021. "Electric vehicles and consumer choices," Renewable and Sustainable Energy Reviews, Elsevier, vol. 142(C).
    13. Mkhadri, Abdallah & Ouhourane, Mohamed, 2013. "An extended variable inclusion and shrinkage algorithm for correlated variables," Computational Statistics & Data Analysis, Elsevier, vol. 57(1), pages 631-644.
    14. Ni, Xiao & Zhang, Hao Helen & Zhang, Daowen, 2009. "Automatic model selection for partially linear models," Journal of Multivariate Analysis, Elsevier, vol. 100(9), pages 2100-2111, October.
    15. David Kohns & Tibor Szendrei, 2021. "Decoupling Shrinkage and Selection for the Bayesian Quantile Regression," Papers 2107.08498, arXiv.org.
    16. Avagyan, Vahe & Alonso Fernández, Andrés Modesto & Nogales, Francisco J., 2015. "D-trace Precision Matrix Estimation Using Adaptive Lasso Penalties," DES - Working Papers. Statistics and Econometrics. WS 21775, Universidad Carlos III de Madrid. Departamento de Estadística.
    17. Diego Vidaurre & Concha Bielza & Pedro Larrañaga, 2013. "A Survey of L1 Regression," International Statistical Review, International Statistical Institute, vol. 81(3), pages 361-387, December.
    18. Xiong, Wei & Wang, Dehui & Deng, Dianliang & Wang, Xinyang & Zhang, Wanying, 2022. "Penalized multiply robust estimation in high-order autoregressive processes with missing explanatory variables," Journal of Multivariate Analysis, Elsevier, vol. 187(C).
    19. Juan Antonio Duro & Alejandro Perez‐Laborda & Markus Löw & Sarah Matej & Barbara Plank & Fridolin Krausmann & Dominik Wiedenhofer & Helmut Haberl, 2024. "Spatial patterns of built structures co‐determine nations’ level of resource demand," Journal of Industrial Ecology, Yale University, vol. 28(2), pages 289-302, April.
    20. Yanlin Tang & Xinyuan Song & Zhongyi Zhu, 2015. "Variable selection via composite quantile regression with dependent errors," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 69(1), pages 1-20, February.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:scjsta:v:49:y:2022:i:4:p:1636-1668. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.blackwellpublishing.com/journal.asp?ref=0303-6898 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.