Bootstrap for inference after model selection and model averaging for likelihood models

My bibliography Save this article

Bootstrap for inference after model selection and model averaging for likelihood models

Author

Listed:

Andrea C. Garcia-Angulo
(Escuela Superior Politécnica del Litoral, ESPOL)
Gerda Claeskens
(KU Leuven)

Registered:

Abstract

A one-step semiparametric bootstrap procedure is constructed to estimate the distribution of estimators after model selection and of model averaging estimators with data-dependent weights. The method is generally applicable to non-normal models. Misspecification is allowed for all candidate parametric models. The semiparametric bootstrap estimator is shown to be consistent within specific regions such that the good and the bad candidate models are separated. Simulation studies exemplify that the bootstrap procedure leads to short confidence intervals with a good coverage.

Suggested Citation

Andrea C. Garcia-Angulo & Gerda Claeskens, 2025. "Bootstrap for inference after model selection and model averaging for likelihood models," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 88(3), pages 311-340, April.

Handle: RePEc:spr:metrik:v:88:y:2025:i:3:d:10.1007_s00184-024-00956-2
DOI: 10.1007/s00184-024-00956-2

Download full text from publisher

As the access to this document is restricted, you may want to

for a different version of it.

References listed on IDEAS

Zou, Hui, 2006. "The Adaptive Lasso and Its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1418-1429, December.
Paul Kabaila, 2009. "The Coverage Properties of Confidence Regions After Model Selection," International Statistical Review, International Statistical Institute, vol. 77(3), pages 405-414, December.
Bradley Efron, 2014. "Estimation and Accuracy After Model Selection," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(507), pages 991-1007, September.
Sin, Chor-Yiu & White, Halbert, 1996. "Information criteria for selecting possibly misspecified parametric models," Journal of Econometrics, Elsevier, vol. 71(1-2), pages 207-225.
Leeb, Hannes & Pötscher, Benedikt M., 2008. "Can One Estimate The Unconditional Distribution Of Post-Model-Selection Estimators?," Econometric Theory, Cambridge University Press, vol. 24(2), pages 338-376, April.
- Hannes Leeb & Benedikt M. Potscher, 2003. "Can One Estimate the Conditional Distribution of Post-Model-Selection Estimators?," Cowles Foundation Discussion Papers 1444, Cowles Foundation for Research in Economics, Yale University.
- Leeb, Hannes & Pötscher, Benedikt M., 2005. "Can One Estimate the Unconditional Distribution of Post-Model-Selection Estimators ?," MPRA Paper 72, University Library of Munich, Germany.
W. Lu & Y. Goldberg & J. P. Fine, 2012. "On the robustness of the adaptive lasso to model misspecification," Biometrika, Biometrika Trust, vol. 99(3), pages 717-731.
Aerts, Marc & Claeskens, Gerda, 2001. "Bootstrap tests for misspecified models, with application to clustered binary data," Computational Statistics & Data Analysis, Elsevier, vol. 36(3), pages 383-401, May.
White,Halbert, 1996. "Estimation, Inference and Specification Analysis," Cambridge Books, Cambridge University Press, number 9780521574464, January.
- White,Halbert, 1994. "Estimation, Inference and Specification Analysis," Cambridge Books, Cambridge University Press, number 9780521252805, November.
Giurcanu, Mihai C., 2012. "Bootstrapping in non-regular smooth function models," Journal of Multivariate Analysis, Elsevier, vol. 111(C), pages 78-93.
Andrea C. Garcia‐Angulo & Gerda Claeskens, 2023. "Exact uniformly most powerful postselection confidence distributions," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 50(1), pages 358-382, March.
Paul Kabaila & A. H. Welsh & Waruni Abeysekera, 2016. "Model-Averaged Confidence Intervals," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 43(1), pages 35-48, March.
S M S Lee & Y Wu, 2018. "A bootstrap recipe for post-model-selection inference under linear regression models," Biometrika, Biometrika Trust, vol. 105(4), pages 873-890.
Ali Charkhi & Gerda Claeskens, 2018. "Asymptotic post-selection inference for the Akaike information criterion," Biometrika, Biometrika Trust, vol. 105(3), pages 645-664.
Fan J. & Li R., 2001. "Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1348-1360, December.
Danilov, Dmitry & Magnus, J.R.Jan R., 2004. "On the harm that ignoring pretesting can cause," Journal of Econometrics, Elsevier, vol. 122(1), pages 27-46, September.
L. Camponovo, 2015. "On the validity of the pairs bootstrap for lasso estimators," Biometrika, Biometrika Trust, vol. 102(4), pages 981-987.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Tae-Hwy Lee & Zhou Xi & Ru Zhang, 2013. "Testing for Neglected Nonlinearity Using Regularized Artificial Neural Networks," Working Papers 201422, University of California at Riverside, Department of Economics, revised Apr 2012.
Pirenne, Sarah & Claeskens, Gerda, 2024. "Exact post-selection inference for adjusted R squared selection," Statistics & Probability Letters, Elsevier, vol. 211(C).
Alexandre Belloni & Victor Chernozhukov & Ivan Fernandez-Val & Christian Hansen, 2013. "Program evaluation with high-dimensional data," CeMMAP working papers CWP77/13, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Alexandre Belloni & Victor Chernozhukov & Ivan Fernandez-Val & Christian Hansen, 2015. "Program evaluation with high-dimensional data," CeMMAP working papers 55/15, Institute for Fiscal Studies.
- Alexandre Belloni & Victor Chernozhukov & Ivan Fernandez-Val & Christian Hansen, 2014. "Program evaluation with high-dimensional data," CeMMAP working papers CWP33/14, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Victor Chernozhukov & Ivan Fernandez-Val & Christian Hansen, 2013. "Program evaluation with high-dimensional data," CeMMAP working papers 57/13, Institute for Fiscal Studies.
- Alexandre Belloni & Victor Chernozhukov & Ivan Fernandez-Val & Christian Hansen, 2015. "Program evaluation with high-dimensional data," CeMMAP working papers CWP55/15, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Alexandre Belloni & Victor Chernozhukov & Ivan Fernandez-Val & Christian Hansen, 2014. "Program evaluation with high-dimensional data," CeMMAP working papers 33/14, Institute for Fiscal Studies.
- Alexandre Belloni & Victor Chernozhukov & Ivan Fernandez-Val & Christian Hansen, 2013. "Program evaluation with high-dimensional data," CeMMAP working papers 77/13, Institute for Fiscal Studies.
- Victor Chernozhukov & Ivan Fernandez-Val & Christian Hansen, 2013. "Program evaluation with high-dimensional data," CeMMAP working papers CWP57/13, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
Alexandre Belloni & Victor Chernozhukov & Christian Hansen, 2014. "High-Dimensional Methods and Inference on Structural and Treatment Effects," Journal of Economic Perspectives, American Economic Association, vol. 28(2), pages 29-50, Spring.
- Alexandre Belloni & Victor Chernozhukov & Christian Hansen, 2013. "High dimensional methods and inference on structural and treatment effects," CeMMAP working papers CWP59/13, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Alexandre Belloni & Victor Chernozhukov & Christian Hansen, 2013. "High dimensional methods and inference on structural and treatment effects," CeMMAP working papers 59/13, Institute for Fiscal Studies.
Mihai C. Giurcanu, 2017. "Oracle M-Estimation for Time Series Models," Journal of Time Series Analysis, Wiley Blackwell, vol. 38(3), pages 479-504, May.
Paul Kabaila & Christeen Wijethunga, 2024. "Confidence intervals centred on bootstrap smoothed estimators: an impossibility result," Statistical Papers, Springer, vol. 65(3), pages 1531-1551, May.
Farrell, Max H., 2015. "Robust inference on average treatment effects with possibly more covariates than observations," Journal of Econometrics, Elsevier, vol. 189(1), pages 1-23.
- Max H. Farrell, 2013. "Robust Inference on Average Treatment Effects with Possibly More Covariates than Observations," Papers 1309.4686, arXiv.org, revised Feb 2018.
Wan, Alan T.K. & Zhang, Xinyu & Wang, Shouyang, 2014. "Frequentist model averaging for multinomial and ordered logit models," International Journal of Forecasting, Elsevier, vol. 30(1), pages 118-128.
Pötscher, Benedikt M. & Leeb, Hannes, 2009. "On the distribution of penalized maximum likelihood estimators: The LASSO, SCAD, and thresholding," Journal of Multivariate Analysis, Elsevier, vol. 100(9), pages 2065-2082, October.
- Pötscher, Benedikt M. & Leeb, Hannes, 2007. "On the distribution of penalized maximum likelihood estimators: The LASSO, SCAD, and thresholding," MPRA Paper 5615, University Library of Munich, Germany.
Ghosh, D. & Yuan, Z., 2009. "An improved model averaging scheme for logistic regression," Journal of Multivariate Analysis, Elsevier, vol. 100(8), pages 1670-1681, September.
Susan M. Shortreed & Ashkan Ertefaie, 2017. "Outcome‐adaptive lasso: Variable selection for causal inference," Biometrics, The International Biometric Society, vol. 73(4), pages 1111-1122, December.
Gueuning, Thomas & Claeskens, Gerda, 2016. "Confidence intervals for high-dimensional partially linear single-index models," Journal of Multivariate Analysis, Elsevier, vol. 149(C), pages 13-29.
Ng, Serena, 2013. "Variable Selection in Predictive Regressions," Handbook of Economic Forecasting, in: G. Elliott & C. Granger & A. Timmermann (ed.), Handbook of Economic Forecasting, edition 1, volume 2, chapter 0, pages 752-789, Elsevier.
Andrea C. Garcia‐Angulo & Gerda Claeskens, 2023. "Exact uniformly most powerful postselection confidence distributions," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 50(1), pages 358-382, March.
Christian Hansen & Damian Kozbur & Sanjog Misra, 2016. "Targeted undersmoothing," ECON - Working Papers 282, Department of Economics - University of Zurich, revised Apr 2018.
Aman Ullah & Huansha Wang, 2013. "Parametric and Nonparametric Frequentist Model Selection and Model Averaging," Econometrics, MDPI, vol. 1(2), pages 1-23, September.
Tang, Niansheng & Yan, Xiaodong & Zhao, Puying, 2018. "Exponentially tilted likelihood inference on growing dimensional unconditional moment models," Journal of Econometrics, Elsevier, vol. 202(1), pages 57-74.
Tutz, Gerhard & Pößnecker, Wolfgang & Uhlmann, Lorenz, 2015. "Variable selection in general multinomial logit models," Computational Statistics & Data Analysis, Elsevier, vol. 82(C), pages 207-222.
Margherita Giuzio, 2017. "Genetic algorithm versus classical methods in sparse index tracking," Decisions in Economics and Finance, Springer;Associazione per la Matematica, vol. 40(1), pages 243-256, November.
Xu, Yang & Zhao, Shishun & Hu, Tao & Sun, Jianguo, 2021. "Variable selection for generalized odds rate mixture cure models with interval-censored failure time data," Computational Statistics & Data Analysis, Elsevier, vol. 156(C).

More about this item

Keywords

; ; ; ; ;

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:metrik:v:88:y:2025:i:3:d:10.1007_s00184-024-00956-2. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Bootstrap for inference after model selection and model averaging for likelihood models

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data