Asymptotic post-selection inference for the Akaike information criterion

Asymptotic post-selection inference for the Akaike information criterion

Author

Listed:

Ali Charkhi
Gerda Claeskens

Abstract

SummaryIgnoring the model selection step in inference after selection is harmful. In this paper we study the asymptotic distribution of estimators after model selection using the Akaike information criterion. First, we consider the classical setting in which a true model exists and is included in the candidate set of models. We exploit the overselection property of this criterion in constructing a selection region, and we obtain the asymptotic distribution of estimators and linear combinations thereof conditional on the selected model. The limiting distribution depends on the set of competitive models and on the smallest overparameterized model. Second, we relax the assumption on the existence of a true model and obtain uniform asymptotic results. We use simulation to study the resulting post-selection distributions and to calculate confidence regions for the model parameters, and we also apply the method to a diabetes dataset.

Suggested Citation

Ali Charkhi & Gerda Claeskens, 2018. "Asymptotic post-selection inference for the Akaike information criterion," Biometrika, Biometrika Trust, vol. 105(3), pages 645-664.

Handle: RePEc:oup:biomet:v:105:y:2018:i:3:p:645-664.

Download full text from publisher

As the access to this document is restricted, you may want to

for a different version of it.

References listed on IDEAS

Bradley Efron, 2014. "Estimation and Accuracy After Model Selection," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(507), pages 991-1007, September.
Maarten Jansen, 2014. "Information criteria for variable selection under sparsity," Biometrika, Biometrika Trust, vol. 101(1), pages 37-55.
Kabaila, Paul & Leeb, Hannes, 2006. "On the Large-Sample Minimal Coverage Probability of Confidence Intervals After Model Selection," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 619-629, June.
Gerda Claeskens & Nils Lid Hjort, 2004. "Goodness of Fit via Non‐parametric Likelihood Ratios," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 31(4), pages 487-513, December.
Victor Chernozhukov & Christian Hansen & Martin Spindler, 2015. "Valid Post-Selection and Post-Regularization Inference: An Elementary, General Approach," Annual Review of Economics, Annual Reviews, vol. 7(1), pages 649-688, August.
- Victor Chernozhukov & Christian Hansen & Martin Spindler, 2015. "Valid Post-Selection and Post-Regularization Inference: An Elementary, General Approach," Papers 1501.03430, arXiv.org, revised Aug 2015.
- Victor Chernozhukov & Christian Hansen & Martin Spindler, 2016. "Valid post-selection and post-regularization inference: An elementary, general approach," CeMMAP working papers 36/16, Institute for Fiscal Studies.
- Victor Chernozhukov & Christian Hansen & Martin Spindler, 2016. "Valid post-selection and post-regularization inference: An elementary, general approach," CeMMAP working papers CWP36/16, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
Vuong, Quang H, 1989. "Likelihood Ratio Tests for Model Selection and Non-nested Hypotheses," Econometrica, Econometric Society, vol. 57(2), pages 307-333, March.
Leeb, Hannes & Pötscher, Benedikt M., 2005. "Model Selection And Inference: Facts And Fiction," Econometric Theory, Cambridge University Press, vol. 21(1), pages 21-59, February.
Kabaila, Paul, 1998. "Valid Confidence Intervals In Regression After Variable Selection," Econometric Theory, Cambridge University Press, vol. 14(4), pages 463-482, August.
A. Belloni & V. Chernozhukov & K. Kato, 2015. "Uniform post-selection inference for least absolute deviation regression and other Z-estimation problems," Biometrika, Biometrika Trust, vol. 102(1), pages 77-94.
Hjort N.L. & Claeskens G., 2003. "Frequentist Model Average Estimators," Journal of the American Statistical Association, American Statistical Association, vol. 98, pages 879-899, January.
Kabaila, Paul, 1995. "The Effect of Model Selection on Confidence Regions and Prediction Regions," Econometric Theory, Cambridge University Press, vol. 11(3), pages 537-549, June.
Ryan J. Tibshirani & Jonathan Taylor & Richard Lockhart & Robert Tibshirani, 2016. "Exact Post-Selection Inference for Sequential Regression Procedures," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(514), pages 600-620, April.
Danilov, Dmitry & Magnus, J.R.Jan R., 2004. "On the harm that ignoring pretesting can cause," Journal of Econometrics, Elsevier, vol. 122(1), pages 27-46, September.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Andrea C. Garcia-Angulo & Gerda Claeskens, 2025. "Bootstrap for inference after model selection and model averaging for likelihood models," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 88(3), pages 311-340, April.
Kramlinger, Peter & Schneider, Ulrike & Krivobokova, Tatyana, 2023. "Uniformly valid inference based on the Lasso in linear mixed models," Journal of Multivariate Analysis, Elsevier, vol. 198(C).
Lasanthi C. R. Pelawa Watagoda & David J. Olive, 2021. "Comparing six shrinkage estimators with large sample theory and asymptotically optimal prediction intervals," Statistical Papers, Springer, vol. 62(5), pages 2407-2431, October.
Rügamer, David & Baumann, Philipp F.M. & Greven, Sonja, 2022. "Selective inference for additive and linear mixed models," Computational Statistics & Data Analysis, Elsevier, vol. 167(C).
Jelle J Goeman & Aldo Solari, 2024. "On selection and conditioning in multiple testing and selective inference," Biometrika, Biometrika Trust, vol. 111(2), pages 393-416.
Pirenne, Sarah & Claeskens, Gerda, 2024. "Exact post-selection inference for adjusted R squared selection," Statistics & Probability Letters, Elsevier, vol. 211(C).

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Leeb, Hannes & Pötscher, Benedikt M., 2008. "Can One Estimate The Unconditional Distribution Of Post-Model-Selection Estimators?," Econometric Theory, Cambridge University Press, vol. 24(2), pages 338-376, April.
- Hannes Leeb & Benedikt M. Potscher, 2003. "Can One Estimate the Conditional Distribution of Post-Model-Selection Estimators?," Cowles Foundation Discussion Papers 1444, Cowles Foundation for Research in Economics, Yale University.
- Leeb, Hannes & Pötscher, Benedikt M., 2005. "Can One Estimate the Unconditional Distribution of Post-Model-Selection Estimators ?," MPRA Paper 72, University Library of Munich, Germany.
Paul Kabaila, 2009. "The Coverage Properties of Confidence Regions After Model Selection," International Statistical Review, International Statistical Institute, vol. 77(3), pages 405-414, December.
Doko Tchatoka, Firmin & Dufour, Jean-Marie, 2025. "Exogeneity tests and weak identification in IV regressions: Asymptotic theory and point estimation," Journal of Econometrics, Elsevier, vol. 248(C).
Francis DiTraglia, 2011. "Using Invalid Instruments on Purpose: Focused Moment Selection and Averaging for GMM, Second Version," PIER Working Paper Archive 15-027, Penn Institute for Economic Research, Department of Economics, University of Pennsylvania, revised 10 Aug 2015.
- Francis J. DiTraglia, 2011. "Using Invalid Instruments on Purpose: Focused Moment Selection and Averaging for GMM, Second Version," PIER Working Paper Archive 14-045, Penn Institute for Economic Research, Department of Economics, University of Pennsylvania, revised 09 Dec 2014.
DiTraglia, Francis J., 2016. "Using invalid instruments on purpose: Focused moment selection and averaging for GMM," Journal of Econometrics, Elsevier, vol. 195(2), pages 187-208.
- Francis J. DiTraglia, 2011. "Using Invalid Instruments on Purpose: Focused Moment Selection and Averaging for GMM," PIER Working Paper Archive 14-037, Penn Institute for Economic Research, Department of Economics, University of Pennsylvania, revised 04 Aug 2014.
- Francis J. DiTraglia, 2014. "Using Invalid Instruments on Purpose: Focused Moment Selection and Averaging for GMM," Papers 1408.0705, arXiv.org, revised Nov 2020.
Pötscher, Benedikt M., 2007. "Confidence Sets Based on Sparse Estimators Are Necessarily Large," MPRA Paper 5677, University Library of Munich, Germany.
Shaobo Jin, 2022. "Frequentist Model Averaging in Structure Equation Model With Ordinal Data," Psychometrika, Springer;The Psychometric Society, vol. 87(3), pages 1130-1145, September.
Firmin Doko Tchatoka & Wenjie Wang, 2020. "Uniform Inference after Pretesting for Exogeneity," Adelaide Economics Working Papers 2020-05, Adelaide University, School of Economics.
- Doko Tchatoka, Firmin & Wang, Wenjie, 2020. "Uniform Inference after Pretesting for Exogeneity," MPRA Paper 99243, University Library of Munich, Germany.
John Copas & Shinto Eguchi, 2020. "Strong model dependence in statistical analysis: goodness of fit is not enough for model choice," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 72(2), pages 329-352, April.
Shaobo Jin & Sebastian Ankargren, 2019. "Frequentist Model Averaging in Structural Equation Modelling," Psychometrika, Springer;The Psychometric Society, vol. 84(1), pages 84-104, March.
Lenard Lieb & Stephan Smeekes, 2017. "Inference for Impulse Responses under Model Uncertainty," Papers 1709.09583, arXiv.org, revised Oct 2019.
- Lieb, Lenard & Smeekes, Stephan, 2017. "Inference for Impulse Responses under Model Uncertainty," Research Memorandum 022, Maastricht University, Graduate School of Business and Economics (GSBE).
Ruoyao Shi & Zhipeng Liao, 2018. "An Averaging GMM Estimator Robust to Misspecification," Working Papers 201803, University of California at Riverside, Department of Economics.
Leeb, Hannes & Pötscher, Benedikt M. & Ewald, Karl, 2014. "On various confidence intervals post-model-selection," MPRA Paper 52858, University Library of Munich, Germany.
- Leeb, Hannes & Pötscher, Benedikt M. & Ewald, Karl, 2014. "On various confidence intervals post-model-selection," MPRA Paper 58326, University Library of Munich, Germany, revised 2014.
Doko Tchatoka, Firmin & Wang, Wenjie, 2021. "Uniform Inference after Pretesting for Exogeneity with Heteroskedastic Data," MPRA Paper 106408, University Library of Munich, Germany.
Liu, Chu-An, 2012. "A plug-in averaging estimator for regressions with heteroskedastic errors," MPRA Paper 41414, University Library of Munich, Germany.
Liu, Chu-An, 2015. "Distribution theory of the least squares averaging estimator," Journal of Econometrics, Elsevier, vol. 186(1), pages 142-159.
- Liu, Chu-An, 2013. "Distribution Theory of the Least Squares Averaging Estimator," MPRA Paper 54201, University Library of Munich, Germany.
Magnus, Jan R. & Wan, Alan T.K. & Zhang, Xinyu, 2011. "Weighted average least squares estimation with nonspherical disturbances and an application to the Hong Kong housing market," Computational Statistics & Data Analysis, Elsevier, vol. 55(3), pages 1331-1341, March.
Claeskens, Gerda & Magnus, Jan R. & Vasnev, Andrey L. & Wang, Wendun, 2016. "The forecast combination puzzle: A simple theoretical explanation," International Journal of Forecasting, Elsevier, vol. 32(3), pages 754-762.
- Gerda Claeskens & Jan Magnus & Andrey Vasnev & Wendun Wang, 2014. "The Forecast Combination Puzzle: A Simple Theoretical Explanation," Tinbergen Institute Discussion Papers 14-127/III, Tinbergen Institute.
- Gerda Claeskens & Jan Magnus & Andrey Vasnev & Wendun Wang, 2016. "The forecast combination puzzle: a simple theoretical explanation," Working Papers of Department of Decision Sciences and Information Management, Leuven 532152, KU Leuven, Faculty of Economics and Business (FEB), Department of Decision Sciences and Information Management, Leuven.
Bruce E. Hansen, 2007. "Least Squares Model Averaging," Econometrica, Econometric Society, vol. 75(4), pages 1175-1189, July.
Ruth M. Pfeiffer & Andrew Redd & Raymond J. Carroll, 2017. "On the impact of model selection on predictor identification and parameter inference," Computational Statistics, Springer, vol. 32(2), pages 667-690, June.

More about this item

Keywords

; ; ; ; ;

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:oup:biomet:v:105:y:2018:i:3:p:645-664.. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Oxford University Press (email available below). General contact details of provider: https://academic.oup.com/biomet .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Asymptotic post-selection inference for the Akaike information criterion

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data