IDEAS home Printed from https://ideas.repec.org/a/eee/econom/v235y2023i1p280-301.html
   My bibliography  Save this article

Model averaging prediction by K-fold cross-validation

Author

Listed:
  • Zhang, Xinyu
  • Liu, Chu-An

Abstract

This paper considers the model averaging prediction in a quasi-likelihood framework that allows for parameter uncertainty and model misspecification. We propose an averaging prediction that selects the data-driven weights by minimizing a K-fold cross-validation. We provide two theoretical justifications for the proposed method. First, when all candidate models are misspecified, we show that the proposed averaging prediction using K-fold cross-validation weights is asymptotically optimal in the sense of achieving the lowest possible prediction risk. Second, when the model set includes correctly specified models, we demonstrate that the proposed K-fold cross-validation asymptotically assigns all weights to the correctly specified models. Monte Carlo simulations show that the proposed averaging prediction achieves lower empirical risk than other existing model averaging methods. As an empirical illustration, the proposed method is applied to credit card default prediction.

Suggested Citation

  • Zhang, Xinyu & Liu, Chu-An, 2023. "Model averaging prediction by K-fold cross-validation," Journal of Econometrics, Elsevier, vol. 235(1), pages 280-301.
  • Handle: RePEc:eee:econom:v:235:y:2023:i:1:p:280-301
    DOI: 10.1016/j.jeconom.2022.04.007
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0304407622000975
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jeconom.2022.04.007?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Gao, Yan & Zhang, Xinyu & Wang, Shouyang & Zou, Guohua, 2016. "Model averaging based on leave-subject-out cross-validation," Journal of Econometrics, Elsevier, vol. 192(1), pages 139-151.
    2. Zhang, Xinyu, 2015. "Consistency of model averaging estimators," Economics Letters, Elsevier, vol. 130(C), pages 120-123.
    3. Yuan, Zheng & Yang, Yuhong, 2005. "Combining Linear Regression Models: When and How?," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 1202-1214, December.
    4. Yuhong Yang, 2005. "Can the strengths of AIC and BIC be shared? A conflict between model indentification and regression estimation," Biometrika, Biometrika Trust, vol. 92(4), pages 937-950, December.
    5. Mark F. J. Steel, 2020. "Model Averaging and Its Use in Economics," Journal of Economic Literature, American Economic Association, vol. 58(3), pages 644-719, September.
    6. Fang, Fang & Chen, Yuanyuan, 2019. "A new approach for credit scoring by directly maximizing the Kolmogorov–Smirnov statistic," Computational Statistics & Data Analysis, Elsevier, vol. 133(C), pages 180-194.
    7. Yang, Yuhong, 2000. "Combining Different Procedures for Adaptive Regression," Journal of Multivariate Analysis, Elsevier, vol. 74(1), pages 135-161, July.
    8. Yan Gao & Xinyu Zhang & Shouyang Wang & Terence Tai-leung Chong & Guohua Zou, 2019. "Frequentist model averaging for threshold models," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 71(2), pages 275-306, April.
    9. Fernandez, Carmen & Ley, Eduardo & Steel, Mark F. J., 2001. "Benchmark priors for Bayesian model averaging," Journal of Econometrics, Elsevier, vol. 100(2), pages 381-427, February.
    10. Sun, Yuying & Hong, Yongmiao & Lee, Tae-Hwy & Wang, Shouyang & Zhang, Xinyu, 2021. "Time-varying model averaging," Journal of Econometrics, Elsevier, vol. 222(2), pages 974-992.
    11. Hansen, Bruce E., 2008. "Least-squares forecast averaging," Journal of Econometrics, Elsevier, vol. 146(2), pages 342-350, October.
    12. Cheng, Xu & Hansen, Bruce E., 2015. "Forecasting with factor-augmented regression: A frequentist model averaging approach," Journal of Econometrics, Elsevier, vol. 186(2), pages 280-293.
    13. Qingfeng Liu & Ryo Okui, 2013. "Heteroscedasticity‐robust C(p) model averaging," Econometrics Journal, Royal Economic Society, vol. 16(3), pages 463-472, October.
    14. Bruce E. Hansen, 2014. "Model averaging, asymptotic risk, and regressor groups," Quantitative Economics, Econometric Society, vol. 5(3), pages 495-530, November.
    15. Andrews, Donald W. K., 1991. "Asymptotic optimality of generalized CL, cross-validation, and generalized cross-validation in regression with heteroskedastic errors," Journal of Econometrics, Elsevier, vol. 47(2-3), pages 359-377, February.
    16. Xinyu Zhang & Dalei Yu & Guohua Zou & Hua Liang, 2016. "Optimal Model Averaging Estimation for Generalized Linear Models and Generalized Linear Mixed-Effects Models," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(516), pages 1775-1790, October.
    17. Xinyu Zhang & Guohua Zou & Hua Liang, 2014. "Model averaging and weight choice in linear mixed-effects models," Biometrika, Biometrika Trust, vol. 101(1), pages 205-218.
    18. Liu, Chu-An, 2015. "Distribution theory of the least squares averaging estimator," Journal of Econometrics, Elsevier, vol. 186(1), pages 142-159.
    19. Tomohiro Ando & Ker-Chau Li, 2014. "A Model-Averaging Approach for High-Dimensional Regression," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(505), pages 254-265, March.
    20. Lu, Xun & Su, Liangjun, 2015. "Jackknife model averaging for quantile regressions," Journal of Econometrics, Elsevier, vol. 188(1), pages 40-58.
    21. Zhang, Xinyu & Wan, Alan T.K. & Zou, Guohua, 2013. "Model averaging by jackknife criterion in models with dependent data," Journal of Econometrics, Elsevier, vol. 174(2), pages 82-94.
    22. Yang Y., 2001. "Adaptive Regression by Mixing," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 574-588, June.
    23. Xu Cheng & Zhipeng Liao & Ruoyao Shi, 2019. "On uniform asymptotic risk of averaging GMM estimators," Quantitative Economics, Econometric Society, vol. 10(3), pages 931-979, July.
    24. Wan, Alan T.K. & Zhang, Xinyu & Zou, Guohua, 2010. "Least squares model averaging by Mallows criterion," Journal of Econometrics, Elsevier, vol. 156(2), pages 277-283, June.
    25. Claeskens,Gerda & Hjort,Nils Lid, 2008. "Model Selection and Model Averaging," Cambridge Books, Cambridge University Press, number 9780521852258.
    26. Hansen, Bruce E. & Racine, Jeffrey S., 2012. "Jackknife model averaging," Journal of Econometrics, Elsevier, vol. 167(1), pages 38-46.
    27. Enrique Moral-Benito, 2015. "Model Averaging In Economics: An Overview," Journal of Economic Surveys, Wiley Blackwell, vol. 29(1), pages 46-75, February.
    28. Bruce E. Hansen, 2007. "Least Squares Model Averaging," Econometrica, Econometric Society, vol. 75(4), pages 1175-1189, July.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Rongjun Cheng & Qinyin Li & Fuzhou Chen & Baobin Miao, 2024. "A Dual-Stage Attention-Based Vehicle Speed Prediction Model Considering Driver Heterogeneity with Fuel Consumption and Emissions Analysis," Sustainability, MDPI, vol. 16(4), pages 1-24, February.
    2. Wang, Hong & Sun, Fubao & Liu, Fa & Wang, Tingting & Liu, Wenbin & Feng, Yao, 2023. "Reconstruction of the pan evaporation based on meteorological factors with machine learning method over China," Agricultural Water Management, Elsevier, vol. 287(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Liao, Jun & Zou, Guohua, 2020. "Corrected Mallows criterion for model averaging," Computational Statistics & Data Analysis, Elsevier, vol. 144(C).
    2. Sun, Yuying & Hong, Yongmiao & Lee, Tae-Hwy & Wang, Shouyang & Zhang, Xinyu, 2021. "Time-varying model averaging," Journal of Econometrics, Elsevier, vol. 222(2), pages 974-992.
    3. Chen, Yi-Ting & Liu, Chu-An, 2023. "Model averaging for asymptotically optimal combined forecasts," Journal of Econometrics, Elsevier, vol. 235(2), pages 592-607.
    4. Lu, Xun & Su, Liangjun, 2015. "Jackknife model averaging for quantile regressions," Journal of Econometrics, Elsevier, vol. 188(1), pages 40-58.
    5. Shou-Yung Yin & Chu-An Liu & Chang-Ching Lin, 2021. "Focused Information Criterion and Model Averaging for Large Panels With a Multifactor Error Structure," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 39(1), pages 54-68, January.
    6. Liao, Jun & Zou, Guohua & Gao, Yan & Zhang, Xinyu, 2021. "Model averaging prediction for time series models with a diverging number of parameters," Journal of Econometrics, Elsevier, vol. 223(1), pages 190-221.
    7. Sun, Yuying & Hong, Yongmiao & Wang, Shouyang & Zhang, Xinyu, 2023. "Penalized time-varying model averaging," Journal of Econometrics, Elsevier, vol. 235(2), pages 1355-1377.
    8. Yuting Wei & Qihua Wang & Wei Liu, 2021. "Model averaging for linear models with responses missing at random," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 73(3), pages 535-553, June.
    9. Liao, Jun & Zong, Xianpeng & Zhang, Xinyu & Zou, Guohua, 2019. "Model averaging based on leave-subject-out cross-validation for vector autoregressions," Journal of Econometrics, Elsevier, vol. 209(1), pages 35-60.
    10. Chu-An Liu & Biing-Shen Kuo & Wen-Jen Tsay, 2017. "Autoregressive Spectral Averaging Estimator," IEAS Working Paper : academic research 17-A013, Institute of Economics, Academia Sinica, Taipei, Taiwan.
    11. Shangwei Zhao & Jun Liao & Dalei Yu, 2020. "Model averaging estimator in ridge regression and its large sample properties," Statistical Papers, Springer, vol. 61(4), pages 1719-1739, August.
    12. Jingwen Tu & Hu Yang & Chaohui Guo & Jing Lv, 2021. "Model averaging marginal regression for high dimensional conditional quantile prediction," Statistical Papers, Springer, vol. 62(6), pages 2661-2689, December.
    13. Qingfeng Liu & Ryo Okui & Arihiro Yoshimura, 2016. "Generalized Least Squares Model Averaging," Econometric Reviews, Taylor & Francis Journals, vol. 35(8-10), pages 1692-1752, December.
    14. Haili Zhang & Guohua Zou, 2020. "Cross-Validation Model Averaging for Generalized Functional Linear Model," Econometrics, MDPI, vol. 8(1), pages 1-35, February.
    15. Rongjie Jiang & Liming Wang & Yang Bai, 2021. "Optimal model averaging estimator for semi-functional partially linear models," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 84(2), pages 167-194, February.
    16. Yang Feng & Qingfeng Liu, 2020. "Nested Model Averaging on Solution Path for High-dimensional Linear Regression," Papers 2005.08057, arXiv.org.
    17. Ruoyao Shi, 2021. "An Averaging Estimator for Two Step M Estimation in Semiparametric Models," Working Papers 202105, University of California at Riverside, Department of Economics.
    18. Steven F. Lehrer & Tian Xie, 2022. "The Bigger Picture: Combining Econometrics with Analytics Improves Forecasts of Movie Success," Management Science, INFORMS, vol. 68(1), pages 189-210, January.
    19. Liu, Chu-An, 2015. "Distribution theory of the least squares averaging estimator," Journal of Econometrics, Elsevier, vol. 186(1), pages 142-159.
    20. Wei, Yuting & Wang, Qihua, 2021. "Cross-validation-based model averaging in linear models with response missing at random," Statistics & Probability Letters, Elsevier, vol. 171(C).

    More about this item

    Keywords

    Asymptotic optimality; Cross-validation; Model averaging; Weight convergence;
    All these keywords.

    JEL classification:

    • C51 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Model Construction and Estimation
    • C52 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Model Evaluation, Validation, and Selection

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:econom:v:235:y:2023:i:1:p:280-301. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/jeconom .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.