IDEAS home Printed from https://ideas.repec.org/a/taf/gnstxx/v27y2015i2p167-179.html
   My bibliography  Save this article

A K -fold averaging cross-validation procedure

Author

Listed:
  • Yoonsuh Jung
  • Jianhua Hu

Abstract

Cross-validation (CV) type of methods have been widely used to facilitate model estimation and variable selection. In this work, we suggest a new K -fold CV procedure to select a candidate 'optimal' model from each hold-out fold and average the K candidate 'optimal' models to obtain the ultimate model. Due to the averaging effect, the variance of the proposed estimates can be significantly reduced. This new procedure results in more stable and efficient parameter estimation than the classical K -fold CV procedure. In addition, we show the asymptotic equivalence between the proposed and classical CV procedures in the linear regression setting. We also demonstrate the broad applicability of the proposed procedure via two examples of parameter sparsity regularisation and quantile smoothing splines modelling. We illustrate the promise of the proposed method through simulations and a real data example.

Suggested Citation

  • Yoonsuh Jung & Jianhua Hu, 2015. "A K -fold averaging cross-validation procedure," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 27(2), pages 167-179, June.
  • Handle: RePEc:taf:gnstxx:v:27:y:2015:i:2:p:167-179
    DOI: 10.1080/10485252.2015.1010532
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1080/10485252.2015.1010532
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1080/10485252.2015.1010532?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Koenker, Roger W & Bassett, Gilbert, Jr, 1978. "Regression Quantiles," Econometrica, Econometric Society, vol. 46(1), pages 33-50, January.
    2. Patrick Royston & Douglas G. Altman, 1994. "Regression Using Fractional Polynomials of Continuous Covariates: Parsimonious Parametric Modelling," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 43(3), pages 429-453, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Yingtao Zhang & Tao Wang & Kangkang Liu & Yao Xia & Yi Lu & Qinlong Jing & Zhicong Yang & Wenbiao Hu & Jiahai Lu, 2016. "Developing a Time Series Predictive Model for Dengue in Zhongshan, China Based on Weather and Guangzhou Dengue Surveillance Data," PLOS Neglected Tropical Diseases, Public Library of Science, vol. 10(2), pages 1-17, February.
    2. Yang, Yadong & Shahbeik, Hossein & Shafizadeh, Alireza & Rafiee, Shahin & Hafezi, Amir & Du, Xinyi & Pan, Junting & Tabatabaei, Meisam & Aghbashlo, Mortaza, 2023. "Predicting municipal solid waste gasification using machine learning: A step toward sustainable regional planning," Energy, Elsevier, vol. 278(PB).
    3. Dhan Lord B. Fortela & Armani Travis & Ashley P. Mikolajczyk & Wayne Sharp & Emmanuel Revellame & William Holmes & Rafael Hernandez & Mark E. Zappi, 2023. "Quantitating Wastewater Characteristic Parameters Using Neural Network Regression Modeling on Spectral Reflectance," Clean Technol., MDPI, vol. 5(4), pages 1-17, September.
    4. A. Costa & G. Buffa & D. Palmeri & G. Pollara & L. Fratini, 2022. "Hybrid prediction-optimization approaches for maximizing parts density in SLM of Ti6Al4V titanium alloy," Journal of Intelligent Manufacturing, Springer, vol. 33(7), pages 1967-1989, October.
    5. Yang, Yadong & Shahbeik, Hossein & Shafizadeh, Alireza & Masoudnia, Nima & Rafiee, Shahin & Zhang, Yijia & Pan, Junting & Tabatabaei, Meisam & Aghbashlo, Mortaza, 2022. "Biomass microwave pyrolysis characterization by machine learning for sustainable rural biorefineries," Renewable Energy, Elsevier, vol. 201(P2), pages 70-86.
    6. Li, Guosheng & Ma, Shuaichao & Zhang, Dequan & Yang, Leping & Zhang, Weihua & Wu, Zeping, 2024. "An efficient sequential anisotropic RBF reliability analysis method with fast cross-validation and parallelizability," Reliability Engineering and System Safety, Elsevier, vol. 241(C).
    7. Leonardo Brain García Fernández & Anna Diva Plasencia Lotufo & Carlos Roberto Minussi, 2023. "Development of a Short-Term Electrical Load Forecasting in Disaggregated Levels Using a Hybrid Modified Fuzzy-ARTMAP Strategy," Energies, MDPI, vol. 16(10), pages 1-30, May.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Marcelo Cajias & Philipp Freudenreich & Anna Heller & Wolfgang Schaefers, 2018. "Censored Quantile Regressions and the Determinants of Real Estate Liquidity," ERES eres2018_203, European Real Estate Society (ERES).
    2. Kuk, Anthony Y.C., 2017. "Function compositional adjustments of conditional quantile curves," Computational Statistics & Data Analysis, Elsevier, vol. 115(C), pages 281-293.
    3. Petrella, Lea & Raponi, Valentina, 2019. "Joint estimation of conditional quantiles in multivariate linear regression models with an application to financial distress," Journal of Multivariate Analysis, Elsevier, vol. 173(C), pages 70-84.
    4. Yu, Keming & Moyeed, Rana A., 2001. "Bayesian quantile regression," Statistics & Probability Letters, Elsevier, vol. 54(4), pages 437-447, October.
    5. Akosah, Nana Kwame & Alagidede, Imhotep Paul & Schaling, Eric, 2020. "Testing for asymmetry in monetary policy rule for small-open developing economies: Multiscale Bayesian quantile evidence from Ghana," The Journal of Economic Asymmetries, Elsevier, vol. 22(C).
    6. Molyneux, Philip & Pancotto, Livia & Reghezza, Alessio & Rodriguez d'Acri, Costanza, 2022. "Interest rate risk and monetary policy normalisation in the euro area," Journal of International Money and Finance, Elsevier, vol. 124(C).
    7. Paul Hewson & Keming Yu, 2008. "Quantile regression for binary performance indicators," Applied Stochastic Models in Business and Industry, John Wiley & Sons, vol. 24(5), pages 401-418, September.
    8. Noémi Kreif & Richard Grieve & Iván Díaz & David Harrison, 2015. "Evaluation of the Effect of a Continuous Treatment: A Machine Learning Approach with an Application to Treatment for Traumatic Brain Injury," Health Economics, John Wiley & Sons, Ltd., vol. 24(9), pages 1213-1228, September.
    9. Georgios Bertsatos & Plutarchos Sakellaris & Mike G. Tsionas, 2022. "Extensions of the Pesaran, Shin and Smith (2001) bounds testing procedure," Empirical Economics, Springer, vol. 62(2), pages 605-634, February.
    10. Salimata Sissoko, 2011. "Working Paper 03-11 - Niveau de décentralisation de la négociation et structure des salaires," Working Papers 1103, Federal Planning Bureau, Belgium.
    11. Korom, Philipp, 2016. "Inherited advantage: The importance of inheritance for private wealth accumulation in Europe," MPIfG Discussion Paper 16/11, Max Planck Institute for the Study of Societies.
    12. Daniele, Vittorio, 2007. "Criminalità e investimenti esteri. Un’analisi per le province italiane [The effect of organized crime on Foreign Investments. An Empirical Analysis for the Italian Provinces]," MPRA Paper 6417, University Library of Munich, Germany.
    13. Ma, Lingjie & Koenker, Roger, 2006. "Quantile regression methods for recursive structural equation models," Journal of Econometrics, Elsevier, vol. 134(2), pages 471-506, October.
    14. Cuesta, Lizeth & Ruiz, Yomara, 2021. "Efecto de la globalización sobre la desigualdad. Un estudio global para 104 países usando regresiones cuantílicas [Effect of globalization on inequality. A global study for 104 countries using quan," MPRA Paper 111022, University Library of Munich, Germany.
    15. Dutta, Anupam & Bouri, Elie & Rothovius, Timo & Uddin, Gazi Salah, 2023. "Climate risk and green investments: New evidence," Energy, Elsevier, vol. 265(C).
    16. Proto, Eugenio & Rustichini, Aldo, 2012. "Life Satisfaction, Household Income and Personality Traits," The Warwick Economics Research Paper Series (TWERPS) 988, University of Warwick, Department of Economics.
    17. Cowling, Marc & Ughetto, Elisa & Lee, Neil, 2018. "The innovation debt penalty: Cost of debt, loan default, and the effects of a public loan guarantee on high-tech firms," Technological Forecasting and Social Change, Elsevier, vol. 127(C), pages 166-176.
    18. Guili Liao & Qimeng Liu & Rongmao Zhang & Shifang Zhang, 2022. "Rank test of unit‐root hypothesis with AR‐GARCH errors," Journal of Time Series Analysis, Wiley Blackwell, vol. 43(5), pages 695-719, September.
    19. Shweta Bahl & Ajay Sharma, 2021. "Education–Occupation Mismatch and Dispersion in Returns to Education: Evidence from India," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 153(1), pages 251-298, January.
    20. Nguyen, Thao & Bai, Min & Hou, Greg & Vu, Manh-Chien, 2020. "State ownership and adjustment speed toward target leverage: Evidence from a transitional economy," Research in International Business and Finance, Elsevier, vol. 53(C).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:taf:gnstxx:v:27:y:2015:i:2:p:167-179. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Longhurst (email available below). General contact details of provider: http://www.tandfonline.com/GNST20 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.