This file is part of IDEAS, which uses RePEc data


[ Papers | Articles | Software | Books | Chapters | Authors | Institutions | JEL Classification | NEP reports | Search | New papers by email | Author registration | Rankings | Volunteers | FAQ | Blog | Help! ]

No unbiased Estimator of the Variance of K-Fold Cross-Validation

Author info | Abstract | Publisher info | Download info | Related research | Statistics
Author Info
Yoshua Bengio
Yves Grandvalet
Abstract

In statistical machine learning, the standard measure of accuracy for models is the prediction error, i.e. the expected loss on future examples. When the data distribution is unknown, it cannot be computed but several resampling methods, such as K-fold cross-validation can be used to obtain an unbiased estimator of prediction error. However, to compare learning algorithms one needs to also estimate the uncertainty around the cross-validation estimator, which is important because it can be very large. However, the usual variance estimates for means of independent samples cannot be used because of the reuse of the data used to form the cross-validation estimator. The main result of this paper is that there is no universal (distribution independent) unbiased estimator of the variance of the K-fold cross-validation estimator, based only on the empirical results of the error measurements obtained through the cross-validation procedure. The analysis provides a theoretical understanding showing the difficulty of this estimation. These results generalize to other resampling methods, as long as data are reused for training or testing.

L'erreur de prédiction, donc la perte attendue sur des données futures, est la mesure standard pour la qualité des modèles d'apprentissage statistique. Quand la distribution des données est inconnue, cette erreur ne peut être calculée mais plusieurs méthodes de rééchantillonnage, comme la validation croisée, peuvent être utilisées pour obtenir un estimateur non-biaisé de l'erreur de prédiction. Cependant pour comparer des algorithmes d'apprentissage, il faut aussi estimer l'incertitude autour de cet estimateur d'erreur future, car cette incertitude peut être très grande. Cependant, les estimateurs ordinaires de variance d'une moyenne pour des échantillons indépendants ne peuvent être utilisés à cause du recoupement des ensembles d'apprentissage utilisés pour effectuer la validation croisée. Le résultat principal de cet article est qu'il n'existe pas d'estimateur non-biaisé universel (indépendant de la distribution) de la variance de la validation croisée, en se basant sur les mesures d'erreur faites durant la validation croisée. L'analyse fournit une meilleure compréhension de la difficulté d'estimer l'incertitude autour de la validation croisée. Ces résultats se généralisent à d'autres méthodes de rééchantillonnage pour lesquelles des données sont réutilisées pour l'apprentissage ou le test.

Download Info
To download:

If you experience problems downloading a file, check if you have the proper application to view it first. Information about this may be contained in the File-Format links below. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.

File URL: http://www.cirano.qc.ca/pdf/publication/2003s-22.pdf
File Format: application/pdf
File Function:
Download Restriction: no

Publisher Info
Paper provided by CIRANO in its series CIRANO Working Papers with number 2003s-22.

Download reference. The following formats are available: HTML (with abstract), plain text (with abstract), BibTeX, RIS (EndNote, RefMan, ProCite), ReDIF
Length:
Date of creation: 01 May 2003
Date of revision:
Handle: RePEc:cir:cirwor:2003s-22

Contact details of provider:
Postal: 2020 rue University, 25e �tage, Montr�al, Qu�c, H3A 2A5
Phone: (514) 985-4000
Fax: (514) 985-4039
Email:
Web page: http://www.cirano.qc.ca/
More information through EDIRC

For technical questions regarding this item, or to correct its listing, contact: (Webmaster).

Related research
Keywords: Prediction error; cross-validation; multivariate variance estimators; statistical comparison of algorithms; Erreur de prédiction; validation croisée; estimateur de variance multivariée; comparaison statistique des algorithmes;

This paper has been announced in the following NEP Reports:

Statistics
Access and download statistics

Did you know? All bibliographic data on IDEAS has been put in the public domain by the publishers.

This page was last updated on 2009-12-19.


This information is provided to you by IDEAS at the Department of Economics, College of Liberal Arts and Sciences, University of Connecticut using RePEc data on a server sponsored by the Society for Economic Dynamics.