IDEAS home Printed from https://ideas.repec.org/a/taf/japsta/v28y2001i5p585-595.html
   My bibliography  Save this article

Data-based interval estimation of classification error rates

Author

Listed:
  • W. J. Krzanowski

Abstract

Leave-one-out and 632 bootstrap are popular data-based methods of estimating the true error rate of a classification rule, but practical applications almost exclusively quote only point estimates. Interval estimation would provide better assessment of the future performance of the rule, but little has been published on this topic. We first review general-purpose jackknife and bootstrap methodology that can be used in conjunction with leave-one-out estimates to provide prediction intervals for true error rates of classification rules. Monte Carlo simulation is then used to investigate coverage rates of the resulting intervals for normal data, but the results are disappointing; standard intervals show considerable overinclusion, intervals based on Edgeworth approximations or random weighting do not perform well, and while a bootstrap approach provides intervals with coverage rates closer to the nominal ones there is still marked underinclusion. We then turn to intervals constructed from 632 bootstrap estimates, and show that much better results are obtained. Although there is now some overinclusion, particularly for large training samples, the actual coverage rates are sufficiently close to the nominal rates for the method to be recommended. An application to real data illustrates the considerable variability that can arise in practical estimation of error rates.

Suggested Citation

  • W. J. Krzanowski, 2001. "Data-based interval estimation of classification error rates," Journal of Applied Statistics, Taylor & Francis Journals, vol. 28(5), pages 585-595.
  • Handle: RePEc:taf:japsta:v:28:y:2001:i:5:p:585-595
    DOI: 10.1080/02664760120047915
    as

    Download full text from publisher

    File URL: http://www.tandfonline.com/doi/abs/10.1080/02664760120047915
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1080/02664760120047915?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Richard M. Heiberger, 1978. "Generation of Random Orthogonal Matrices," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 27(2), pages 199-206, June.
    2. G. Fitzmaurice & W. Krzanowski & D. Hand, 1991. "A Monte Carlo study of the 632 bootstrap estimator of error rate," Journal of Classification, Springer;The Classification Society, vol. 8(2), pages 239-250, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Pausinger, Florian, 2019. "Uniformly distributed sequences in the orthogonal group and on the Grassmannian manifold," Mathematics and Computers in Simulation (MATCOM), Elsevier, vol. 160(C), pages 13-22.
    2. Kai-Tai Fang & Run-Ze Li, 1997. "Some methods for generating both an NT-net and the uniform distribution on a Stiefel manifold and their applications," Computational Statistics & Data Analysis, Elsevier, vol. 24(1), pages 29-46, March.
    3. Kim, Ji-Hyun, 2009. "Estimating classification error rate: Repeated cross-validation, repeated hold-out and bootstrap," Computational Statistics & Data Analysis, Elsevier, vol. 53(11), pages 3735-3745, September.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:taf:japsta:v:28:y:2001:i:5:p:585-595. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Longhurst (email available below). General contact details of provider: http://www.tandfonline.com/CJAS20 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.