IDEAS home Printed from https://ideas.repec.org/a/bla/jorssb/v67y2005i3p363-379.html
   My bibliography  Save this article

Properties of bagged nearest neighbour classifiers

Author

Listed:
  • Peter Hall
  • Richard J. Samworth

Abstract

Summary. It is shown that bagging, a computationally intensive method, asymptotically improves the performance of nearest neighbour classifiers provided that the resample size is less than 69% of the actual sample size, in the case of with‐replacement bagging, or less than 50% of the sample size, for without‐replacement bagging. However, for larger sampling fractions there is no asymptotic difference between the risk of the regular nearest neighbour classifier and its bagged version. In particular, neither achieves the large sample performance of the Bayes classifier. In contrast, when the sampling fractions converge to 0, but the resample sizes diverge to ∞, the bagged classifier converges to the optimal Bayes rule and its risk converges to the risk of the latter. These results are most readily seen when the two populations have well‐defined densities, but they may also be derived in other cases, where densities exist in only a relative sense. Cross‐validation can be used effectively to choose the sampling fraction. Numerical calculation is used to illustrate these theoretical properties.

Suggested Citation

  • Peter Hall & Richard J. Samworth, 2005. "Properties of bagged nearest neighbour classifiers," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(3), pages 363-379, June.
  • Handle: RePEc:bla:jorssb:v:67:y:2005:i:3:p:363-379
    DOI: 10.1111/j.1467-9868.2005.00506.x
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/j.1467-9868.2005.00506.x
    Download Restriction: no

    File URL: https://libkey.io/10.1111/j.1467-9868.2005.00506.x?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Petersen, Maya L. & Molinaro, Annette M. & Sinisi, Sandra E. & van der Laan, Mark J., 2007. "Cross-validated bagged learning," Journal of Multivariate Analysis, Elsevier, vol. 98(9), pages 1693-1704, October.
    2. Timothy I. Cannings & Richard J. Samworth, 2017. "Random-projection ensemble classification," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(4), pages 959-1035, September.
    3. Asma Gul & Aris Perperoglou & Zardad Khan & Osama Mahmoud & Miftahuddin Miftahuddin & Werner Adler & Berthold Lausen, 2018. "Ensemble of a subset of kNN classifiers," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 12(4), pages 827-840, December.
    4. Will Wei Sun & Xingye Qiao & Guang Cheng, 2016. "Stabilized Nearest Neighbor Classifier and its Statistical Properties," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(515), pages 1254-1265, July.
    5. Pere Marti-Puig & Alejandro Blanco-M & Juan José Cárdenas & Jordi Cusidó & Jordi Solé-Casals, 2019. "Feature Selection Algorithms for Wind Turbine Failure Prediction," Energies, MDPI, vol. 12(3), pages 1-18, January.
    6. Cholaquidis, Alejandro & Fraiman, Ricardo & Kalemkerian, Juan & Llop, Pamela, 2016. "A nonlinear aggregation type classifier," Journal of Multivariate Analysis, Elsevier, vol. 146(C), pages 269-281.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jorssb:v:67:y:2005:i:3:p:363-379. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/rssssea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.