Ensemble classification of paired data
AbstractIn many medical applications, data are taken from paired organs or from repeated measurements of the same organ or subject. Subject based as opposed to observation based evaluation of these data results in increased efficiency of the estimation of the misclassification rate. A subject based approach for classification in the generation of bootstrap samples of bagging and bundling methods is analyzed. A simulation model is used to compare the performance of different strategies to create the bootstrap samples which are used to grow individual trees. The proposed approach is compared to linear discriminant analysis, logistic regression, random forests and gradient boosting. Finally, the simulation results are applied to glaucoma diagnosis using both eyes of glaucoma patients and healthy controls. It is demonstrated that the proposed subject based resampling reduces the misclassification rate.
Download InfoIf you experience problems downloading a file, check if you have the proper application to view it first. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.
Bibliographic InfoArticle provided by Elsevier in its journal Computational Statistics & Data Analysis.
Volume (Year): 55 (2011)
Issue (Month): 5 (May)
Contact details of provider:
Web page: http://www.elsevier.com/locate/csda
Ensemble classification Glaucoma diagnosis Paired data;
Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:
- Hothorn, Torsten & Lausen, Berthold, 2005. "Bundling classifiers by bagging trees," Computational Statistics & Data Analysis, Elsevier, vol. 49(4), pages 1068-1078, June.
- De Bock, Koen W & Coussement, Kristof & Van den Poel, Dirk, 2010.
"Ensemble classification based on generalized additive models,"
2010/02, Hogeschool-Universiteit Brussel, Faculteit Economie en Management.
- De Bock, Koen W. & Coussement, Kristof & Van den Poel, Dirk, 2010. "Ensemble classification based on generalized additive models," Computational Statistics & Data Analysis, Elsevier, vol. 54(6), pages 1535-1546, June.
- K. W. De Bock & K. Coussement & D. Van Den Poel & -, 2009. "Ensemble classification based on generalized additive models," Working Papers of Faculty of Economics and Business Administration, Ghent University, Belgium 09/625, Ghent University, Faculty of Economics and Business Administration.
- Rokach, Lior, 2009. "Taxonomy for characterizing ensemble methods in classification tasks: A review and annotated bibliography," Computational Statistics & Data Analysis, Elsevier, vol. 53(12), pages 4046-4072, October.
- Zhang, Chun-Xia & Zhang, Jiang-She & Zhang, Gai-Ying, 2009. "Using Boosting to prune Double-Bagging ensembles," Computational Statistics & Data Analysis, Elsevier, vol. 53(4), pages 1218-1231, February.
- Iranpanah, N. & Mohammadzadeh, M. & Taylor, C.C., 2011. "A comparison of block and semi-parametric bootstrap methods for variance estimation in spatial statistics," Computational Statistics & Data Analysis, Elsevier, vol. 55(1), pages 578-587, January.
- Adler, Werner & Lausen, Berthold, 2009. "Bootstrap estimated true and false positive rates and ROC curve," Computational Statistics & Data Analysis, Elsevier, vol. 53(3), pages 718-729, January.
- Werner Adler & Sergej Potapov & Berthold Lausen, 2011. "Classification of repeated measurements data using tree-based ensemble methods," Computational Statistics, Springer, vol. 26(2), pages 355-369, June.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Zhang, Lei).
If references are entirely missing, you can add them using this form.