Kernel Factory: An Ensemble of Kernel Machines
AbstractWe propose an ensemble method for kernel machines. The training data is randomly split into a number of mutually exclusive partitions defined by a row and column parameter. Each partition forms an input space and is transformed by a kernel function into a kernel matrix K. Subsequently, each K is used as training data for a base binary classifier (Random Forest). This results in a number of predictions equal to the number of partitions. A weighted average combines the predictions into one final prediction. To optimize the weights, a genetic algorithm is used. This approach has the advantage of simultaneously promoting (1) diversity, (2) accuracy, and (3) computational speed. (1) Diversity is fostered because the individual K’s are based on a subset of features and observations, (2) accuracy is sought by optimizing the weights with the genetic algorithm, and (3) computational speed is obtained because the computation of each K can be parallelized. Using five times two-fold cross validation we benchmark the classification performance of Kernel Factory against Random Forest and Kernel-Induced Random Forest (KIRF). We find that Kernel Factory has significantly better performance than Kernel-Induced Random Forest. When the right kernel is specified Kernel Factory is also significantly better than Random Forest. In addition, an open-source Rsoftware package of the algorithm (kernelFactory) is available from CRAN.
Download InfoIf you experience problems downloading a file, check if you have the proper application to view it first. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.
Bibliographic InfoPaper provided by Ghent University, Faculty of Economics and Business Administration in its series Working Papers of Faculty of Economics and Business Administration, Ghent University, Belgium with number 12/825.
Length: 24 pages
Date of creation: Dec 2012
Date of revision:
This paper has been announced in the following NEP Reports:
- NEP-ALL-2013-02-03 (All new papers)
- NEP-CMP-2013-02-03 (Computational Economics)
- NEP-ECM-2013-02-03 (Econometrics)
- NEP-FOR-2013-02-03 (Forecasting)
Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:
- Alexandros Karatzoglou & Alexandros Smola & Kurt Hornik & Achim Zeileis, . "kernlab - An S4 Package for Kernel Methods in R," Journal of Statistical Software, American Statistical Association, vol. 11(i09).
- W. Buckinx & D. Van Den Poel, 2003.
"Customer Base Analysis: Partial Defection of Behaviorally-Loyal Clients in a Non-Contractual FMCG Retail Setting,"
Working Papers of Faculty of Economics and Business Administration, Ghent University, Belgium
03/178, Ghent University, Faculty of Economics and Business Administration.
- Buckinx, Wouter & Van den Poel, Dirk, 2005. "Customer base analysis: partial defection of behaviourally loyal clients in a non-contractual FMCG retail setting," European Journal of Operational Research, Elsevier, vol. 164(1), pages 252-268, July.
- Qi Li & Jeffrey Scott Racine, 2006. "Nonparametric Econometrics: Theory and Practice," Economics Books, Princeton University Press, edition 1, volume 1, number 8355.
- K. Coussement & D. Van Den Poel, 2006. "Churn Prediction in Subscription Services: an Application of Support Vector Machines While Comparing Two Parameter-Selection Techniques," Working Papers of Faculty of Economics and Business Administration, Ghent University, Belgium 06/412, Ghent University, Faculty of Economics and Business Administration.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Nathalie Verhaeghe).
If references are entirely missing, you can add them using this form.