Recombining partitions from multivariate data: a clustering method on Bayes factors
AbstractWe introduce SAGRA (Split And Group Recombining Algorithm), a cluster analysis methodology which split the data set into small homogeneous groups and later recombine those groups using Bayes factors. We compare the performance of SAGRA with other three cluster analysis algorithms: SAR, M-clust and K-means, using five quality measures: Purity, number of groups, Rand index, adjusted Rand index, and F1, over four different data configurations. Results indicate that the SAGRA algorithm obtain consistently similar or better indexes than the other algorithms over all measures and data configurations
Download InfoIf you experience problems downloading a file, check if you have the proper application to view it first. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.
Bibliographic InfoPaper provided by Universidad Carlos III, Departamento de Estadística y Econometría in its series Statistics and Econometrics Working Papers with number ws1450804.
Date of creation: Mar 2014
Date of revision:
Contact details of provider:
Postal: C/ Madrid, 126 - 28903 GETAFE (MADRID)
Web page: http://www.uc3m.es/uc3m/dpto/DEE/departamento.html
More information through EDIRC
Cluster analysis; Bayes factors; SAR; SAGRA;
This paper has been announced in the following NEP Reports:
- NEP-ALL-2014-04-18 (All new papers)
Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:
- Atkinson, A.C. & Riani, M., 2007. "Exploratory tools for clustering multivariate data," Computational Statistics & Data Analysis, Elsevier, vol. 52(1), pages 272-285, September.
- Christian Hennig, 2010. "Methods for merging Gaussian mixture components," Advances in Data Analysis and Classification, Springer, vol. 4(1), pages 3-34, April.
- Mark Culp, . "spa: Semi-Supervised Semi-Parametric Graph-Based Estimation in R," Journal of Statistical Software, American Statistical Association, vol. 40(i10).
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ().
If references are entirely missing, you can add them using this form.