Global Optimization strategies for two-mode clustering

Global Optimization strategies for two-mode clustering

Author

Listed:

van Rosmalen, J.M.
Groenen, P.J.F.
Trejos, J.
Castilli, W.

Registered:

Patrick Groenen

Abstract

Two-mode clustering is a relatively new form of clustering that clusters both rows and columns of a data matrix. To do so, a criterion similar to k-means is optimized. However, it is still unclear which optimization method should be used to perform two-mode clustering, as various methods may lead to non-global optima. This paper reviews and compares several optimization methods for two-mode clustering. Several known algorithms are discussed and a new, fuzzy algorithm is introduced. The meta-heuristics Multistart, Simulated Annealing, and Tabu Search are used in combination with these algorithms. The new, fuzzy algorithm is based on the fuzzy c-means algorithm of Bezdek (1981) and the Fuzzy Steps approach to avoid local minima of Heiser and Groenen (1997) and Groenen and Jajuga (2001). The performance of all methods is compared in a large simulation study. It is found that using a Multistart meta-heuristic in combination with a two-mode k-means algorithm or the fuzzy algorithm often gives the best results. Finally, an empirical data set is used to give a practical example of two-mode clustering.

Suggested Citation

van Rosmalen, J.M. & Groenen, P.J.F. & Trejos, J. & Castilli, W., 2005. "Global Optimization strategies for two-mode clustering," Econometric Institute Research Papers EI 2005-33, Erasmus University Rotterdam, Erasmus School of Economics (ESE), Econometric Institute.

Handle: RePEc:ems:eureir:7022

Download full text from publisher

References listed on IDEAS

Lawrence Hubert & Phipps Arabie, 1985. "Comparing partitions," Journal of Classification, Springer;The Classification Society, vol. 2(1), pages 193-218, December.
Willem Heiser & Patrick Groenen, 1997. "Cluster differences scaling with a within-clusters loss component and a fuzzy successive approximation strategy to avoid local minima," Psychometrika, Springer;The Psychometric Society, vol. 62(1), pages 63-83, March.
Glenn Milligan, 1980. "An examination of the effect of six types of error perturbation on fifteen clustering algorithms," Psychometrika, Springer;The Psychometric Society, vol. 45(3), pages 325-342, September.
Wayne Desarbo, 1982. "Gennclus: New models for general nonhierarchical clustering analysis," Psychometrika, Springer;The Psychometric Society, vol. 47(4), pages 449-475, December.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Vera, J. Fernando & Macas, Rodrigo & Heiser, Willem J., 2009. "A dual latent class unfolding model for two-way two-mode preference rating data," Computational Statistics & Data Analysis, Elsevier, vol. 53(8), pages 3231-3244, June.
Jan Schepers & Eva Ceulemans & Iven Mechelen, 2008. "Selecting Among Multi-Mode Partitioning Models of Different Complexities: A Comparison of Four Model Selection Criteria," Journal of Classification, Springer;The Classification Society, vol. 25(1), pages 67-85, June.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Joost Rosmalen & Patrick Groenen & Javier Trejos & William Castillo, 2009. "Optimization Strategies for Two-Mode Partitioning," Journal of Classification, Springer;The Classification Society, vol. 26(2), pages 155-181, August.
DeSarbo, Wayne S. & Selin Atalay, A. & Blanchard, Simon J., 2009. "A three-way clusterwise multidimensional unfolding procedure for the spatial representation of context dependent preferences," Computational Statistics & Data Analysis, Elsevier, vol. 53(8), pages 3217-3230, June.
- Selin Atalay & Wayne S. Desarbo & Simon J. Blanchard, 2009. "A three-way clusterwise multidimensional unfolding procedure for the spatial representation of context dependent preferences," Post-Print hal-00458377, HAL.
Aurora Torrente & Juan Romo, 2021. "Initializing k-means Clustering by Bootstrap and Data Depth," Journal of Classification, Springer;The Classification Society, vol. 38(2), pages 232-256, July.
J. Fernando Vera & Rodrigo Macías, 2021. "On the Behaviour of K-Means Clustering of a Dissimilarity Matrix by Means of Full Multidimensional Scaling," Psychometrika, Springer;The Psychometric Society, vol. 86(2), pages 489-513, June.
Michael Brusco & Douglas Steinley, 2015. "Affinity Propagation and Uncapacitated Facility Location Problems," Journal of Classification, Springer;The Classification Society, vol. 32(3), pages 443-480, October.
Douglas Steinley & Michael Brusco, 2008. "Selection of Variables in Cluster Analysis: An Empirical Comparison of Eight Procedures," Psychometrika, Springer;The Psychometric Society, vol. 73(1), pages 125-144, March.
Michael Brusco & Douglas Steinley, 2007. "A Comparison of Heuristic Procedures for Minimum Within-Cluster Sums of Squares Partitioning," Psychometrika, Springer;The Psychometric Society, vol. 72(4), pages 583-600, December.
Florian Schreiber, 2017. "Identification of customer groups in the German term life market: a benefit segmentation," Annals of Operations Research, Springer, vol. 254(1), pages 365-399, July.
Dolnicar, Sara & Grün, Bettina & Leisch, Friedrich, 2016. "Increasing sample size compensates for data problems in segmentation studies," Journal of Business Research, Elsevier, vol. 69(2), pages 992-999.
Rocci, Roberto & Vichi, Maurizio, 2008. "Two-mode multi-partitioning," Computational Statistics & Data Analysis, Elsevier, vol. 52(4), pages 1984-2003, January.
Simon Blanchard & Daniel Aloise & Wayne DeSarbo, 2012. "The Heterogeneous P-Median Problem for Categorization Based Clustering," Psychometrika, Springer;The Psychometric Society, vol. 77(4), pages 741-762, October.
Chia-Yi Chiu & Jeffrey Douglas & Xiaodong Li, 2009. "Cluster Analysis for Cognitive Diagnosis: Theory and Applications," Psychometrika, Springer;The Psychometric Society, vol. 74(4), pages 633-665, December.
Weinand, J.M. & McKenna, R. & Fichtner, W., 2019. "Developing a municipality typology for modelling decentralised energy systems," Utilities Policy, Elsevier, vol. 57(C), pages 75-96.
J. Fernando Vera & Ricardo Subiabre & Rodrigo Macías, 2025. "Clustering and Geodesic Scaling of Dissimilarities on the Spherical Surface," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 30(1), pages 172-192, March.
Donatella Vicari, 2014. "Classification of Asymmetric Proximity Data," Journal of Classification, Springer;The Classification Society, vol. 31(3), pages 386-420, October.
Efthymios Costa & Ioanna Papatsouma & Angelos Markos, 2023. "Benchmarking distance-based partitioning methods for mixed-type data," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 17(3), pages 701-724, September.
Christian Hennig, 2022. "An empirical comparison and characterisation of nine popular clustering methods," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 16(1), pages 201-229, March.
Schepers, Jan & van Mechelen, Iven & Ceulemans, Eva, 2006. "Three-mode partitioning," Computational Statistics & Data Analysis, Elsevier, vol. 51(3), pages 1623-1642, December.
Pieter C. Schoonees & Patrick J. F. Groenen & Michel Velden, 2022. "Least-squares bilinear clustering of three-way data," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 16(4), pages 1001-1037, December.
Paulina Pankowska & Daniel Oberski & Mauricio Garnier-Villarreal & Dimitris Pavlopoulos, 2025. "The effect of measurement error on clustering," Quality & Quantity: International Journal of Methodology, Springer, vol. 59(5), pages 4825-4860, October.

More about this item

Keywords

; ; ; ; ; ; ;

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ems:eureir:7022. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: RePub The email address of this maintainer does not seem to be valid anymore. Please ask RePub to update the entry or send us the correct address (email available below). General contact details of provider: https://edirc.repec.org/data/feeurnl.html .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Global Optimization strategies for two-mode clustering

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data