IDEAS home Printed from https://ideas.repec.org/a/spr/jglopt/v49y2011i3p449-465.html
   My bibliography  Save this article

Evaluating a branch-and-bound RLT-based algorithm for minimum sum-of-squares clustering

Author

Listed:
  • Daniel Aloise
  • Pierre Hansen

Abstract

Minimum sum-of-squares clustering consists in partitioning a given set of n points into c clusters in order to minimize the sum of squared distances from the points to the centroid of their cluster. Recently, Sherali and Desai (JOGO, 2005) proposed a reformulation-linearization based branch-and-bound algorithm for this problem, claiming to solve instances with up to 1,000 points. In this paper, their algorithm is investigated in further detail, reproducing some of their computational experiments. However, our computational times turn out to be drastically larger. Indeed, for two data sets from the literature only instances with up to 20 points could be solved in less than 10 h of computer time. Possible reasons for this discrepancy are discussed. The effect of a symmetry breaking rule due to Plastria (EJOR, 2002) and of the introduction of valid inequalities of the convex hull of points in two dimensions which may belong to each cluster is also explored. Copyright Springer Science+Business Media, LLC. 2011

Suggested Citation

  • Daniel Aloise & Pierre Hansen, 2011. "Evaluating a branch-and-bound RLT-based algorithm for minimum sum-of-squares clustering," Journal of Global Optimization, Springer, vol. 49(3), pages 449-465, March.
  • Handle: RePEc:spr:jglopt:v:49:y:2011:i:3:p:449-465
    DOI: 10.1007/s10898-010-9571-3
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1007/s10898-010-9571-3
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1007/s10898-010-9571-3?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Michael Brusco & Douglas Steinley, 2007. "A Comparison of Heuristic Procedures for Minimum Within-Cluster Sums of Squares Partitioning," Psychometrika, Springer;The Psychometric Society, vol. 72(4), pages 583-600, December.
    2. Pierre Hansen & Eric Ngai & Bernard K. Cheung & Nenad Mladenovic, 2005. "Analysis of Global k-Means, an Incremental Heuristic for Minimum Sum-of-Squares Clustering," Journal of Classification, Springer;The Classification Society, vol. 22(2), pages 287-310, September.
    3. Plastria, Frank, 2002. "Formulating logical implications in combinatorial optimisation," European Journal of Operational Research, Elsevier, vol. 140(2), pages 338-353, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Daniel Aloise & Nielsen Castelo Damasceno & Nenad Mladenović & Daniel Nobre Pinheiro, 2017. "On Strategies to Fix Degenerate k-means Solutions," Journal of Classification, Springer;The Classification Society, vol. 34(2), pages 165-190, July.
    2. Plastria, F., 2012. "A note towards improved homeland defense," Omega, Elsevier, vol. 40(2), pages 244-248, April.
    3. Roe, R.A. & Smeelen, M. & Hoefeld, C., 2005. "Outsourcing and organizational change : an employee perspective," Research Memorandum 045, Maastricht University, Maastricht Research School of Economics of Technology and Organization (METEOR).
    4. Federico Della Croce, 2016. "MP or not MP: that is the question," Journal of Scheduling, Springer, vol. 19(1), pages 33-42, February.
    5. J. Fernando Vera & Rodrigo Macías, 2021. "On the Behaviour of K-Means Clustering of a Dissimilarity Matrix by Means of Full Multidimensional Scaling," Psychometrika, Springer;The Psychometric Society, vol. 86(2), pages 489-513, June.
    6. Michael Brusco & Douglas Steinley, 2015. "Affinity Propagation and Uncapacitated Facility Location Problems," Journal of Classification, Springer;The Classification Society, vol. 32(3), pages 443-480, October.
    7. Aurore Delaigle & Peter Hall & Tung Pham, 2019. "Clustering functional data into groups by using projections," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 81(2), pages 271-304, April.
    8. Luyao Wang & Hong Fan & Tianren Gong, 2018. "The Consumer Demand Estimating and Purchasing Strategies Optimizing of FMCG Retailers Based on Geographic Methods," Sustainability, MDPI, vol. 10(2), pages 1-17, February.
    9. Stéphane Bonhomme & Elena Manresa, 2015. "Grouped Patterns of Heterogeneity in Panel Data," Econometrica, Econometric Society, vol. 83(3), pages 1147-1184, May.
    10. Duchenne, Éric & Laporte, Gilbert & Semet, Frédéric, 2012. "The undirected m-Capacitated Peripatetic Salesman Problem," European Journal of Operational Research, Elsevier, vol. 223(3), pages 637-643.
    11. Raf Jans, 2009. "Solving Lot-Sizing Problems on Parallel Identical Machines Using Symmetry-Breaking Constraints," INFORMS Journal on Computing, INFORMS, vol. 21(1), pages 123-136, February.
    12. Michael Brusco & Patrick Doreian & Andrej Mrvar & Douglas Steinley, 2011. "Two Algorithms for Relaxed Structural Balance Partitioning: Linking Theory, Models, and Data to Understand Social Network Phenomena," Sociological Methods & Research, , vol. 40(1), pages 57-87, February.
    13. Michael Brusco & Hans-Friedrich Köhn, 2009. "Exemplar-Based Clustering via Simulated Annealing," Psychometrika, Springer;The Psychometric Society, vol. 74(3), pages 457-475, September.
    14. Carrizosa, E. & Martin-Barragán, B. & Plastria, F. & Romero Morales, M.D., 2002. "A Dissimilarity-based approach for Classification," Research Memorandum 027, Maastricht University, Maastricht Research School of Economics of Technology and Organization (METEOR).
    15. Amaldi, Edoardo & Coniglio, Stefano, 2013. "A distance-based point-reassignment heuristic for the k-hyperplane clustering problem," European Journal of Operational Research, Elsevier, vol. 227(1), pages 22-29.
    16. Jan Pablo Burgard & Carina Moreira Costa & Christopher Hojny & Thomas Kleinert & Martin Schmidt, 2023. "Mixed-integer programming techniques for the minimum sum-of-squares clustering problem," Journal of Global Optimization, Springer, vol. 87(1), pages 133-189, September.
    17. Turkensteen, Marcel & Klose, Andreas, 2012. "Demand dispersion and logistics costs in one-to-many distribution systems," European Journal of Operational Research, Elsevier, vol. 223(2), pages 499-507.
    18. Carrizosa, Emilio & Mladenović, Nenad & Todosijević, Raca, 2013. "Variable neighborhood search for minimum sum-of-squares clustering on networks," European Journal of Operational Research, Elsevier, vol. 230(2), pages 356-363.
    19. Adib Roshani & Mehdi Hamidi, 2022. "Groundwater Level Fluctuations in Coastal Aquifer: Using Artificial Neural Networks to Predict the Impacts of Climatical CMIP6 Scenarios," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 36(11), pages 3981-4001, September.
    20. Ahmed Albatineh & Magdalena Niewiadomska-Bugaj, 2011. "MCS: A Method for Finding the Number of Clusters," Journal of Classification, Springer;The Classification Society, vol. 28(2), pages 184-209, July.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:jglopt:v:49:y:2011:i:3:p:449-465. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.