IDEAS home Printed from https://ideas.repec.org/a/inm/orijoc/v38y2026i3p905-920.html

A Column Generation Algorithm with Dynamic Constraint Aggregation for Minimum Sum-of-Squares Clustering

Author

Listed:
  • Antonio M. Sudoso

    (Department of Computer, Control and Management Engineering “A. Ruberti”, Sapienza University of Rome, 00185 Rome, Italy)

  • Daniel Aloise

    (Department of Computer and Software Engineering, Polytechnique Montréal, Montréal, Quebec H3T 1J4, Canada)

Abstract

The minimum sum-of-squares clustering problem (MSSC), also known as k -means clustering, refers to the problem of partitioning n data points into k clusters, with the objective of minimizing the total sum of squared Euclidean distances between each point and the center of its assigned cluster. We propose an efficient algorithm for solving large-scale MSSC instances, which combines column generation (CG) with dynamic constraint aggregation (DCA) to effectively reduce the number of constraints considered in the CG master problem. DCA was originally conceived to reduce degeneracy in set partitioning problems by utilizing an aggregated restricted master problem obtained from a partition of the set partitioning constraints into disjoint clusters. In this work, we explore the use of DCA within a CG algorithm for MSSC exact solution. Our method is fine-tuned by a series of ablation studies on DCA design choices, and is demonstrated to significantly outperform existing state-of-the-art exact approaches available in the literature.

Suggested Citation

  • Antonio M. Sudoso & Daniel Aloise, 2026. "A Column Generation Algorithm with Dynamic Constraint Aggregation for Minimum Sum-of-Squares Clustering," INFORMS Journal on Computing, INFORMS, vol. 38(3), pages 905-920, May.
  • Handle: RePEc:inm:orijoc:v:38:y:2026:i:3:p:905-920
    DOI: 10.1287/ijoc.2024.0938
    as

    Download full text from publisher

    File URL: http://dx.doi.org/10.1287/ijoc.2024.0938
    Download Restriction: no

    File URL: https://libkey.io/10.1287/ijoc.2024.0938?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Keywords

    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:orijoc:v:38:y:2026:i:3:p:905-920. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.