IDEAS home Printed from https://ideas.repec.org/a/eee/ejores/v253y2016i3p659-672.html
   My bibliography  Save this article

A model for clustering data from heterogeneous dissimilarities

Author

Listed:
  • Santi, Éverton
  • Aloise, Daniel
  • Blanchard, Simon J.

Abstract

Clustering algorithms partition a set of n objects into p groups (called clusters), such that objects assigned to the same groups are homogeneous according to some criteria. To derive these clusters, the data input required is often a single n × n dissimilarity matrix. Yet for many applications, more than one instance of the dissimilarity matrix is available and so to conform to model requirements, it is common practice to aggregate (e.g., sum up, average) the matrices. This aggregation practice results in clustering solutions that mask the true nature of the original data. In this paper we introduce a clustering model which, to handle the heterogeneity, uses all available dissimilarity matrices and identifies for groups of individuals clustering objects in a similar way. The model is a nonconvex problem and difficult to solve exactly, and we thus introduce a Variable Neighborhood Search heuristic to provide solutions efficiently. Computational experiments and an empirical application to perception of chocolate candy show that the heuristic algorithm is efficient and that the proposed model is suited for recovering heterogeneous data. Implications for clustering researchers are discussed.

Suggested Citation

  • Santi, Éverton & Aloise, Daniel & Blanchard, Simon J., 2016. "A model for clustering data from heterogeneous dissimilarities," European Journal of Operational Research, Elsevier, vol. 253(3), pages 659-672.
  • Handle: RePEc:eee:ejores:v:253:y:2016:i:3:p:659-672
    DOI: 10.1016/j.ejor.2016.03.033
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0377221716301618
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ejor.2016.03.033?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Wayne S. DeSarbo & A. Selin Atalay & David LeBaron & Simon J. Blanchard, 2008. "Estimating Multiple Consumer Segment Ideal Points from Context-Dependent Survey Data," Journal of Consumer Research, Journal of Consumer Research Inc., vol. 35(1), pages 142-153, March.
    2. Pierre Hansen & Nenad Mladenović & José Moreno Pérez, 2010. "Variable neighbourhood search: methods and applications," Annals of Operations Research, Springer, vol. 175(1), pages 367-407, March.
    3. Cait Poynor Lamberton & Kristin Diehl, 2013. "Retail Choice Architecture: The Effects of Benefit- and Attribute-Based Assortment Organization on Consumer Perceptions and Choice," Journal of Consumer Research, Journal of Consumer Research Inc., vol. 40(3), pages 393-411.
    4. Bettman, James R & Luce, Mary Frances & Payne, John W, 1998. "Constructive Consumer Choice Processes," Journal of Consumer Research, Journal of Consumer Research Inc., vol. 25(3), pages 187-217, December.
    5. Mladenovic, Nenad & Brimberg, Jack & Hansen, Pierre & Moreno-Perez, Jose A., 2007. "The p-median problem: A survey of metaheuristic approaches," European Journal of Operational Research, Elsevier, vol. 179(3), pages 927-939, June.
    6. Simon Blanchard & Wayne DeSarbo & A. Atalay & Nukhet Harmancioglu, 2012. "Identifying consumer heterogeneity in unobserved categories," Marketing Letters, Springer, vol. 23(1), pages 177-194, March.
    7. Shugan, Steven M, 1980. "The Cost of Thinking," Journal of Consumer Research, Journal of Consumer Research Inc., vol. 7(2), pages 99-111, Se.
    8. Rebecca Hamilton & Debora Thompson & Zachary Arens & Simon Blanchard & Gerald Häubl & P. Kannan & Uzma Khan & Donald Lehmann & Margaret Meloy & Neal Roese & Manoj Thomas, 2014. "Consumer substitution decisions: an integrative framework," Marketing Letters, Springer, vol. 25(3), pages 305-317, September.
    9. Simon Blanchard & Daniel Aloise & Wayne DeSarbo, 2012. "The Heterogeneous P-Median Problem for Categorization Based Clustering," Psychometrika, Springer;The Psychometric Society, vol. 77(4), pages 741-762, October.
    10. Simon Blanchard & Wayne DeSarbo, 2013. "A New Zero-Inflated Negative Binomial Methodology for Latent Category Identification," Psychometrika, Springer;The Psychometric Society, vol. 78(2), pages 322-340, April.
    11. Herbert A. Simon, 1955. "A Behavioral Model of Rational Choice," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 69(1), pages 99-118.
    12. Lawrence Hubert & Phipps Arabie, 1985. "Comparing partitions," Journal of Classification, Springer;The Classification Society, vol. 2(1), pages 193-218, December.
    13. Sáez-Aguado, Jesús & Trandafir, Paula Camelia, 2012. "Some heuristic methods for solving p-median problems with a coverage constraint," European Journal of Operational Research, Elsevier, vol. 220(2), pages 320-327.
    14. Swait, Joffre & Brigden, Neil & Johnson, Richard D., 2014. "Categories shape preferences: A model of taste heterogeneity arising from categorization of alternatives," Journal of choice modelling, Elsevier, vol. 13(C), pages 3-23.
    15. Douglas Steinley & Gretchen Hendrickson & Michael Brusco, 2015. "A Note on Maximizing the Agreement Between Partitions: A Stepwise Optimal Algorithm and Some Properties," Journal of Classification, Springer;The Classification Society, vol. 32(1), pages 114-126, April.
    16. Hansen, Pierre & Mladenovic, Nenad, 2001. "Variable neighborhood search: Principles and applications," European Journal of Operational Research, Elsevier, vol. 130(3), pages 449-467, May.
    17. Ruth Misener & Christodoulos Floudas, 2013. "GloMIQO: Global mixed-integer quadratic optimizer," Journal of Global Optimization, Springer, vol. 57(1), pages 3-50, September.
    18. Wayne DeSarbo & J. Douglas Carroll, 1985. "Three-way metric unfolding via alternating weighted least squares," Psychometrika, Springer;The Psychometric Society, vol. 50(3), pages 275-300, September.
    19. Michael Brusco & J. Cradit, 2001. "A variable-selection heuristic for K-means clustering," Psychometrika, Springer;The Psychometric Society, vol. 66(2), pages 249-270, June.
    20. X. Zheng & X. Sun & D. Li, 2011. "Nonconvex quadratically constrained quadratic programming: best D.C. decompositions and their SDP representations," Journal of Global Optimization, Springer, vol. 50(4), pages 695-712, August.
    21. Park, C Whan & Iyer, Easwar S & Smith, Daniel C, 1989. "The Effects of Situational Factors on In-Store Grocery Shopping Behavior: The Role of Store Environment and Time Available for Shopping," Journal of Consumer Research, Journal of Consumer Research Inc., vol. 15(4), pages 422-433, March.
    22. Bettman, James R & Park, C Whan, 1980. "Effects of Prior Knowledge and Experience and Phase of the Choice Process on Consumer Decision Processes: A Protocol Analysis," Journal of Consumer Research, Journal of Consumer Research Inc., vol. 7(3), pages 234-248, December.
    23. Maurizio Vichi & Roberto Rocci & Henk A.L. Kiers, 2007. "Simultaneous Component and Clustering Models for Three-way Data: Within and Between Approaches," Journal of Classification, Springer;The Classification Society, vol. 24(1), pages 71-98, June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Zhu, Shan & Hu, Xiangpei & Huang, Kai & Yuan, Yufei, 2021. "Optimization of product category allocation in multiple warehouses to minimize splitting of online supermarket customer orders," European Journal of Operational Research, Elsevier, vol. 290(2), pages 556-571.
    2. Karmitsa, Napsu & Bagirov, Adil M. & Taheri, Sona, 2017. "New diagonal bundle method for clustering problems in large data sets," European Journal of Operational Research, Elsevier, vol. 263(2), pages 367-379.
    3. Radek Hrebik & Jaromir Kukal & Josef Jablonsky, 2019. "Optimal unions of hidden classes," Central European Journal of Operations Research, Springer;Slovak Society for Operations Research;Hungarian Operational Research Society;Czech Society for Operations Research;Österr. Gesellschaft für Operations Research (ÖGOR);Slovenian Society Informatika - Section for Operational Research;Croatian Operational Research Society, vol. 27(1), pages 161-177, March.
    4. Huerta-Muñoz, Diana L. & Ríos-Mercado, Roger Z. & Ruiz, Rubén, 2017. "An iterated greedy heuristic for a market segmentation problem with multiple attributes," European Journal of Operational Research, Elsevier, vol. 261(1), pages 75-87.
    5. Chen, Yi-Ting & Sun, Edward W. & Lin, Yi-Bing, 2020. "Merging anomalous data usage in wireless mobile telecommunications: Business analytics with a strategy-focused data-driven approach for sustainability," European Journal of Operational Research, Elsevier, vol. 281(3), pages 687-705.
    6. Rota Bulò, Samuel & Pelillo, Marcello, 2017. "Dominant-set clustering: A review," European Journal of Operational Research, Elsevier, vol. 262(1), pages 1-13.
    7. Gambella, Claudio & Ghaddar, Bissan & Naoum-Sawaya, Joe, 2021. "Optimization problems for machine learning: A survey," European Journal of Operational Research, Elsevier, vol. 290(3), pages 807-828.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Simon Blanchard & Daniel Aloise & Wayne DeSarbo, 2012. "The Heterogeneous P-Median Problem for Categorization Based Clustering," Psychometrika, Springer;The Psychometric Society, vol. 77(4), pages 741-762, October.
    2. Michael Brusco & Douglas Steinley, 2015. "Affinity Propagation and Uncapacitated Facility Location Problems," Journal of Classification, Springer;The Classification Society, vol. 32(3), pages 443-480, October.
    3. Shao, Wei & Lye, Ashley & Rundle-Thiele, Sharyn, 2009. "Different strokes for different folks: A method to accommodate decision -making heterogeneity," Journal of Retailing and Consumer Services, Elsevier, vol. 16(6), pages 495-501.
    4. Aby Abraham & Sanjay Patro, 2014. "‘Country-of-Origin’ Effect and Consumer Decision-making," Management and Labour Studies, XLRI Jamshedpur, School of Business Management & Human Resources, vol. 39(3), pages 309-318, August.
    5. Song Lin & Juanjuan Zhang & John R. Hauser, 2015. "Learning from Experience, Simply," Marketing Science, INFORMS, vol. 34(1), pages 1-19, January.
    6. Hauser, John R., 2014. "Consideration-set heuristics," Journal of Business Research, Elsevier, vol. 67(8), pages 1688-1699.
    7. Christina Schamp & Mark Heitmann & Robin Katzenstein, 2019. "Consideration of ethical attributes along the consumer decision-making journey," Journal of the Academy of Marketing Science, Springer, vol. 47(2), pages 328-348, March.
    8. Gabriele Pizzi & Gian Luca Marzocchi, 2020. "Consumer-defined assortments: application of card-sorting to category management," Italian Journal of Marketing, Springer, vol. 2020(1), pages 67-84, March.
    9. Irawan, Chandra Ade & Salhi, Said & Scaparra, Maria Paola, 2014. "An adaptive multiphase approach for large unconditional and conditional p-median problems," European Journal of Operational Research, Elsevier, vol. 237(2), pages 590-605.
    10. Dost, Florian & Wilken, Robert, 2012. "Measuring willingness to pay as a range, revisited: When should we care?," International Journal of Research in Marketing, Elsevier, vol. 29(2), pages 148-166.
    11. Dimitrios Tsekouras & Benedict G. C. Dellaert & Bas Donkers & Gerald Häubl, 2020. "Product set granularity and consumer response to recommendations," Journal of the Academy of Marketing Science, Springer, vol. 48(2), pages 186-202, March.
    12. Timothy J. Gilbride & Greg M. Allenby, 2006. "Estimating Heterogeneous EBA and Economic Screening Rule Choice Models," Marketing Science, INFORMS, vol. 25(5), pages 494-509, September.
    13. Moon, HyungBin & Park, Stephen Youngjun & Woo, JongRoul, 2021. "Staying on convention or leapfrogging to eco-innovation?: Identifying early adopters of hydrogen-powered vehicles," Technological Forecasting and Social Change, Elsevier, vol. 171(C).
    14. Mark Heitmann & Andreas Herrmann, 2007. "Die Zufriedenheit mit dem Entscheidungsprozess als Determinante der Kundenbindung," Schmalenbach Journal of Business Research, Springer, vol. 59(5), pages 530-566, August.
    15. Mark Heitmann & Andreas Herrmann & Christian Kaiser, 2007. "The effect of product variety on purchase probability," Review of Managerial Science, Springer, vol. 1(2), pages 111-131, August.
    16. S. Iglesias-Parro & A. Ortega & E. De la Fuente & I. Martín, 2001. "Context Variables as Cognitive Effort Modulators in Decision Making Using an Alternative-Based Processing Strategy," Quality & Quantity: International Journal of Methodology, Springer, vol. 35(3), pages 311-323, August.
    17. Mueller, Michel G. & de Haan, Peter, 2009. "How much do incentives affect car purchase? Agent-based microsimulation of consumer choice of new cars--Part I: Model structure, simulation of bounded rationality, and model validation," Energy Policy, Elsevier, vol. 37(3), pages 1072-1082, March.
    18. Wästlund, Erik & Otterbring, Tobias & Gustafsson, Anders & Shams, Poja, 2015. "Heuristics and resource depletion: eye-tracking customers’ in situ gaze behavior in the field," Journal of Business Research, Elsevier, vol. 68(1), pages 95-101.
    19. Cervi, Cleber & Brei, Vinicius Andrade, 2022. "Choice deferral: The interaction effects of visual boundaries and consumer knowledge," Journal of Retailing and Consumer Services, Elsevier, vol. 68(C).
    20. Timothy J. Gilbride & Greg M. Allenby, 2004. "A Choice Model with Conjunctive, Disjunctive, and Compensatory Screening Rules," Marketing Science, INFORMS, vol. 23(3), pages 391-406, October.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:ejores:v:253:y:2016:i:3:p:659-672. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/eor .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.