IDEAS home Printed from https://ideas.repec.org/p/cte/wsrepe/ws121913.html
   My bibliography  Save this paper

A p-median problem with distance selection

Author

Listed:
  • Benati, Stefano
  • García, Sergio

Abstract

This paper introduces an extension of the p-median problem and its application to clustering, in which the distance/dissimilarity function between units is calculated as the distance sum on the q most important variables. These variables are to be chosen from a set of m elements, so a new combinatorial feature has been added to the problem, that we call the p-median model with distance selection. This problem has its origin in cluster analysis, often applied to sociological surveys, where it is common practice for a researcher to select the q statistical variables they predict will be the most important in discriminating the statistical units before applying the clustering algorithm. Here we show how this selection can be formulated as a non-linear mixed integer optimization mode and we show how this model can be linearized in several different ways. These linearizations are compared in a computational study and the results outline that the radius formulation of the p-median is the most efficient model for solving this problem.

Suggested Citation

  • Benati, Stefano & García, Sergio, 2012. "A p-median problem with distance selection," DES - Working Papers. Statistics and Econometrics. WS ws121913, Universidad Carlos III de Madrid. Departamento de Estadística.
  • Handle: RePEc:cte:wsrepe:ws121913
    as

    Download full text from publisher

    File URL: https://e-archivo.uc3m.es/bitstream/handle/10016/14672/ws121913.pdf?sequence=1
    Download Restriction: no

    References listed on IDEAS

    as
    1. John M. Mulvey & Harlan P. Crowder, 1979. "Cluster Analysis: An Application of Lagrangian Relaxation," Management Science, INFORMS, vol. 25(4), pages 329-340, April.
    2. T. D. Klastorin, 1985. "The p-Median Problem for Cluster Analysis: A Comparative Test Using the Mixture Model Approach," Management Science, INFORMS, vol. 31(1), pages 84-95, January.
    3. Stefano Benati & Silvana Stefani, 2011. "The Academic Journal Ranking Problem: A Fuzzy-Clustering Approach," Journal of Classification, Springer;The Classification Society, vol. 28(1), pages 7-20, April.
    Full references (including those not matched with items on IDEAS)

    More about this item

    Keywords

    p-median problem;

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:cte:wsrepe:ws121913. See general information about how to correct material in RePEc.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Ana Poveda). General contact details of provider: http://portal.uc3m.es/portal/page/portal/dpto_estadistica .

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service hosted by the Research Division of the Federal Reserve Bank of St. Louis . RePEc uses bibliographic data supplied by the respective publishers.