IDEAS home Printed from https://ideas.repec.org/a/inm/oropre/v61y2013i1p184-198.html
   My bibliography  Save this article

A Polynomial Time Algorithm for Rayleigh Ratio on Discrete Variables: Replacing Spectral Techniques for Expander Ratio, Normalized Cut, and Cheeger Constant

Author

Listed:
  • Dorit S. Hochbaum

    (Department of Industrial Engineering and Operations Research, University of California, Berkeley, Berkeley, California 94720)

Abstract

A general form of minimizing the Rayleigh ratio on discrete variables is shown here, for the first time, to be polynomial time solvable. This is significant because major problems in clustering, partitioning, and imaging can be presented as the Rayleigh ratio minimization on discrete variables and an orthogonality constraint. These challenging problems are modeled as the normalized cut problem, the graph expander ratio problem, the Cheeger constant problem, or the conductance problem, all of which are NP-hard. These problems have traditionally been solved, heuristically, using the “spectral technique.” A unified framework is provided here whereby all these problems are formulated as a constrained minimization form of a quadratic ratio, referred to here as the Rayleigh ratio . The quadratic ratio is to be minimized on discrete variables and a single sum constraint that we call the balance or orthogonality constraint. When the discreteness constraints on the variables are disregarded, the resulting continuous relaxation is solved by the spectral method. It is shown here that the Rayleigh ratio minimization subject to the discreteness constraints requiring each variable to assume one of two values in {- b ,1} is solvable in strongly polynomial time, equivalent to a single minimum s , t cut algorithm on a graph of same size as the input graph, for any nonnegative value of b . This discrete form for the Rayleigh ratio problem was often assumed to be NP-hard. Not only is it shown here that the discrete Rayleigh ratio problem is polynomial time solvable, but also the algorithm is more efficient than the spectral algorithm. Furthermore, an experimental study demonstrates that the new algorithm provides in practice an improvement, often dramatic, on the quality of the results of the spectral method, both in terms of approximating the true optimum of the Rayleigh ratio problem on both the discrete variables and the balance constraint, and in terms of the subjective partition quality. A further contribution here is the introduction of a problem, the quantity-normalized cut , generalizing all the Rayleigh ratio problems. The discrete version of that problem is also solved with the efficient algorithm presented. This problem is shown, in a companion paper, to enable the modeling of features essential to clustering that are valuable in practical applications.

Suggested Citation

  • Dorit S. Hochbaum, 2013. "A Polynomial Time Algorithm for Rayleigh Ratio on Discrete Variables: Replacing Spectral Techniques for Expander Ratio, Normalized Cut, and Cheeger Constant," Operations Research, INFORMS, vol. 61(1), pages 184-198, February.
  • Handle: RePEc:inm:oropre:v:61:y:2013:i:1:p:184-198
    DOI: 10.1287/opre.1120.1126
    as

    Download full text from publisher

    File URL: http://dx.doi.org/10.1287/opre.1120.1126
    Download Restriction: no

    File URL: https://libkey.io/10.1287/opre.1120.1126?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Dorit S. Hochbaum, 2008. "The Pseudoflow Algorithm: A New Algorithm for the Maximum-Flow Problem," Operations Research, INFORMS, vol. 56(4), pages 992-1009, August.
    2. Kenneth M. Hall, 1970. "An r-Dimensional Quadratic Placement Algorithm," Management Science, INFORMS, vol. 17(3), pages 219-229, November.
    3. Eitan Sharon & Meirav Galun & Dahlia Sharon & Ronen Basri & Achi Brandt, 2006. "Hierarchy and adaptivity in segmenting visual scenes," Nature, Nature, vol. 442(7104), pages 810-813, August.
    4. Manikandan Narayanan & Adrian Vetta & Eric E Schadt & Jun Zhu, 2010. "Simultaneous Clustering of Multiple Gene Expression and Physical Interaction Datasets," PLOS Computational Biology, Public Library of Science, vol. 6(4), pages 1-13, April.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Hugo Harry Kramer & Eduardo Uchoa & Marcia Fampa & Viviane Köhler & François Vanderbeck, 2016. "Column generation approaches for the software clustering problem," Computational Optimization and Applications, Springer, vol. 64(3), pages 843-864, July.
    2. Baumann, P. & Hochbaum, D.S. & Yang, Y.T., 2019. "A comparative study of the leading machine learning techniques and two new optimization algorithms," European Journal of Operational Research, Elsevier, vol. 272(3), pages 1041-1057.
    3. Matsypura, Dmytro & Thompson, Ryan & Vasnev, Andrey L., 2018. "Optimal selection of expert forecasts with integer programming," Omega, Elsevier, vol. 78(C), pages 165-175.
    4. Roberto Asín Achá & Dorit S. Hochbaum & Quico Spaen, 2020. "HNCcorr: combinatorial optimization for neuron identification," Annals of Operations Research, Springer, vol. 289(1), pages 5-32, June.
    5. Ruriko Yoshida & Kenji Fukumizu & Chrysafis Vogiatzis, 2019. "Multilocus phylogenetic analysis with gene tree clustering," Annals of Operations Research, Springer, vol. 276(1), pages 293-313, May.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Roberto Asín Achá & Dorit S. Hochbaum & Quico Spaen, 2020. "HNCcorr: combinatorial optimization for neuron identification," Annals of Operations Research, Springer, vol. 289(1), pages 5-32, June.
    2. Gautier M Krings & Jean-François Carpantier & Jean-Charles Delvenne, 2014. "Trade Integration and Trade Imbalances in the European Union: A Network Perspective," PLOS ONE, Public Library of Science, vol. 9(1), pages 1-14, January.
    3. Amina Lamghari & Roussos Dimitrakopoulos & Jacques Ferland, 2015. "A hybrid method based on linear programming and variable neighborhood descent for scheduling production in open-pit mines," Journal of Global Optimization, Springer, vol. 63(3), pages 555-582, November.
    4. Armin Fügenschuh & Marzena Fügenschuh, 2008. "Integer linear programming models for topology optimization in sheet metal design," Mathematical Methods of Operations Research, Springer;Gesellschaft für Operations Research (GOR);Nederlands Genootschap voor Besliskunde (NGB), vol. 68(2), pages 313-331, October.
    5. Enrique Jelvez & Nelson Morales & Julian M. Ortiz, 2021. "Stochastic Final Pit Limits: An Efficient Frontier Analysis under Geological Uncertainty in the Open-Pit Mining Industry," Mathematics, MDPI, vol. 10(1), pages 1-16, December.
    6. Renaud Chicoisne & Daniel Espinoza & Marcos Goycoolea & Eduardo Moreno & Enrique Rubio, 2012. "A New Algorithm for the Open-Pit Mine Production Scheduling Problem," Operations Research, INFORMS, vol. 60(3), pages 517-528, June.
    7. Yan T. Yang & Barak Fishbain & Dorit S. Hochbaum & Eric B. Norman & Erik Swanberg, 2014. "The Supervised Normalized Cut Method for Detecting, Classifying, and Identifying Special Nuclear Materials," INFORMS Journal on Computing, INFORMS, vol. 26(1), pages 45-58, February.
    8. Laleh Behjat & Dorothy Kucar & Anthony Vannelli, 2002. "A Novel Eigenvector Technique for Large Scale Combinatorial Problems in VLSI Layout," Journal of Combinatorial Optimization, Springer, vol. 6(3), pages 271-286, September.
    9. Ruriko Yoshida & Kenji Fukumizu & Chrysafis Vogiatzis, 2019. "Multilocus phylogenetic analysis with gene tree clustering," Annals of Operations Research, Springer, vol. 276(1), pages 293-313, May.
    10. Nancel-Penard, Pierre & Morales, Nelson & Cornillier, Fabien, 2022. "A recursive time aggregation-disaggregation heuristic for the multidimensional and multiperiod precedence-constrained knapsack problem: An application to the open-pit mine block sequencing problem," European Journal of Operational Research, Elsevier, vol. 303(3), pages 1088-1099.
    11. André R. S. Amaral, 2008. "An Exact Approach to the One-Dimensional Facility Layout Problem," Operations Research, INFORMS, vol. 56(4), pages 1026-1033, August.
    12. Kwame Awuah-Offei & Sisi Que & Atta Ur Rehman, 2021. "Evaluating Mine Design Alternatives for Social Risks Using Discrete Choice Analysis," Sustainability, MDPI, vol. 13(16), pages 1-15, August.
    13. Bhupatiraju, Samyukta & Nomaler, Önder & Triulzi, Giorgio & Verspagen, Bart, 2012. "Knowledge flows – Analyzing the core literature of innovation, entrepreneurship and science and technology studies," Research Policy, Elsevier, vol. 41(7), pages 1205-1218.
    14. Jélvez, Enrique & Morales, Nelson & Nancel-Penard, Pierre & Cornillier, Fabien, 2020. "A new hybrid heuristic algorithm for the Precedence Constrained Production Scheduling Problem: A mining application," Omega, Elsevier, vol. 94(C).
    15. Gonzalo Muñoz & Daniel Espinoza & Marcos Goycoolea & Eduardo Moreno & Maurice Queyranne & Orlando Rivera Letelier, 2018. "A study of the Bienstock–Zuckerberg algorithm: applications in mining and resource constrained project scheduling," Computational Optimization and Applications, Springer, vol. 69(2), pages 501-534, March.
    16. Whittle, D. & Brazil, M. & Grossman, P.A. & Rubinstein, J.H. & Thomas, D.A., 2018. "Combined optimisation of an open-pit mine outline and the transition depth to underground mining," European Journal of Operational Research, Elsevier, vol. 268(2), pages 624-634.
    17. Madziwa, Lawrence & Pillalamarry, Mallikarjun & Chatterjee, Snehamoy, 2023. "Integrating stochastic mine planning model with ARDL commodity price forecasting," Resources Policy, Elsevier, vol. 85(PB).
    18. Ravi Kumar, K. & Hadjinicola, George C. & Lin, Ting-li, 1995. "A heuristic procedure for the single-row facility layout problem," European Journal of Operational Research, Elsevier, vol. 87(1), pages 65-73, November.
    19. Jintae Park & Sungha Yoon & Chaeyoung Lee & Junseok Kim, 2020. "A Simple Method for Network Visualization," Mathematics, MDPI, vol. 8(6), pages 1-13, June.
    20. Li, Xiangyong & Aneja, Y.P., 2017. "Regenerator location problem: Polyhedral study and effective branch-and-cut algorithms," European Journal of Operational Research, Elsevier, vol. 257(1), pages 25-40.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:oropre:v:61:y:2013:i:1:p:184-198. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.