IDEAS home Printed from https://ideas.repec.org/a/eee/ejores/v248y2016i2p593-606.html
   My bibliography  Save this article

A pool-based pattern generation algorithm for logical analysis of data with automatic fine-tuning

Author

Listed:
  • Caserta, Marco
  • Reiners, Torsten

Abstract

In this paper, we address the binary classification problem, in which one is given a set of observations, characterized by a number of (binary and non-binary) attributes and wants to determine which class each observation belongs to. The proposed classification algorithm is based on the Logical Analysis of Data (LAD) technique and belongs to the class of supervised learning algorithms. We introduce a novel metaheuristic-based approach for pattern generation within LAD. The key idea relies on the generation of a pool of patterns for each given observation of the training set. Such a pool is built with one or more criteria in mind (e.g., diversity, homogeneity, coverage, etc.), and is paramount in the achievement of high classification accuracy, as shown by the computational results we obtained. In addition, we address one of the major concerns of many data mining algorithms, i.e., the fine-tuning and calibration of parameters. We employ here a novel technique, called biased Random-Key Genetic Algorithm that allows the calibration of all the parameters of the algorithm in an automatic fashion, hence reducing the fine-tuning effort required and enhancing the performance of the algorithm itself. We tested the proposed approach on 10 benchmark instances from the UCI repository and we proved that the algorithm is competitive, both in terms of classification accuracy and running time.

Suggested Citation

  • Caserta, Marco & Reiners, Torsten, 2016. "A pool-based pattern generation algorithm for logical analysis of data with automatic fine-tuning," European Journal of Operational Research, Elsevier, vol. 248(2), pages 593-606.
  • Handle: RePEc:eee:ejores:v:248:y:2016:i:2:p:593-606
    DOI: 10.1016/j.ejor.2015.05.078
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0377221715004907
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ejor.2015.05.078?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Peter Hammer & Tibérius Bonates, 2006. "Logical analysis of data—An overview: From combinatorial optimization to medical applications," Annals of Operations Research, Springer, vol. 148(1), pages 203-225, November.
    2. Pieter-Tjerk de Boer & Dirk Kroese & Shie Mannor & Reuven Rubinstein, 2005. "A Tutorial on the Cross-Entropy Method," Annals of Operations Research, Springer, vol. 134(1), pages 19-67, February.
    3. Greco, Salvatore & Matarazzo, Benedetto & Slowinski, Roman, 2001. "Rough sets theory for multicriteria decision analysis," European Journal of Operational Research, Elsevier, vol. 129(1), pages 1-47, February.
    4. Hammer, P.L. & Kogan, A. & Lejeune, M.A., 2006. "Modeling country risk ratings using partial orders," European Journal of Operational Research, Elsevier, vol. 175(2), pages 836-859, December.
    5. Thiago Noronha & Mauricio Resende & Celso Ribeiro, 2011. "A biased random-key genetic algorithm for routing and wavelength assignment," Journal of Global Optimization, Springer, vol. 50(3), pages 503-518, July.
    6. Endre Boros & Yves Crama & Peter Hammer & Toshihide Ibaraki & Alexander Kogan & Kazuhisa Makino, 2011. "Logical analysis of data: classification with justification," Annals of Operations Research, Springer, vol. 188(1), pages 33-61, August.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Lejeune, Miguel & Lozin, Vadim & Lozina, Irina & Ragab, Ahmed & Yacout, Soumaya, 2019. "Recent advances in the theory and practice of Logical Analysis of Data," European Journal of Operational Research, Elsevier, vol. 275(1), pages 1-15.
    2. Pessoa, Luciana S. & Andrade, Carlos E., 2018. "Heuristics for a flowshop scheduling problem with stepwise job objective function," European Journal of Operational Research, Elsevier, vol. 266(3), pages 950-962.
    3. Maurizio Boccia & Antonio Sforza & Claudio Sterle, 2020. "Simple Pattern Minimality Problems: Integer Linear Programming Formulations and Covering-Based Heuristic Solving Approaches," INFORMS Journal on Computing, INFORMS, vol. 32(4), pages 1049-1060, October.
    4. Caserta, Marco & Voß, Stefan, 2019. "The robust multiple-choice multidimensional knapsack problem," Omega, Elsevier, vol. 86(C), pages 16-27.
    5. Andrade, Carlos E. & Toso, Rodrigo F. & Gonçalves, José F. & Resende, Mauricio G.C., 2021. "The Multi-Parent Biased Random-Key Genetic Algorithm with Implicit Path-Relinking and its real-world applications," European Journal of Operational Research, Elsevier, vol. 289(1), pages 17-30.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Lejeune, Miguel & Lozin, Vadim & Lozina, Irina & Ragab, Ahmed & Yacout, Soumaya, 2019. "Recent advances in the theory and practice of Logical Analysis of Data," European Journal of Operational Research, Elsevier, vol. 275(1), pages 1-15.
    2. Maurizio Boccia & Antonio Sforza & Claudio Sterle, 2020. "Simple Pattern Minimality Problems: Integer Linear Programming Formulations and Covering-Based Heuristic Solving Approaches," INFORMS Journal on Computing, INFORMS, vol. 32(4), pages 1049-1060, October.
    3. Fawaz Alsolami & Talha Amin & Igor Chikalov & Mikhail Moshkov, 2018. "Bi-criteria optimization problems for decision rules," Annals of Operations Research, Springer, vol. 271(2), pages 279-295, December.
    4. Bagchi, Prabir & Lejeune, Miguel A. & Alam, A., 2014. "How supply competency affects FDI decisions: Some insights," International Journal of Production Economics, Elsevier, vol. 147(PB), pages 239-251.
    5. Miguel Lejeune & François Margot, 2011. "Optimization for simulation: LAD accelerator," Annals of Operations Research, Springer, vol. 188(1), pages 285-305, August.
    6. Miguel A. Lejeune, 2012. "Pattern-Based Modeling and Solution of Probabilistically Constrained Optimization Problems," Operations Research, INFORMS, vol. 60(6), pages 1356-1372, December.
    7. Wen-Min Lu & Qian Long Kweh & Chung-Wei Wang, 2021. "Integration and application of rough sets and data envelopment analysis for assessments of the investment trusts industry," Annals of Operations Research, Springer, vol. 296(1), pages 163-194, January.
    8. Pang, Jifang & Liang, Jiye, 2012. "Evaluation of the results of multi-attribute group decision-making with linguistic information," Omega, Elsevier, vol. 40(3), pages 294-301.
    9. Eduardo Fernández & José Rui Figueira & Jorge Navarro, 2023. "A theoretical look at ordinal classification methods based on comparing actions with limiting boundaries between adjacent classes," Annals of Operations Research, Springer, vol. 325(2), pages 819-843, June.
    10. Doumpos, M. & Marinakis, Y. & Marinaki, M. & Zopounidis, C., 2009. "An evolutionary approach to construction of outranking models for multicriteria classification: The case of the ELECTRE TRI method," European Journal of Operational Research, Elsevier, vol. 199(2), pages 496-505, December.
    11. Skorupski, Jacek & Uchroński, Piotr, 2017. "A fuzzy model for evaluating metal detection equipment at airport security screening checkpoints," International Journal of Critical Infrastructure Protection, Elsevier, vol. 16(C), pages 39-48.
    12. Pierre Hansen & Christophe Meyer, 2011. "A new column generation algorithm for Logical Analysis of Data," Annals of Operations Research, Springer, vol. 188(1), pages 215-249, August.
    13. Bouyssou, Denis & Marchant, Thierry, 2007. "An axiomatic approach to noncompensatory sorting methods in MCDM, II: More than two categories," European Journal of Operational Research, Elsevier, vol. 178(1), pages 246-276, April.
    14. Sun, Xiaolei & Li, Jianping & Tang, Ling & Wu, Dengsheng, 2012. "Identifying the risk-return tradeoff and exploring the dynamic risk exposure of country portfolio of the FSU's oil economies," Economic Modelling, Elsevier, vol. 29(6), pages 2494-2503.
    15. repec:dau:papers:123456789/4080 is not listed on IDEAS
    16. Becchio, Cristina & Bottero, Marta Carla & Corgnati, Stefano Paolo & Dell’Anna, Federico, 2018. "Decision making for sustainable urban energy planning: an integrated evaluation framework of alternative solutions for a NZED (Net Zero-Energy District) in Turin," Land Use Policy, Elsevier, vol. 78(C), pages 803-817.
    17. Elnaz Gholipour & B'ela Vizv'ari & Zolt'an Lakner, 2020. "Reconstruction Rating Model of Sovereign Debt by Logical Analysis of Data," Papers 2011.14112, arXiv.org.
    18. Fernandez, Eduardo & Navarro, Jorge & Bernal, Sergio, 2010. "Handling multicriteria preferences in cluster analysis," European Journal of Operational Research, Elsevier, vol. 202(3), pages 819-827, May.
    19. Pawel Lezanski & Maria Pilacinska, 2018. "The dominance-based rough set approach to cylindrical plunge grinding process diagnosis," Journal of Intelligent Manufacturing, Springer, vol. 29(5), pages 989-1004, June.
    20. Tsoukias, Alexis, 2008. "From decision theory to decision aiding methodology," European Journal of Operational Research, Elsevier, vol. 187(1), pages 138-161, May.
    21. Choudhary, Devendra & Shankar, Ravi, 2012. "An STEEP-fuzzy AHP-TOPSIS framework for evaluation and selection of thermal power plant location: A case study from India," Energy, Elsevier, vol. 42(1), pages 510-521.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:ejores:v:248:y:2016:i:2:p:593-606. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/eor .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.