IDEAS home Printed from https://ideas.repec.org/a/eee/ejores/v248y2016i2p593-606.html
   My bibliography  Save this article

A pool-based pattern generation algorithm for logical analysis of data with automatic fine-tuning

Author

Listed:
  • Caserta, Marco
  • Reiners, Torsten

Abstract

In this paper, we address the binary classification problem, in which one is given a set of observations, characterized by a number of (binary and non-binary) attributes and wants to determine which class each observation belongs to. The proposed classification algorithm is based on the Logical Analysis of Data (LAD) technique and belongs to the class of supervised learning algorithms. We introduce a novel metaheuristic-based approach for pattern generation within LAD. The key idea relies on the generation of a pool of patterns for each given observation of the training set. Such a pool is built with one or more criteria in mind (e.g., diversity, homogeneity, coverage, etc.), and is paramount in the achievement of high classification accuracy, as shown by the computational results we obtained. In addition, we address one of the major concerns of many data mining algorithms, i.e., the fine-tuning and calibration of parameters. We employ here a novel technique, called biased Random-Key Genetic Algorithm that allows the calibration of all the parameters of the algorithm in an automatic fashion, hence reducing the fine-tuning effort required and enhancing the performance of the algorithm itself. We tested the proposed approach on 10 benchmark instances from the UCI repository and we proved that the algorithm is competitive, both in terms of classification accuracy and running time.

Suggested Citation

  • Caserta, Marco & Reiners, Torsten, 2016. "A pool-based pattern generation algorithm for logical analysis of data with automatic fine-tuning," European Journal of Operational Research, Elsevier, vol. 248(2), pages 593-606.
  • Handle: RePEc:eee:ejores:v:248:y:2016:i:2:p:593-606
    DOI: 10.1016/j.ejor.2015.05.078
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0377221715004907
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ejor.2015.05.078?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Peter Hammer & Tibérius Bonates, 2006. "Logical analysis of data—An overview: From combinatorial optimization to medical applications," Annals of Operations Research, Springer, vol. 148(1), pages 203-225, November.
    2. Pieter-Tjerk de Boer & Dirk Kroese & Shie Mannor & Reuven Rubinstein, 2005. "A Tutorial on the Cross-Entropy Method," Annals of Operations Research, Springer, vol. 134(1), pages 19-67, February.
    3. Greco, Salvatore & Matarazzo, Benedetto & Slowinski, Roman, 2001. "Rough sets theory for multicriteria decision analysis," European Journal of Operational Research, Elsevier, vol. 129(1), pages 1-47, February.
    4. Hammer, P.L. & Kogan, A. & Lejeune, M.A., 2006. "Modeling country risk ratings using partial orders," European Journal of Operational Research, Elsevier, vol. 175(2), pages 836-859, December.
    5. Thiago Noronha & Mauricio Resende & Celso Ribeiro, 2011. "A biased random-key genetic algorithm for routing and wavelength assignment," Journal of Global Optimization, Springer, vol. 50(3), pages 503-518, July.
    6. Endre Boros & Yves Crama & Peter Hammer & Toshihide Ibaraki & Alexander Kogan & Kazuhisa Makino, 2011. "Logical analysis of data: classification with justification," Annals of Operations Research, Springer, vol. 188(1), pages 33-61, August.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Lejeune, Miguel & Lozin, Vadim & Lozina, Irina & Ragab, Ahmed & Yacout, Soumaya, 2019. "Recent advances in the theory and practice of Logical Analysis of Data," European Journal of Operational Research, Elsevier, vol. 275(1), pages 1-15.
    2. Pessoa, Luciana S. & Andrade, Carlos E., 2018. "Heuristics for a flowshop scheduling problem with stepwise job objective function," European Journal of Operational Research, Elsevier, vol. 266(3), pages 950-962.
    3. Maurizio Boccia & Antonio Sforza & Claudio Sterle, 2020. "Simple Pattern Minimality Problems: Integer Linear Programming Formulations and Covering-Based Heuristic Solving Approaches," INFORMS Journal on Computing, INFORMS, vol. 32(4), pages 1049-1060, October.
    4. Caserta, Marco & Voß, Stefan, 2019. "The robust multiple-choice multidimensional knapsack problem," Omega, Elsevier, vol. 86(C), pages 16-27.
    5. Andrade, Carlos E. & Toso, Rodrigo F. & Gonçalves, José F. & Resende, Mauricio G.C., 2021. "The Multi-Parent Biased Random-Key Genetic Algorithm with Implicit Path-Relinking and its real-world applications," European Journal of Operational Research, Elsevier, vol. 289(1), pages 17-30.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Lejeune, Miguel & Lozin, Vadim & Lozina, Irina & Ragab, Ahmed & Yacout, Soumaya, 2019. "Recent advances in the theory and practice of Logical Analysis of Data," European Journal of Operational Research, Elsevier, vol. 275(1), pages 1-15.
    2. Fawaz Alsolami & Talha Amin & Igor Chikalov & Mikhail Moshkov, 2018. "Bi-criteria optimization problems for decision rules," Annals of Operations Research, Springer, vol. 271(2), pages 279-295, December.
    3. Maurizio Boccia & Antonio Sforza & Claudio Sterle, 2020. "Simple Pattern Minimality Problems: Integer Linear Programming Formulations and Covering-Based Heuristic Solving Approaches," INFORMS Journal on Computing, INFORMS, vol. 32(4), pages 1049-1060, October.
    4. Bagchi, Prabir & Lejeune, Miguel A. & Alam, A., 2014. "How supply competency affects FDI decisions: Some insights," International Journal of Production Economics, Elsevier, vol. 147(PB), pages 239-251.
    5. Miguel Lejeune & François Margot, 2011. "Optimization for simulation: LAD accelerator," Annals of Operations Research, Springer, vol. 188(1), pages 285-305, August.
    6. Miguel A. Lejeune, 2012. "Pattern-Based Modeling and Solution of Probabilistically Constrained Optimization Problems," Operations Research, INFORMS, vol. 60(6), pages 1356-1372, December.
    7. Eduardo Fernández & José Rui Figueira & Jorge Navarro, 2023. "A theoretical look at ordinal classification methods based on comparing actions with limiting boundaries between adjacent classes," Annals of Operations Research, Springer, vol. 325(2), pages 819-843, June.
    8. Doumpos, M. & Marinakis, Y. & Marinaki, M. & Zopounidis, C., 2009. "An evolutionary approach to construction of outranking models for multicriteria classification: The case of the ELECTRE TRI method," European Journal of Operational Research, Elsevier, vol. 199(2), pages 496-505, December.
    9. Bouyssou, Denis & Marchant, Thierry, 2007. "An axiomatic approach to noncompensatory sorting methods in MCDM, II: More than two categories," European Journal of Operational Research, Elsevier, vol. 178(1), pages 246-276, April.
    10. Fernandez, Eduardo & Navarro, Jorge & Bernal, Sergio, 2010. "Handling multicriteria preferences in cluster analysis," European Journal of Operational Research, Elsevier, vol. 202(3), pages 819-827, May.
    11. Pawel Lezanski & Maria Pilacinska, 2018. "The dominance-based rough set approach to cylindrical plunge grinding process diagnosis," Journal of Intelligent Manufacturing, Springer, vol. 29(5), pages 989-1004, June.
    12. Choudhary, Devendra & Shankar, Ravi, 2012. "An STEEP-fuzzy AHP-TOPSIS framework for evaluation and selection of thermal power plant location: A case study from India," Energy, Elsevier, vol. 42(1), pages 510-521.
    13. García Cáceres, Rafael Guillermo & Aráoz Durand, Julián Arturo & Gómez, Fernando Palacios, 2009. "Integral analysis method - IAM," European Journal of Operational Research, Elsevier, vol. 192(3), pages 891-903, February.
    14. Azam, Nouman & Zhang, Yan & Yao, JingTao, 2017. "Evaluation functions and decision conditions of three-way decisions with game-theoretic rough sets," European Journal of Operational Research, Elsevier, vol. 261(2), pages 704-714.
    15. Fowler, John W. & Mönch, Lars, 2022. "A survey of scheduling with parallel batch (p-batch) processing," European Journal of Operational Research, Elsevier, vol. 298(1), pages 1-24.
    16. San Martín Albizuri, Nerea & Rodríguez Castellanos, Arturo, 2008. "¿Reflejan los índices de riesgo país las variables relevantes en el desencadenamiento de las crisis externas? Un análisis sobre el periodo 1994-2001," Cuadernos de Gestión, Universidad del País Vasco - Instituto de Economía Aplicada a la Empresa (IEAE).
    17. Xi Chen & Enlu Zhou, 2015. "Population model-based optimization," Journal of Global Optimization, Springer, vol. 63(1), pages 125-148, September.
    18. Kadziński, Miłosz & Wójcik, Michał & Ciomek, Krzysztof, 2022. "Review and experimental comparison of ranking and choice procedures for constructing a univocal recommendation in a preference disaggregation setting," Omega, Elsevier, vol. 113(C).
    19. Hu, Qiwei & Chakhar, Salem & Siraj, Sajid & Labib, Ashraf, 2017. "Spare parts classification in industrial manufacturing using the dominance-based rough set approach," European Journal of Operational Research, Elsevier, vol. 262(3), pages 1136-1163.
    20. Leung, Yee & Fischer, Manfred M. & Wu, Wei-Zhi & Mi, Ju-Sheng, 2008. "A rough set approach for the discovery of classification rules in interval-valued information systems," MPRA Paper 77767, University Library of Munich, Germany.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:ejores:v:248:y:2016:i:2:p:593-606. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/eor .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.