IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v10y2022i9p1460-d802978.html
   My bibliography  Save this article

Correlation Assessment of the Performance of Associative Classifiers on Credit Datasets Based on Data Complexity Measures

Author

Listed:
  • Francisco J. Camacho-Urriolagoitia

    (Instituto Politécnico Nacional, Centro de Innovación y Desarrollo Tecnológico en Cómputo, Av. Juan de Dios Bátiz s/n, Nueva Industrial Vallejo, GAM, Mexico City 07700, Mexico)

  • Yenny Villuendas-Rey

    (Instituto Politécnico Nacional, Centro de Innovación y Desarrollo Tecnológico en Cómputo, Av. Juan de Dios Bátiz s/n, Nueva Industrial Vallejo, GAM, Mexico City 07700, Mexico)

  • Itzamá López-Yáñez

    (Instituto Politécnico Nacional, Centro de Innovación y Desarrollo Tecnológico en Cómputo, Av. Juan de Dios Bátiz s/n, Nueva Industrial Vallejo, GAM, Mexico City 07700, Mexico)

  • Oscar Camacho-Nieto

    (Instituto Politécnico Nacional, Centro de Innovación y Desarrollo Tecnológico en Cómputo, Av. Juan de Dios Bátiz s/n, Nueva Industrial Vallejo, GAM, Mexico City 07700, Mexico)

  • Cornelio Yáñez-Márquez

    (Instituto Politécnico Nacional, Centro de Investigación en Computación, Av. Juan de Dios Bátiz s/n, Nueva Industrial Vallejo, GAM, Mexico City 07738, Mexico)

Abstract

One of the four basic machine learning tasks is pattern classification. The selection of the proper learning algorithm for a given problem is a challenging task, formally known as the algorithm selection problem (ASP). In particular, we are interested in the behavior of the associative classifiers derived from Alpha-Beta models applied to the financial field. In this paper, the behavior of four associative classifiers was studied: the One-Hot version of the Hybrid Associative Classifier with Translation (CHAT-OHM), the Extended Gamma (EG), the Naïve Associative Classifier (NAC), and the Assisted Classification for Imbalanced Datasets (ACID). To establish the performance, we used the area under the curve (AUC), F-score, and geometric mean measures. The four classifiers were applied over 11 datasets from the financial area. Then, the performance of each one was analyzed, considering their correlation with the measures of data complexity, corresponding to six categories based on specific aspects of the datasets: feature, linearity, neighborhood, network, dimensionality, and class imbalance. The correlations that arise between the measures of complexity of the datasets and the measures of performance of the associative classifiers are established; these results are expressed with Spearman’s Rho coefficient. The experimental results correctly indicated correlations between data complexity measures and the performance of the associative classifiers.

Suggested Citation

  • Francisco J. Camacho-Urriolagoitia & Yenny Villuendas-Rey & Itzamá López-Yáñez & Oscar Camacho-Nieto & Cornelio Yáñez-Márquez, 2022. "Correlation Assessment of the Performance of Associative Classifiers on Credit Datasets Based on Data Complexity Measures," Mathematics, MDPI, vol. 10(9), pages 1-16, April.
  • Handle: RePEc:gam:jmathe:v:10:y:2022:i:9:p:1460-:d:802978
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/10/9/1460/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/10/9/1460/
    Download Restriction: no
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Francisco J. Camacho-Urriolagoitia & Yenny Villuendas-Rey & Cornelio Yáñez-Márquez & Miltiadis Lytras, 2023. "Novel Features and Neighborhood Complexity Measures for Multiclass Classification of Hybrid Data," Sustainability, MDPI, vol. 15(3), pages 1-18, January.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:10:y:2022:i:9:p:1460-:d:802978. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.