IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2603.24215.html

Adapting Altman's bankruptcy prediction model to the compositional data methodology

Author

Listed:
  • Fatemeh Keivani

    (Universitat de Girona)

  • Germ`a Coenders

    (Universitat de Girona)

  • Ge`orgia Escaram'is

    (Universitat de Girona)

Abstract

Using standard financial ratios as variables in statistical analyses has been related to several serious problems, such as extreme outliers, asymmetry, non-normality, and non-linearity. The compositional-data methodology has been successfully applied to solve these problems and has always yielded substantially different results when compared to standard financial ratios. An under-researched area is the use of financial log-ratios computed with the compositional-data methodology to predict bankruptcy or the related terms of business default, insolvency or failure. Another under-researched area is the use of machine learning methods in combination with compositional log-ratios. The present article adapts the classical Altman bankruptcy prediction model and some of its extensions to the compositional methodology with pairwise log-ratios and three common statistical and machine learning tools: logistic regression models, k-nearest neighbours, and random forests, and compares the results with standard financial ratios. Data from the sector in the Spanish economy with the largest number of bankrupt firms according to the first two digits of the NACE code (46XX "wholesale trade, except of motor vehicles and motorcycles") were obtained from the Iberian Balance sheet Analysis System. The sample size (31,131 firms, of which 97 were bankrupt) was divided into a training and a validation dataset. The training data set was downsampled to one healthy firm to each bankrupt firm. No outliers were removed. Focusing on predictive performance, the results show that compositional methods are better than standard ratios in terms of sensitivity, with mixed results regarding specificity, compositional random forests and compositional logistic regression behaving the best.

Suggested Citation

  • Fatemeh Keivani & Germ`a Coenders & Ge`orgia Escaram'is, 2026. "Adapting Altman's bankruptcy prediction model to the compositional data methodology," Papers 2603.24215, arXiv.org.
  • Handle: RePEc:arx:papers:2603.24215
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2603.24215
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Veganzones, David & Séverin, Eric & Ben Jabeur, Sami, 2026. "Forecasting corporate bankruptcy in imbalanced datasets using a new hybrid machine learning approach," Research in International Business and Finance, Elsevier, vol. 81(C).
    2. Juan José Egozcue & Vera Pawlowsky-Glahn, 2019. "Rejoinder on: Compositional data: the sample space and its structure," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 28(3), pages 658-663, September.
    3. Pavol Durana & Erika Kovalova & Roman Blazek & Klaudia Bicanovska, 2025. "Business Efficiency: Insights from Visegrad Four Before, During, and After the COVID-19 Pandemic," Economies, MDPI, vol. 13(2), pages 1-36, January.
    4. Masanobu Matsumaru & Hideki Katagiri, 2025. "A Two-Stage Machine Learning Approach to Bankruptcy Prediction: Integrating Full-Feature Modeling and Optimized Feature Selection," JRFM, MDPI, vol. 18(12), pages 1-21, November.
    5. Juan José Egozcue & Vera Pawlowsky-Glahn, 2019. "Compositional data: the sample space and its structure," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 28(3), pages 599-638, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Anna Maria Fiori & Francesco Porro, 2023. "A compositional analysis of systemic risk in European financial institutions," Annals of Finance, Springer, vol. 19(3), pages 325-354, September.
    2. Germ`a Coenders & N'uria Arimany Serrat, 2023. "Accounting statement analysis at industry level. A gentle introduction to the compositional approach," Papers 2305.16842, arXiv.org, revised Jun 2025.
    3. Joscha Krause & Jan Pablo Burgard & Domingo Morales, 2022. "Robust prediction of domain compositions from uncertain data using isometric logratio transformations in a penalized multivariate Fay–Herriot model," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 76(1), pages 65-96, February.
    4. Janice L. Scealy, 2021. "Comments on: Recent advances in directional statistics," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 30(1), pages 68-70, March.
    5. Leyla Kodalci & Olivier Thas, 2023. "Simple and flexible sign and rank-based methods for testing for differential abundance in microbiome studies," PLOS ONE, Public Library of Science, vol. 18(9), pages 1-17, September.
    6. Salvador Linares-Mustar'os & Maria `Angels Farreras-Noguer & N'uria Arimany-Serrat & Germ`a Coenders, 2022. "New financial ratios based on the compositional data methodology," Papers 2210.11138, arXiv.org, revised May 2025.
    7. Alvis Cabrera & Lyvia Biagi & Aleix Beneyto & Ernesto Estremera & Iván Contreras & Marga Giménez & Ignacio Conget & Jorge Bondia & Josep Antoni Martín-Fernández & Josep Vehí, 2023. "Validation of a Probabilistic Prediction Model for Patients with Type 1 Diabetes Using Compositional Data Analysis," Mathematics, MDPI, vol. 11(5), pages 1-17, March.
    8. Núria Arimany-Serrat & Andrey Felipe Sgorla, 2024. "Financial and ESG Analysis of the Beer Sector Pre- and Post-COVID-19 in Italy and Spain," Sustainability, MDPI, vol. 16(17), pages 1-17, August.
    9. Pol Jofre-Campuzano & Germà Coenders, 2022. "Compositional Classification of Financial Statement Profiles: The Weighted Case," JRFM, MDPI, vol. 15(12), pages 1-17, November.
    10. Anna Maria Fiori & Emanuela Rosazza Gianin, 2025. "Compositional risk capital allocations," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 34(2), pages 261-290, May.
    11. María Dolores Esteban & María José Lombardía & Esther López-Vizcaíno & Domingo Morales & Agustín Pérez, 2023. "Small area estimation of average compositions under multivariate nested error regression models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 32(2), pages 651-676, June.
    12. María Dolores Esteban & María José Lombardía & Esther López-Vizcaíno & Domingo Morales & Agustín Pérez, 2020. "Small area estimation of proportions under area-level compositional mixed models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 29(3), pages 793-818, September.
    13. Nikola Štefelová & Javier Palarea-Albaladejo & Karel Hron & Aleš Gába & Jan Dygrýn, 2024. "Compositional PLS biplot based on pivoting balances: an application to explore the association between 24-h movement behaviours and adiposity," Computational Statistics, Springer, vol. 39(2), pages 835-863, April.
    14. Dominika Gajdosikova & Jakub Michulek, 2025. "Artificial Intelligence Models for Bankruptcy Prediction in Agriculture: Comparing the Performance of Artificial Neural Networks and Decision Trees," Agriculture, MDPI, vol. 15(10), pages 1-29, May.
    15. Vasilii Erokhin & Tianming Gao & Anna Ivolga, 2020. "Structural Variations in the Composition of Land Funds at Regional Scales across Russia," Land, MDPI, vol. 9(6), pages 1-39, June.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2603.24215. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.