IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2401.12652.html
   My bibliography  Save this paper

From Numbers to Words: Multi-Modal Bankruptcy Prediction Using the ECL Dataset

Author

Listed:
  • Henri Arno
  • Klaas Mulier
  • Joke Baeck
  • Thomas Demeester

Abstract

In this paper, we present ECL, a novel multi-modal dataset containing the textual and numerical data from corporate 10K filings and associated binary bankruptcy labels. Furthermore, we develop and critically evaluate several classical and neural bankruptcy prediction models using this dataset. Our findings suggest that the information contained in each data modality is complementary for bankruptcy prediction. We also see that the binary bankruptcy prediction target does not enable our models to distinguish next year bankruptcy from an unhealthy financial situation resulting in bankruptcy in later years. Finally, we explore the use of LLMs in the context of our task. We show how GPT-based models can be used to extract meaningful summaries from the textual data but zero-shot bankruptcy prediction results are poor. All resources required to access and update the dataset or replicate our experiments are available on github.com/henriarnoUG/ECL.

Suggested Citation

  • Henri Arno & Klaas Mulier & Joke Baeck & Thomas Demeester, 2024. "From Numbers to Words: Multi-Modal Bankruptcy Prediction Using the ECL Dataset," Papers 2401.12652, arXiv.org.
  • Handle: RePEc:arx:papers:2401.12652
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2401.12652
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Beaver, Wh, 1966. "Financial Ratios As Predictors Of Failure," Journal of Accounting Research, Wiley Blackwell, vol. 4, pages 71-111.
    2. Bernanke, Ben S, 1981. "Bankruptcy, Liquidity, and Recession," American Economic Review, American Economic Association, vol. 71(2), pages 155-159, May.
    3. Ohlson, Ja, 1980. "Financial Ratios And The Probabilistic Prediction Of Bankruptcy," Journal of Accounting Research, Wiley Blackwell, vol. 18(1), pages 109-131.
    4. Beaver, Wh, 1966. "Financial Ratios As Predictors Of Failure - Reply," Journal of Accounting Research, Wiley Blackwell, vol. 4, pages 123-127.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Henri Arno & Klaas Mulier & Joke Baeck & Thomas Demeester, 2022. "Next-Year Bankruptcy Prediction from Textual Data: Benchmark and Baselines," Papers 2208.11334, arXiv.org.
    2. Zhao, Qi & Xu, Weijun & Ji, Yucheng, 2023. "Predicting financial distress of Chinese listed companies using machine learning: To what extent does textual disclosure matter?," International Review of Financial Analysis, Elsevier, vol. 89(C).
    3. Antonio Davila & George Foster & Xiaobin He & Carlos Shimizu, 2015. "The rise and fall of startups: Creation and destruction of revenue and jobs by young companies," Australian Journal of Management, Australian School of Business, vol. 40(1), pages 6-35, February.
    4. Li, Chunyu & Lou, Chenxin & Luo, Dan & Xing, Kai, 2021. "Chinese corporate distress prediction using LASSO: The role of earnings management," International Review of Financial Analysis, Elsevier, vol. 76(C).
    5. Zhou, Fanyin & Fu, Lijun & Li, Zhiyong & Xu, Jiawei, 2022. "The recurrence of financial distress: A survival analysis," International Journal of Forecasting, Elsevier, vol. 38(3), pages 1100-1115.
    6. Guido Max Mantovani & Gregory Gadzinski, 2022. "How to Rate the Financial Performance of Private Companies? A Tailored Integrated Rating Methodology Applied to North-Eastern Italian Districts," JRFM, MDPI, vol. 15(11), pages 1-18, October.
    7. Enrico Supino & Nicola Piras, 2022. "Le performance dei modelli di credit scoring in contesti di forte instabilit? macroeconomica: il ruolo delle Reti Neurali Artificiali," MANAGEMENT CONTROL, FrancoAngeli Editore, vol. 2022(2), pages 41-61.
    8. Adriana Csikosova & Maria Janoskova & Katarina Culkova, 2020. "Application of Discriminant Analysis for Avoiding the Risk of Quarry Operation Failure," JRFM, MDPI, vol. 13(10), pages 1-14, September.
    9. Haoming Wang & Xiangdong Liu, 2021. "Undersampling bankruptcy prediction: Taiwan bankruptcy data," PLOS ONE, Public Library of Science, vol. 16(7), pages 1-17, July.
    10. Trueck, Stefan & Rachev, Svetlozar T., 2008. "Rating Based Modeling of Credit Risk," Elsevier Monographs, Elsevier, edition 1, number 9780123736833.
    11. Le, Hong Hanh & Viviani, Jean-Laurent, 2018. "Predicting bank failure: An improvement by implementing a machine-learning approach to classical financial ratios," Research in International Business and Finance, Elsevier, vol. 44(C), pages 16-25.
    12. Arati Kale & Devendra Kale & Sriram Villupuram, 2024. "Decomposition of risk for small size and low book-to-market stocks," Journal of Asset Management, Palgrave Macmillan, vol. 25(1), pages 96-112, February.
    13. Ahsan Habib & Mabel D' Costa & Hedy Jiaying Huang & Md. Borhan Uddin Bhuiyan & Li Sun, 2020. "Determinants and consequences of financial distress: review of the empirical literature," Accounting and Finance, Accounting and Finance Association of Australia and New Zealand, vol. 60(S1), pages 1023-1075, April.
    14. Serrano-Cinca, Carlos & Gutiérrez-Nieto, Begoña & Bernate-Valbuena, Martha, 2019. "The use of accounting anomalies indicators to predict business failure," European Management Journal, Elsevier, vol. 37(3), pages 353-375.
    15. Miquel-Flores, Ixart & Reghezza, Alessio & Buchetti, Bruno & Perdichizzi, Salvatore, 2024. "Greening the economy: how public-guaranteed loans influence firm-level resource allocation," Working Paper Series 2916, European Central Bank.
    16. Youssef Zizi & Mohamed Oudgou & Abdeslam El Moudden, 2020. "Determinants and Predictors of SMEs’ Financial Failure: A Logistic Regression Approach," Risks, MDPI, vol. 8(4), pages 1-21, October.
    17. Shoukat Ali & Ramiz ur Rehman & Wang Yuan & Muhammad Ishfaq Ahmad & Rizwan Ali, 2022. "Does foreign institutional ownership mediate the nexus between board diversity and the risk of financial distress? A case of an emerging economy of China," Eurasian Business Review, Springer;Eurasia Business and Economics Society, vol. 12(3), pages 553-581, September.
    18. Barboza, Flavio & Altman, Edward, 2024. "Predicting financial distress in Latin American companies: A comparative analysis of logistic regression and random forest models," The North American Journal of Economics and Finance, Elsevier, vol. 72(C).
    19. Juraini Zainol Abidin & Nur Adiana Hiau Abdullah & Karren Lee-Hwei Khaw, 2020. "Predicting SMEs Failure: Logistic Regression vs Artificial Neural Network Models," Capital Markets Review, Malaysian Finance Association, vol. 28(2), pages 29-41.
    20. Hamid Waqas & Rohani Md-Rus, 2018. "Predicting financial distress: Applicability of O-score model for Pakistani firms," Business and Economic Horizons (BEH), Prague Development Center, vol. 14(2), pages 389-401, April.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2401.12652. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.