IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2211.08405.html
   My bibliography  Save this paper

Multimodal Generative Models for Bankruptcy Prediction Using Textual Data

Author

Listed:
  • Rogelio A. Mancisidor
  • Kjersti Aas

Abstract

Textual data from financial filings, e.g., the Management's Discussion & Analysis (MDA) section in Form 10-K, has been used to improve the prediction accuracy of bankruptcy models. In practice, however, we cannot obtain the MDA section for all public companies, which limits the use of MDA data in traditional bankruptcy models, as they need complete data to make predictions. The two main reasons for the lack of MDA are: (i) not all companies are obliged to submit the MDA and (ii) technical problems arise when crawling and scrapping the MDA section. To solve this limitation, this research introduces the Conditional Multimodal Discriminative (CMMD) model that learns multimodal representations that embed information from accounting, market, and textual data modalities. The CMMD model needs a sample with all data modalities for model training. At test time, the CMMD model only needs access to accounting and market modalities to generate multimodal representations, which are further used to make bankruptcy predictions and to generate words from the missing MDA modality. With this novel methodology, it is realistic to use textual data in bankruptcy prediction models, since accounting and market data are available for all companies, unlike textual data. The empirical results of this research show that if financial regulators, or investors, were to use traditional models using MDA data, they would only be able to make predictions for 60% of the companies. Furthermore, the classification performance of our proposed methodology is superior to that of a large number of traditional classifier models, taking into account all the companies in our sample.

Suggested Citation

  • Rogelio A. Mancisidor & Kjersti Aas, 2022. "Multimodal Generative Models for Bankruptcy Prediction Using Textual Data," Papers 2211.08405, arXiv.org, revised Feb 2024.
  • Handle: RePEc:arx:papers:2211.08405
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2211.08405
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Durnev, Art & Mangen, Claudine, 2020. "The spillover effects of MD&A disclosures for real investment: The role of industry competition," Journal of Accounting and Economics, Elsevier, vol. 70(1).
    2. David M. Blei & Alp Kucukelbir & Jon D. McAuliffe, 2017. "Variational Inference: A Review for Statisticians," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(518), pages 859-877, April.
    3. J. C. Neves & A. Vieira, 2006. "Improving bankruptcy prediction with Hidden Layer Learning Vector Quantization," European Accounting Review, Taylor & Francis Journals, vol. 15(2), pages 253-271.
    4. Ravi Kumar, P. & Ravi, V., 2007. "Bankruptcy prediction in banks and firms via statistical and intelligent techniques - A review," European Journal of Operational Research, Elsevier, vol. 180(1), pages 1-28, July.
    5. Zhang, Guoqiang & Y. Hu, Michael & Eddy Patuwo, B. & C. Indro, Daniel, 1999. "Artificial neural networks in bankruptcy prediction: General framework and cross-validation analysis," European Journal of Operational Research, Elsevier, vol. 116(1), pages 16-32, July.
    6. Mai, Feng & Tian, Shaonan & Lee, Chihoon & Ma, Ling, 2019. "Deep learning models for bankruptcy prediction using textual disclosures," European Journal of Operational Research, Elsevier, vol. 274(2), pages 743-758.
    7. Zhang, Linlang & Zhang, Zhe & Zhang, Peng & Wang, Xiongyuan, 2022. "Defend or remain quiet? Tax avoidance and the textual characteristics of the MD&A in annual reports," International Review of Economics & Finance, Elsevier, vol. 79(C), pages 193-204.
    8. Yang, Z. R. & Platt, Marjorie B. & Platt, Harlan D., 1999. "Probabilistic Neural Networks in Bankruptcy Prediction," Journal of Business Research, Elsevier, vol. 44(2), pages 67-74, February.
    9. Kar Yan Tam & Melody Y. Kiang, 1992. "Managerial Applications of Neural Networks: The Case of Bank Failure Predictions," Management Science, INFORMS, vol. 38(7), pages 926-947, July.
    10. Beaver, Wh, 1966. "Financial Ratios As Predictors Of Failure," Journal of Accounting Research, Wiley Blackwell, vol. 4, pages 71-111.
    11. Demyanyk, Yuliya & Hasan, Iftekhar, 2010. "Financial crises and bank failures: A review of prediction methods," Omega, Elsevier, vol. 38(5), pages 315-324, October.
    12. Edward I. Altman, 1968. "Financial Ratios, Discriminant Analysis And The Prediction Of Corporate Bankruptcy," Journal of Finance, American Finance Association, vol. 23(4), pages 589-609, September.
    13. Pamela K. Coats & L. Franklin Fant, 1993. "Recognizing Financial Distress Patterns Using a Neural Network Tool," Financial Management, Financial Management Association, vol. 22(3), Fall.
    14. Sudheer Chava, 2014. "Environmental Externalities and Cost of Capital," Management Science, INFORMS, vol. 60(9), pages 2223-2247, September.
    15. Shumway, Tyler, 2001. "Forecasting Bankruptcy More Accurately: A Simple Hazard Model," The Journal of Business, University of Chicago Press, vol. 74(1), pages 101-124, January.
    16. Edward I. Altman, 1968. "The Prediction Of Corporate Bankruptcy: A Discriminant Analysis," Journal of Finance, American Finance Association, vol. 23(1), pages 193-194, March.
    17. A. Adam Ding & Shaonan Tian & Yan Yu & Hui Guo, 2012. "A Class of Discrete Transformation Survival Models With Application to Default Probability Prediction," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(499), pages 990-1003, September.
    18. Beaver, Wh, 1966. "Financial Ratios As Predictors Of Failure - Reply," Journal of Accounting Research, Wiley Blackwell, vol. 4, pages 123-127.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Mai, Feng & Tian, Shaonan & Lee, Chihoon & Ma, Ling, 2019. "Deep learning models for bankruptcy prediction using textual disclosures," European Journal of Operational Research, Elsevier, vol. 274(2), pages 743-758.
    2. Zhou, Fanyin & Fu, Lijun & Li, Zhiyong & Xu, Jiawei, 2022. "The recurrence of financial distress: A survival analysis," International Journal of Forecasting, Elsevier, vol. 38(3), pages 1100-1115.
    3. Mohammad Mahdi Mousavi & Jamal Ouenniche & Kaoru Tone, 2023. "A dynamic performance evaluation of distress prediction models," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 42(4), pages 756-784, July.
    4. Kamesh Korangi & Christophe Mues & Cristi'an Bravo, 2021. "A transformer-based model for default prediction in mid-cap corporate markets," Papers 2111.09902, arXiv.org, revised Apr 2023.
    5. Korangi, Kamesh & Mues, Christophe & Bravo, Cristián, 2023. "A transformer-based model for default prediction in mid-cap corporate markets," European Journal of Operational Research, Elsevier, vol. 308(1), pages 306-320.
    6. Şaban Çelik, 2013. "Micro Credit Risk Metrics: A Comprehensive Review," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 20(4), pages 233-272, October.
    7. Le, Hong Hanh & Viviani, Jean-Laurent, 2018. "Predicting bank failure: An improvement by implementing a machine-learning approach to classical financial ratios," Research in International Business and Finance, Elsevier, vol. 44(C), pages 16-25.
    8. Serrano-Cinca, Carlos & Gutiérrez-Nieto, Begoña & Bernate-Valbuena, Martha, 2019. "The use of accounting anomalies indicators to predict business failure," European Management Journal, Elsevier, vol. 37(3), pages 353-375.
    9. Vladislav V. Afanasev & Yulia A. Tarasova, 2022. "Default Prediction for Housing and Utilities Management Firms Using Non-Financial Data," Finansovyj žhurnal — Financial Journal, Financial Research Institute, Moscow 125375, Russia, issue 6, pages 91-110, December.
    10. fernández, María t. Tascón & gutiérrez, Francisco J. Castaño, 2012. "Variables y Modelos Para La Identificación y Predicción Del Fracaso Empresarial: Revisión de La Investigación Empírica Reciente," Revista de Contabilidad - Spanish Accounting Review, Elsevier, vol. 15(1), pages 7-58.
    11. Zhiyong Li & Chen Feng & Ying Tang, 2022. "Bank efficiency and failure prediction: a nonparametric and dynamic model based on data envelopment analysis," Annals of Operations Research, Springer, vol. 315(1), pages 279-315, August.
    12. Francesco Ciampi & Valentina Cillo & Fabio Fiano, 2020. "Combining Kohonen maps and prior payment behavior for small enterprise default prediction," Small Business Economics, Springer, vol. 54(4), pages 1007-1039, April.
    13. Ben Jabeur, Sami & Serret, Vanessa, 2023. "Bankruptcy prediction using fuzzy convolutional neural networks," Research in International Business and Finance, Elsevier, vol. 64(C).
    14. Nawaf Almaskati & Ron Bird & Yue Lu & Danny Leung, 2019. "The Role of Corporate Governance and Estimation Methods in Predicting Bankruptcy," Working Papers in Economics 19/16, University of Waikato.
    15. Kim, Soo Y. & Upneja, Arun, 2014. "Predicting restaurant financial distress using decision tree and AdaBoosted decision tree models," Economic Modelling, Elsevier, vol. 36(C), pages 354-362.
    16. Mohammad Mahdi Mousavi & Jamal Ouenniche, 2018. "Multi-criteria ranking of corporate distress prediction models: empirical evaluation and methodological contributions," Annals of Operations Research, Springer, vol. 271(2), pages 853-886, December.
    17. Leila Bateni & Farshid Asghari, 2020. "Bankruptcy Prediction Using Logit and Genetic Algorithm Models: A Comparative Analysis," Computational Economics, Springer;Society for Computational Economics, vol. 55(1), pages 335-348, January.
    18. Lin, Fengyi & Yeh, Ching Chiang & Lee, Meng Yuan, 2013. "A Hybrid Business Failure Prediction Model Using Locally Linear Embedding And Support Vector Machines," Journal for Economic Forecasting, Institute for Economic Forecasting, vol. 0(1), pages 82-97, March.
    19. Nasim Nasirpour & Alireza Mazdaki & Esmail Enayati, 2016. "The Investigation and Comparison of the Performance of Heuristic Methods in the Prediction of the Type of Auditor’s Opinion in Firms Accepted in Tehran Stock Exchange," Asian Social Science, Canadian Center of Science and Education, vol. 12(6), pages 148-148, June.
    20. Kumar, Rahul & Deb, Soumya Guha & Mukherjee, Shubhadeep, 2020. "Do words reveal the latent truth? Identifying communication patterns of corporate losers," Journal of Behavioral and Experimental Finance, Elsevier, vol. 26(C).

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2211.08405. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.