IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2306.08165.html
   My bibliography  Save this paper

Machine Learning for Zombie Hunting: Predicting Distress from Firms' Accounts and Missing Values

Author

Listed:
  • Falco J. Bargagli-Stoffi
  • Fabio Incerti
  • Massimo Riccaboni
  • Armando Rungi

Abstract

In this contribution, we propose machine learning techniques to predict zombie firms. First, we derive the risk of failure by training and testing our algorithms on disclosed financial information and non-random missing values of 304,906 firms active in Italy from 2008 to 2017. Then, we spot the highest financial distress conditional on predictions that lies above a threshold for which a combination of false positive rate (false prediction of firm failure) and false negative rate (false prediction of active firms) is minimized. Therefore, we identify zombies as firms that persist in a state of financial distress, i.e., their forecasts fall into the risk category above the threshold for at least three consecutive years. For our purpose, we implement a gradient boosting algorithm (XGBoost) that exploits information about missing values. The inclusion of missing values in our predictive model is crucial because patterns of undisclosed accounts are correlated with firm failure. Finally, we show that our preferred machine learning algorithm outperforms (i) proxy models such as Z-scores and the Distance-to-Default, (ii) traditional econometric methods, and (iii) other widely used machine learning techniques. We provide evidence that zombies are on average less productive and smaller, and that they tend to increase in times of crisis. Finally, we argue that our application can help financial institutions and public authorities design evidence-based policies-e.g., optimal bankruptcy laws and information disclosure policies.

Suggested Citation

  • Falco J. Bargagli-Stoffi & Fabio Incerti & Massimo Riccaboni & Armando Rungi, 2023. "Machine Learning for Zombie Hunting: Predicting Distress from Firms' Accounts and Missing Values," Papers 2306.08165, arXiv.org.
  • Handle: RePEc:arx:papers:2306.08165
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2306.08165
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Achim Ahrens & Christian B. Hansen & Mark E. Schaffer, 2020. "lassopack: Model selection and prediction with regularized regression in Stata," Stata Journal, StataCorp LP, vol. 20(1), pages 176-235, March.
    2. Gita Gopinath & Şebnem Kalemli-Özcan & Loukas Karabarbounis & Carolina Villegas-Sanchez, 2017. "Capital Allocation and Productivity in South Europe," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 132(4), pages 1915-1967.
    3. Jiahua Chen & Zehua Chen, 2008. "Extended Bayesian information criteria for model selection with large model spaces," Biometrika, Biometrika Trust, vol. 95(3), pages 759-771.
    4. Alexandre Belloni & Victor Chernozhukov & Christian Hansen & Damian Kozbur, 2016. "Inference in High-Dimensional Panel Models With an Application to Gun Control," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 34(4), pages 590-605, October.
    5. Dan Andrews & Filippos Petroulakis, 2017. "Breaking the Shackles: Zombie Firms, Weak Banks and Depressed Restructuring in Europe," OECD Economics Department Working Papers 1433, OECD Publishing.
    6. T. Kirk White & Jerome P. Reiter & Amil Petrin, 2018. "Imputation in U.S. Manufacturing Data and Its Implications for Productivity Dispersion," The Review of Economics and Statistics, MIT Press, vol. 100(3), pages 502-509, July.
    7. Annalisa Ferrando & Alessandro Ruggieri, 2018. "Financial constraints and productivity: Evidence from euro area companies," International Journal of Finance & Economics, John Wiley & Sons, Ltd., vol. 23(3), pages 257-282, July.
    8. Sara Calligaris & Massimo Del Gatto & Fadi Hassan & Gianmarco I P Ottaviano & Fabiano Schivardi & Tommaso MonacelliManaging Editor, 2018. "The productivity puzzle and misallocation: an Italian perspective," Economic Policy, CEPR, CESifo, Sciences Po;CES;MSH, vol. 33(96), pages 635-684.
    9. Edward I. Altman, 1968. "The Prediction Of Corporate Bankruptcy: A Discriminant Analysis," Journal of Finance, American Finance Association, vol. 23(1), pages 193-194, March.
    10. Ryan Niladri Banerjee & Boris Hofmann, 2018. "The rise of zombie firms: causes and consequences," BIS Quarterly Review, Bank for International Settlements, September.
    11. Fabiano Schivardi & Enrico Sette & Guido Tabellini, 2020. "Identifying the Real Effects of Zombie Lending," The Review of Corporate Finance Studies, Society for Financial Studies, vol. 9(3), pages 569-592.
    12. Nickell, Stephen & Nicolitsas, Daphne, 1999. "How does financial pressure affect firms?," European Economic Review, Elsevier, vol. 43(8), pages 1435-1456, August.
    13. Riccaboni, Massimo & Wang, Xu & Zhu, Zhen, 2021. "Firm performance in networks: The interplay between firm centrality and corporate group size," Journal of Business Research, Elsevier, vol. 129(C), pages 641-653.
    14. Merton, Robert C, 1974. "On the Pricing of Corporate Debt: The Risk Structure of Interest Rates," Journal of Finance, American Finance Association, vol. 29(2), pages 449-470, May.
    15. Müge Adalet McGowan & Dan Andrews & Valentine Millot & Thorsten BeckManaging Editor, 2018. "The walking dead? Zombie firms and productivity performance in OECD countries," Economic Policy, CEPR, CESifo, Sciences Po;CES;MSH, vol. 33(96), pages 685-736.
    16. Steven M. Fazzari & R. Glenn Hubbard & Bruce C. Petersen, 1988. "Financing Constraints and Corporate Investment," Brookings Papers on Economic Activity, Economic Studies Program, The Brookings Institution, vol. 19(1), pages 141-206.
    17. Daniel A. Ackerberg & Kevin Caves & Garth Frazer, 2015. "Identification Properties of Recent Production Function Estimators," Econometrica, Econometric Society, vol. 83, pages 2411-2451, November.
    18. Philippe Aghion & Antonin Bergeaud & Gilbert Cette & Rémy Lecat & Hélène Maghin, 2019. "Coase Lecture ‐ The Inverted‐U Relationship Between Credit Access and Productivity Growth," Economica, London School of Economics and Political Science, vol. 86(341), pages 1-31, January.
    19. Sebnem Kalemli-Ozcan & Bent Sorensen & Carolina Villegas-Sanchez & Vadym Volosovych & Sevcan Yesiltas, 2015. "How to Construct Nationally Representative Firm Level Data from the Orbis Global Database: New Facts and Aggregate Implications," NBER Working Papers 21558, National Bureau of Economic Research, Inc.
    20. Andreas Behr & Jurij Weinblat, 2017. "Default Patterns in Seven EU Countries: A Random Forest Approach," International Journal of the Economics of Business, Taylor & Francis Journals, vol. 24(2), pages 181-222, May.
    21. Peter N. Gal, 2013. "Measuring Total Factor Productivity at the Firm Level using OECD-ORBIS," OECD Economics Department Working Papers 1049, OECD Publishing.
    22. Ricardo J. Caballero & Takeo Hoshi & Anil K. Kashyap, 2008. "Zombie Lending and Depressed Restructuring in Japan," American Economic Review, American Economic Association, vol. 98(5), pages 1943-1977, December.
    23. Javier Cravino & Andrei A. Levchenko, 2017. "Multinational Firms and International Business Cycle Transmission," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 132(2), pages 921-962.
    24. Dan Andrews & Müge Adalet McGowan & Valentine Millot, 2017. "Confronting the zombies: Policies for productivity revival," OECD Economic Policy Papers 21, OECD Publishing.
    25. Takaya Saito & Marc Rehmsmeier, 2015. "The Precision-Recall Plot Is More Informative than the ROC Plot When Evaluating Binary Classifiers on Imbalanced Datasets," PLOS ONE, Public Library of Science, vol. 10(3), pages 1-21, March.
    26. Jon Kleinberg & Jens Ludwig & Sendhil Mullainathan & Ziad Obermeyer, 2015. "Prediction Policy Problems," American Economic Review, American Economic Association, vol. 105(5), pages 491-495, May.
    27. Alexandre Belloni & Victor Chernozhukov & Ying Wei, 2016. "Post-Selection Inference for Generalized Linear Models With Many Controls," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 34(4), pages 606-619, October.
    28. Joe Peek & Eric S. Rosengren, 2005. "Unnatural Selection: Perverse Incentives and the Misallocation of Credit in Japan," American Economic Review, American Economic Association, vol. 95(4), pages 1144-1166, September.
    29. Mirko Moscatelli & Simone Narizzano & Fabio Parlapiano & Gianluca Viggiano, 2019. "Corporate default forecasting with machine learning," Temi di discussione (Economic working papers) 1256, Bank of Italy, Economic Research and International Relations Area.
    30. Edward I. Altman, 1968. "Financial Ratios, Discriminant Analysis And The Prediction Of Corporate Bankruptcy," Journal of Finance, American Finance Association, vol. 23(4), pages 589-609, September.
    31. Rajan, Raghuram G & Zingales, Luigi, 1995. "What Do We Know about Capital Structure? Some Evidence from International Data," Journal of Finance, American Finance Association, vol. 50(5), pages 1421-1460, December.
    32. Ohlson, Ja, 1980. "Financial Ratios And The Probabilistic Prediction Of Bankruptcy," Journal of Accounting Research, Wiley Blackwell, vol. 18(1), pages 109-131.
    33. Charles J. Hadlock & Joshua R. Pierce, 2010. "New Evidence on Measuring Financial Constraints: Moving Beyond the KZ Index," The Review of Financial Studies, Society for Financial Studies, vol. 23(5), pages 1909-1940.
    34. Sendhil Mullainathan & Jann Spiess, 2017. "Machine Learning: An Applied Econometric Approach," Journal of Economic Perspectives, American Economic Association, vol. 31(2), pages 87-106, Spring.
    35. Sørensen, Bent E & Kalemli-Özcan, Sebnem & Volosovych, Vadym & Villegas-Sanchez, Carolina & Yesiltas, Sevcan, 2015. "How to construct nationally representative firm level data from the ORBIS global database," CEPR Discussion Papers 10829, C.E.P.R. Discussion Papers.
    36. Antonio R. Linero, 2018. "Bayesian Regression Trees for High-Dimensional Prediction and Variable Selection," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 113(522), pages 626-636, April.
    37. Black, Fischer & Scholes, Myron S, 1973. "The Pricing of Options and Corporate Liabilities," Journal of Political Economy, University of Chicago Press, vol. 81(3), pages 637-654, May-June.
    38. Falco J. Bargagli Stoffi & Kenneth De Beckker & Joana E. Maldonado & Kristof De Witte, 2021. "Assessing Sensitivity of Machine Learning Predictions.A Novel Toolbox with an Application to Financial Literacy," Papers 2102.04382, arXiv.org.
    39. Lily Davies & Mark Kattenberg & Benedikt Vogt, 2023. "Predicting Firm Exits with Machine Learning: Implications for Selection into COVID-19 Support and Productivity Growth," CPB Discussion Paper 444, CPB Netherlands Bureau for Economic Policy Analysis.
    40. Kapelner, Adam & Bleich, Justin, 2016. "bartMachine: Machine Learning with Bayesian Additive Regression Trees," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 70(i04).
    41. Antonio R. Linero & Yun Yang, 2018. "Bayesian regression tree ensembles that adapt to smoothness and sparsity," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 80(5), pages 1087-1110, November.
    42. Laurens Cherchye & Bram De Rock & Annalisa Ferrando & Klaas Mulier & Marijn Verschelde, 2018. "Identifying Financial Constraints from Production Data," Working Papers ECARES 2018-31, ULB -- Universite Libre de Bruxelles.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Falco J. Bargagli-Dtoffi & Massimo Riccaboni & Armando Rungi, 2020. "Machine Learning for Zombie Hunting. Firms Failures and Financial Constraints," Working Papers 01/2020, IMT School for Advanced Studies Lucca, revised Jun 2020.
    2. Francesca Micocci & Armando Rungi, 2021. "Predicting Exporters with Machine Learning," Working Papers 03/2021, IMT School for Advanced Studies Lucca, revised Jul 2021.
    3. Viral V. Acharya & Matteo Crosignani & Tim Eisert & Christian Eufinger, 2020. "Zombie Credit and (Dis-)Inflation: Evidence from Europe," NBER Working Papers 27158, National Bureau of Economic Research, Inc.
    4. Qiao, Lu & Fei, Junjun, 2022. "Government subsidies, enterprise operating efficiency, and “stiff but deathless” zombie firms," Economic Modelling, Elsevier, vol. 107(C).
    5. Mingarelli, Luca & Ravanetti, Beatrice & Shakir, Tamarah & Wendelborn, Jonas, 2022. "Dawn of the (half) dead: the twisted world of zombie identification," Working Paper Series 2743, European Central Bank.
    6. Álvarez, Laura & García-Posada, Miguel & Mayordomo, Sergio, 2023. "Distressed firms, zombie firms and zombie lending: A taxonomy," Journal of Banking & Finance, Elsevier, vol. 149(C).
    7. Ricardo Pinheiro Alves & Nuno Tavares & Gabriel Osório de Barros, 2023. "Revisitar as Empresas Zombie em Portugal (2008-2021)," GEE Papers 178, Gabinete de Estratégia e Estudos, Ministério da Economia, revised Oct 2023.
    8. Maximilian Gobel & Nuno Tavares, 2022. "Zombie-Lending in the United States -- Prevalence versus Relevance," Papers 2201.10524, arXiv.org, revised Jul 2022.
    9. Christian Abele & Agnès Bénassy-Quéré & Lionel Fontagné, 2020. "One Size Does Not Fit All: TFP in the Aftermath of Financial Crises in Three European Countries," PSE Working Papers halshs-02883685, HAL.
    10. Feng, Ling & Lang, Henan & Pei, Tingting, 2022. "Zombie firms and corporate savings: Evidence from Chinese manufacturing firms," International Review of Economics & Finance, Elsevier, vol. 79(C), pages 551-564.
    11. Falco J. Bargagli-Stoffi & Jan Niederreiter & Massimo Riccaboni, 2020. "Supervised learning for the prediction of firm dynamics," Papers 2009.06413, arXiv.org.
    12. Giorgio Brunello & Áron Gereben & Désirée Rückert & Christoph Weiss & Patricia Wruuck, 2022. "Do investments in human and physical capital respond differently to financing constraints?," Swiss Journal of Economics and Statistics, Springer;Swiss Society of Economics and Statistics, vol. 158(1), pages 1-14, December.
    13. Müge Adalet McGowan & Dan Andrews & Valentine Millot, 2017. "Insolvency regimes, zombie firms and capital reallocation," OECD Economics Department Working Papers 1399, OECD Publishing.
    14. Diana Bonfim & Geraldo Cerqueiro & Hans Degryse & Steven Ongena, 2023. "On-Site Inspecting Zombie Lending," Management Science, INFORMS, vol. 69(5), pages 2547-2567, May.
    15. Lily Davies & Mark Kattenberg & Benedikt Vogt, 2023. "Predicting Firm Exits with Machine Learning: Implications for Selection into COVID-19 Support and Productivity Growth," CPB Discussion Paper 444, CPB Netherlands Bureau for Economic Policy Analysis.
    16. Christian Keuschnigg & Michael Kogler & Johannes Matt, 2022. "Banks, Credit Reallocation, and Creative Destruction," Swiss Finance Institute Research Paper Series 22-83, Swiss Finance Institute.
    17. Özlem Dursun-de Neef, H. & Schandlbauer, Alexander, 2021. "COVID-19 and lending responses of European banks," Journal of Banking & Finance, Elsevier, vol. 133(C).
    18. Maurin, Laurent & Wolski, Marcin, 2021. "Aggregate productivity slowdown in Europe: New evidence from corporate balance sheets," EIB Working Papers 2021/04, European Investment Bank (EIB).
    19. Viral Acharya & Sergei A. Davydenko & Ilya A. Strebulaev, 2012. "Cash Holdings and Credit Risk," The Review of Financial Studies, Society for Financial Studies, vol. 25(12), pages 3572-3609.
    20. Simone Lenzu & Francesco Manaresi, 2019. "Sources and implications of resource misallocation: new evidence from firm-level marginal products and user costs," Questioni di Economia e Finanza (Occasional Papers) 485, Bank of Italy, Economic Research and International Relations Area.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2306.08165. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.