IDEAS home Printed from https://ideas.repec.org/a/eee/reecon/v77y2023i2p226-238.html
   My bibliography  Save this article

Machine learning and sentiment analysis: Projecting bank insolvency risk

Author

Listed:
  • de Jesus, Diego Pitta
  • Besarria, Cássio da Nóbrega

Abstract

The main motivation of this paper is to use machine learning techniques to build a new insolvency risk rating metric for banks traded on Brazilian stock exchange. Then, a set of prediction models will be used to project the risk rating of these institutions. Conventionally, the literature analyzes bank insolvency risk from accounting data and macroeconomic variables. In addition to these variables, this paper will construct a series of bank institution manager sentiment, via quarterly reports (ITR), and this will be used to improve the accuracy of bank risk predictions. The results indicate that the bank risk classification, via the k-means algorithm, was able to classify 17% of the sample into the highest risk group (1), while 83% of the sample was in the lowest bankruptcy risk group (0). Using the Z-score metric, we found that 65% of the sample is in the low-risk group, and 35% of the sample is in the high-risk group. Thus, the k-means algorithm is more rigorous in classifying a bank in the highest risk category. Next we used the data already described to project the risk of bank insolvency. The results of this step showed that the decision tree model performed the best for the test sample. In addition, it was found that the inclusion of the bank sentiment variable was able to improve the performance of the prediction models, especially, when bank sentiment is constructed from a time-varying dictionary.

Suggested Citation

  • de Jesus, Diego Pitta & Besarria, Cássio da Nóbrega, 2023. "Machine learning and sentiment analysis: Projecting bank insolvency risk," Research in Economics, Elsevier, vol. 77(2), pages 226-238.
  • Handle: RePEc:eee:reecon:v:77:y:2023:i:2:p:226-238
    DOI: 10.1016/j.rie.2023.03.001
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S1090944323000224
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.rie.2023.03.001?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Lepetit, Laetitia & Strobel, Frank, 2013. "Bank insolvency risk and time-varying Z-score measures," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 25(C), pages 73-87.
    2. Constantin, Andreea & Peltonen, Tuomas A. & Sarlin, Peter, 2018. "Network linkages to predict bank distress," Journal of Financial Stability, Elsevier, vol. 35(C), pages 226-241.
    3. Francesca Fortuna & Fabrizio Maturo, 2019. "K-means clustering of item characteristic curves and item information curves via functional principal component analysis," Quality & Quantity: International Journal of Methodology, Springer, vol. 53(5), pages 2291-2304, September.
    4. Climent, Francisco & Momparler, Alexandre & Carmona, Pedro, 2019. "Anticipating bank distress in the Eurozone: An Extreme Gradient Boosting approach," Journal of Business Research, Elsevier, vol. 101(C), pages 885-896.
    5. Ravi Kumar, P. & Ravi, V., 2007. "Bankruptcy prediction in banks and firms via statistical and intelligent techniques - A review," European Journal of Operational Research, Elsevier, vol. 180(1), pages 1-28, July.
    6. Shapiro, Adam Hale & Sudhof, Moritz & Wilson, Daniel J., 2022. "Measuring news sentiment," Journal of Econometrics, Elsevier, vol. 228(2), pages 221-243.
    7. Lepetit, Laetitia & Strobel, Frank, 2013. "Bank insolvency risk and time-varying Z-score measures," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 25(C), pages 73-87.
    8. Bill Provencher & Kenneth A. Baerenklau & Richard C. Bishop, 2002. "A Finite Mixture Logit Model of Recreational Angling with Serially Correlated Random Utility," American Journal of Agricultural Economics, Agricultural and Applied Economics Association, vol. 84(4), pages 1066-1075.
    9. Rebel Cole & Jeffery Gunther, 1998. "Predicting Bank Failures: A Comparison of On- and Off-Site Monitoring Systems," Journal of Financial Services Research, Springer;Western Finance Association, vol. 13(2), pages 103-117, April.
    10. Suss, Joel & Treitel, Henry, 2019. "Predicting bank distress in the UK with machine learning," Bank of England working papers 831, Bank of England.
    11. Curry, Timothy J. & Elmer, Peter J. & Fissel, Gary S., 2007. "Equity market data, bank failures and market efficiency," Journal of Economics and Business, Elsevier, vol. 59(6), pages 536-559.
    12. P. K. Viswanathan & Suresh Srinivasan & N. Hariharan, 2020. "Predicting Financial Health of Banks for Investor Guidance Using Machine Learning Algorithms," Journal of Emerging Market Finance, Institute for Financial Management and Research, vol. 19(2), pages 226-261, August.
    13. Tim Loughran & Bill Mcdonald, 2011. "When Is a Liability Not a Liability? Textual Analysis, Dictionaries, and 10‐Ks," Journal of Finance, American Finance Association, vol. 66(1), pages 35-65, February.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Papanikolaou, Nikolaos I., 2018. "To be bailed out or to be left to fail? A dynamic competing risks hazard analysis," Journal of Financial Stability, Elsevier, vol. 34(C), pages 61-85.
    2. Mare, Davide Salvatore & Moreira, Fernando & Rossi, Roberto, 2017. "Nonstationary Z-Score measures," European Journal of Operational Research, Elsevier, vol. 260(1), pages 348-358.
    3. Manthoulis, Georgios & Doumpos, Michalis & Zopounidis, Constantin & Galariotis, Emilios, 2020. "An ordinal classification framework for bank failure prediction: Methodology and empirical evidence for US banks," European Journal of Operational Research, Elsevier, vol. 282(2), pages 786-801.
    4. Gogas, Periklis & Papadimitriou, Theophilos & Agrapetidou, Anna, 2018. "Forecasting bank failures and stress testing: A machine learning approach," International Journal of Forecasting, Elsevier, vol. 34(3), pages 440-455.
    5. Paola Bongini & Małgorzata Iwanicz-Drozdowska & Paweł Smaga & Bartosz Witkowski, 2018. "In search of a measure of banking sector distress: empirical study of CESEE banking sectors," Risk Management, Palgrave Macmillan, vol. 20(3), pages 242-257, August.
    6. Risfandy, Tastaftiyan & Tarazi, Amine & Trinugroho, Irwan, 2022. "Competition in dual markets: Implications for banking system stability," Global Finance Journal, Elsevier, vol. 52(C).
    7. Peterson K. Ozili, 2020. "Does competence of central bank governors influence financial stability?," Future Business Journal, Springer, vol. 6(1), pages 1-20, December.
    8. Kanga, Désiré & Murinde, Victor & Soumaré, Issouf, 2020. "Capital, risk and profitability of WAEMU banks: Does bank ownership matter?," Journal of Banking & Finance, Elsevier, vol. 114(C).
    9. Mehmet Balcilar & Riza Demirer, 2014. "The Effect of Global Shocks and Volatility on Herd Behavior in Borsa Istanbul," BIFEC Book of Abstracts & Proceedings, Research and Business Development Department, Borsa Istanbul, vol. 1(2), pages 142-172, March.
    10. Sakarya, Burchan, 2016. "Financial Stability of Islamic (Participation) Banks in Turkey," MPRA Paper 69451, University Library of Munich, Germany.
    11. Hanjo Odendaal & Monique Reid & Johann F. Kirsten, 2020. "Media‐Based Sentiment Indices as an Alternative Measure of Consumer Confidence," South African Journal of Economics, Economic Society of South Africa, vol. 88(4), pages 409-434, December.
    12. Picault, Matthieu & Pinter, Julien & Renault, Thomas, 2022. "Media sentiment on monetary policy: Determinants and relevance for inflation expectations," Journal of International Money and Finance, Elsevier, vol. 124(C).
    13. Kriebel, Johannes & Stitz, Lennart, 2022. "Credit default prediction from user-generated text in peer-to-peer lending using deep learning," European Journal of Operational Research, Elsevier, vol. 302(1), pages 309-323.
    14. Aprigliano, Valentina & Emiliozzi, Simone & Guaitoli, Gabriele & Luciani, Andrea & Marcucci, Juri & Monteforte, Libero, 2023. "The power of text-based indicators in forecasting Italian economic activity," International Journal of Forecasting, Elsevier, vol. 39(2), pages 791-808.
    15. Inka Yusgiantoro & Wahyoe Soedarmono & Amine Tarazi, 2017. "Bank consolidation and financial stability revisited: Evidence from Indonesia," Working Papers hal-01577970, HAL.
    16. Sun, Junjie & Wu, Deming & Zhao, Xinlei, 2018. "Systematic risk factors and bank failures," Journal of Economics and Business, Elsevier, vol. 98(C), pages 1-18.
    17. Krause, Thomas & Sondershaus, Talina & Tonzer, Lena, 2017. "Complexity and bank risk during the financial crisis," Economics Letters, Elsevier, vol. 150(C), pages 118-121.
    18. Sarmiento, Miguel & Cely, Jorge & León, Carlos, 2017. "An early warning indicator system to monitor the unsecured interbank funds market," Research in International Business and Finance, Elsevier, vol. 40(C), pages 114-128.
    19. Hassan Belkacem GHASSAN, 2017. "New alternative measuring financial stability," Turkish Economic Review, KSP Journals, vol. 4(3), pages 275-281, September.
    20. repec:prg:jnlpep:v:preprint:id:682:p:1-20 is not listed on IDEAS
    21. Miletic, Marko & Pavic Kramaric, Tomislava & Plazibat, Boze, 2019. "What Determines Financial Soundness Of Croatian Listed Firms?," UTMS Journal of Economics, University of Tourism and Management, Skopje, Macedonia, vol. 10(2), pages 189-200.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:reecon:v:77:y:2023:i:2:p:226-238. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/inca/622941 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.