IDEAS home Printed from https://ideas.repec.org/a/spr/qualqt/v56y2022i6d10.1007_s11135-021-01313-5.html
   My bibliography  Save this article

Learning from revisions: an algorithm to detect errors in banks’ balance sheet statistical reporting

Author

Listed:
  • Francesco Cusano

    (Bank of Italy)

  • Giuseppe Marinelli

    (Bank of Italy)

  • Stefano Piermattei

    (Bank of Italy)

Abstract

Ensuring and disseminating high-quality data is crucial for central banks to adequately support monetary analysis and the related decision-making processes. In this paper, we develop a new machine learning procedure for the identification of errors in banks’ supervisory reports on loans to the private sector, which are employed in the Bank of Italy’s production of statistics on Monetary Financial Institutions’ (MFIs) Balance Sheet Items (BSI). We model a “Revisions Adjusted–Quantile Regression Random Forest” (RA–QRRF) algorithm in which the predicted acceptance regions of the reported values are calibrated through an individual “imprecision rate” derived from the entire history of each bank’s reporting errors and revisions collected by the Bank of Italy. The analysis shows that our RA-QRRF approach provides very satisfying results in terms of error detection, especially for loans to the households sector, and outperforms well-established alternative outlier detection procedures based on probit and logit models.

Suggested Citation

  • Francesco Cusano & Giuseppe Marinelli & Stefano Piermattei, 2022. "Learning from revisions: an algorithm to detect errors in banks’ balance sheet statistical reporting," Quality & Quantity: International Journal of Methodology, Springer, vol. 56(6), pages 4025-4059, December.
  • Handle: RePEc:spr:qualqt:v:56:y:2022:i:6:d:10.1007_s11135-021-01313-5
    DOI: 10.1007/s11135-021-01313-5
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11135-021-01313-5
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11135-021-01313-5?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Marcello Bofondi & Luisa Carpinelli & Enrico Sette, 2018. "Credit Supply During a Sovereign Debt Crisis," Journal of the European Economic Association, European Economic Association, vol. 16(3), pages 696-729.
    2. Luigi Infante & Stefano Piermattei & Raffaele Santioni & Bianca Sorvillo, 2020. "Diversifying away risks through derivatives: an analysis of the Italian banking system," Economia Politica: Journal of Analytical and Institutional Economics, Springer;Fondazione Edison, vol. 37(2), pages 621-657, July.
    3. Sebastian Doerr & Leonardo Gambacorta & José María Serena Garralda, 2021. "Big data and machine learning in central banking," BIS Working Papers 930, Bank for International Settlements.
    4. Chakraborty, Chiranjit & Joseph, Andreas, 2017. "Machine learning at central banks," Bank of England working papers 674, Bank of England.
    5. Scott Hendry & Alison Madeley, 2010. "Text Mining and the Information Content of Bank of Canada Communications," Staff Working Papers 10-31, Bank of Canada.
    6. D’Amuri, Francesco & Marcucci, Juri, 2017. "The predictive power of Google searches in forecasting US unemployment," International Journal of Forecasting, Elsevier, vol. 33(4), pages 801-816.
    7. Hansen, Stephen & McMahon, Michael, 2016. "Shocking language: Understanding the macroeconomic effects of central bank communication," Journal of International Economics, Elsevier, vol. 99(S1), pages 114-133.
    8. Marta Bernardini & Paolo Massaro & Francesca Pepe & Francesco Tocco, 2021. "The market notices published by the Italian Stock Exchange: a machine learning approach for the selection of the relevant ones," Questioni di Economia e Finanza (Occasional Papers) 632, Bank of Italy, Economic Research and International Relations Area.
    9. Jiménez, Gabriel & Ongena, Steven & Peydró, José-Luis & Saurina, Jesús, 2012. "Credit Supply and Monetary Policy: Identifying the Bank Balance-Sheet Channel with Loan Applications," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 102(5), pages 2301-2326.
    10. Patrick Bajari & Denis Nekipelov & Stephen P. Ryan & Miaoyu Yang, 2015. "Machine Learning Methods for Demand Estimation," American Economic Review, American Economic Association, vol. 105(5), pages 481-485, May.
    11. Paolo Massaro & Ilaria Vannini & Oliver Giudice, 2020. "Institutional sector classifier, a machine learning approach," Questioni di Economia e Finanza (Occasional Papers) 548, Bank of Italy, Economic Research and International Relations Area.
    12. Fabio Zambuto, 2021. "Quality checks on granular banking data: an experimental approach based on machine learning," IFC Bulletins chapters, in: Bank for International Settlements (ed.), Micro data for the macro world, volume 53, Bank for International Settlements.
    13. van Damme, E.E.C., 2002. "Economische analyse van politieke processen," Other publications TiSEM 54188f86-501c-4808-b802-1, Tilburg University, School of Economics and Management.
    14. Maria Ludovica Drudi & Stefano Nobili, 2021. "A liquidity risk early warning indicator for Italian banks: a machine learning approach," Temi di discussione (Economic working papers) 1337, Bank of Italy, Economic Research and International Relations Area.
    15. Federico Cingano & Francesco Manaresi & Enrico Sette, 2016. "Does Credit Crunch Investment Down? New Evidence on the Real Effects of the Bank-Lending Channel," Review of Financial Studies, Society for Financial Studies, vol. 29(10), pages 2737-2773.
    16. Eleni Kalamara & Arthur Turrell & Chris Redl & George Kapetanios & Sujit Kapadia, 2022. "Making text count: Economic forecasting using newspaper text," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(5), pages 896-919, August.
    17. Bottero, Margherita & Lenzu, Simone & Mezzanotti, Filippo, 2020. "Sovereign debt exposure and the bank lending channel: Impact on credit supply and the real economy," Journal of International Economics, Elsevier, vol. 126(C).
    18. Andrea Carboni & Alessandro Moro, 2018. "Imputation techniques for the nationality of foreign shareholders in Italian firms," IFC Bulletins chapters, in: Bank for International Settlements (ed.), External sector statistics: current issues and new challenges, volume 48, Bank for International Settlements.
    19. Emilia Bonaccorsi di Patti & Enrico Sette, 2012. "Bank balance sheets and the transmission of financial shocks to borrowers: evidence from the 2007-2008 crisis," Temi di discussione (Economic working papers) 848, Bank of Italy, Economic Research and International Relations Area.
    20. Tobias Cagala, 2017. "Improving data quality and closing data gaps with machine learning," IFC Bulletins chapters, in: Bank for International Settlements (ed.), Data needs and Statistics compilation for macroprudential analysis, volume 46, Bank for International Settlements.
    21. Massimiliano Affinito & Giorgio Albareto & Raffaele Santioni, 2016. "Purchases of sovereign debt securities by Italian banks during the crisis: the role of balance-sheet conditions," Questioni di Economia e Finanza (Occasional Papers) 330, Bank of Italy, Economic Research and International Relations Area.
    22. Tanaka, Katsuyuki & Kinkyo, Takuji & Hamori, Shigeyuki, 2016. "Random forests-based early warning system for bank failures," Economics Letters, Elsevier, vol. 148(C), pages 118-121.
    23. Fabio Zambuto & Simona Arcuti & Roberto Sabatini & Daniele Zambuto, 2021. "Application of classification algorithms for the assessment of confirmation to quality remarks," Questioni di Economia e Finanza (Occasional Papers) 631, Bank of Italy, Economic Research and International Relations Area.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Francesco Cusano & Giuseppe Marinelli & Stefano Piermattei, 2021. "Learning from revisions: a tool for detecting potential errors in banks' balance sheet statistical reporting," Questioni di Economia e Finanza (Occasional Papers) 611, Bank of Italy, Economic Research and International Relations Area.
    2. Lorenzo Burlon & Davide Fantino & Andrea Nobili & Gabriele Sene, 2016. "The quantity of corporate credit rationing with matched bank-firm data," Temi di discussione (Economic working papers) 1058, Bank of Italy, Economic Research and International Relations Area.
    3. Balduzzi, Pierluigi & Brancati, Emanuele & Schiantarelli, Fabio, 2018. "Financial markets, banks’ cost of funding, and firms’ decisions: Lessons from two crises," Journal of Financial Intermediation, Elsevier, vol. 36(C), pages 1-15.
    4. Ricci, Lorenzo & Soggia, Giovanni & Trimarchi, Lorenzo, 2023. "The impact of bank lending standards on credit to firms," Journal of Banking & Finance, Elsevier, vol. 152(C).
    5. Fabio Schiantarelli & Massimiliano Stacchini & Philip E. Strahan, 2020. "Bank Quality, Judicial Efficiency, and Loan Repayment Delays in Italy," Journal of Finance, American Finance Association, vol. 75(4), pages 2139-2178, August.
    6. Gambacorta, Leonardo & Sette, Enrico & Banerjee, Ryan, 2017. "The real effects of relationship lending," CEPR Discussion Papers 12340, C.E.P.R. Discussion Papers.
    7. Banerjee, Ryan N. & Gambacorta, Leonardo & Sette, Enrico, 2021. "The real effects of relationship lending✰," Journal of Financial Intermediation, Elsevier, vol. 48(C).
    8. Sebastian Doerr & Leonardo Gambacorta & José María Serena Garralda, 2021. "Big data and machine learning in central banking," BIS Working Papers 930, Bank for International Settlements.
    9. Francesco Bripi, 2023. "The impact of credit substitution between banks on investment," Temi di discussione (Economic working papers) 1408, Bank of Italy, Economic Research and International Relations Area.
    10. Kurz, Michael & Kleimeier, Stefanie, 2019. "Credit Supply: Are there negative spillovers from banks’ proprietary trading? (RM/19/005-revised-)," Research Memorandum 026, Maastricht University, Graduate School of Business and Economics (GSBE).
    11. Ippolito, Filippo & Peydró, José-Luis & Polo, Andrea & Sette, Enrico, 2016. "Double bank runs and liquidity risk management," Journal of Financial Economics, Elsevier, vol. 122(1), pages 135-154.
    12. Rhys Bidder & John Krainer & Adam Shapiro, 2021. "De-leveraging or de-risking? How banks cope with loss," Review of Economic Dynamics, Elsevier for the Society for Economic Dynamics, vol. 39, pages 100-127, January.
    13. Ugo Albertazzi & Andrea Nobili & Federico M. Signoretti, 2021. "The Bank Lending Channel of Conventional and Unconventional Monetary Policy," Journal of Money, Credit and Banking, Blackwell Publishing, vol. 53(2-3), pages 261-299, March.
    14. Martin Baumgaertner & Johannes Zahner, 2021. "Whatever it takes to understand a central banker - Embedding their words using neural networks," MAGKS Papers on Economics 202130, Philipps-Universität Marburg, Faculty of Business Administration and Economics, Department of Economics (Volkswirtschaftliche Abteilung).
    15. Rhys Bidder & John Krainer & Adam Shapiro, 2021. "De-leveraging or de-risking? How banks cope with loss," Review of Economic Dynamics, Elsevier for the Society for Economic Dynamics, vol. 39, pages 100-127, January.
    16. Doerr, Sebastian & Raissi, Mehdi & Weber, Anke, 2018. "Credit-supply shocks and firm productivity in Italy," Journal of International Money and Finance, Elsevier, vol. 87(C), pages 155-171.
    17. Francisco Buera & Sudipto Karmakar, 2022. "Real Effects of Financial Distress: The Role of Heterogeneity," The Economic Journal, Royal Economic Society, vol. 132(644), pages 1309-1348.
    18. Crosignani, Matteo & Faria-e-Castro, Miguel & Fonseca, Luís, 2020. "The (Unintended?) consequences of the largest liquidity injection ever," Journal of Monetary Economics, Elsevier, vol. 112(C), pages 97-112.
    19. Peydró, José-Luis & Polo, Andrea & Sette, Enrico, 2021. "Monetary policy at work: Security and credit application registers evidence," Journal of Financial Economics, Elsevier, vol. 140(3), pages 789-814.
    20. Albertazzi, Ugo & Barbiero, Francesca & Marqués-Ibáñez, David & Popov, Alexander & Rodriguez d’Acri, Costanza & Vlassopoulos, Thomas, 2020. "Monetary policy and bank stability: the analytical toolbox reviewed," Working Paper Series 2377, European Central Bank.

    More about this item

    Keywords

    Banks; Balance sheet items; Outlier detection; Machine learning;
    All these keywords.

    JEL classification:

    • C63 - Mathematical and Quantitative Methods - - Mathematical Methods; Programming Models; Mathematical and Simulation Modeling - - - Computational Techniques
    • C81 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs - - - Methodology for Collecting, Estimating, and Organizing Microeconomic Data; Data Access
    • G21 - Financial Economics - - Financial Institutions and Services - - - Banks; Other Depository Institutions; Micro Finance Institutions; Mortgages

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:qualqt:v:56:y:2022:i:6:d:10.1007_s11135-021-01313-5. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.