IDEAS home Printed from https://ideas.repec.org/p/zbw/cfrwps/323935.html
   My bibliography  Save this paper

Interpretable machine learning for earnings forecasts: Leveraging high-dimensional financial statement data

Author

Listed:
  • Hess, Dieter
  • Simon, Frederik
  • Weibels, Sebastian

Abstract

We predict earnings for forecast horizons of up to five years by using the entire set of Compustat financial statement data as input and providing it to state-of-the-art machine learning models capable of approximating arbitrary functional forms. Our approach improves prediction one year ahead by an average of 11% compared to the traditional linear approach that performs best. This superior performance is consistent across a variety of evaluation metrics as well as different firm subsamples and translates into more profitable investment strategies. Extensive model interpretation reveals that income statement variables, especially different definitions of earnings, are by far the most important predictors. Conversely, we find that while income statement variables decline in relevance, balance sheet information becomes more significant as the forecast horizon extends. Lastly, we show that the influence of interactions and non- linearities on the machine learning forecast is modest, but substantial differences between firm subsamples exist.

Suggested Citation

  • Hess, Dieter & Simon, Frederik & Weibels, Sebastian, 2025. "Interpretable machine learning for earnings forecasts: Leveraging high-dimensional financial statement data," CFR Working Papers 25-06, University of Cologne, Centre for Financial Research (CFR).
  • Handle: RePEc:zbw:cfrwps:323935
    as

    Download full text from publisher

    File URL: https://www.econstor.eu/bitstream/10419/323935/1/1932922806.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Bryan T. Kelly & Semyon Malamud & Kangying Zhou, 2022. "The Virtue of Complexity in Return Prediction," NBER Working Papers 30217, National Bureau of Economic Research, Inc.
    2. Ball, Ray & Watts, Ross, 1972. "Some Time Series Properties of Accounting Income," Journal of Finance, American Finance Association, vol. 27(3), pages 663-681, June.
    3. Hou, Kewei & van Dijk, Mathijs A. & Zhang, Yinglei, 2012. "The implied cost of capital: A new approach," Journal of Accounting and Economics, Elsevier, vol. 53(3), pages 504-526.
    4. Jones, Jj, 1991. "Earnings Management During Import Relief Investigations," Journal of Accounting Research, John Wiley & Sons, Ltd., vol. 29(2), pages 193-228.
    5. Kevin K. Li & Partha Mohanram, 2014. "Evaluating cross-sectional forecasting models for implied cost of capital," Review of Accounting Studies, Springer, vol. 19(3), pages 1152-1185, September.
    6. James Claus & Jacob Thomas, 2001. "Equity Premia as Low as Three Percent? Evidence from Analysts' Earnings Forecasts for Domestic and International Stock Markets," Journal of Finance, American Finance Association, vol. 56(5), pages 1629-1666, October.
    7. William R. Gebhardt & Charles M. C. Lee & Bhaskaran Swaminathan, 2001. "Toward an Implied Cost of Capital," Journal of Accounting Research, John Wiley & Sons, Ltd., vol. 39(1), pages 135-176, June.
    8. Turan G Bali & Heiner Beckmeyer & Mathis Mörke & Florian Weigert & Stefano Giglio, 2023. "Option Return Predictability with Machine Learning and Big Data," The Review of Financial Studies, Society for Financial Studies, vol. 36(9), pages 3548-3602.
    9. Brown, Lawrence D., 1993. "Reply to commentaries on "Earnings forecasting research: its implications for capital markets research"," International Journal of Forecasting, Elsevier, vol. 9(3), pages 343-344, November.
    10. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," The Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2223-2273.
    11. Ball, R & Brown, P, 1968. "Empirical Evaluation Of Accounting Income Numbers," Journal of Accounting Research, John Wiley & Sons, Ltd., vol. 6(2), pages 159-178.
    12. Whitney Newey & Kenneth West, 2014. "A simple, positive semi-definite, heteroscedasticity and autocorrelation consistent covariance matrix," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 33(1), pages 125-132.
    13. Graham, John R. & Harvey, Campbell R. & Rajgopal, Shiva, 2005. "The economic implications of corporate financial reporting," Journal of Accounting and Economics, Elsevier, vol. 40(1-3), pages 3-73, December.
    14. Joseph R. Gordon & Myron J. Gordon, 1997. "The Finite Horizon Expected Return Model," Financial Analysts Journal, Taylor & Francis Journals, vol. 53(3), pages 52-61, May.
    15. Brown, Lawrence D., 1993. "Earnings forecasting research: its implications for capital markets research," International Journal of Forecasting, Elsevier, vol. 9(3), pages 295-320, November.
    16. Watts, Rl & Leftwich, Rw, 1977. "Time-Series Of Annual Accounting Earnings," Journal of Accounting Research, John Wiley & Sons, Ltd., vol. 15(2), pages 253-271.
    17. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," Review of Finance, European Finance Association, vol. 33(5), pages 2223-2273.
    18. James A. Ohlson & Beate E. Juettner-Nauroth, 2005. "Expected EPS and EPS Growth as Determinantsof Value," Review of Accounting Studies, Springer, vol. 10(2), pages 349-365, September.
    19. Xi Chen & Yang Ha (Tony) Cho & Yiwei Dou & Baruch Lev, 2022. "Predicting Future Earnings Changes Using Machine Learning and Detailed Financial Data," Journal of Accounting Research, John Wiley & Sons, Ltd., vol. 60(2), pages 467-515, May.
    20. Albrecht, Ws & Lookabill, Ll & Mckeown, Jc, 1977. "Time-Series Properties Of Annual Earnings," Journal of Accounting Research, John Wiley & Sons, Ltd., vol. 15(2), pages 226-244.
    21. Jules H van Binsbergen & Xiao Han & Alejandro Lopez-Lira, 2023. "Man versus Machine Learning: The Term Structure of Earnings Expectations and Conditional Biases," The Review of Financial Studies, Society for Financial Studies, vol. 36(6), pages 2361-2396.
    22. Brown, Philip, 1993. "Comments on 'Earnings forecasting research: its implications for capital markets research' by L. Brown," International Journal of Forecasting, Elsevier, vol. 9(3), pages 331-335, November.
    23. Harris, Richard D.F. & Wang, Pengguo, 2019. "Model-based earnings forecasts vs. financial analysts' earnings forecasts," The British Accounting Review, Elsevier, vol. 51(4), pages 424-437.
    24. Monahan, Steven J., 2018. "Financial Statement Analysis and Earnings Forecasting," Foundations and Trends(R) in Accounting, now publishers, vol. 12(2), pages 105-215, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Mark T. Bradshaw & Michael S. Drake & James N. Myers & Linda A. Myers, 2012. "A re-examination of analysts’ superiority over time-series forecasts of annual earnings," Review of Accounting Studies, Springer, vol. 17(4), pages 944-968, December.
    2. Vitor Azevedo & Patrick Bielstein & Manuel Gerhart, 2021. "Earnings forecasts: the case for combining analysts’ estimates with a cross-sectional model," Review of Quantitative Finance and Accounting, Springer, vol. 56(2), pages 545-579, February.
    3. Higgins, Huong, 2013. "Can securities analysts forecast intangible firms’ earnings?," International Journal of Forecasting, Elsevier, vol. 29(1), pages 155-174.
    4. Echterling, F. & Eierle, B. & Ketterer, S., 2015. "A review of the literature on methods of computing the implied cost of capital," International Review of Financial Analysis, Elsevier, vol. 42(C), pages 235-252.
    5. Stephannie Larocque, 2013. "Analysts’ earnings forecast errors and cost of equity capital estimates," Review of Accounting Studies, Springer, vol. 18(1), pages 135-166, March.
    6. Ramnath, Sundaresh & Rock, Steve & Shane, Philip, 2008. "The financial analyst forecasting literature: A taxonomy with suggestions for further research," International Journal of Forecasting, Elsevier, vol. 24(1), pages 34-75.
    7. Ferreira Savoia, José Roberto & Securato, José Roberto & Bergmann, Daniel Reed & Lopes da Silva, Fabiana, 2019. "Comparing results of the implied cost of capital and capital asset pricing models for infrastructure firms in Brazil," Utilities Policy, Elsevier, vol. 56(C), pages 149-158.
    8. S. P. Kothari & Charles Wasley, 2019. "Commemorating the 50‐Year Anniversary of Ball and Brown (1968): The Evolution of Capital Market Research over the Past 50 Years," Journal of Accounting Research, John Wiley & Sons, Ltd., vol. 57(5), pages 1117-1159, December.
    9. Hou, Kewei & van Dijk, Mathijs A. & Zhang, Yinglei, 2012. "The implied cost of capital: A new approach," Journal of Accounting and Economics, Elsevier, vol. 53(3), pages 504-526.
    10. Alexander P. Paton & Damien Cannavan & Stephen Gray & Khoa Hoang, 2020. "Analyst versus model‐based earnings forecasts: implied cost of capital applications," Accounting and Finance, Accounting and Finance Association of Australia and New Zealand, vol. 60(4), pages 4061-4092, December.
    11. Lin, Hai & Tao, Xinyuan & Wu, Chunchi, 2022. "Forecasting earnings with combination of analyst forecasts," Journal of Empirical Finance, Elsevier, vol. 68(C), pages 133-159.
    12. Kothari, S. P., 2001. "Capital markets research in accounting," Journal of Accounting and Economics, Elsevier, vol. 31(1-3), pages 105-231, September.
    13. Partha Mohanram & Dan Gode, 2013. "Removing predictable analyst forecast errors to improve implied cost of equity estimates," Review of Accounting Studies, Springer, vol. 18(2), pages 443-478, June.
    14. Sati P. Bandyopadhyay & Alan Guoming Huang & Kevin Jialin Sun & Tony S. Wirjanto, 2017. "The return premiums to accruals quality," Review of Quantitative Finance and Accounting, Springer, vol. 48(1), pages 83-115, January.
    15. Holger Daske & Günther Gebhardt, 2006. "Zukunftsorientierte Bestimmung von Risikoprämien und Eigenkapitalkosten für die Unternehmensbewertung," Schmalenbach Journal of Business Research, Springer, vol. 58(4), pages 530-551, June.
    16. Saeed Fathi & Zahra Mohammadin & Karim Azarbayjani, 2025. "Corporate finance signaling theory: an empirical analysis on the relationship between information asymmetry and the cost of equity capital," International Journal of Disclosure and Governance, Palgrave Macmillan, vol. 22(3), pages 629-643, September.
    17. Chang, Chu-Hsuan & Lin, Hsiou-Wei William, 2018. "Does there prevail momentum in earnings management for seasoned equity offering firms?," International Review of Economics & Finance, Elsevier, vol. 55(C), pages 111-129.
    18. Kjell Bjørn Nordal & Randi Næs, 2009. "Mean reversion in profitability for non-listed firms," Working Paper 2009/29, Norges Bank.
    19. Datta, Sudip & Iskandar-Datta, Mai & Sharma, Vivek, 2011. "Product market pricing power, industry concentration and analysts' earnings forecasts," Journal of Banking & Finance, Elsevier, vol. 35(6), pages 1352-1366, June.
    20. David Ashton & Chau (Ruby) Trinh, 2018. "Evaluating the information content of earnings forecasts," Accounting and Business Research, Taylor & Francis Journals, vol. 48(6), pages 674-699, September.

    More about this item

    Keywords

    ;
    ;
    ;

    JEL classification:

    • G11 - Financial Economics - - General Financial Markets - - - Portfolio Choice; Investment Decisions
    • G12 - Financial Economics - - General Financial Markets - - - Asset Pricing; Trading Volume; Bond Interest Rates
    • G17 - Financial Economics - - General Financial Markets - - - Financial Forecasting and Simulation
    • G31 - Financial Economics - - Corporate Finance and Governance - - - Capital Budgeting; Fixed Investment and Inventory Studies
    • G32 - Financial Economics - - Corporate Finance and Governance - - - Financing Policy; Financial Risk and Risk Management; Capital and Ownership Structure; Value of Firms; Goodwill
    • M40 - Business Administration and Business Economics; Marketing; Accounting; Personnel Economics - - Accounting - - - General
    • M41 - Business Administration and Business Economics; Marketing; Accounting; Personnel Economics - - Accounting - - - Accounting

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:zbw:cfrwps:323935. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ZBW - Leibniz Information Centre for Economics (email available below). General contact details of provider: https://edirc.repec.org/data/cfkoede.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.