IDEAS home Printed from https://ideas.repec.org/a/eee/empfin/v81y2025ics0927539825000209.html
   My bibliography  Save this article

Is machine learning a necessity? A regression-based approach for stock return prediction

Author

Listed:
  • Cheng, Tingting
  • Jiang, Shan
  • Zhao, Albert Bo
  • Zhao, Junyi

Abstract

We propose a simple, linear-regression-based method for prediction of the time series of stock returns. The method achieves out-of-sample performances comparable to machine learning methods while having ignorable computational costs. The key component of the method is to integrate a straightforward cross-market factor screening into the iterated combination method proposed by Lin et al., (2018). Our empirical results on the U.S. stock market show that the method outperforms many state-of-the-art machine learning methods in certain periods. The method also exhibits greater utility gain and investment profits in most periods after considering transaction costs.

Suggested Citation

  • Cheng, Tingting & Jiang, Shan & Zhao, Albert Bo & Zhao, Junyi, 2025. "Is machine learning a necessity? A regression-based approach for stock return prediction," Journal of Empirical Finance, Elsevier, vol. 81(C).
  • Handle: RePEc:eee:empfin:v:81:y:2025:i:c:s0927539825000209
    DOI: 10.1016/j.jempfin.2025.101598
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0927539825000209
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jempfin.2025.101598?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Ivo Welch & Amit Goyal, 2008. "A Comprehensive Look at The Empirical Performance of Equity Premium Prediction," The Review of Financial Studies, Society for Financial Studies, vol. 21(4), pages 1455-1508, July.
    2. Philippe Goulet Coulombe & Maxime Leroux & Dalibor Stevanovic & Stéphane Surprenant, 2022. "How is machine learning useful for macroeconomic forecasting?," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(5), pages 920-964, August.
    3. Daniele Bianchi & Matthias Büchner & Tobias Hoogteijling & Andrea Tamoni, 2021. "Corrigendum: Bond Risk Premiums with Machine Learning [Bond risk premiums with machine learning]," The Review of Financial Studies, Society for Financial Studies, vol. 34(2), pages 1090-1103.
    4. Xue Gong & Weiguo Zhang & Yuan Zhao & Xin Ye, 2023. "Forecasting stock volatility with a large set of predictors: A new forecast combination method," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 42(7), pages 1622-1647, November.
    5. John Y. Campbell & Samuel B. Thompson, 2008. "Predicting Excess Stock Returns Out of Sample: Can Anything Beat the Historical Average?," The Review of Financial Studies, Society for Financial Studies, vol. 21(4), pages 1509-1531, July.
    6. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," Review of Finance, European Finance Association, vol. 33(5), pages 2223-2273.
    7. Leippold, Markus & Wang, Qian & Zhou, Wenyu, 2022. "Machine learning in the Chinese stock market," Journal of Financial Economics, Elsevier, vol. 145(2), pages 64-82.
    8. Guanhao Feng & Stefano Giglio & Dacheng Xiu, 2020. "Taming the Factor Zoo: A Test of New Factors," Journal of Finance, American Finance Association, vol. 75(3), pages 1327-1370, June.
    9. Hutchinson, James M & Lo, Andrew W & Poggio, Tomaso, 1994. "A Nonparametric Approach to Pricing and Hedging Derivative Securities via Learning Networks," Journal of Finance, American Finance Association, vol. 49(3), pages 851-889, July.
    10. Harvey, David I & Leybourne, Stephen J & Newbold, Paul, 1998. "Tests for Forecast Encompassing," Journal of Business & Economic Statistics, American Statistical Association, vol. 16(2), pages 254-259, April.
    11. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," The Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2223-2273.
    12. Zhao, Albert Bo & Cheng, Tingting, 2022. "Stock return prediction: Stacking a variety of models," Journal of Empirical Finance, Elsevier, vol. 67(C), pages 288-317.
    13. Michael W. McCracken & Serena Ng, 2016. "FRED-MD: A Monthly Database for Macroeconomic Research," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 34(4), pages 574-589, October.
    14. Fama, Eugene F. & French, Kenneth R., 1989. "Business conditions and expected returns on stocks and bonds," Journal of Financial Economics, Elsevier, vol. 25(1), pages 23-49, November.
    15. Hongwei Zhang & Qiang He & Ben Jacobsen & Fuwei Jiang, 2020. "Forecasting stock returns with model uncertainty and parameter instability," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 35(5), pages 629-644, August.
    16. Zhou Lu & Zhuyao Zhuo, 2021. "Modelling of Chinese corporate bond default – A machine learning approach," Accounting and Finance, Accounting and Finance Association of Australia and New Zealand, vol. 61(5), pages 6147-6191, December.
    17. Hai Lin & Chunchi Wu & Guofu Zhou, 2018. "Forecasting Corporate Bond Returns with a Large Set of Predictors: An Iterated Combination Approach," Management Science, INFORMS, vol. 64(9), pages 4218-4238, September.
    18. Rapach, David & Zhou, Guofu, 2013. "Forecasting Stock Returns," Handbook of Economic Forecasting, in: G. Elliott & C. Granger & A. Timmermann (ed.), Handbook of Economic Forecasting, edition 1, volume 2, chapter 0, pages 328-383, Elsevier.
    19. Daniele Bianchi & Matthias Büchner & Andrea Tamoni, 2021. "Bond Risk Premiums with Machine Learning [Quadratic term structure models: Theory and evidence]," The Review of Financial Studies, Society for Financial Studies, vol. 34(2), pages 1046-1089.
    20. Dangl, Thomas & Halling, Michael, 2012. "Predictive regressions with time-varying coefficients," Journal of Financial Economics, Elsevier, vol. 106(1), pages 157-181.
    21. Elliott, Graham & Gargano, Antonio & Timmermann, Allan, 2013. "Complete subset regressions," Journal of Econometrics, Elsevier, vol. 177(2), pages 357-373.
    22. Clark, Todd E. & West, Kenneth D., 2007. "Approximately normal tests for equal predictive accuracy in nested models," Journal of Econometrics, Elsevier, vol. 138(1), pages 291-311, May.
    23. David E. Rapach & Jack K. Strauss & Guofu Zhou, 2010. "Out-of-Sample Equity Premium Prediction: Combination Forecasts and Links to the Real Economy," The Review of Financial Studies, Society for Financial Studies, vol. 23(2), pages 821-862, February.
    24. Rachidi Kotchoni & Maxime Leroux & Dalibor Stevanovic, 2019. "Macroeconomic forecast accuracy in a data‐rich environment," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 34(7), pages 1050-1072, November.
    25. Gu, Shihao & Kelly, Bryan & Xiu, Dacheng, 2021. "Autoencoder asset pricing models," Journal of Econometrics, Elsevier, vol. 222(1), pages 429-450.
    26. Bryan Kelly & Semyon Malamud & Kangying Zhou, 2024. "The Virtue of Complexity in Return Prediction," Journal of Finance, American Finance Association, vol. 79(1), pages 459-503, February.
    27. Luiz Renato Lima & Fanning Meng, 2017. "Out‐of‐Sample Return Predictability: A Quantile Combination Approach," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 32(4), pages 877-895, June.
    28. Davide Castelvecchi, 2016. "Can we open the black box of AI?," Nature, Nature, vol. 538(7623), pages 20-23, October.
    29. Bruce E. Hansen, 2007. "Least Squares Model Averaging," Econometrica, Econometric Society, vol. 75(4), pages 1175-1189, July.
    30. Masaya Abe & Hideki Nakayama, 2018. "Deep Learning for Forecasting Stock Returns in the Cross-Section," Papers 1801.01777, arXiv.org, revised Jun 2018.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Zhao, Albert Bo & Cheng, Tingting, 2022. "Stock return prediction: Stacking a variety of models," Journal of Empirical Finance, Elsevier, vol. 67(C), pages 288-317.
    2. Faria, Gonçalo & Verona, Fabio, 2024. "Enhancing forecast accuracy through frequencydomain combination: Applications to financial and economic indicators," Bank of Finland Research Discussion Papers 14/2024, Bank of Finland.
    3. Mykola Babiak & Jozef Barunik, 2020. "Deep Learning, Predictability, and Optimal Portfolio Returns," CERGE-EI Working Papers wp677, The Center for Economic Research and Graduate Education - Economics Institute, Prague.
    4. Feng, Guanhao & He, Xin & Wang, Yanchu & Wu, Chunchi, 2025. "Predicting individual corporate bond returns," Journal of Banking & Finance, Elsevier, vol. 171(C).
    5. Borup, Daniel & Christensen, Bent Jesper & Mühlbach, Nicolaj Søndergaard & Nielsen, Mikkel Slot, 2023. "Targeting predictors in random forest regression," International Journal of Forecasting, Elsevier, vol. 39(2), pages 841-868.
    6. Pan, Zhiyuan & Zhong, Hao & Wang, Yudong & Huang, Juan, 2024. "Forecasting oil futures returns with news," Energy Economics, Elsevier, vol. 134(C).
    7. Zhang, Yaojie & Wang, Yudong, 2023. "Forecasting crude oil futures market returns: A principal component analysis combination approach," International Journal of Forecasting, Elsevier, vol. 39(2), pages 659-673.
    8. , & Stein, Tobias, 2021. "Equity premium predictability over the business cycle," CEPR Discussion Papers 16357, C.E.P.R. Discussion Papers.
    9. Wang, Yudong & Hao, Xianfeng & Wu, Chongfeng, 2021. "Forecasting stock returns: A time-dependent weighted least squares approach," Journal of Financial Markets, Elsevier, vol. 53(C).
    10. Niu, Zibo & Demirer, Riza & Suleman, Muhammad Tahir & Zhang, Hongwei & Zhu, Xuehong, 2024. "Do industries predict stock market volatility? Evidence from machine learning models," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 90(C).
    11. Díaz, Juan D. & Hansen, Erwin & Cabrera, Gabriel, 2023. "Gold risk premium estimation with machine learning methods," Journal of Commodity Markets, Elsevier, vol. 31(C).
    12. Xi Dong & Yan Li & David E. Rapach & Guofu Zhou, 2022. "Anomalies and the Expected Market Return," Journal of Finance, American Finance Association, vol. 77(1), pages 639-681, February.
    13. Smith, Simon C., 2021. "International stock return predictability," International Review of Financial Analysis, Elsevier, vol. 78(C).
    14. Gonçalo Faria & Fabio Verona, 2016. "Forecasting the equity risk premium with frequency-decomposed predictors," Working Papers de Economia (Economics Working Papers) 06, Católica Porto Business School, Universidade Católica Portuguesa.
    15. Uddin, Ajim & Tao, Xinyuan & Yu, Dantong, 2023. "Attention based dynamic graph neural network for asset pricing," Global Finance Journal, Elsevier, vol. 58(C).
    16. Daniel Borup & Jonas N. Eriksen & Mads M. Kjær & Martin Thyrsgaard, 2024. "Predicting Bond Return Predictability," Management Science, INFORMS, vol. 70(2), pages 931-951, February.
    17. Hongwei Zhang & Qiang He & Ben Jacobsen & Fuwei Jiang, 2020. "Forecasting stock returns with model uncertainty and parameter instability," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 35(5), pages 629-644, August.
    18. Gonçalo Faria & Fabio Verona, 2016. "Forecasting the equity risk premium with frequency-decomposed predictors," Working Papers de Economia (Economics Working Papers) 06, Católica Porto Business School, Universidade Católica Portuguesa.
    19. Nusret Cakici & Christian Fieberg & Daniel Metko & Adam Zaremba, 2024. "Do Anomalies Really Predict Market Returns? New Data and New Evidence," Review of Finance, European Finance Association, vol. 28(1), pages 1-44.
    20. Dai, Zhifeng & Zhu, Huan, 2020. "Stock return predictability from a mixed model perspective," Pacific-Basin Finance Journal, Elsevier, vol. 60(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:empfin:v:81:y:2025:i:c:s0927539825000209. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/jempfin .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.