IDEAS home Printed from https://ideas.repec.org/p/zbw/iwqwdp/032016.html
   My bibliography  Save this paper

Deep neural networks, gradient-boosted trees, random forests: Statistical arbitrage on the S&P 500

Author

Listed:
  • Krauss, Christopher
  • Do, Xuan Anh
  • Huck, Nicolas

Abstract

In recent years, machine learning research has gained momentum: New developments in the field of deep learning allow for multiple levels of abstraction and are starting to supersede well-known and powerful tree-based techniques mainly operating on the original feature space. All these methods can be applied to various fields, including finance. This article implements and analyses the effectiveness of deep neural networks (DNN), gradient-boosted-trees (GBT), random forests (RAF), and a combination (ENS) of these methods in the context of statistical arbitrage. Each model is trained on lagged returns of all stocks in the S&P 500, after elimination of survivor bias. From 1992 to 2015, daily one-day-ahead trading signals are generated based on the probability forecast of a stock to outperform the general market. The highest k probabilities are converted into long and the lowest k probabilities into short positions, thus censoring the less certain middle part of the ranking. Empirical findings are promising. A simple ensemble consisting of one deep neural network, one gradient-boosted tree, and one random forest produces out-of-sample returns exceeding 0.45 percent per day for k = 10, prior to transaction costs. Irrespective of the fact that profits are declining in recent years, our findings pose a severe challenge to the semi-strong form of market efficiency.

Suggested Citation

  • Krauss, Christopher & Do, Xuan Anh & Huck, Nicolas, 2016. "Deep neural networks, gradient-boosted trees, random forests: Statistical arbitrage on the S&P 500," FAU Discussion Papers in Economics 03/2016, Friedrich-Alexander University Erlangen-Nuremberg, Institute for Economics.
  • Handle: RePEc:zbw:iwqwdp:032016
    as

    Download full text from publisher

    File URL: https://www.econstor.eu/bitstream/10419/130166/1/856307327.pdf
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Fernandes, Marcelo & Medeiros, Marcelo C. & Scharth, Marcel, 2014. "Modeling and predicting the CBOE market volatility index," Journal of Banking & Finance, Elsevier, vol. 40(C), pages 1-10.
    2. Jacobs, Heiko, 2015. "What explains the dynamics of 100 anomalies?," Journal of Banking & Finance, Elsevier, vol. 57(C), pages 65-85.
    3. Khandani, Amir E. & Lo, Andrew W., 2011. "What happened to the quants in August 2007? Evidence from factors and transactions data," Journal of Financial Markets, Elsevier, vol. 14(1), pages 1-46, February.
    4. Pesaran, M Hashem & Timmermann, Allan, 1992. "A Simple Nonparametric Test of Predictive Performance," Journal of Business & Economic Statistics, American Statistical Association, vol. 10(4), pages 561-565, October.
    5. Sermpinis, Georgios & Theofilatos, Konstantinos & Karathanasopoulos, Andreas & Georgopoulos, Efstratios F. & Dunis, Christian, 2013. "Forecasting foreign exchange rates with adaptive neural networks using radial-basis functions and Particle Swarm Optimization," European Journal of Operational Research, Elsevier, vol. 225(3), pages 528-540.
    6. Sadka, Ronnie, 2010. "Liquidity risk and the cross-section of hedge-fund returns," Journal of Financial Economics, Elsevier, vol. 98(1), pages 54-71, October.
    7. Nicolas Huck, 2015. "Pairs trading: does volatility timing matter?," Applied Economics, Taylor & Francis Journals, vol. 47(57), pages 6239-6256, December.
    8. Huck, Nicolas, 2009. "Pairs selection and outranking: An application to the S&P 100 index," European Journal of Operational Research, Elsevier, vol. 196(2), pages 819-825, July.
    9. Jacobs, Heiko & Weber, Martin, 2015. "On the determinants of pairs trading profitability," Journal of Financial Markets, Elsevier, vol. 23(C), pages 75-97.
    10. Carhart, Mark M, 1997. "On Persistence in Mutual Fund Performance," Journal of Finance, American Finance Association, vol. 52(1), pages 57-82, March.
    11. Evan Gatev & William N. Goetzmann & K. Geert Rouwenhorst, 2006. "Pairs Trading: Performance of a Relative-Value Arbitrage Rule," The Review of Financial Studies, Society for Financial Studies, vol. 19(3), pages 797-827.
    12. Nicolas Huck, 2015. "Pairs trading: does volatility timing matter?," Post-Print hal-01370246, HAL.
    13. Genre, Véronique & Kenny, Geoff & Meyler, Aidan & Timmermann, Allan, 2013. "Combining expert forecasts: Can anything beat the simple average?," International Journal of Forecasting, Elsevier, vol. 29(1), pages 108-121.
    14. Leung, Mark T. & Daouk, Hazem & Chen, An-Sing, 2000. "Forecasting stock indices: a comparison of classification and level estimation models," International Journal of Forecasting, Elsevier, vol. 16(2), pages 173-190.
    15. François Longin & Bruno Solnik, 2001. "Extreme Correlation of International Equity Markets," Journal of Finance, American Finance Association, vol. 56(2), pages 649-676, April.
    16. Fama, Eugene F. & French, Kenneth R., 2015. "A five-factor asset pricing model," Journal of Financial Economics, Elsevier, vol. 116(1), pages 1-22.
    17. Zhang, Guoqiang & Eddy Patuwo, B. & Y. Hu, Michael, 1998. "Forecasting with artificial neural networks:: The state of the art," International Journal of Forecasting, Elsevier, vol. 14(1), pages 35-62, March.
    18. Friedman, Jerome H., 2002. "Stochastic gradient boosting," Computational Statistics & Data Analysis, Elsevier, vol. 38(4), pages 367-378, February.
    19. Huck, Nicolas, 2010. "Pairs trading and outranking: The multi-step-ahead forecasting case," European Journal of Operational Research, Elsevier, vol. 207(3), pages 1702-1716, December.
    20. Hal R. Varian, 2014. "Big Data: New Tricks for Econometrics," Journal of Economic Perspectives, American Economic Association, vol. 28(2), pages 3-28, Spring.
    21. Marco Avellaneda & Jeong-Hyun Lee, 2010. "Statistical arbitrage in the US equities market," Quantitative Finance, Taylor & Francis Journals, vol. 10(7), pages 761-782.
    22. Timofei Bogomolov, 2013. "Pairs trading based on statistical variability of the spread process," Quantitative Finance, Taylor & Francis Journals, vol. 13(9), pages 1411-1430, September.
    23. Aiolfi, Marco & Timmermann, Allan, 2006. "Persistence in forecasting performance and conditional combination strategies," Journal of Econometrics, Elsevier, vol. 135(1-2), pages 31-53.
    24. Fama, Eugene F & French, Kenneth R, 1996. "Multifactor Explanations of Asset Pricing Anomalies," Journal of Finance, American Finance Association, vol. 51(1), pages 55-84, March.
    25. G. Elliott & C. Granger & A. Timmermann (ed.), 2006. "Handbook of Economic Forecasting," Handbook of Economic Forecasting, Elsevier, edition 1, volume 1, number 1.
    26. Mark W. Watson & James H. Stock, 2004. "Combination forecasts of output growth in a seven-country data set," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 23(6), pages 405-430.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Flori, Andrea & Regoli, Daniele, 2021. "Revealing Pairs-trading opportunities with long short-term memory networks," European Journal of Operational Research, Elsevier, vol. 295(2), pages 772-791.
    2. Matthew Clegg & Christopher Krauss, 2018. "Pairs trading with partial cointegration," Quantitative Finance, Taylor & Francis Journals, vol. 18(1), pages 121-138, January.
    3. Huck, Nicolas, 2019. "Large data sets and machine learning: Applications to statistical arbitrage," European Journal of Operational Research, Elsevier, vol. 278(1), pages 330-342.
    4. Fischer, Thomas & Krauss, Christopher, 2018. "Deep learning with long short-term memory networks for financial market predictions," European Journal of Operational Research, Elsevier, vol. 270(2), pages 654-669.
    5. Krauss, Christopher, 2015. "Statistical arbitrage pairs trading strategies: Review and outlook," FAU Discussion Papers in Economics 09/2015, Friedrich-Alexander University Erlangen-Nuremberg, Institute for Economics.
    6. Fischer, Thomas & Krauss, Christopher, 2017. "Deep learning with long short-term memory networks for financial market predictions," FAU Discussion Papers in Economics 11/2017, Friedrich-Alexander University Erlangen-Nuremberg, Institute for Economics.
    7. Clegg, Matthew & Krauss, Christopher, 2016. "Pairs trading with partial cointegration," FAU Discussion Papers in Economics 05/2016, Friedrich-Alexander University Erlangen-Nuremberg, Institute for Economics.
    8. Hossein Rad & Rand Kwong Yew Low & Robert Faff, 2016. "The profitability of pairs trading strategies: distance, cointegration and copula methods," Quantitative Finance, Taylor & Francis Journals, vol. 16(10), pages 1541-1558, October.
    9. Han, Chulwoo & He, Zhaodong & Toh, Alenson Jun Wei, 2023. "Pairs trading via unsupervised learning," European Journal of Operational Research, Elsevier, vol. 307(2), pages 929-947.
    10. Knoll, Julian & Stübinger, Johannes & Grottke, Michael, 2017. "Exploiting social media with higher-order Factorization Machines: Statistical arbitrage on high-frequency data of the S&P 500," FAU Discussion Papers in Economics 13/2017, Friedrich-Alexander University Erlangen-Nuremberg, Institute for Economics.
    11. Rubesam, Alexandre, 2022. "Machine learning portfolios with equal risk contributions: Evidence from the Brazilian market," Emerging Markets Review, Elsevier, vol. 51(PB).
    12. Jeff Stephenson & Bruce Vanstone & Tobias Hahn, 2021. "A Unifying Model for Statistical Arbitrage: Model Assumptions and Empirical Failure," Computational Economics, Springer;Society for Computational Economics, vol. 58(4), pages 943-964, December.
    13. Krauss, Christopher & Krüger, Tom & Beerstecher, Daniel, 2015. "The Piotroski F-Score: A fundamental value strategy revisited from an investor's perspective," FAU Discussion Papers in Economics 13/2015, Friedrich-Alexander University Erlangen-Nuremberg, Institute for Economics.
    14. Krauss, Christopher & Beerstecher, Daniel & Krüger, Tom, 2015. "Feasible earnings momentum in the U.S. stock market: An investor's perspective," FAU Discussion Papers in Economics 12/2015, Friedrich-Alexander University Erlangen-Nuremberg, Institute for Economics.
    15. Fernando Caneo & Werner Kristjanpoller, 2021. "Improving statistical arbitrage investment strategy: Evidence from Latin American stock markets," International Journal of Finance & Economics, John Wiley & Sons, Ltd., vol. 26(3), pages 4424-4440, July.
    16. Stübinger, Johannes & Endres, Sylvia, 2017. "Pairs trading with a mean-reverting jump-diffusion model on high-frequency data," FAU Discussion Papers in Economics 10/2017, Friedrich-Alexander University Erlangen-Nuremberg, Institute for Economics.
    17. Marianna Brunetti & Roberta De Luca, 2023. "Pairs trading in the index options market," Eurasian Economic Review, Springer;Eurasia Business and Economics Society, vol. 13(1), pages 145-173, March.
    18. Carlos Henrique Dias Cordeiro de Castro & Fernando Antonio Lucena Aiube, 2023. "Forecasting inflation time series using score‐driven dynamic models and combination methods: The case of Brazil," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 42(2), pages 369-401, March.
    19. Endres, Sylvia & Stübinger, Johannes, 2017. "Optimal trading strategies for Lévy-driven Ornstein-Uhlenbeck processes," FAU Discussion Papers in Economics 17/2017, Friedrich-Alexander University Erlangen-Nuremberg, Institute for Economics.
    20. Jiun-Hua Su, 2021. "No-Regret Forecasting with Egalitarian Committees," Papers 2109.13801, arXiv.org.

    More about this item

    Keywords

    statistical arbitrage; deep learning; gradient-boosting; random forests; ensemble learning;
    All these keywords.

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:zbw:iwqwdp:032016. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ZBW - Leibniz Information Centre for Economics (email available below). General contact details of provider: https://edirc.repec.org/data/vierlde.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.