IDEAS home Printed from https://ideas.repec.org/p/hal/journl/hal-02311104.html
   My bibliography  Save this paper

Ensemble Learning Applied to Quant Equity: Gradient Boosting in a Multifactor Framework

Author

Listed:
  • Tony Guida

    (EM - EMLyon Business School)

  • Guillaume Coqueret

Abstract

This chapter proposes to benefit from the advantages of machine learning (ML) in general and boosted trees in particular, e.g. non‐linearity, regularization and good generalization results, scaling up well with lots of data. It gives a mildly technical introduction to boosted trees. The chapter introduces the construction of the dataset with the feature and labels engineering, and the calibration of the ML applying rigorous protocol established by the computer science community. It describes the data used and the empirical protocol for the ML model. The chapter also introduces the concept of confusion matrix and all the related metrics in order to precisely assess a ML model's quality. It provides guidance on how to tune, train and test an ML‐based model using traditional financial characteristics such as valuation and profitability metrics, but also price momentum, risk estimates, volume and liquidity characteristic.

Suggested Citation

  • Tony Guida & Guillaume Coqueret, 2019. "Ensemble Learning Applied to Quant Equity: Gradient Boosting in a Multifactor Framework," Post-Print hal-02311104, HAL.
  • Handle: RePEc:hal:journl:hal-02311104
    DOI: 10.1002/9781119522225.ch7
    Note: View the original document on HAL open archive server: https://hal.science/hal-02311104
    as

    Download full text from publisher

    File URL: https://hal.science/hal-02311104/document
    Download Restriction: no

    File URL: https://libkey.io/10.1002/9781119522225.ch7?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Krauss, Christopher & Do, Xuan Anh & Huck, Nicolas, 2017. "Deep neural networks, gradient-boosted trees, random forests: Statistical arbitrage on the S&P 500," European Journal of Operational Research, Elsevier, vol. 259(2), pages 689-702.
    2. Bodnar, Taras & Mazur, Stepan & Okhrin, Yarema, 2017. "Bayesian estimation of the global minimum variance portfolio," European Journal of Operational Research, Elsevier, vol. 256(1), pages 292-307.
    3. Michael W. Brandt & Pedro Santa-Clara & Rossen Valkanov, 2009. "Parametric Portfolio Policies: Exploiting Characteristics in the Cross-Section of Equity Returns," The Review of Financial Studies, Society for Financial Studies, vol. 22(9), pages 3411-3447, September.
    4. Ammann, Manuel & Coqueret, Guillaume & Schade, Jan-Philip, 2016. "Characteristics-based portfolio choice with leverage constraints," Journal of Banking & Finance, Elsevier, vol. 70(C), pages 23-37.
    5. Christopher Krauss & Anh Do & Nicolas Huck, 2017. "Deep neural networks, gradient-boosted trees, random forests: Statistical arbitrage on the S&P 500," Post-Print hal-01768895, HAL.
    6. Banz, Rolf W., 1981. "The relationship between return and market value of common stocks," Journal of Financial Economics, Elsevier, vol. 9(1), pages 3-18, March.
    7. Manuel Ammann & Guillaume Coqueret & Jan-Philip Schade, 2016. "Characteristics-based portfolio choice with leverage constraints," Post-Print hal-02312221, HAL.
    8. R. David Mclean & Jeffrey Pontiff, 2016. "Does Academic Research Destroy Stock Return Predictability?," Journal of Finance, American Finance Association, vol. 71(1), pages 5-32, February.
    9. Avanidhar Subrahmanyam, 2010. "The Cross†Section of Expected Stock Returns: What Have We Learnt from the Past Twenty†Five Years of Research?," European Financial Management, European Financial Management Association, vol. 16(1), pages 27-42, January.
    10. van Dijk, Mathijs A., 2011. "Is size dead? A review of the size effect in equity returns," Journal of Banking & Finance, Elsevier, vol. 35(12), pages 3263-3274.
    11. Ang, Andrew, 2014. "Asset Management: A Systematic Approach to Factor Investing," OUP Catalogue, Oxford University Press, number 9780199959327.
    12. Narasimhan Jegadeesh & Sheridan Titman, 2001. "Profitability of Momentum Strategies: An Evaluation of Alternative Explanations," Journal of Finance, American Finance Association, vol. 56(2), pages 699-720, April.
    13. Daniel, Kent & Titman, Sheridan, 1997. "Evidence on the Characteristics of Cross Sectional Variation in Stock Returns," Journal of Finance, American Finance Association, vol. 52(1), pages 1-33, March.
    14. Jegadeesh, Narasimhan & Titman, Sheridan, 1993. "Returns to Buying Winners and Selling Losers: Implications for Stock Market Efficiency," Journal of Finance, American Finance Association, vol. 48(1), pages 65-91, March.
    15. Fama, Eugene F & French, Kenneth R, 1992. "The Cross-Section of Expected Stock Returns," Journal of Finance, American Finance Association, vol. 47(2), pages 427-465, June.
    16. Manuel Ammann & Guillaume Coqueret & Jan-Philip Schade, 2016. "Characteristics-based portfolio choice with leverage constraints," Post-Print hal-02009129, HAL.
    17. Fischer, Thomas & Krauss, Christopher, 2018. "Deep learning with long short-term memory networks for financial market predictions," European Journal of Operational Research, Elsevier, vol. 270(2), pages 654-669.
    18. Fama, Eugene F. & French, Kenneth R., 1993. "Common risk factors in the returns on stocks and bonds," Journal of Financial Economics, Elsevier, vol. 33(1), pages 3-56, February.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Guillaume Chevalier & Guillaume Coqueret & Thomas Raffinot, 2022. "Supervised portfolios," Post-Print hal-04144588, HAL.
    2. Guillaume Coqueret & Tony Guida, 2020. "Training trees on tails with applications to portfolio choice," Post-Print hal-04144665, HAL.
    3. Guillaume Coqueret & Tony Guida, 2020. "Training trees on tails with applications to portfolio choice," Annals of Operations Research, Springer, vol. 288(1), pages 181-221, May.
    4. Eric Andr'e & Guillaume Coqueret, 2020. "Dirichlet policies for reinforced factor portfolios," Papers 2011.05381, arXiv.org, revised Jun 2021.
    5. Stephen A. Gorman & Frank J. Fabozzi, 2021. "The ABC’s of the alternative risk premium: academic roots," Journal of Asset Management, Palgrave Macmillan, vol. 22(6), pages 405-436, October.
    6. Anton Astakhov & Tomas Havranek & Jiri Novak, 2019. "Firm Size And Stock Returns: A Quantitative Survey," Journal of Economic Surveys, Wiley Blackwell, vol. 33(5), pages 1463-1492, December.
    7. Amit Goyal, 2012. "Empirical cross-sectional asset pricing: a survey," Financial Markets and Portfolio Management, Springer;Swiss Society for Financial Market Research, vol. 26(1), pages 3-38, March.
    8. Auer, Benjamin R. & Rottmann, Horst, 2019. "Have capital market anomalies worldwide attenuated in the recent era of high liquidity and trading activity?," Journal of Economics and Business, Elsevier, vol. 103(C), pages 61-79.
    9. Joachim Freyberger & Andreas Neuhierl & Michael Weber, 2020. "Dissecting Characteristics Nonparametrically," The Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2326-2377.
    10. Zaremba Adam & Konieczka Przemysław, 2017. "Size, Value, and Momentum in Polish Equity Returns: Local or International Factors?," International Journal of Management and Economics, Warsaw School of Economics, Collegium of World Economy, vol. 53(3), pages 26-47, September.
    11. Waszczuk, Antonina, 2013. "A risk-based explanation of return patterns—Evidence from the Polish stock market," Emerging Markets Review, Elsevier, vol. 15(C), pages 186-210.
    12. Guillaume Coqueret, 2022. "Characteristics-driven returns in equilibrium," Papers 2203.07865, arXiv.org.
    13. DeMiguel, Victor & Martin-Utrera, Alberto & Nogales, Francisco J. & Uppal, Raman, 2017. "A Portfolio Perspective on the Multitude of Firm Characteristics," CEPR Discussion Papers 12417, C.E.P.R. Discussion Papers.
    14. Eero Pätäri & Timo Leivo, 2017. "A Closer Look At Value Premium: Literature Review And Synthesis," Journal of Economic Surveys, Wiley Blackwell, vol. 31(1), pages 79-168, February.
    15. Adam Zaremba & Jacob Koby Shemer, 2018. "Price-Based Investment Strategies," Springer Books, Springer, number 978-3-319-91530-2, December.
    16. Sabine Artmann & Philipp Finter & Alexander Kempf & Stefan Koch & Erik Theissen, 2012. "The Cross-Section of German Stock Returns: New Data and New Evidence," Schmalenbach Business Review (sbr), LMU Munich School of Management, vol. 64(1), pages 20-43, January.
    17. Kentaro Imajo & Kentaro Minami & Katsuya Ito & Kei Nakagawa, 2020. "Deep Portfolio Optimization via Distributional Prediction of Residual Factors," Papers 2012.07245, arXiv.org.
    18. John Cotter & Niall McGeever, 2018. "Are equity market anomalies disappearing? Evidence from the U.K," Working Papers 201804, Geary Institute, University College Dublin.
    19. Clarke, Charles, 2022. "The level, slope, and curve factor model for stocks," Journal of Financial Economics, Elsevier, vol. 143(1), pages 159-187.
    20. de Groot, Wilma & Pang, Juan & Swinkels, Laurens, 2012. "The cross-section of stock returns in frontier emerging markets," Journal of Empirical Finance, Elsevier, vol. 19(5), pages 796-818.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:hal:journl:hal-02311104. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: CCSD (email available below). General contact details of provider: https://hal.archives-ouvertes.fr/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.