IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2002.08245.html
   My bibliography  Save this paper

AutoAlpha: an Efficient Hierarchical Evolutionary Algorithm for Mining Alpha Factors in Quantitative Investment

Author

Listed:
  • Tianping Zhang
  • Yuanqi Li
  • Yifei Jin
  • Jian Li

Abstract

The multi-factor model is a widely used model in quantitative investment. The success of a multi-factor model is largely determined by the effectiveness of the alpha factors used in the model. This paper proposes a new evolutionary algorithm called AutoAlpha to automatically generate effective formulaic alphas from massive stock datasets. Specifically, first we discover an inherent pattern of the formulaic alphas and propose a hierarchical structure to quickly locate the promising part of space for search. Then we propose a new Quality Diversity search based on the Principal Component Analysis (PCA-QD) to guide the search away from the well-explored space for more desirable results. Next, we utilize the warm start method and the replacement method to prevent the premature convergence problem. Based on the formulaic alphas we discover, we propose an ensemble learning-to-rank model for generating the portfolio. The backtests in the Chinese stock market and the comparisons with several baselines further demonstrate the effectiveness of AutoAlpha in mining formulaic alphas for quantitative trading.

Suggested Citation

  • Tianping Zhang & Yuanqi Li & Yifei Jin & Jian Li, 2020. "AutoAlpha: an Efficient Hierarchical Evolutionary Algorithm for Mining Alpha Factors in Quantitative Investment," Papers 2002.08245, arXiv.org, revised Apr 2020.
  • Handle: RePEc:arx:papers:2002.08245
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2002.08245
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Stephen A. Ross, 2013. "The Arbitrage Theory of Capital Asset Pricing," World Scientific Book Chapters, in: Leonard C MacLean & William T Ziemba (ed.), HANDBOOK OF THE FUNDAMENTALS OF FINANCIAL DECISION MAKING Part I, chapter 1, pages 11-30, World Scientific Publishing Co. Pte. Ltd..
    2. Fama, Eugene F. & French, Kenneth R., 1993. "Common risk factors in the returns on stocks and bonds," Journal of Financial Economics, Elsevier, vol. 33(1), pages 3-56, February.
    3. William F. Sharpe, 1964. "Capital Asset Prices: A Theory Of Market Equilibrium Under Conditions Of Risk," Journal of Finance, American Finance Association, vol. 19(3), pages 425-442, September.
    4. Jegadeesh, Narasimhan & Titman, Sheridan, 1993. "Returns to Buying Winners and Selling Losers: Implications for Stock Market Efficiency," Journal of Finance, American Finance Association, vol. 48(1), pages 65-91, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Hang Yuan & Saizhuo Wang & Jian Guo, 2024. "Alpha-GPT 2.0: Human-in-the-Loop AI for Quantitative Investment," Papers 2402.09746, arXiv.org.
    2. Saizhuo Wang & Hang Yuan & Leon Zhou & Lionel M. Ni & Heung-Yeung Shum & Jian Guo, 2023. "Alpha-GPT: Human-AI Interactive Alpha Mining for Quantitative Investment," Papers 2308.00016, arXiv.org.
    3. Bruno Gav{s}perov & Marko {DJ}urasevi'c & Domagoj Jakobovic, 2024. "Finding Near-Optimal Portfolios With Quality-Diversity," Papers 2402.16118, arXiv.org.
    4. Shuo Sun & Rundong Wang & Bo An, 2021. "Reinforcement Learning for Quantitative Trading," Papers 2109.13851, arXiv.org.
    5. Tao Ren & Ruihan Zhou & Jinyang Jiang & Jiafeng Liang & Qinghao Wang & Yijie Peng, 2024. "RiskMiner: Discovering Formulaic Alphas via Risk Seeking Monte Carlo Tree Search," Papers 2402.07080, arXiv.org, revised Feb 2024.
    6. Chuheng Zhang & Yuanqi Li & Xi Chen & Yifei Jin & Pingzhong Tang & Jian Li, 2020. "DoubleEnsemble: A New Ensemble Method Based on Sample Reweighting and Feature Selection for Financial Data Analysis," Papers 2010.01265, arXiv.org, revised Jan 2021.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Shi, Huai-Long & Zhou, Wei-Xing, 2022. "Factor volatility spillover and its implications on factor premia," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 80(C).
    2. Zura Kakushadze, 2014. "4-Factor Model for Overnight Returns," Papers 1410.5513, arXiv.org, revised Jun 2015.
    3. Jean-Pierre Allegret & Hélène Raymond & Houda Rharrabti, 2014. "The impact of the global and eurozone crises on European banks stocks Some evidence of shift contagion," Working Papers hal-04141339, HAL.
    4. Adam Zaremba & Jacob Koby Shemer, 2018. "Price-Based Investment Strategies," Springer Books, Springer, number 978-3-319-91530-2, December.
    5. repec:gnv:wpaper:unige:76321 is not listed on IDEAS
    6. Lu Zhang, 2017. "The Investment CAPM," European Financial Management, European Financial Management Association, vol. 23(4), pages 545-603, September.
    7. Lu Zhang, 2019. "Q-factors and Investment CAPM," NBER Working Papers 26538, National Bureau of Economic Research, Inc.
    8. Adrian, Tobias & Franzoni, Francesco, 2009. "Learning about beta: Time-varying factor loadings, expected returns, and the conditional CAPM," Journal of Empirical Finance, Elsevier, vol. 16(4), pages 537-556, September.
    9. Stafylas, Dimitrios & Anderson, Keith & Uddin, Moshfique, 2017. "Recent advances in explaining hedge fund returns: Implicit factors and exposures," Global Finance Journal, Elsevier, vol. 33(C), pages 69-87.
    10. Zura Kakushadze & Willie Yu, 2016. "Multifactor Risk Models and Heterotic CAPM," Papers 1602.04902, arXiv.org, revised Mar 2016.
    11. Michael E. Drew & Madhu Veeraraghavan, 2000. "Multifactor Models are Alive and Well," School of Economics and Finance Discussion Papers and Working Papers Series 083, School of Economics and Finance, Queensland University of Technology.
    12. Patrick Gagliardini & Elisa Ossola & Olivier Scaillet, 2016. "Time‐Varying Risk Premium in Large Cross‐Sectional Equity Data Sets," Econometrica, Econometric Society, vol. 84, pages 985-1046, May.
    13. Lubos Pastor & Robert F. Stambaugh, "undated". "Evaluating and Investing in Equity Mutual Funds," CRSP working papers 516, Center for Research in Security Prices, Graduate School of Business, University of Chicago.
    14. Alex R. Horenstein, 2021. "The Unintended Impact of Academic Research on Asset Returns: The Capital Asset Pricing Model Alpha," Management Science, INFORMS, vol. 67(6), pages 3655-3673, June.
    15. repec:wvu:wpaper:10-08 is not listed on IDEAS
    16. Amit Goyal, 2012. "Empirical cross-sectional asset pricing: a survey," Financial Markets and Portfolio Management, Springer;Swiss Society for Financial Market Research, vol. 26(1), pages 3-38, March.
    17. Chang Liu & Bowen Deng, 2023. "Is it really paid for sustainable development? The economic significance of firms' green practice," Sustainable Development, John Wiley & Sons, Ltd., vol. 31(2), pages 908-925, April.
    18. Simin, Timothy, 2008. "The Poor Predictive Performance of Asset Pricing Models," Journal of Financial and Quantitative Analysis, Cambridge University Press, vol. 43(2), pages 355-380, June.
    19. Sawaliya, Priya & Sinha, Pankaj, 2018. "Behaviour of asset pricing models in pre and post-recession period: an evidence from India," MPRA Paper 93084, University Library of Munich, Germany, revised 22 Jan 2019.
    20. Rocciolo, Francesco & Gheno, Andrea & Brooks, Chris, 2022. "Explaining abnormal returns in stock markets: An alpha-neutral version of the CAPM," International Review of Financial Analysis, Elsevier, vol. 82(C).
    21. Nawar Hashem & Larry Su, 2015. "Industry Concentration and the Cross-Section of Stock Returns: Evidence from the UK," Journal of Business Economics and Management, Taylor & Francis Journals, vol. 16(4), pages 769-785, August.
    22. Kei Nakagawa & Tomoki Ito & Masaya Abe & Kiyoshi Izumi, 2019. "Deep Recurrent Factor Model: Interpretable Non-Linear and Time-Varying Multi-Factor Model," Papers 1901.11493, arXiv.org.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2002.08245. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.