IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2306.12964.html
   My bibliography  Save this paper

Generating Synergistic Formulaic Alpha Collections via Reinforcement Learning

Author

Listed:
  • Shuo Yu
  • Hongyan Xue
  • Xiang Ao
  • Feiyang Pan
  • Jia He
  • Dandan Tu
  • Qing He

Abstract

In the field of quantitative trading, it is common practice to transform raw historical stock data into indicative signals for the market trend. Such signals are called alpha factors. Alphas in formula forms are more interpretable and thus favored by practitioners concerned with risk. In practice, a set of formulaic alphas is often used together for better modeling precision, so we need to find synergistic formulaic alpha sets that work well together. However, most traditional alpha generators mine alphas one by one separately, overlooking the fact that the alphas would be combined later. In this paper, we propose a new alpha-mining framework that prioritizes mining a synergistic set of alphas, i.e., it directly uses the performance of the downstream combination model to optimize the alpha generator. Our framework also leverages the strong exploratory capabilities of reinforcement learning~(RL) to better explore the vast search space of formulaic alphas. The contribution to the combination models' performance is assigned to be the return used in the RL process, driving the alpha generator to find better alphas that improve upon the current set. Experimental evaluations on real-world stock market data demonstrate both the effectiveness and the efficiency of our framework for stock trend forecasting. The investment simulation results show that our framework is able to achieve higher returns compared to previous approaches.

Suggested Citation

  • Shuo Yu & Hongyan Xue & Xiang Ao & Feiyang Pan & Jia He & Dandan Tu & Qing He, 2023. "Generating Synergistic Formulaic Alpha Collections via Reinforcement Learning," Papers 2306.12964, arXiv.org.
  • Handle: RePEc:arx:papers:2306.12964
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2306.12964
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Deb, Chirag & Zhang, Fan & Yang, Junjing & Lee, Siew Eang & Shah, Kwok Wei, 2017. "A review on time series forecasting techniques for building energy consumption," Renewable and Sustainable Energy Reviews, Elsevier, vol. 74(C), pages 902-924.
    2. Wentao Xu & Weiqing Liu & Chang Xu & Jiang Bian & Jian Yin & Tie-Yan Liu, 2021. "REST: Relational Event-driven Stock Trend Forecasting," Papers 2102.07372, arXiv.org, revised Feb 2021.
    3. Xiao Yang & Weiqing Liu & Dong Zhou & Jiang Bian & Tie-Yan Liu, 2020. "Qlib: An AI-oriented Quantitative Investment Platform," Papers 2009.11189, arXiv.org.
    4. Zura Kakushadze, 2016. "101 Formulaic Alphas," Papers 1601.00991, arXiv.org, revised Mar 2016.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Saizhuo Wang & Hang Yuan & Leon Zhou & Lionel M. Ni & Heung-Yeung Shum & Jian Guo, 2023. "Alpha-GPT: Human-AI Interactive Alpha Mining for Quantitative Investment," Papers 2308.00016, arXiv.org.
    2. Tao Ren & Ruihan Zhou & Jinyang Jiang & Jiafeng Liang & Qinghao Wang & Yijie Peng, 2024. "RiskMiner: Discovering Formulaic Alphas via Risk Seeking Monte Carlo Tree Search," Papers 2402.07080, arXiv.org, revised Feb 2024.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Shuo Sun & Rundong Wang & Bo An, 2021. "Reinforcement Learning for Quantitative Trading," Papers 2109.13851, arXiv.org.
    2. Lifan Zhao & Shuming Kong & Yanyan Shen, 2023. "DoubleAdapt: A Meta-learning Approach to Incremental Learning for Stock Trend Forecasting," Papers 2306.09862, arXiv.org, revised Apr 2024.
    3. Wentao Xu & Weiqing Liu & Lewen Wang & Yingce Xia & Jiang Bian & Jian Yin & Tie-Yan Liu, 2021. "HIST: A Graph-based Framework for Stock Trend Forecasting via Mining Concept-Oriented Shared Information," Papers 2110.13716, arXiv.org, revised Jan 2022.
    4. Liang Zeng & Lei Wang & Hui Niu & Ruchen Zhang & Ling Wang & Jian Li, 2021. "Trade When Opportunity Comes: Price Movement Forecasting via Locality-Aware Attention and Iterative Refinement Labeling," Papers 2107.11972, arXiv.org, revised May 2023.
    5. Alexandru Pîrjan & Simona-Vasilica Oprea & George Căruțașu & Dana-Mihaela Petroșanu & Adela Bâra & Cristina Coculescu, 2017. "Devising Hourly Forecasting Solutions Regarding Electricity Consumption in the Case of Commercial Center Type Consumers," Energies, MDPI, vol. 10(11), pages 1-36, October.
    6. Rao, Congjun & Zhang, Yue & Wen, Jianghui & Xiao, Xinping & Goh, Mark, 2023. "Energy demand forecasting in China: A support vector regression-compositional data second exponential smoothing model," Energy, Elsevier, vol. 263(PC).
    7. Jie Fang & Shutao Xia & Jianwu Lin & Yong Jiang, 2019. "Automatic Financial Feature Construction," Papers 1912.06236, arXiv.org, revised Oct 2020.
    8. Jie Fang & Jianwu Lin & Shutao Xia & Yong Jiang & Zhikang Xia & Xiang Liu, 2020. "Neural Network-based Automatic Factor Construction," Papers 2008.06225, arXiv.org, revised Oct 2020.
    9. Nweye, Kingsley & Nagy, Zoltan, 2022. "MARTINI: Smart meter driven estimation of HVAC schedules and energy savings based on Wi-Fi sensing and clustering," Applied Energy, Elsevier, vol. 316(C).
    10. Zura Kakushadze & Willie Yu, 2017. "Dead Alphas as Risk Factors," Papers 1709.06641, arXiv.org.
    11. Deb, C. & Gelder, L.V. & Spiekman, M. & Pandraud, Guillaume & Jack, R. & Fitton, R., 2021. "Measuring the heat transfer coefficient (HTC) in buildings: A stakeholder's survey," Renewable and Sustainable Energy Reviews, Elsevier, vol. 144(C).
    12. Chi Chen & Li Zhao & Wei Cao & Jiang Bian & Chunxiao Xing, 2020. "Trimming the Sail: A Second-order Learning Paradigm for Stock Prediction," Papers 2002.06878, arXiv.org.
    13. Chou, Jui-Sheng & Tran, Duc-Son, 2018. "Forecasting energy consumption time series using machine learning techniques based on usage patterns of residential householders," Energy, Elsevier, vol. 165(PB), pages 709-726.
    14. Gautham Krishnadas & Aristides Kiprakis, 2020. "A Machine Learning Pipeline for Demand Response Capacity Scheduling," Energies, MDPI, vol. 13(7), pages 1-25, April.
    15. Ewa Chodakowska & Joanicjusz Nazarko & Łukasz Nazarko, 2021. "ARIMA Models in Electrical Load Forecasting and Their Robustness to Noise," Energies, MDPI, vol. 14(23), pages 1-22, November.
    16. Wulfran Fendzi Mbasso & Reagan Jean Jacques Molu & Serge Raoul Dzonde Naoussi & Saatong Kenfack, 2022. "Demand-Supply Forecasting based on Deep Learning for Electricity Balance in Cameroon," International Journal of Energy Economics and Policy, Econjournals, vol. 12(4), pages 99-103, July.
    17. Wang, Qiang & Jiang, Feng, 2019. "Integrating linear and nonlinear forecasting techniques based on grey theory and artificial intelligence to forecast shale gas monthly production in Pennsylvania and Texas of the United States," Energy, Elsevier, vol. 178(C), pages 781-803.
    18. Sarabia Escriva, Emilio José & Hart, Matthew & Acha, Salvador & Soto Francés, Víctor & Shah, Nilay & Markides, Christos N., 2022. "Techno-economic evaluation of integrated energy systems for heat recovery applications in food retail buildings," Applied Energy, Elsevier, vol. 305(C).
    19. Hu, Yuntong & Xiao, Fuyuan, 2022. "A novel method for forecasting time series based on directed visibility graph and improved random walk," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 594(C).
    20. Yoon, Y. & Jung, S. & Im, P. & Salonvaara, M. & Bhandari, M. & Kunwar, N., 2023. "Empirical validation of building energy simulation model input parameter for multizone commercial building during the cooling season," Renewable and Sustainable Energy Reviews, Elsevier, vol. 188(C).

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2306.12964. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.