IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2010.04404.html
   My bibliography  Save this paper

Deep Reinforcement Learning for Asset Allocation in US Equities

Author

Listed:
  • Miquel Noguer i Alonso
  • Sonam Srivastava

Abstract

Reinforcement learning is a machine learning approach concerned with solving dynamic optimization problems in an almost model-free way by maximizing a reward function in state and action spaces. This property makes it an exciting area of research for financial problems. Asset allocation, where the goal is to obtain the weights of the assets that maximize the rewards in a given state of the market considering risk and transaction costs, is a problem easily framed using a reinforcement learning framework. It is first a prediction problem for expected returns and covariance matrix and then an optimization problem for returns, risk, and market impact. Investors and financial researchers have been working with approaches like mean-variance optimization, minimum variance, risk parity, and equally weighted and several methods to make expected returns and covariance matrices' predictions more robust. This paper demonstrates the application of reinforcement learning to create a financial model-free solution to the asset allocation problem, learning to solve the problem using time series and deep neural networks. We demonstrate this on daily data for the top 24 stocks in the US equities universe with daily rebalancing. We use a deep reinforcement model on US stocks using different architectures. We use Long Short Term Memory networks, Convolutional Neural Networks, and Recurrent Neural Networks and compare them with more traditional portfolio management. The Deep Reinforcement Learning approach shows better results than traditional approaches using a simple reward function and only being given the time series of stocks. In Finance, no training to test error generalization results come guaranteed. We can say that the modeling framework can deal with time series prediction and asset allocation, including transaction costs.

Suggested Citation

  • Miquel Noguer i Alonso & Sonam Srivastava, 2020. "Deep Reinforcement Learning for Asset Allocation in US Equities," Papers 2010.04404, arXiv.org.
  • Handle: RePEc:arx:papers:2010.04404
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2010.04404
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Attilio Meucci, 2010. "Fully Flexible Views: Theory and Practice," Papers 1012.2848, arXiv.org.
    2. Zihao Zhang & Stefan Zohren & Stephen Roberts, 2019. "Deep Reinforcement Learning for Trading," Papers 1911.10107, arXiv.org.
    3. Xi Bai & Katya Scheinberg & Reha Tutuncu, 2016. "Least-squares approach to risk parity in portfolio selection," Quantitative Finance, Taylor & Francis Journals, vol. 16(3), pages 357-376, March.
    4. Owen, Joel & Rabinovitch, Ramon, 1983. "On the Class of Elliptical Distributions and Their Applications to the Theory of Portfolio Choice," Journal of Finance, American Finance Association, vol. 38(3), pages 745-752, June.
    5. Zhengyao Jiang & Dixing Xu & Jinjun Liang, 2017. "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem," Papers 1706.10059, arXiv.org, revised Jul 2017.
    6. Guillaume Coqueret & Tony Guida, 2020. "Machine Learning for Factor Investing : R version," Post-Print hal-03188226, HAL.
    7. Thomas M. Cover, 1991. "Universal Portfolios," Mathematical Finance, Wiley Blackwell, vol. 1(1), pages 1-29, January.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Uta Pigorsch & Sebastian Schafer, 2021. "High-Dimensional Stock Portfolio Trading with Deep Reinforcement Learning," Papers 2112.04755, arXiv.org.
    2. Ricard Durall, 2022. "Asset Allocation: From Markowitz to Deep Reinforcement Learning," Papers 2208.07158, arXiv.org.
    3. Jiwon Kim & Moon-Ju Kang & KangHun Lee & HyungJun Moon & Bo-Kwan Jeon, 2023. "Deep Reinforcement Learning for Asset Allocation: Reward Clipping," Papers 2301.05300, arXiv.org.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Shuo Sun & Rundong Wang & Bo An, 2021. "Reinforcement Learning for Quantitative Trading," Papers 2109.13851, arXiv.org.
    2. Adil Rengim Cetingoz & Olivier Gu'eant, 2023. "Factor Risk Budgeting and Beyond," Papers 2312.11132, arXiv.org.
    3. Pier Francesco Procacci & Tomaso Aste, 2022. "Portfolio optimization with sparse multivariate modeling," Journal of Asset Management, Palgrave Macmillan, vol. 23(6), pages 445-465, October.
    4. Foster, Dean P. & Vohra, Rakesh, 1999. "Regret in the On-Line Decision Problem," Games and Economic Behavior, Elsevier, vol. 29(1-2), pages 7-35, October.
    5. Jiahua Xu & Daniel Perez & Yebo Feng & Benjamin Livshits, 2023. "Auto.gov: Learning-based On-chain Governance for Decentralized Finance (DeFi)," Papers 2302.09551, arXiv.org, revised May 2023.
    6. Furman, Edward & Landsman, Zinoviy, 2010. "Multivariate Tweedie distributions and some related capital-at-risk analyses," Insurance: Mathematics and Economics, Elsevier, vol. 46(2), pages 351-361, April.
    7. Vaughn Gambeta & Roy Kwon, 2020. "Risk Return Trade-Off in Relaxed Risk Parity Portfolio Optimization," JRFM, MDPI, vol. 13(10), pages 1-28, October.
    8. Bao, Te & Diks, Cees & Li, Hao, 2018. "A generalized CAPM model with asymmetric power distributed errors with an application to portfolio construction," Economic Modelling, Elsevier, vol. 68(C), pages 611-621.
    9. Giacomo di Tollo & Joseph Andria & Gianni Filograsso, 2023. "The Predictive Power of Social Media Sentiment: Evidence from Cryptocurrencies and Stock Markets Using NLP and Stochastic ANNs," Mathematics, MDPI, vol. 11(16), pages 1-18, August.
    10. Balvers, Ronald J. & Mitchell, Douglas W., 2000. "Efficient gradualism in intertemporal portfolios," Journal of Economic Dynamics and Control, Elsevier, vol. 24(1), pages 21-38, January.
    11. Thomas Eichner, 2010. "Slutzky equations and substitution effects of risks in terms of mean-variance preferences," Theory and Decision, Springer, vol. 69(1), pages 17-26, July.
    12. David A. Hennessy, 2004. "Orthogonal Subgroups for Portfolio Choice," Economics Bulletin, AccessEcon, vol. 7(1), pages 1-7.
    13. Seung-Hyun Moon & Yong-Hyuk Kim & Byung-Ro Moon, 2019. "Empirical investigation of state-of-the-art mean reversion strategies for equity markets," Papers 1909.04327, arXiv.org.
    14. Amir Mosavi & Pedram Ghamisi & Yaser Faghan & Puhong Duan, 2020. "Comprehensive Review of Deep Reinforcement Learning Methods and Applications in Economics," Papers 2004.01509, arXiv.org.
    15. Ioannis D Vrontos & Loukia Meligkotsidou & Spyridon D Vrontos, 2011. "Performance evaluation of mutual fund investments: The impact of non-normality and time-varying volatility," Journal of Asset Management, Palgrave Macmillan, vol. 12(4), pages 292-307, September.
    16. Hino, Hideitsu & Wakayama, Keigo & Murata, Noboru, 2013. "Entropy-based sliced inverse regression," Computational Statistics & Data Analysis, Elsevier, vol. 67(C), pages 105-114.
    17. Taras Bodnar & Yarema Okhrin & Valdemar Vitlinskyy & Taras Zabolotskyy, 2018. "Determination and estimation of risk aversion coefficients," Computational Management Science, Springer, vol. 15(2), pages 297-317, June.
    18. Ortobelli, Sergio & Rachev, Svetlozar & Schwartz, Eduardo, 2000. "The Problem of Optimal Asset Allocation with Stable Distributed Returns," University of California at Los Angeles, Anderson Graduate School of Management qt3zd6q86c, Anderson Graduate School of Management, UCLA.
    19. Peñaranda, Francisco & Sentana, Enrique, 2012. "Spanning tests in return and stochastic discount factor mean–variance frontiers: A unifying approach," Journal of Econometrics, Elsevier, vol. 170(2), pages 303-324.
    20. Keith Vorkink & Douglas J. Hodgson & Oliver Linton, 2002. "Testing the capital asset pricing model efficiently under elliptical symmetry: a semiparametric approach," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 17(6), pages 617-639.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2010.04404. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.