Deep Reinforcement Learning for Asset Allocation in US Equities

My bibliography Save this paper

Deep Reinforcement Learning for Asset Allocation in US Equities

Author

Listed:

Miquel Noguer i Alonso
Sonam Srivastava

Registered:

Abstract

Reinforcement learning is a machine learning approach concerned with solving dynamic optimization problems in an almost model-free way by maximizing a reward function in state and action spaces. This property makes it an exciting area of research for financial problems. Asset allocation, where the goal is to obtain the weights of the assets that maximize the rewards in a given state of the market considering risk and transaction costs, is a problem easily framed using a reinforcement learning framework. It is first a prediction problem for expected returns and covariance matrix and then an optimization problem for returns, risk, and market impact. Investors and financial researchers have been working with approaches like mean-variance optimization, minimum variance, risk parity, and equally weighted and several methods to make expected returns and covariance matrices' predictions more robust. This paper demonstrates the application of reinforcement learning to create a financial model-free solution to the asset allocation problem, learning to solve the problem using time series and deep neural networks. We demonstrate this on daily data for the top 24 stocks in the US equities universe with daily rebalancing. We use a deep reinforcement model on US stocks using different architectures. We use Long Short Term Memory networks, Convolutional Neural Networks, and Recurrent Neural Networks and compare them with more traditional portfolio management. The Deep Reinforcement Learning approach shows better results than traditional approaches using a simple reward function and only being given the time series of stocks. In Finance, no training to test error generalization results come guaranteed. We can say that the modeling framework can deal with time series prediction and asset allocation, including transaction costs.

Suggested Citation

Miquel Noguer i Alonso & Sonam Srivastava, 2020. "Deep Reinforcement Learning for Asset Allocation in US Equities," Papers 2010.04404, arXiv.org.

Handle: RePEc:arx:papers:2010.04404

Download full text from publisher

References listed on IDEAS

Attilio Meucci, 2010. "Fully Flexible Views: Theory and Practice," Papers 1012.2848, arXiv.org.
Zihao Zhang & Stefan Zohren & Stephen Roberts, 2019. "Deep Reinforcement Learning for Trading," Papers 1911.10107, arXiv.org.
Xi Bai & Katya Scheinberg & Reha Tutuncu, 2016. "Least-squares approach to risk parity in portfolio selection," Quantitative Finance, Taylor & Francis Journals, vol. 16(3), pages 357-376, March.
Owen, Joel & Rabinovitch, Ramon, 1983. "On the Class of Elliptical Distributions and Their Applications to the Theory of Portfolio Choice," Journal of Finance, American Finance Association, vol. 38(3), pages 745-752, June.
Zhengyao Jiang & Dixing Xu & Jinjun Liang, 2017. "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem," Papers 1706.10059, arXiv.org, revised Jul 2017.
Guillaume Coqueret & Tony Guida, 2020. "Machine Learning for Factor Investing : R version," Post-Print hal-03188226, HAL.
Thomas M. Cover, 1991. "Universal Portfolios," Mathematical Finance, Wiley Blackwell, vol. 1(1), pages 1-29, January.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Uta Pigorsch & Sebastian Schafer, 2021. "High-Dimensional Stock Portfolio Trading with Deep Reinforcement Learning," Papers 2112.04755, arXiv.org.
Ricard Durall, 2022. "Asset Allocation: From Markowitz to Deep Reinforcement Learning," Papers 2208.07158, arXiv.org.
Jiwon Kim & Moon-Ju Kang & KangHun Lee & HyungJun Moon & Bo-Kwan Jeon, 2023. "Deep Reinforcement Learning for Asset Allocation: Reward Clipping," Papers 2301.05300, arXiv.org.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Shuo Sun & Rundong Wang & Bo An, 2021. "Reinforcement Learning for Quantitative Trading," Papers 2109.13851, arXiv.org.
Adil Rengim Cetingoz & Olivier Gu'eant, 2023. "Factor Risk Budgeting and Beyond," Papers 2312.11132, arXiv.org.
Pier Francesco Procacci & Tomaso Aste, 2022. "Portfolio optimization with sparse multivariate modeling," Journal of Asset Management, Palgrave Macmillan, vol. 23(6), pages 445-465, October.
Foster, Dean P. & Vohra, Rakesh, 1999. "Regret in the On-Line Decision Problem," Games and Economic Behavior, Elsevier, vol. 29(1-2), pages 7-35, October.
Jiahua Xu & Daniel Perez & Yebo Feng & Benjamin Livshits, 2023. "Auto.gov: Learning-based On-chain Governance for Decentralized Finance (DeFi)," Papers 2302.09551, arXiv.org, revised May 2023.
Furman, Edward & Landsman, Zinoviy, 2010. "Multivariate Tweedie distributions and some related capital-at-risk analyses," Insurance: Mathematics and Economics, Elsevier, vol. 46(2), pages 351-361, April.
Vaughn Gambeta & Roy Kwon, 2020. "Risk Return Trade-Off in Relaxed Risk Parity Portfolio Optimization," JRFM, MDPI, vol. 13(10), pages 1-28, October.
Bao, Te & Diks, Cees & Li, Hao, 2018. "A generalized CAPM model with asymmetric power distributed errors with an application to portfolio construction," Economic Modelling, Elsevier, vol. 68(C), pages 611-621.
Giacomo di Tollo & Joseph Andria & Gianni Filograsso, 2023. "The Predictive Power of Social Media Sentiment: Evidence from Cryptocurrencies and Stock Markets Using NLP and Stochastic ANNs," Mathematics, MDPI, vol. 11(16), pages 1-18, August.
Balvers, Ronald J. & Mitchell, Douglas W., 2000. "Efficient gradualism in intertemporal portfolios," Journal of Economic Dynamics and Control, Elsevier, vol. 24(1), pages 21-38, January.
Thomas Eichner, 2010. "Slutzky equations and substitution effects of risks in terms of mean-variance preferences," Theory and Decision, Springer, vol. 69(1), pages 17-26, July.
David A. Hennessy, 2004. "Orthogonal Subgroups for Portfolio Choice," Economics Bulletin, AccessEcon, vol. 7(1), pages 1-7.
- Hennessy, David A., 2004. "Orthogonal Subgroups for Portfolio Choice," Staff General Research Papers Archive 11993, Iowa State University, Department of Economics.
Seung-Hyun Moon & Yong-Hyuk Kim & Byung-Ro Moon, 2019. "Empirical investigation of state-of-the-art mean reversion strategies for equity markets," Papers 1909.04327, arXiv.org.
Amir Mosavi & Pedram Ghamisi & Yaser Faghan & Puhong Duan, 2020. "Comprehensive Review of Deep Reinforcement Learning Methods and Applications in Economics," Papers 2004.01509, arXiv.org.
Ioannis D Vrontos & Loukia Meligkotsidou & Spyridon D Vrontos, 2011. "Performance evaluation of mutual fund investments: The impact of non-normality and time-varying volatility," Journal of Asset Management, Palgrave Macmillan, vol. 12(4), pages 292-307, September.
Hino, Hideitsu & Wakayama, Keigo & Murata, Noboru, 2013. "Entropy-based sliced inverse regression," Computational Statistics & Data Analysis, Elsevier, vol. 67(C), pages 105-114.
Taras Bodnar & Yarema Okhrin & Valdemar Vitlinskyy & Taras Zabolotskyy, 2018. "Determination and estimation of risk aversion coefficients," Computational Management Science, Springer, vol. 15(2), pages 297-317, June.
Ortobelli, Sergio & Rachev, Svetlozar & Schwartz, Eduardo, 2000. "The Problem of Optimal Asset Allocation with Stable Distributed Returns," University of California at Los Angeles, Anderson Graduate School of Management qt3zd6q86c, Anderson Graduate School of Management, UCLA.
Peñaranda, Francisco & Sentana, Enrique, 2012. "Spanning tests in return and stochastic discount factor mean–variance frontiers: A unifying approach," Journal of Econometrics, Elsevier, vol. 170(2), pages 303-324.
- Sentana, Enrique & PeÃ±aranda, Francisco, 2004. "Spanning Tests in Return and Stochastic Discount Factor Mean Variance Frontiers: A Unifying Approach," CEPR Discussion Papers 4422, C.E.P.R. Discussion Papers.
- Francisco Peñaranda & Enrique Sentana, 2004. "Spanning Tests in Return and Stochastic Discount Factor Mean-Variance Frontiers: A Unifying Approach," Working Papers wp2004_0410, CEMFI.
- Enrique Sentana & Francisco Penaranda, 2004. "Spanning Tests in Return and Stochastic Discount Factor Mean-Variance Frontiers: A Unifying Approach," FMG Discussion Papers dp497, Financial Markets Group.
- Francisco Peñaranda & Enrique Sentana, 2008. "Spanning tests in return and stochastic discount factor mean-variance frontiers: A unifying approach," Economics Working Papers 1101, Department of Economics and Business, Universitat Pompeu Fabra, revised Sep 2010.
Keith Vorkink & Douglas J. Hodgson & Oliver Linton, 2002. "Testing the capital asset pricing model efficiently under elliptical symmetry: a semiparametric approach," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 17(6), pages 617-639.
- Douglas J. Hodgson & Oliver Linton & Keith Vorkink, 2002. "Testing the capital asset pricing model efficiently under elliptical symmetry: a semiparametric approach," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 17(6), pages 617-639, December.
- Douglas J Hodgson & Oliver Linton & Keith Vorkink, 2000. "Testing the Capital Asset Pricing Model Efficiently under Elliptical Symmetry: A Semiparametric Approach," STICERD - Econometrics Paper Series 398, Suntory and Toyota International Centres for Economics and Related Disciplines, LSE.
- Douglas J. Hodgson & Oliver Linton & Keith Vorkink, 2001. "Testing the Capital Asset Pricing Model Efficiently Under Elliptical Symmetry: A Semiparametric Approach," Cahiers de recherche CREFE / CREFE Working Papers 143, CREFE, Université du Québec à Montréal.
- Hodgson, Douglas J & Linton, Oliver & Vorkink, Keith, 2000. "Testing the capital asset pricing model efficiently under elliptical symmetry : a semiparametric approach," LSE Research Online Documents on Economics 2197, London School of Economics and Political Science, LSE Library.
- Oliver Linton & Douglas J.Hodgson & Keith Vorkink, 2001. "Testing the Capital Asset Pricing Model Efficiently Under Elliptical Symmetry: A Semiparametric Approach," FMG Discussion Papers dp382, Financial Markets Group.

More about this item

NEP fields

This paper has been announced in the following NEP Reports:

NEP-BIG-2020-11-09 (Big Data)
NEP-CMP-2020-11-09 (Computational Economics)
NEP-FMK-2020-11-09 (Financial Markets)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2010.04404. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Deep Reinforcement Learning for Asset Allocation in US Equities

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data