IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2508.20103.html

Deep Reinforcement Learning for Optimal Asset Allocation Using DDPG with TiDE

Author

Listed:
  • Rongwei Liu
  • Jin Zheng
  • John Cartlidge

Abstract

The optimal asset allocation between risky and risk-free assets is a persistent challenge due to the inherent volatility in financial markets. Conventional methods rely on strict distributional assumptions or non-additive reward ratios, which limit their robustness and applicability to investment goals. To overcome these constraints, this study formulates the optimal two-asset allocation problem as a sequential decision-making task within a Markov Decision Process (MDP). This framework enables the application of reinforcement learning (RL) mechanisms to develop dynamic policies based on simulated financial scenarios, regardless of prerequisites. We use the Kelly criterion to balance immediate reward signals against long-term investment objectives, and we take the novel step of integrating the Time-series Dense Encoder (TiDE) into the Deep Deterministic Policy Gradient (DDPG) RL framework for continuous decision-making. We compare DDPG-TiDE with a simple discrete-action Q-learning RL framework and a passive buy-and-hold investment strategy. Empirical results show that DDPG-TiDE outperforms Q-learning and generates higher risk adjusted returns than buy-and-hold. These findings suggest that tackling the optimal asset allocation problem by integrating TiDE within a DDPG reinforcement learning framework is a fruitful avenue for further exploration.

Suggested Citation

  • Rongwei Liu & Jin Zheng & John Cartlidge, 2025. "Deep Reinforcement Learning for Optimal Asset Allocation Using DDPG with TiDE," Papers 2508.20103, arXiv.org.
  • Handle: RePEc:arx:papers:2508.20103
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2508.20103
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Chao Zhang & Zihao Zhang & Mihai Cucuringu & Stefan Zohren, 2021. "A Universal End-to-End Approach to Portfolio Optimization via Deep Learning," Papers 2111.09170, arXiv.org.
    2. Kan, Raymond & Zhou, Guofu, 2007. "Optimal Portfolio Choice with Parameter Uncertainty," Journal of Financial and Quantitative Analysis, Cambridge University Press, vol. 42(3), pages 621-656, September.
    3. Ben Hambly & Renyuan Xu & Huining Yang, 2021. "Recent Advances in Reinforcement Learning in Finance," Papers 2112.04553, arXiv.org, revised Feb 2023.
    4. Merton, Robert C., 1971. "Optimum consumption and portfolio rules in a continuous-time model," Journal of Economic Theory, Elsevier, vol. 3(4), pages 373-413, December.
    5. Amit Goyal & Ivo Welch & Athanasse Zafirov, 2024. "A Comprehensive 2022 Look at the Empirical Performance of Equity Premium Prediction," The Review of Financial Studies, Society for Financial Studies, vol. 37(11), pages 3490-3557.
    6. R. Jiang & D. Saunders & C. Weng, 2022. "The reinforcement learning Kelly strategy," Quantitative Finance, Taylor & Francis Journals, vol. 22(8), pages 1445-1464, August.
    7. Ben Hambly & Renyuan Xu & Huining Yang, 2023. "Recent advances in reinforcement learning in finance," Mathematical Finance, Wiley Blackwell, vol. 33(3), pages 437-503, July.
    8. Merton, Robert C, 1969. "Lifetime Portfolio Selection under Uncertainty: The Continuous-Time Case," The Review of Economics and Statistics, MIT Press, vol. 51(3), pages 247-257, August.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jiang, Yifu & Olmo, Jose & Atwi, Majed, 2025. "High-dimensional multi-period portfolio allocation using deep reinforcement learning," International Review of Economics & Finance, Elsevier, vol. 98(C).
    2. Constantinos Kardaras & Hyeng Keun Koo & Johannes Ruf, 2022. "Estimation of growth in fund models," Papers 2208.02573, arXiv.org.
    3. Ruili Sun & Tiefeng Ma & Shuangzhe Liu & Milind Sathye, 2019. "Improved Covariance Matrix Estimation for Portfolio Risk Measurement: A Review," JRFM, MDPI, vol. 12(1), pages 1-34, March.
    4. Ahmad Aghapour & Erhan Bayraktar & Fengyi Yuan, 2025. "Solving dynamic portfolio selection problems via score-based diffusion models," Papers 2507.09916, arXiv.org, revised Aug 2025.
    5. Owadally, Iqbal & Landsman, Zinoviy, 2013. "A characterization of optimal portfolios under the tail mean–variance criterion," Insurance: Mathematics and Economics, Elsevier, vol. 52(2), pages 213-221.
    6. Minshuo Chen & Renyuan Xu & Yumin Xu & Ruixun Zhang, 2025. "Diffusion Factor Models: Generating High-Dimensional Returns with Factor Structure," Papers 2504.06566, arXiv.org, revised Jan 2026.
    7. Yuanyuan Zhang & Xiang Li & Sini Guo, 2018. "Portfolio selection problems with Markowitz’s mean–variance framework: a review of literature," Fuzzy Optimization and Decision Making, Springer, vol. 17(2), pages 125-158, June.
    8. An, Jongbong & Jeon, Junkee & Kim, Takwon, 2025. "Optimal portfolio and retirement decisions with costly job switching options," Applied Mathematics and Computation, Elsevier, vol. 491(C).
    9. Auffret, Philippe, 2001. "An alternative unifying measure of welfare gains from risk-sharing," Policy Research Working Paper Series 2676, The World Bank.
    10. Chen, An & Hieber, Peter & Sureth, Caren, 2022. "Pay for tax certainty? Advance tax rulings for risky investment under multi-dimensional tax uncertainty," arqus Discussion Papers in Quantitative Tax Research 273, arqus - Arbeitskreis Quantitative Steuerlehre.
    11. Andreas Fagereng & Luigi Guiso & Davide Malacrino & Luigi Pistaferri, 2020. "Heterogeneity and Persistence in Returns to Wealth," Econometrica, Econometric Society, vol. 88(1), pages 115-170, January.
    12. John H. Cochrane, 1999. "New facts in finance," Economic Perspectives, Federal Reserve Bank of Chicago, vol. 23(Q III), pages 36-58.
    13. John Y. Campbell & Luis M. Viceira & Joshua S. White, 2003. "Foreign Currency for Long-Term Investors," Economic Journal, Royal Economic Society, vol. 113(486), pages 1-25, March.
    14. Stephen Satchell & Susan Thorp, 2007. "Scenario Analysis with Recursive Utility: Dynamic Consumption Plans for Charitable Endowments," Research Paper Series 209, Quantitative Finance Research Centre, University of Technology, Sydney.
    15. Hong‐Chih Huang, 2010. "Optimal Multiperiod Asset Allocation: Matching Assets to Liabilities in a Discrete Model," Journal of Risk & Insurance, The American Risk and Insurance Association, vol. 77(2), pages 451-472, June.
    16. Orszag, J. Michael & Yang, Hong, 1995. "Portfolio choice with Knightian uncertainty," Journal of Economic Dynamics and Control, Elsevier, vol. 19(5-7), pages 873-900.
    17. Bjork, Tomas, 2009. "Arbitrage Theory in Continuous Time," OUP Catalogue, Oxford University Press, edition 3, number 9780199574742.
    18. E. Nasakkala & J. Keppo, 2008. "Hydropower with Financial Information," Applied Mathematical Finance, Taylor & Francis Journals, vol. 15(5-6), pages 503-529.
    19. Jan Kallsen & Johannes Muhle-Karbe, 2013. "The General Structure of Optimal Investment and Consumption with Small Transaction Costs," Papers 1303.3148, arXiv.org, revised May 2015.
    20. Bouyaddou, Youssef & Jebabli, Ikram, 2025. "Integration of investor behavioral perspective and climate change in reinforcement learning for portfolio optimization," Research in International Business and Finance, Elsevier, vol. 73(PB).

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2508.20103. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.