Regime-Based Portfolio Allocation Using Hidden Markov Models and Reinforcement Learning

Regime-Based Portfolio Allocation Using Hidden Markov Models and Reinforcement Learning

Author

Listed:

Ajay Kumar Verma
Nunik Srikandi Putri
Neo Paul Lesupi

Abstract

This study develops a regime-aware portfolio allocation framework that integrates Markov switching models with Reinforcement Learning (RL) to dynamically allocate across equities (SPY), long-term Treasuries (TLT), and gold (GLD). Using daily ETF data from 2004-2025, we first characterize market behavior through a discrete Markov chain and then estimate a three-state Gaussian Hidden Markov Model (HMM) selected by the Bayesian Information Criterion (BIC). The estimated regimes-low-volatility, transitional, and high-volatility-exhibit strong persistence and state-dependent return dynamics consistent with recent findings on nonlinear market states (Ardia et al., 2024; Gupta & Pierdzioch, 2023). State-conditional analysis shows that SPY dominates in stable regimes, while TLT and GLD provide protection during stressed periods, motivating regime-conditioned allocation rules. We evaluate rule-based rotation and RL-driven strategies using a 30% out-of-sample test window with a one-day execution lag to avoid look-ahead bias. Both HMM-based allocations outperform a passive SPY benchmark, while the RL policy achieves the highest risk-adjusted performance, delivering the strongest Sharpe ratio and materially lower drawdowns, yet remains fully interpretable through discrete regime-dependent actions. Sensitivity analysis confirms the robustness of the three-state specification relative to two-state alternatives. Overall, the results demonstrate that RL can systematically enhance HMM-based regime detection, providing a transparent, adaptive, and empirically grounded framework for tactical asset allocation. The combined HMM-RL system provides a transparent, rules-based approach to tactical allocation that improves risk-adjusted performance relative to standard benchmark strategies.

Suggested Citation

Ajay Kumar Verma & Nunik Srikandi Putri & Neo Paul Lesupi, 2026. "Regime-Based Portfolio Allocation Using Hidden Markov Models and Reinforcement Learning," Papers 2605.27848, arXiv.org.

Handle: RePEc:arx:papers:2605.27848

Download full text from publisher

References listed on IDEAS

Zhengyao Jiang & Dixing Xu & Jinjun Liang, 2017. "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem," Papers 1706.10059, arXiv.org, revised Jul 2017.
Baur, Dirk G. & McDermott, Thomas K., 2010. "Is gold a safe haven? International evidence," Journal of Banking & Finance, Elsevier, vol. 34(8), pages 1886-1898, August.
- Dirk G. Baur & Thomas K. McDermott, "undated". "Is gold a safe haven? International evidence," The Institute for International Integration Studies Discussion Paper Series iiisdp310, IIIS.
R. Cont, 2001. "Empirical properties of asset returns: stylized facts and statistical issues," Quantitative Finance, Taylor & Francis Journals, vol. 1(2), pages 223-236.
GIOT, Pierre, 2005. "Implied volatility indexes and daily Value at Risk models," LIDAM Reprints CORE 1840, Université catholique de Louvain, Center for Operations Research and Econometrics (CORE).
Hamilton, James D., 1990. "Analysis of time series subject to changes in regime," Journal of Econometrics, Elsevier, vol. 45(1-2), pages 39-70.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Bariviera, Aurelio F. & Font-Ferrer, Alejandro & Sorrosal-Forradellas, M. Teresa & Rosso, Osvaldo A., 2019. "An information theory perspective on the informational efficiency of gold price," The North American Journal of Economics and Finance, Elsevier, vol. 50(C).
Brian M. Lucey & Fergal A. O’Connor, 2013. "Do bubbles occur in the gold price? An investigation of gold lease rates and Markov Switching models," Borsa Istanbul Review, Research and Business Development Department, Borsa Istanbul, vol. 13(3), pages 53-63, September.
Takashi Miyazaki, 2019. "Clarifying the Response of Gold Return to Financial Indicators: An Empirical Comparative Analysis Using Ordinary Least Squares, Robust and Quantile Regressions," JRFM, MDPI, vol. 12(1), pages 1-18, February.
Ben Hambly & Renyuan Xu & Huining Yang, 2021. "Recent Advances in Reinforcement Learning in Finance," Papers 2112.04553, arXiv.org, revised Feb 2023.
Alessio Brini & Daniele Tantari, 2021. "Deep Reinforcement Trading with Predictable Returns," Papers 2104.14683, arXiv.org, revised May 2023.
Salam Rabindrajit Luwang & Buddha Nath Sharma & Kundan Mukhia & Md. Nurujjaman & Anish Rai & Filippo Petroni & Luis E. C. Rocha, 2026. "Regime Discovery and Intra-Regime Return Dynamics in Global Equity Markets," Papers 2601.08571, arXiv.org.
Aliyu, Shehu Usman Rano, 2020. "What have we learnt from modelling stock returns in Nigeria: Higgledy-piggledy?," MPRA Paper 110382, University Library of Munich, Germany, revised 06 Jun 2021.
Mohammad Rezoanul Hoque & Md Meftahul Ferdaus & M. Kabir Hassan, 2025. "Reinforcement Learning in Financial Decision Making: A Systematic Review of Performance, Challenges, and Implementation Strategies," Papers 2512.10913, arXiv.org.
Benjamin R. Auer & Benjamin Mögel, 2016. "How Accurate are Modern Value-at-Risk Estimators Derived from Extreme Value Theory?," CESifo Working Paper Series 6288, CESifo.
Chattopadhyay, Dhriti & Saha, Bidipta & Saha, Dikshita & Saha, Madhurima & Chakrabarti, Gagari, 2025. "Adding precious metals to a risk avert Investor's portfolio – Is gold alone?," Resources Policy, Elsevier, vol. 106(C).
Semei Coronado & Rebeca Jim'enez-Rodr'iguez & Omar Rojas, 2015. "An empirical analysis of the relationships between crude oil, gold and stock markets," Papers 1510.07599, arXiv.org, revised May 2016.
Liu Ziyin & Kentaro Minami & Kentaro Imajo, 2021. "Theoretically Motivated Data Augmentation and Regularization for Portfolio Construction," Papers 2106.04114, arXiv.org, revised Dec 2022.
Yue Peng & Wing Ng, 2012. "Analysing financial contagion and asymmetric market dependence with volatility indices via copulas," Annals of Finance, Springer, vol. 8(1), pages 49-74, February.
Focardi, Sergio M. & Fabozzi, Frank J. & Mazza, Davide, 2019. "Modeling local trends with regime shifting models with time-varying probabilities," International Review of Financial Analysis, Elsevier, vol. 66(C).
Carlo Confalonieri & Paola De Vincentiis, 2026. "Forecasting the worst: is implied volatility forward-looking enough?," Journal of Banking Regulation, Palgrave Macmillan, vol. 27(1), pages 1-20, March.
Benjamin Mögel & Benjamin R. Auer, 2018. "How accurate are modern Value-at-Risk estimators derived from extreme value theory?," Review of Quantitative Finance and Accounting, Springer, vol. 50(4), pages 979-1030, May.
Chen, Louisa & Verousis, Thanos & Wang, Kai & Zhou, Zhiping, 2023. "Financial stress and commodity price volatility," Energy Economics, Elsevier, vol. 125(C).
Nedved, Martin & Kristoufek, Ladislav, 2023. "Safe havens for Bitcoin," Finance Research Letters, Elsevier, vol. 51(C).
Brini, Alessio & Tantari, Daniele, 2023. "Deep reinforcement trading with predictable returns," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 622(C).
Brian Lucey & Fergal A. O'connor, 2012. "Do Bubbles occur in Gold Prices? Evidence from Gold Lease Rates and Markov Switching Models," The Institute for International Integration Studies Discussion Paper Series iiisdp418, IIIS.

More about this item

NEP fields

This paper has been announced in the following NEP Reports:

NEP-MIN-2026-06-15 (Mining)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2605.27848. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: https://arxiv.org/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Regime-Based Portfolio Allocation Using Hidden Markov Models and Reinforcement Learning

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data