G-Learner and GIRL: Goal Based Wealth Management with Reinforcement Learning

G-Learner and GIRL: Goal Based Wealth Management with Reinforcement Learning

Author

Listed:

Matthew Dixon
Igor Halperin

Abstract

We present a reinforcement learning approach to goal based wealth management problems such as optimization of retirement plans or target dated funds. In such problems, an investor seeks to achieve a financial goal by making periodic investments in the portfolio while being employed, and periodically draws from the account when in retirement, in addition to the ability to re-balance the portfolio by selling and buying different assets (e.g. stocks). Instead of relying on a utility of consumption, we present G-Learner: a reinforcement learning algorithm that operates with explicitly defined one-step rewards, does not assume a data generation process, and is suitable for noisy data. Our approach is based on G-learning - a probabilistic extension of the Q-learning method of reinforcement learning. In this paper, we demonstrate how G-learning, when applied to a quadratic reward and Gaussian reference policy, gives an entropy-regulated Linear Quadratic Regulator (LQR). This critical insight provides a novel and computationally tractable tool for wealth management tasks which scales to high dimensional portfolios. In addition to the solution of the direct problem of G-learning, we also present a new algorithm, GIRL, that extends our goal-based G-learning approach to the setting of Inverse Reinforcement Learning (IRL) where rewards collected by the agent are not observed, and should instead be inferred. We demonstrate that GIRL can successfully learn the reward parameters of a G-Learner agent and thus imitate its behavior. Finally, we discuss potential applications of the G-Learner and GIRL algorithms for wealth management and robo-advising.

Suggested Citation

Matthew Dixon & Igor Halperin, 2020. "G-Learner and GIRL: Goal Based Wealth Management with Reinforcement Learning," Papers 2002.10990, arXiv.org.

Handle: RePEc:arx:papers:2002.10990

Download full text from publisher

References listed on IDEAS

Igor Halperin & Ilya Feldshteyn, 2018. "Market Self-Learning of Signals, Impact and Optimal Trading: Invisible Hand Inference with Free Energy," Papers 1805.06126, arXiv.org.
Merton, Robert C., 1971. "Optimum consumption and portfolio rules in a continuous-time model," Journal of Economic Theory, Elsevier, vol. 3(4), pages 373-413, December.
- R. C. Merton, 1970. "Optimum Consumption and Portfolio Rules in a Continuous-time Model," Working papers 58, Massachusetts Institute of Technology (MIT), Department of Economics.
Browne, S., 1996. "Reaching Goals by a Deadline: Digital Options and Continuous-Time Active Portfolio Management," Papers 96-16, Columbia - Graduate School of Business.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Ben Hambly & Renyuan Xu & Huining Yang, 2023. "Recent advances in reinforcement learning in finance," Mathematical Finance, Wiley Blackwell, vol. 33(3), pages 437-503, July.
Chung I Lu, 2023. "Evaluation of Deep Reinforcement Learning Algorithms for Portfolio Optimisation," Papers 2307.07694, arXiv.org, revised Aug 2025.
Tessa Bauman & Bruno Gav{s}perov & Stjepan Beguv{s}i'c & Zvonko Kostanjv{c}ar, 2023. "Deep Reinforcement Learning for Robust Goal-Based Wealth Management," Papers 2307.13501, arXiv.org.
Alejandro Rodriguez Dominguez, 2025. "Causal PDE-Control Models for Dynamic Portfolio Optimization with Latent Drivers," Papers 2509.09585, arXiv.org, revised Apr 2026.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

An, Jongbong & Jeon, Junkee & Kim, Takwon, 2025. "Optimal portfolio and retirement decisions with costly job switching options," Applied Mathematics and Computation, Elsevier, vol. 491(C).
Auffret, Philippe, 2001. "An alternative unifying measure of welfare gains from risk-sharing," Policy Research Working Paper Series 2676, The World Bank.
Chen, An & Hieber, Peter & Sureth, Caren, 2022. "Pay for tax certainty? Advance tax rulings for risky investment under multi-dimensional tax uncertainty," arqus Discussion Papers in Quantitative Tax Research 273, arqus - Arbeitskreis Quantitative Steuerlehre.
Sanchez-Romero, Miguel, 2006. "“Demand for Private Annuities and Social Security: Consequences to Individual Wealth”," Working Papers in Economic Theory 2006/07, Universidad Autónoma de Madrid (Spain), Department of Economic Analysis (Economic Theory and Economic History).
Andreas Fagereng & Luigi Guiso & Davide Malacrino & Luigi Pistaferri, 2020. "Heterogeneity and Persistence in Returns to Wealth," Econometrica, Econometric Society, vol. 88(1), pages 115-170, January.
- Guiso, Luigi & Pistaferri, Luigi & Fagereng, Andreas & Malacrino, Davide, 2016. "Heterogeneity and Persistence in Returns to Wealth," CEPR Discussion Papers 11635, Centre for Economic Policy Research.
- Andreas Fagereng & Luigi Guiso & Luigi Pistaferri & Davide Malacrino, 2019. "Heterogeneity and persistence in returns to wealth," Discussion Papers 912, Statistics Norway, Research Department.
- Andreas Fagereng & Luigi Guiso & Davide Malacrino & Luigi Pistaferri, 2016. "Heterogeneity and Persistence in Returns to Wealth," NBER Working Papers 22822, National Bureau of Economic Research, Inc.
- Andreas Fagereng & Luigi Guiso & Davide Malacrino & Luigi Pistaferri, 2016. "Heterogeneity and Persistence in Returns to Wealth," EIEF Working Papers Series 1615, Einaudi Institute for Economics and Finance (EIEF), revised Nov 2016.
- Andreas Fagereng & Luigi Guiso & Mr. Davide Malacrino & Luigi Pistaferri, 2018. "Heterogeneity and Persistence in Returns to Wealth," IMF Working Papers 2018/171, International Monetary Fund.
- Andreas Fagereng & Luigi Guiso & Davide Malacrino & Luigi Pistaferri, 2018. "Heterogeneity and Persistence in Returns to Wealth," CESifo Working Paper Series 7107, CESifo.
Luca Di Persio & Luca Prezioso & Kai Wallbaum, 2019. "Closed-End Formula for options linked to Target Volatility Strategies," Papers 1902.08821, arXiv.org.
John H. Cochrane, 1999. "New facts in finance," Economic Perspectives, Federal Reserve Bank of Chicago, vol. 23(Q III), pages 36-58.
- John H. Cochrane, 1999. "New Facts in Finance," CRSP working papers 490, Center for Research in Security Prices, Graduate School of Business, University of Chicago.
- John H. Cochrane, 1999. "New Facts in Finance," NBER Working Papers 7169, National Bureau of Economic Research, Inc.
Larrain, Borja, 2011. "World betas, consumption growth, and financial integration," Journal of International Money and Finance, Elsevier, vol. 30(6), pages 999-1018, October.
Song, Dandan & Wang, Huamao & Yang, Zhaojun, 2014. "Learning, pricing, timing and hedging of the option to invest for perpetual cash flows with idiosyncratic risk," Journal of Mathematical Economics, Elsevier, vol. 51(C), pages 1-11.
Devereux, Michael B. & Saito, Makoto, 1997. "Growth and risk-sharing with incomplete international assets markets," Journal of International Economics, Elsevier, vol. 42(3-4), pages 453-481, May.
John Y. Campbell & Luis M. Viceira & Joshua S. White, 2003. "Foreign Currency for Long-Term Investors," Economic Journal, Royal Economic Society, vol. 113(486), pages 1-25, March.
- John Y. Campbell & Luis M. Viceira & Joshua S. White, 2002. "Foreign Currency for Long-Term Investors," NBER Working Papers 9075, National Bureau of Economic Research, Inc.
- Campbell, John Y & Viceira, Luis & White, Josh S., 2002. "Foreign Currency for Long-Term Investors," CEPR Discussion Papers 3463, Centre for Economic Policy Research.
- Viceira, Luis & Campbell, John & White, Joshua, 2003. "Foreign Currency for Long-Term Investors," Scholarly Articles 3128708, Harvard University Department of Economics.
repec:dau:papers:123456789/56 is not listed on IDEAS
Stephen Satchell & Susan Thorp, 2007. "Scenario Analysis with Recursive Utility: Dynamic Consumption Plans for Charitable Endowments," Research Paper Series 209, Quantitative Finance Research Centre, University of Technology, Sydney.
- Stephen Satchell & Susan Thorp, 2008. "Scenario Analysis with Recursive Utility: Dynamic Consumption Plans for Charitable Endowments," CAMA Working Papers 2008-03, Centre for Applied Macroeconomic Analysis, Crawford School of Public Policy, The Australian National University.
Cuoco, Domenico & Liu, Hong, 2000. "Optimal consumption of a divisible durable good," Journal of Economic Dynamics and Control, Elsevier, vol. 24(4), pages 561-613, April.
- Domenico Cuoco & Hong Liu, "undated". "Optimal Consumption of a Divisible Durable Good," Rodney L. White Center for Financial Research Working Papers 20-98, Wharton School Rodney L. White Center for Financial Research.
Hong‐Chih Huang, 2010. "Optimal Multiperiod Asset Allocation: Matching Assets to Liabilities in a Discrete Model," Journal of Risk & Insurance, The American Risk and Insurance Association, vol. 77(2), pages 451-472, June.
Carlos Garriga & Mark P. Keightley, 2007. "A general equilibrium theory of college with education subsidies, in-school labor supply, and borrowing constraints," Working Papers 2007-051, Federal Reserve Bank of St. Louis.
- Carlos Garriga & Mark P. Keightley, 2013. "A General Equilibrium Theory of College with Education Subsidies, In-School Labor Supply, and Borrowing Constraints," Working Papers 2013-002, Human Capital and Economic Opportunity Working Group.
- Mark P. Keightley & Carlos Garriga, 2009. "A General Equilibrium Theory of College with Education Subsidies, In-School Labor Supply, and Borrowing Constraints," 2009 Meeting Papers 1180, Society for Economic Dynamics.
Orszag, J. Michael & Yang, Hong, 1995. "Portfolio choice with Knightian uncertainty," Journal of Economic Dynamics and Control, Elsevier, vol. 19(5-7), pages 873-900.
Bjork, Tomas, 2009. "Arbitrage Theory in Continuous Time," OUP Catalogue, Oxford University Press, edition 3, number 9780199574742.
Andrew Papanicolaou, 2018. "Backward SDEs for Control with Partial Information," Papers 1807.08222, arXiv.org.
E. Nasakkala & J. Keppo, 2008. "Hydropower with Financial Information," Applied Mathematical Finance, Taylor & Francis Journals, vol. 15(5-6), pages 503-529.
Jan Kallsen & Johannes Muhle-Karbe, 2013. "The General Structure of Optimal Investment and Consumption with Small Transaction Costs," Papers 1303.3148, arXiv.org, revised May 2015.

More about this item

NEP fields

This paper has been announced in the following NEP Reports:

NEP-BIG-2020-03-09 (Big Data)
NEP-CMP-2020-03-09 (Computational Economics)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2002.10990. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: https://arxiv.org/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

G-Learner and GIRL: Goal Based Wealth Management with Reinforcement Learning

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data