IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2602.13544.html

Merton's Problem with Recursive Perturbed Utility

Author

Listed:
  • Min Dai
  • Yuchao Dong
  • Yanwei Jia
  • Xun Yu Zhou

Abstract

The classical Merton investment problem predicts deterministic, state-dependent portfolio rules; however, laboratory and field evidence suggests that individuals often prefer randomized decisions leading to stochastic and noisy choices. Fudenberg et al. (2015) develop the additive perturbed utility theory to explain the preference for randomization in the static setting, which, however, becomes ill-posed or intractable in the dynamic setting. We introduce the recursive perturbed utility (RPU), a special stochastic differential utility that incorporates an entropy-based preference for randomization into a recursive aggregator. RPU endogenizes the intertemporal trade-off between utilities from randomization and bequest via a discounting term dependent on past accumulated randomization, thereby avoiding excessive randomization and yielding a well-posed problem. In a general Markovian incomplete market with CRRA preferences, we prove that the RPU-optimal portfolio policy (in terms of the risk exposure ratio) is Gaussian and can be expressed in closed form, independent of wealth. Its variance is inversely proportional to risk aversion and stock volatility, while its mean is based on the solution to a partial differential equation. Moreover, the mean is the sum of a myopic term and an intertemporal hedging term (against market incompleteness) that intertwines with policy randomization. Finally, we carry out an asymptotic expansion in terms of the perturbed utility weight to show that the optimal mean policy deviates from the classical Merton policy at first order, while the associated relative wealth loss is of a higher order, quantifying the financial cost of the preference for randomization.

Suggested Citation

  • Min Dai & Yuchao Dong & Yanwei Jia & Xun Yu Zhou, 2026. "Merton's Problem with Recursive Perturbed Utility," Papers 2602.13544, arXiv.org.
  • Handle: RePEc:arx:papers:2602.13544
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2602.13544
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Marina Agranov & Pietro Ortoleva, 2025. "Ranges of Randomization," The Review of Economics and Statistics, MIT Press, vol. 107(6), pages 1702-1713, November.
    2. Duffie, Darrel & Lions, Pierre-Louis, 1992. "PDE solutions of stochastic differential utility," Journal of Mathematical Economics, Elsevier, vol. 21(6), pages 577-606.
    3. Nicholas Barberis, 2012. "A Model of Casino Gambling," Management Science, INFORMS, vol. 58(1), pages 35-51, January.
    4. Marina Agranov & Pietro Ortoleva, 2017. "Stochastic Choice and Preferences for Randomization," Journal of Political Economy, University of Chicago Press, vol. 125(1), pages 40-68.
    5. Guiyun Feng & Xiaobo Li & Zizhuo Wang, 2017. "Technical Note—On the Relation Between Several Discrete Choice Models," Operations Research, INFORMS, vol. 65(6), pages 1516-1525, December.
    6. Min Dai & Yu Sun & Zuo Quan Xu & Xun Yu Zhou, 2024. "Learning to Optimally Stop Diffusion Processes, with Financial Applications," Papers 2408.09242, arXiv.org, revised Aug 2025.
    7. V. Joseph Hotz & Robert A. Miller & Seth Sanders & Jeffrey Smith, 1994. "A Simulation Estimator for Dynamic Models of Discrete Choice," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 61(2), pages 265-289.
    8. Yanwei Jia, 2024. "Continuous-time Risk-sensitive Reinforcement Learning via Quadratic Variation Penalty," Papers 2404.12598, arXiv.org, revised Mar 2026.
    9. Dianetti, Jodi & Dumitrescu, Roxana & Ferrari, Giorgio & Xu, Renyuan, 2025. "Entropy Regularization in Mean-Field Games of Optimal Stopping," Center for Mathematical Economics Working Papers 755, Center for Mathematical Economics, Bielefeld University.
    10. Min Dai & Yuchao Dong & Yanwei Jia, 2023. "Learning equilibrium mean‐variance strategy," Mathematical Finance, Wiley Blackwell, vol. 33(4), pages 1166-1212, October.
    11. Mattsson, Lars-Goran & Weibull, Jorgen W., 2002. "Probabilistic choice and procedurally bounded rationality," Games and Economic Behavior, Elsevier, vol. 41(1), pages 61-78, October.
    12. Duffie, Darrell & Epstein, Larry G, 1992. "Stochastic Differential Utility," Econometrica, Econometric Society, vol. 60(2), pages 353-394, March.
    13. V. Joseph Hotz & Robert A. Miller, 1993. "Conditional Choice Probabilities and the Estimation of Dynamic Models," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 60(3), pages 497-529.
    14. Drew Fudenberg & Ryota Iijima & Tomasz Strzalecki, 2015. "Stochastic Choice and Revealed Perturbed Utility," Econometrica, Econometric Society, vol. 83, pages 2371-2409, November.
    15. Bin Miao & Songfa Zhong, 2018. "Probabilistic social preference: how Machina’s Mom randomizes her choice," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 65(1), pages 1-24, January.
    16. Ying Hu & Peter Imkeller & Matthias Muller, 2005. "Utility maximization in incomplete markets," Papers math/0508448, arXiv.org.
    17. Min Dai & Hanqing Jin & Steven Kou & Yuhong Xu, 2021. "A Dynamic Mean-Variance Analysis for Log Returns," Management Science, INFORMS, vol. 67(2), pages 1093-1108, February.
    18. Kydland, Finn E & Prescott, Edward C, 1982. "Time to Build and Aggregate Fluctuations," Econometrica, Econometric Society, vol. 50(6), pages 1345-1370, November.
    19. Sang Hu & Jan Obłój & Xun Yu Zhou, 2023. "A Casino Gambling Model Under Cumulative Prospect Theory: Analysis and Algorithm," Management Science, INFORMS, vol. 69(4), pages 2474-2496, April.
    20. Duffie, Darrell & Epstein, Larry G, 1992. "Asset Pricing with Stochastic Differential Utility," The Review of Financial Studies, Society for Financial Studies, vol. 5(3), pages 411-436.
    21. Bender, Christian & Thuan, Nguyen Tran, 2026. "Continuous time reinforcement learning: A random measure approach," Stochastic Processes and their Applications, Elsevier, vol. 194(C).
    22. Xin Guo & Renyuan Xu & Thaleia Zariphopoulou, 2022. "Entropy Regularization for Mean Field Games with Learning," Mathematics of Operations Research, INFORMS, vol. 47(4), pages 3239-3260, November.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Min Dai & Yuchao Dong & Yanwei Jia & Xun Yu Zhou, 2023. "Data-Driven Merton's Strategies via Policy Randomization," Papers 2312.11797, arXiv.org, revised Feb 2026.
    2. Zixin Feng & Dejian Tian, 2021. "Optimal consumption and portfolio selection with Epstein-Zin utility under general constraints," Papers 2111.09032, arXiv.org, revised May 2023.
    3. Knut K. Aase, 2016. "Recursive utility using the stochastic maximum principle," Quantitative Economics, Econometric Society, vol. 7(3), pages 859-887, November.
    4. Joshua Aurand & Yu-Jui Huang, 2019. "Epstein-Zin Utility Maximization on a Random Horizon," Papers 1903.08782, arXiv.org, revised May 2023.
    5. Yilie Huang & Yanwei Jia & Xun Yu Zhou, 2024. "Mean--Variance Portfolio Selection by Continuous-Time Reinforcement Learning: Algorithms, Regret Analysis, and Empirical Study," Papers 2412.16175, arXiv.org, revised Mar 2026.
    6. Joshua Aurand & Yu‐Jui Huang, 2023. "Epstein‐Zin utility maximization on a random horizon," Mathematical Finance, Wiley Blackwell, vol. 33(4), pages 1370-1411, October.
    7. Huiwen Yan & Gechun Liang & Zhou Yang, 2015. "Indifference Pricing and Hedging in a Multiple-Priors Model with Trading Constraints," Papers 1503.08969, arXiv.org.
    8. Anis Matoussi & Hao Xing, 2016. "Convex duality for stochastic differential utility," Papers 1601.03562, arXiv.org.
    9. Guanxing Fu & Ulrich Horst, 2025. "Mean Field Portfolio Games with Epstein-Zin Preferences," Papers 2505.07231, arXiv.org.
    10. Ruan, Xinfeng & Zhang, Jin E., 2018. "Equilibrium variance risk premium in a cost-free production economy," Journal of Economic Dynamics and Control, Elsevier, vol. 96(C), pages 42-60.
    11. Sanjiv Ranjan Das & Rangarajan K. Sundaram, 2002. "An approximation algorithm for optimal consumption/investment problems," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 11(2), pages 55-69, April.
    12. Hao Xing, 2015. "Consumption investment optimization with Epstein-Zin utility in incomplete markets," Papers 1501.04747, arXiv.org, revised Nov 2015.
    13. Feng, Zixin & Tian, Dejian & Zheng, Harry, 2026. "Consumption–investment optimization with Epstein–Zin utility in unbounded non-Markovian markets," Stochastic Processes and their Applications, Elsevier, vol. 192(C).
    14. Guanxing Fu & Ulrich Horst, 2025. "Mean Field Portfolio Games with Epstein-Zin Preferences," Rationality and Competition Discussion Paper Series 540, CRC TRR 190 Rationality and Competition.
    15. Gregor Heyne & Michael Kupper & Ludovic Tangpi, 2016. "Portfolio Optimization Under Nonlinear Utility," International Journal of Theoretical and Applied Finance (IJTAF), World Scientific Publishing Co. Pte. Ltd., vol. 19(05), pages 1-37, August.
    16. Casassus, Jaime & Collin-Dufresne, Pierre & Routledge, Bryan R., 2018. "Equilibrium commodity prices with irreversible investment and non-linear technologies," Journal of Banking & Finance, Elsevier, vol. 95(C), pages 128-147.
    17. Hao Xing, 2017. "Consumption–investment optimization with Epstein–Zin utility in incomplete markets," Finance and Stochastics, Springer, vol. 21(1), pages 227-262, January.
    18. Aase, Knut K., 2014. "Recursive utility and jump-diffusions," Discussion Papers 2014/9, Norwegian School of Economics, Department of Business and Management Science.
    19. Aase, Knut K., 2014. "Heterogeneity and limited stock market Participation," Discussion Papers 2014/5, Norwegian School of Economics, Department of Business and Management Science, revised 25 Mar 2015.
    20. Fahrenwaldt, Matthias Albrecht & Jensen, Ninna Reitzel & Steffensen, Mogens, 2020. "Nonrecursive separation of risk and time preferences," Journal of Mathematical Economics, Elsevier, vol. 90(C), pages 95-108.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2602.13544. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.