IDEAS home Printed from https://ideas.repec.org/a/eee/ejores/v328y2026i3p877-893.html

Markov decision processes: Monotonicity of optimal policy in exponential and quasi-hyperbolic discounting parameters

Author

Listed:
  • Kılıç, Hakan
  • Canbolat, Pelin Gülşah
  • Güneş, Evrim Didem

Abstract

Intertemporal preferences of decision makers, i.e., the way they discount delayed utilities, impact their decisions. Empirical evidence suggests that individuals commonly have hyperbolic discounting preferences. This can result in time-inconsistent behavior, e.g., procrastination, which may be a barrier to adopting preventive behavior such as machine maintenance and patient adherence to treatment. In this paper, we theoretically compare the actions of individuals based on their discounting characteristics. We consider the Hyperbolic Discounting (HD) model, which is more representative of individual behavior than Exponential Discounting (ED). We formulate a discrete-time finite-horizon Markov decision process with Quasi-Hyperbolic Discounting (QHD), an analytically tractable function representing HD and present sufficient conditions that ensure the monotonicity of the optimal policy in the discounting parameters. We consider submodular maximization or supermodular maximization problems. Our paper is the first to investigate the monotonicity of the optimal policy in QHD parameters for these problems. Moreover, we compare the optimal actions under ED and QHD. We apply our results to the settings of machine maintenance, individual health behavior and inventory control. We provide numerical examples that show there might not be monotonicity if our sufficient conditions are not met. Also, we explore the discrepancy between the expected total exponentially-discounted rewards of the actions obtained from QHD and of the actions that are optimal under ED, and observe that this discrepancy is affected mainly by the present bias.

Suggested Citation

  • Kılıç, Hakan & Canbolat, Pelin Gülşah & Güneş, Evrim Didem, 2026. "Markov decision processes: Monotonicity of optimal policy in exponential and quasi-hyperbolic discounting parameters," European Journal of Operational Research, Elsevier, vol. 328(3), pages 877-893.
  • Handle: RePEc:eee:ejores:v:328:y:2026:i:3:p:877-893
    DOI: 10.1016/j.ejor.2025.09.013
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0377221725007301
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ejor.2025.09.013?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Erica L. Plambeck & Qiong Wang, 2013. "Implications of Hyperbolic Discounting for Optimal Pricing and Scheduling of Unpleasant Services That Generate Future Benefits," Management Science, INFORMS, vol. 59(8), pages 1927-1946, August.
    2. Diego Nocetti & Elyès Jouini & Clotilde Napp, 2008. "Properties of the Social Discount Rate in a Benthamite Framework with Heterogeneous Degrees of Impatience," Management Science, INFORMS, vol. 54(10), pages 1822-1826, October.
    3. Dror Zuckerman, 1986. "Optimal maintenance policy for stochastically failing equipment: A diffusion approximation," Naval Research Logistics Quarterly, John Wiley & Sons, vol. 33(3), pages 469-477, August.
    4. Arvaniti, Maria & Krishnamurthy, Chandra Kiran B. & Crépin, Anne-Sophie, 2023. "Time-consistent renewable resource management with present bias and regime shifts," Journal of Economic Behavior & Organization, Elsevier, vol. 207(C), pages 479-495.
    5. Tomak, Kerem & Keskin, Tayfun, 2008. "Exploring the trade-off between immediate gratification and delayed network externalities in the consumption of information goods," European Journal of Operational Research, Elsevier, vol. 187(3), pages 887-902, June.
    6. Hopenhayn, Hugo A & Prescott, Edward C, 1992. "Stochastic Monotonicity and Stationary Distributions for Dynamic Economies," Econometrica, Econometric Society, vol. 60(6), pages 1387-1406, November.
    7. Balbus, Łukasz & Reffett, Kevin & Woźny, Łukasz, 2018. "On uniqueness of time-consistent Markov policies for quasi-hyperbolic consumers under uncertainty," Journal of Economic Theory, Elsevier, vol. 176(C), pages 293-310.
    8. Lydia Lawless & Andreas Drichoutis & Rodolfo Nayga, 2013. "Time preferences and health behaviour: a review," Demography, Springer;Population Association of America (PAA), vol. 1(1), pages 1-19, December.
    9. repec:dau:papers:123456789/260 is not listed on IDEAS
    10. Rabah Amir, 2002. "Complementarity and Diagonal Dominance in Discounted Stochastic Games," Annals of Operations Research, Springer, vol. 114(1), pages 39-56, August.
    11. Matthew Rabin & Ted O'Donoghue, 1999. "Doing It Now or Later," American Economic Review, American Economic Association, vol. 89(1), pages 103-124, March.
    12. Nicholas G. Hall & Zhixin Liu, 2023. "Scheduling with present bias," Production and Operations Management, Production and Operations Management Society, vol. 32(6), pages 1743-1759, June.
    13. Katherine L. Milkman & Todd Rogers & Max H. Bazerman, 2009. "Highbrow Films Gather Dust: Time-Inconsistent Preferences and Online DVD Rentals," Management Science, INFORMS, vol. 55(6), pages 1047-1059, June.
    14. Sezer Ülkü & Claudiu V. Dimofte & Glen M. Schmidt, 2012. "Consumer Valuation of Modularly Upgradeable Products," Management Science, INFORMS, vol. 58(9), pages 1761-1776, September.
    15. S. D. Chikte & S. D. Deshmukh, 1981. "Preventive maintenance and replacement under additive damage," Naval Research Logistics Quarterly, John Wiley & Sons, vol. 28(1), pages 33-46, March.
    16. Tomas Björk & Agatha Murgoci, 2014. "A theory of Markovian time-inconsistent stochastic control in discrete time," Finance and Stochastics, Springer, vol. 18(3), pages 545-592, July.
    17. Jonathan Cohen & Keith Marzilli Ericson & David Laibson & John Myles White, 2020. "Measuring Time Preferences," Journal of Economic Literature, American Economic Association, vol. 58(2), pages 299-347, June.
    18. Zhu, Jinxia & Siu, Tak Kuen & Yang, Hailiang, 2020. "Singular dividend optimization for a linear diffusion model with time-inconsistent preferences," European Journal of Operational Research, Elsevier, vol. 285(1), pages 66-80.
    19. Xiaobo Zhao & Yun Zhou & Jinxing Xie, 2017. "An inventory system with quasi-hyperbolic discounting rate," IISE Transactions, Taylor & Francis Journals, vol. 49(6), pages 593-602, June.
    20. Shane Frederick & George Loewenstein & Ted O'Donoghue, 2002. "Time Discounting and Time Preference: A Critical Review," Journal of Economic Literature, American Economic Association, vol. 40(2), pages 351-401, June.
    21. Steven M. Shechter & Matthew D. Bailey & Andrew J. Schaefer & Mark S. Roberts, 2008. "The Optimal Time to Initiate HIV Therapy Under Ordered Health States," Operations Research, INFORMS, vol. 56(1), pages 20-33, February.
    22. Yang, Li & Ye, Zhi-sheng & Lee, Chi-Guhn & Yang, Su-fen & Peng, Rui, 2019. "A two-phase preventive maintenance policy considering imperfect repair and postponed replacement," European Journal of Operational Research, Elsevier, vol. 274(3), pages 966-977.
    23. Ted O'Donoghue & Matthew Rabin, 2001. "Choice and Procrastination," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 116(1), pages 121-160.
    24. Anna Jaśkiewicz & Andrzej S. Nowak, 2021. "Markov decision processes with quasi-hyperbolic discounting," Finance and Stochastics, Springer, vol. 25(2), pages 189-229, April.
    25. George Loewenstein & Drazen Prelec, 1992. "Anomalies in Intertemporal Choice: Evidence and an Interpretation," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 107(2), pages 573-597.
    26. Bar Light, 2021. "Stochastic Comparative Statics in Markov Decision Processes," Mathematics of Operations Research, INFORMS, vol. 46(2), pages 797-810, May.
    27. Balbus, Łukasz & Reffett, Kevin & Woźny, Łukasz, 2022. "Time-consistent equilibria in dynamic models with recursive payoffs and behavioral discounting," Journal of Economic Theory, Elsevier, vol. 204(C).
    28. Anett John, 2020. "When Commitment Fails: Evidence from a Field Experiment," Management Science, INFORMS, vol. 66(2), pages 503-529, February.
    29. Curtat, Laurent O., 1996. "Markov Equilibria of Stochastic Games with Complementarities," Games and Economic Behavior, Elsevier, vol. 17(2), pages 177-199, December.
    30. Francesca Gino & Gary Pisano, 2008. "Toward a Theory of Behavioral Operations," Manufacturing & Service Operations Management, INFORMS, vol. 10(4), pages 676-691, March.
    31. Taisuke Imai & Tom A Rutter & Colin F Camerer, 2021. "Meta-Analysis of Present-Bias Estimation using Convex Time Budgets," The Economic Journal, Royal Economic Society, vol. 131(636), pages 1788-1814.
    32. David Laibson, 1997. "Golden Eggs and Hyperbolic Discounting," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 112(2), pages 443-478.
    33. Norman Henderson & Ian Langford, 1998. "Cross-Disciplinary Evidence for Hyperbolic Social Discount Rates," Management Science, INFORMS, vol. 44(11-Part-1), pages 1493-1500, November.
    34. E. S. Phelps & R. A. Pollak, 1968. "On Second-Best National Saving and Game-Equilibrium Growth," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 35(2), pages 185-199.
    35. Andrzej Nowak, 2007. "On stochastic games in economics," Mathematical Methods of Operations Research, Springer;Gesellschaft für Operations Research (GOR);Nederlands Genootschap voor Besliskunde (NGB), vol. 66(3), pages 513-530, December.
    36. Stefano DellaVigna & Ulrike Malmendier, 2006. "Paying Not to Go to the Gym," American Economic Review, American Economic Association, vol. 96(3), pages 694-719, June.
    37. Chen‐Nan Liao & Ying‐Ju Chen, 2021. "Design of Long‐Term Conditional Cash Transfer Program to Encourage Healthy Habits," Production and Operations Management, Production and Operations Management Society, vol. 30(11), pages 3987-4003, November.
    38. Łukasz Balbus & Kevin Reffett & Łukasz Woźny, 2015. "Time consistent Markov policies in dynamic economies with quasi-hyperbolic consumers," International Journal of Game Theory, Springer;Game Theory Society, vol. 44(1), pages 83-112, February.
    39. Loch, Christoph H. & Wu, Yaozhong, 2007. "Behavioral Operations Management," Foundations and Trends(R) in Technology, Information and Operations Management, now publishers, vol. 1(3), pages 121-232, December.
    40. Mariana Carrera & Heather Royer & Mark Stehr & Justin Sydnor, 2020. "The Structure of Health Incentives: Evidence from a Field Experiment," Management Science, INFORMS, vol. 66(5), pages 1890-1908, May.
    41. Li Li & Li Jiang, 2022. "How should firms adapt pricing strategies when consumers are time‐inconsistent?," Production and Operations Management, Production and Operations Management Society, vol. 31(9), pages 3457-3473, September.
    42. Larry G. Epstein & Stephen M. Tanny, 1980. "Increasing Generalized Correlation: A Definition and Some Economic Consequences," Canadian Journal of Economics, Canadian Economics Association, vol. 13(1), pages 16-34, February.
    43. Doug J. Chung & Byungyeon Kim & Byoung G. Park, 2021. "The Comprehensive Effects of Sales Force Management: A Dynamic Structural Analysis of Selection, Compensation, and Training," Management Science, INFORMS, vol. 67(11), pages 7046-7074, November.
    44. Łukasz Balbus & Anna Jaśkiewicz & Andrzej S. Nowak, 2015. "Existence of Stationary Markov Perfect Equilibria in Stochastic Altruistic Growth Economies," Journal of Optimization Theory and Applications, Springer, vol. 165(1), pages 295-315, April.
    45. Samuel Vercraene & Jean-Philippe Gayon & Fikri Karaesmen, 2018. "Effects of System Parameters on the Optimal Cost and Policy in a Class of Multidimensional Queueing Control Problems," Operations Research, INFORMS, vol. 66(1), pages 150-162, January.
    46. John K.-H. Quah & Bruno Strulovici, 2013. "Discounting, Values, and Decisions," Journal of Political Economy, University of Chicago Press, vol. 121(5), pages 896-939.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Cheung, Stephen L. & Tymula, Agnieszka & Wang, Xueting, 2021. "Quasi-Hyperbolic Present Bias: A Meta-Analysis," IZA Discussion Papers 14625, IZA Network @ LISER.
    2. Akin, Zafer & Yavas, Abdullah, 2023. "Elicited Time Preferences and Behavior in Long-Run Projects," MPRA Paper 117133, University Library of Munich, Germany.
    3. Balbus, Łukasz & Reffett, Kevin & Woźny, Łukasz, 2022. "Time-consistent equilibria in dynamic models with recursive payoffs and behavioral discounting," Journal of Economic Theory, Elsevier, vol. 204(C).
    4. Stephen L. Cheung & Agnieszka Tymula & Xueting Wang, 2022. "Present bias for monetary and dietary rewards," Experimental Economics, Springer;Economic Science Association, vol. 25(4), pages 1202-1233, September.
    5. Stefano DellaVigna, 2009. "Psychology and Economics: Evidence from the Field," Journal of Economic Literature, American Economic Association, vol. 47(2), pages 315-372, June.
    6. Chen, Shumin & Luo, Dan & Yao, Haixiang, 2024. "Optimal investor life cycle decisions with time-inconsistent preferences," Journal of Banking & Finance, Elsevier, vol. 161(C).
    7. O'Donoghue, Ted & Rabin, Matthew, 2008. "Procrastination on long-term projects," Journal of Economic Behavior & Organization, Elsevier, vol. 66(2), pages 161-175, May.
    8. Taisuke Imai & Tom A Rutter & Colin F Camerer, 2021. "Meta-Analysis of Present-Bias Estimation using Convex Time Budgets," The Economic Journal, Royal Economic Society, vol. 131(636), pages 1788-1814.
    9. Méder, Zsombor Z. & Flesch, János & Peeters, Ronald, 2017. "Naiveté and sophistication in dynamic inconsistency," Mathematical Social Sciences, Elsevier, vol. 87(C), pages 40-54.
    10. Laureti, Carolina & Szafarz, Ariane, 2023. "Banking regulation and costless commitment contracts for time-inconsistent agents," Economic Modelling, Elsevier, vol. 129(C).
    11. Bart Cockx & Corinna Ghirelli & Bruno Van der Linden, 2013. "Monitoring Job Search Effort with Hyperbolic Time Preferences and Non-Compliance: A Welfare Analysis," CESifo Working Paper Series 4187, CESifo.
    12. Drouhin, Nicolas, 2020. "Non-stationary additive utility and time consistency," Journal of Mathematical Economics, Elsevier, vol. 86(C), pages 1-14.
    13. Frikk Nesje & Paolo G. Piacquadio & Paolo Giovanni Piacquadio, 2025. "Intergenerational Discounting and Inequality," CESifo Working Paper Series 11630, CESifo.
    14. Manzini Paola & Mariotti Marco, 2006. "A Vague Theory of Choice over Time," The B.E. Journal of Theoretical Economics, De Gruyter, vol. 6(1), pages 1-29, October.
    15. Kang, Jingoo & Kang, Minwook, 2022. "Durable goods as commitment devices under quasi-hyperbolic discounting," Journal of Mathematical Economics, Elsevier, vol. 99(C).
    16. Altınok, Ahmet & Yılmaz, Murat, 2018. "Dynamic voluntary contribution to a public project under time inconsistency," Journal of Economic Behavior & Organization, Elsevier, vol. 145(C), pages 114-140.
    17. Hammond, Peter J & Zank, Horst, 2013. "Rationality and Dynamic Consistency under Risk and Uncertainty," The Warwick Economics Research Paper Series (TWERPS) 1033, University of Warwick, Department of Economics.
    18. Yılmaz, Murat, 2015. "Contracting with a naïve time-inconsistent agent: To exploit or not to exploit?," Mathematical Social Sciences, Elsevier, vol. 77(C), pages 46-51.
    19. Danzer, Alexander M. & Zeidler, Helen, 2024. "Present Bias in Choices over Food and Money," IZA Discussion Papers 17415, IZA Network @ LISER.
    20. Marco Casari, 2009. "Pre-commitment and flexibility in a time decision experiment," Journal of Risk and Uncertainty, Springer, vol. 38(2), pages 117-141, April.

    More about this item

    Keywords

    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:ejores:v:328:y:2026:i:3:p:877-893. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/eor .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.