Constrained Markov decision processes with first passage criteria

My bibliography Save this article

Constrained Markov decision processes with first passage criteria

Author

Listed:

Yonghui Huang
Qingda Wei
Xianping Guo

Registered:

Abstract

This paper deals with constrained Markov decision processes (MDPs) with first passage criteria. The objective is to maximize the expected reward obtained during a first passage time to some target set, and a constraint is imposed on the associated expected cost over this first passage time. The state space is denumerable, and the rewards/costs are possibly unbounded. In addition, the discount factor is state-action dependent and is allowed to be equal to one. We develop suitable conditions for the existence of a constrained optimal policy, which are generalizations of those for constrained MDPs with the standard discount criteria. Moreover, it is revealed that the constrained optimal policy randomizes between two stationary policies differing in at most one state. Finally, we use a controlled queueing system to illustrate our results, which exhibits some advantage of our optimality conditions. Copyright Springer Science+Business Media New York 2013

Suggested Citation

Yonghui Huang & Qingda Wei & Xianping Guo, 2013. "Constrained Markov decision processes with first passage criteria," Annals of Operations Research, Springer, vol. 206(1), pages 197-219, July.

Handle: RePEc:spr:annopr:v:206:y:2013:i:1:p:197-219:10.1007/s10479-012-1292-1
DOI: 10.1007/s10479-012-1292-1

Download full text from publisher

As the access to this document is restricted, you may want to

for a different version of it.

References listed on IDEAS

Berument, Hakan & Kilinc, Zubeyir & Ozlale, Umit, 2004. "The effects of different inflation risk premiums on interest rate spreads," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 333(C), pages 317-324.
Jorge Alvarez-Mena & Onésimo Hernández-Lerma, 2002. "Convergence of the optimal values of constrained Markov control processes," Mathematical Methods of Operations Research, Springer;Gesellschaft für Operations Research (GOR);Nederlands Genootschap voor Besliskunde (NGB), vol. 55(3), pages 461-484, June.
Lanlan Zhang & Xianping Guo, 2008. "Constrained continuous-time Markov decision processes with average criteria," Mathematical Methods of Operations Research, Springer;Gesellschaft für Operations Research (GOR);Nederlands Genootschap voor Besliskunde (NGB), vol. 67(2), pages 323-340, April.
Newell, Richard G. & Pizer, William A., 2003. "Discounting the distant future: how much do uncertain rates increase valuations?," Journal of Environmental Economics and Management, Elsevier, vol. 46(1), pages 52-71, July.
- Pizer, William & Newell, Richard, 2000. "Discounting the Distant Future: How Much Do Uncertain Rates Increase Valuations?," RFF Working Paper Series dp-00-45, Resources for the Future.
- Newell, Richard G. & Pizer, William A., 2001. "Discounting the Distant Future: How Much Do Uncertain Rates Increase Valuations?," Discussion Papers 10743, Resources for the Future.
Sack, Brian & Wieland, Volker, 2000. "Interest-rate smoothing and optimal monetary policy: a review of recent empirical evidence," Journal of Economics and Business, Elsevier, vol. 52(1-2), pages 205-228.
- Brian P. Sack & Volker W. Wieland, 1999. "Interest-rate smoothing and optimal monetary policy: a review of recent empirical evidence," Finance and Economics Discussion Series 1999-39, Board of Governors of the Federal Reserve System (U.S.).
Jorge Alvarez-Mena & Onésimo Hernández-Lerma, 2002. "Convergence of the optimal values of constrained Markov control processes," The Annals of Regional Science, Springer;Western Regional Science Association, vol. 55(3), pages 461-484, June.
Haberman, Steven & Sung, Joo-Ho, 2005. "Optimal pension funding dynamics over infinite control horizon when stochastic rates of return are stationary," Insurance: Mathematics and Economics, Elsevier, vol. 36(1), pages 103-116, February.
Lee, Pei-Ting & Rosenfield, Donald B., 2005. "When to refinance a mortgage: A dynamic programming approach," European Journal of Operational Research, Elsevier, vol. 166(1), pages 266-277, October.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

David González-Sánchez & Fernando Luque-Vásquez & J. Adolfo Minjárez-Sosa, 2019. "Zero-Sum Markov Games with Random State-Actions-Dependent Discount Factors: Existence of Optimal Strategies," Dynamic Games and Applications, Springer, vol. 9(1), pages 103-121, March.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Juan González-Hernández & Raquiel López-Martínez & J. Pérez-Hernández, 2007. "Markov control processes with randomized discounted cost," Mathematical Methods of Operations Research, Springer;Gesellschaft für Operations Research (GOR);Nederlands Genootschap voor Besliskunde (NGB), vol. 65(1), pages 27-44, February.
Wenzhao Zhang, 2019. "Discrete-Time Constrained Average Stochastic Games with Independent State Processes," Mathematics, MDPI, vol. 7(11), pages 1-18, November.
Héctor Jasso-Fuentes & Tomás Prieto-Rumeau, 2024. "Constrained Markov Decision Processes with Non-constant Discount Factor," Journal of Optimization Theory and Applications, Springer, vol. 202(2), pages 897-931, August.
Guo, Xianping & Zhang, Wenzhao, 2014. "Convergence of controlled models and finite-state approximation for discounted continuous-time Markov decision processes with constraints," European Journal of Operational Research, Elsevier, vol. 238(2), pages 486-496.
Steve Newbold & Charles Griffiths & Christopher C. Moore & Ann Wolverton & Elizabeth Kopits, 2010. "The "Social Cost of Carbon" Made Simple," NCEE Working Paper Series 201007, National Center for Environmental Economics, U.S. Environmental Protection Agency, revised Aug 2010.
Luboš Komárek & Filip Rozsypal, 2009. "Vymezení a vyhodnocení agresivity centrálních bank [Definition and Evaluation of the Central Bank agresivity]," Politická ekonomie, Prague University of Economics and Business, vol. 2009(3), pages 383-404.
Frederick H. Wallace & Gary L. Shelley & Luis F. Cabrera Castellanos, 2004. "Pruebas de la neutralidad monetaria a largo plazo: el caso de Nicaragua," Monetaria, CEMLA, vol. 0(4), pages 407-418, octubre-d.
- Wallace, Frederick H. & Shelley, Gary L. & Cabrera Castellanos, Luis Fernando, 2004. "Pruebas de la neutralidad monetaria a largo plazo. El caso de Nicaragua," El Trimestre Económico, Fondo de Cultura Económica, vol. 71(283), pages 613-624, julio-sep.
Jinho Bae & Chang-Jin Kim & Dong Kim, 2012. "The evolution of the monetary policy regimes in the U.S," Empirical Economics, Springer, vol. 43(2), pages 617-649, October.
- Jinho Bae & Chang-Jin Kim & Dong Heon Kim, 2011. "The Evolution of the Monetary Policy Regimes in the U.S," Discussion Paper Series 1102, Institute of Economic Research, Korea University.
Michael D. Bauer & Eric T. Swanson, 2023. "An Alternative Explanation for the "Fed Information Effect"," American Economic Review, American Economic Association, vol. 113(3), pages 664-700, March.
- Michael D. Bauer & Eric T. Swanson, 2020. "An Alternative Explanation for the “Fed Information Effect”," NBER Working Papers 27013, National Bureau of Economic Research, Inc.
Coenen, Gunter & Wieland, Volker, 2005. "A small estimated euro area model with rational expectations and nominal rigidities," European Economic Review, Elsevier, vol. 49(5), pages 1081-1104, July.
- Gunter Coenen & Volker Wieland, 2000. "A Small Estimated Euro-Area Model with Rational Expectations and Nominal Rigidities," Econometric Society World Congress 2000 Contributed Papers 1284, Econometric Society.
- Wieland, Volker & Coenen, Günter, 2000. "A small estimated euro area model with rational expectations and nominal rigidities," Working Paper Series 30, European Central Bank.
- Coenen, GÃ¼nter & Wieland, Volker, 2002. "A Small Estimated Euro Area Model with Rational Expectations and Nominal Rigidities," CEPR Discussion Papers 3574, C.E.P.R. Discussion Papers.
- Coenen, Guenter & Wieland, Volker, 2003. "A Small Estimated Euro Area Model with Rational Expectations and Nominal Rigidities," CFS Working Paper Series 2003/08, Center for Financial Studies (CFS).
Hansen, Anders Chr., 2006. "Do declining discount rates lead to time inconsistent economic advice?," Ecological Economics, Elsevier, vol. 60(1), pages 138-144, November.
J. Doyne Farmer & John Geanakoplos & Matteo G. Richiardi & Miquel Montero & Josep Perelló & Jaume Masoliver, 2024. "Discounting the Distant Future: What Do Historical Bond Prices Imply about the Long-Term Discount Rate?," Mathematics, MDPI, vol. 12(5), pages 1-25, February.
- Matteo Richiardi & J. Doyne Farmer & John Geanakoplos & Jaume Masoliver & Miquel Montero & Josep Perellò, 2017. "Discounting the distant future: What do historical bond prices imply about the long term discount rate?," LABORatorio R. Revelli Working Papers Series 156, LABORatorio R. Revelli, Centre for Employment Studies.
- J. Doyne Farmer & John Geanakoplos & Matteo G. Richiardi & Miquel Montero & Josep Perell'o & Jaume Masoliver, 2023. "Discounting the distant future: What do historical bond prices imply about the long term discount rate?," Papers 2312.17157, arXiv.org.
Marc-Alexandre Sénégas, 2002. "La politique monétaire face à l'incertitude : un survol méthodologique des contributions relatives à la zone euro," Revue d'Économie Financière, Programme National Persée, vol. 65(1), pages 177-200.
Bernard Lapeyre & Emile Quinet, 2017. "A Simple GDP-based Model for Public Investments at Risk," Post-Print hal-01666574, HAL.
Romain Ranciere & Philippe Bacchetta & Philippe Aghion & Kenneth Rogoff, 2005. "Productivity Growth and the Exchange Rate Regime: The Role of Financial Development," Working Papers 214, Barcelona School of Economics.
Clémentine Florens & Eric Jondeau & Hervé Le Bihan, 2001. "Assessing GMM Estimates of the Federal Reserve Reaction Function," Econometrics 0111003, University Library of Munich, Germany.
- Clémentine Florens & Eric Jondeau & Hervé Le Bihan, 2001. "Assessing GMM Estimates of the Federal Reserve Reaction Function," Working papers 83, Banque de France.
Paulo R. Mota & Abel L. C. Fernandes, 2019. "The Dynamic Adjustment Of Central Banks’ Target Interest Rate: The Case Of The Ecb," FEP Working Papers 613, Universidade do Porto, Faculdade de Economia do Porto.
Gonzalo Edwards, 2002. "La Tasa de Descuento en Proyectos de Largo Plazo," Documentos de Trabajo 231, Instituto de Economia. Pontificia Universidad Católica de Chile..
Lim, Terence & Lo, Andrew W. & Merton, Robert C. & Scholes, Myron S., 2006. "The Derivatives Sourcebook," Foundations and Trends(R) in Finance, now publishers, vol. 1(5–6), pages 365-572, April.
Mala Raghavan & Mardi Dungey, 2015. "Should ASEAN-5 monetary policy-makers act pre-emptively against stock market bubbles?," Applied Economics, Taylor & Francis Journals, vol. 47(11), pages 1086-1105, March.
- Raghavan, Mala & Dungey, Mardi, 2014. "Should ASEAN-5 Monetary Policymakers Act Pre-emptively Against Stock Market Bubbles?," Working Papers 2014-04, University of Tasmania, Tasmanian School of Business and Economics, revised 2014.

More about this item

Keywords

; ; ; ; ;

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:annopr:v:206:y:2013:i:1:p:197-219:10.1007/s10479-012-1292-1. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Constrained Markov decision processes with first passage criteria

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data