On Dynamic Programming with Unbounded Rewards

My bibliography Save this article

On Dynamic Programming with Unbounded Rewards

Author

Listed:

Steven A. Lippman
(University of California, Los Angeles)

Registered:

Abstract

Using the technique employed by the author in an earlier paper, the existence of an optimal stationary policy that can be obtained from the usual functional equation is again established in the presence of a bound (not necessarily polynomial) on the one-period reward of a semi-Markov decision process. This is done for both the discounted and the average cost case. In addition to allowing an uncountable state space, the law of motion of the system is rather general in that we permit any state to be reached in a single transition. There is, however, a bound on a weighted moment of the next state reached. Finally, we indicate the applicability of these results.

Suggested Citation

Steven A. Lippman, 1975. "On Dynamic Programming with Unbounded Rewards," Management Science, INFORMS, vol. 21(11), pages 1225-1233, July.

Handle: RePEc:inm:ormnsc:v:21:y:1975:i:11:p:1225-1233
DOI: 10.1287/mnsc.21.11.1225

Download full text from publisher

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Ilbin Lee & Marina A. Epelman & H. Edwin Romeijn & Robert L. Smith, 2017. "Simplex Algorithm for Countable-State Discounted Markov Decision Processes," Operations Research, INFORMS, vol. 65(4), pages 1029-1042, August.
Hyun-Soo Ahn & Mehmet Gümüc{s} & Philip Kaminsky, 2009. "Inventory, Discounts, and the Timing Effect," Manufacturing & Service Operations Management, INFORMS, vol. 11(4), pages 613-629, September.
Andriy Norets, 2010. "Continuity and differentiability of expected value functions in dynamic discrete choice models," Quantitative Economics, Econometric Society, vol. 1(2), pages 305-322, November.
Alexis Akira Toda, 2024. "Unbounded Markov dynamic programming with weighted supremum norm Perov contractions," Economic Theory Bulletin, Springer;Society for the Advancement of Economic Theory (SAET), vol. 12(2), pages 141-156, December.
- Alexis Akira Toda, 2023. "Unbounded Markov Dynamic Programming with Weighted Supremum Norm Perov Contractions," Papers 2310.04593, arXiv.org.
C. Drent & S. Kapodistria & J. A. C. Resing, 2019. "Condition-based maintenance policies under imperfect maintenance at scheduled and unscheduled opportunities," Queueing Systems: Theory and Applications, Springer, vol. 93(3), pages 269-308, December.
James E. Smith & Kevin F. McCardle, 1998. "Valuing Oil Properties: Integrating Option Pricing and Decision Analysis Approaches," Operations Research, INFORMS, vol. 46(2), pages 198-217, April.
José Niño-Mora, 2020. "A Verification Theorem for Threshold-Indexability of Real-State Discounted Restless Bandits," Mathematics of Operations Research, INFORMS, vol. 45(2), pages 465-496, May.
Otero, Karina V., 2016. "Nonparametric identification of dynamic multinomial choice games: unknown payoffs and shocks without interchangeability," MPRA Paper 86784, University Library of Munich, Germany.
Sturm, Roland, 1995. "Why does nuclear power performance differ across Europe?," European Economic Review, Elsevier, vol. 39(6), pages 1197-1214, June.
James E. Smith & Canan Ulu, 2012. "Technology Adoption with Uncertain Future Costs and Quality," Operations Research, INFORMS, vol. 60(2), pages 262-274, April.
Hong Chen & Murray Zed Frank, 2022. "Equilibrium Defaultable Corporate Debt and Investment," Papers 2202.05885, arXiv.org.

More about this item

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:ormnsc:v:21:y:1975:i:11:p:1225-1233. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

We have no bibliographic references for this item. You can help adding them by using this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

On Dynamic Programming with Unbounded Rewards

Author

Abstract

Suggested Citation

Download full text from publisher

Citations

More about this item

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data