IDEAS home Printed from https://ideas.repec.org/a/inm/oropre/v63y2015i2p428-434.html
   My bibliography  Save this article

Myopic Bounds for Optimal Policy of POMDPs: An Extension of Lovejoy’s Structural Results

Author

Listed:
  • Vikram Krishnamurthy

    (Department of Electrical and Computer Engineering, University of British Columbia, Vancouver, British Columbia V6T 1Z4, Canada)

  • Udit Pareek

    (Department of Electrical and Computer Engineering, University of British Columbia, Vancouver, British Columbia V6T 1Z4, Canada)

Abstract

This paper provides a relaxation of the sufficient conditions and an extension of the structural results for partially observed Markov decision processes (POMDPs) obtained by Lovejoy in 1987. Sufficient conditions are provided so that the optimal policy can be upper and lower bounded by judiciously chosen myopic policies. These myopic policy bounds are constructed to maximize the volume of belief states where they coincide with the optimal policy. Numerical examples illustrate these myopic bounds for both continuous and discrete observation sets.

Suggested Citation

  • Vikram Krishnamurthy & Udit Pareek, 2015. "Myopic Bounds for Optimal Policy of POMDPs: An Extension of Lovejoy’s Structural Results," Operations Research, INFORMS, vol. 63(2), pages 428-434, April.
  • Handle: RePEc:inm:oropre:v:63:y:2015:i:2:p:428-434
    DOI: 10.1287/opre.2014.1332
    as

    Download full text from publisher

    File URL: http://dx.doi.org/10.1287/opre.2014.1332
    Download Restriction: no

    File URL: https://libkey.io/10.1287/opre.2014.1332?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Paul R. Milgrom, 1981. "Good News and Bad News: Representation Theorems and Applications," Bell Journal of Economics, The RAND Corporation, vol. 12(2), pages 380-391, Autumn.
    2. Karlin, Samuel & Rinott, Yosef, 1980. "Classes of orderings of measures and related correlation inequalities II. Multivariate reverse rule distributions," Journal of Multivariate Analysis, Elsevier, vol. 10(4), pages 499-516, December.
    3. William S. Lovejoy, 1987. "Some Monotonicity Results for Partially Observed Markov Decision Processes," Operations Research, INFORMS, vol. 35(5), pages 736-743, October.
    4. Karlin, Samuel & Rinott, Yosef, 1980. "Classes of orderings of measures and related correlation inequalities. I. Multivariate totally positive distributions," Journal of Multivariate Analysis, Elsevier, vol. 10(4), pages 467-498, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Junbo Son & Yeongin Kim & Shiyu Zhou, 2022. "Alerting patients via health information system considering trust-dependent patient adherence," Information Technology and Management, Springer, vol. 23(4), pages 245-269, December.
    2. Saghafian, Soroush, 2018. "Ambiguous partially observable Markov decision processes: Structural results and applications," Journal of Economic Theory, Elsevier, vol. 178(C), pages 1-35.
    3. Miehling, Erik & Teneketzis, Demosthenis, 2020. "Monotonicity properties for two-action partially observable Markov decision processes on partially ordered spaces," European Journal of Operational Research, Elsevier, vol. 282(3), pages 936-944.
    4. Chi, Chang Koo & Murto, Pauli & Valimaki, Juuso, 2017. "All-Pay Auctions with Affiliated Values," MPRA Paper 80799, University Library of Munich, Germany.
    5. Arnaud Costinot & Jonathan Vogel, 2010. "Matching and Inequality in the World Economy," Journal of Political Economy, University of Chicago Press, vol. 118(4), pages 747-786, August.
    6. Müller, Alfred & Scarsini, Marco, 2005. "Archimedean copulæ and positive dependence," Journal of Multivariate Analysis, Elsevier, vol. 93(2), pages 434-445, April.
    7. Arnaud Costinot, 2009. "An Elementary Theory of Comparative Advantage," Econometrica, Econometric Society, vol. 77(4), pages 1165-1192, July.
    8. Barmalzan, Ghobad & Akrami, Abbas & Balakrishnan, Narayanaswamy, 2020. "Stochastic comparisons of the smallest and largest claim amounts with location-scale claim severities," Insurance: Mathematics and Economics, Elsevier, vol. 93(C), pages 341-352.
    9. Jian Yang, 2023. "A Partial Order for Strictly Positive Coalitional Games and a Link from Risk Aversion to Cooperation," Papers 2304.10652, arXiv.org.
    10. Battey, H.S. & Cox, D.R., 2022. "Some aspects of non-standard multivariate analysis," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    11. Ligtvoet, R., 2015. "A test for using the sum score to obtain a stochastic ordering of subjects," Journal of Multivariate Analysis, Elsevier, vol. 133(C), pages 136-139.
    12. Huang, Wen-Tao & Xu, Bing, 2002. "Some maximal inequalities and complete convergences of negatively associated random sequences," Statistics & Probability Letters, Elsevier, vol. 57(2), pages 183-191, April.
    13. Francesco Bartolucci, 2002. "A recursive algorithm for Markov random fields," Biometrika, Biometrika Trust, vol. 89(3), pages 724-730, August.
    14. Li, Benchong & Li, Yang, 2017. "A note on faithfulness and total positivity," Statistics & Probability Letters, Elsevier, vol. 122(C), pages 168-172.
    15. Burkett, Justin, 2015. "Endogenous budget constraints in auctions," Journal of Economic Theory, Elsevier, vol. 158(PA), pages 1-20.
    16. Ori Davidov & Amir Herman, 2011. "Multivariate Stochastic Orders Induced by Case-Control Sampling," Methodology and Computing in Applied Probability, Springer, vol. 13(1), pages 139-154, March.
    17. Rudy Ligtvoet, 2015. "Remarks and a Correction of Ligtvoet’s Treatment of the Isotonic Partial Credit Model," Psychometrika, Springer;The Psychometric Society, vol. 80(2), pages 514-515, June.
    18. Fosgerau, Mogens & Lindberg, Per Olov & Mattsson, Lars-Göran & Weibull, Jörgen, 2015. "Invariance of the distribution of the maximum," MPRA Paper 63529, University Library of Munich, Germany.
    19. Colangelo, Antonio & Scarsini, Marco & Shaked, Moshe, 2006. "Some positive dependence stochastic orders," Journal of Multivariate Analysis, Elsevier, vol. 97(1), pages 46-78, January.
    20. Bezgina, E. & Burkschat, M., 2019. "On total positivity of exchangeable random variables obtained by symmetrization, with applications to failure-dependent lifetimes," Journal of Multivariate Analysis, Elsevier, vol. 169(C), pages 95-109.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:oropre:v:63:y:2015:i:2:p:428-434. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.