IDEAS home Printed from https://ideas.repec.org/a/inm/oropre/v63y2015i2p428-434.html
   My bibliography  Save this article

Myopic Bounds for Optimal Policy of POMDPs: An Extension of Lovejoy’s Structural Results

Author

Listed:
  • Vikram Krishnamurthy

    (Department of Electrical and Computer Engineering, University of British Columbia, Vancouver, British Columbia V6T 1Z4, Canada)

  • Udit Pareek

    (Department of Electrical and Computer Engineering, University of British Columbia, Vancouver, British Columbia V6T 1Z4, Canada)

Abstract

This paper provides a relaxation of the sufficient conditions and an extension of the structural results for partially observed Markov decision processes (POMDPs) obtained by Lovejoy in 1987. Sufficient conditions are provided so that the optimal policy can be upper and lower bounded by judiciously chosen myopic policies. These myopic policy bounds are constructed to maximize the volume of belief states where they coincide with the optimal policy. Numerical examples illustrate these myopic bounds for both continuous and discrete observation sets.

Suggested Citation

  • Vikram Krishnamurthy & Udit Pareek, 2015. "Myopic Bounds for Optimal Policy of POMDPs: An Extension of Lovejoy’s Structural Results," Operations Research, INFORMS, vol. 63(2), pages 428-434, April.
  • Handle: RePEc:inm:oropre:v:63:y:2015:i:2:p:428-434
    DOI: 10.1287/opre.2014.1332
    as

    Download full text from publisher

    File URL: http://dx.doi.org/10.1287/opre.2014.1332
    Download Restriction: no

    File URL: https://libkey.io/10.1287/opre.2014.1332?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Karlin, Samuel & Rinott, Yosef, 1980. "Classes of orderings of measures and related correlation inequalities II. Multivariate reverse rule distributions," Journal of Multivariate Analysis, Elsevier, vol. 10(4), pages 499-516, December.
    2. William S. Lovejoy, 1987. "Some Monotonicity Results for Partially Observed Markov Decision Processes," Operations Research, INFORMS, vol. 35(5), pages 736-743, October.
    3. Karlin, Samuel & Rinott, Yosef, 1980. "Classes of orderings of measures and related correlation inequalities. I. Multivariate totally positive distributions," Journal of Multivariate Analysis, Elsevier, vol. 10(4), pages 467-498, December.
    4. Paul R. Milgrom, 1981. "Good News and Bad News: Representation Theorems and Applications," Bell Journal of Economics, The RAND Corporation, vol. 12(2), pages 380-391, Autumn.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Junbo Son & Yeongin Kim & Shiyu Zhou, 2022. "Alerting patients via health information system considering trust-dependent patient adherence," Information Technology and Management, Springer, vol. 23(4), pages 245-269, December.
    2. Saghafian, Soroush, 2018. "Ambiguous partially observable Markov decision processes: Structural results and applications," Journal of Economic Theory, Elsevier, vol. 178(C), pages 1-35.
    3. Miehling, Erik & Teneketzis, Demosthenis, 2020. "Monotonicity properties for two-action partially observable Markov decision processes on partially ordered spaces," European Journal of Operational Research, Elsevier, vol. 282(3), pages 936-944.
    4. Nowak, Piotr Bolesław, 2016. "The MLE of the mean of the exponential distribution based on grouped data is stochastically increasing," Statistics & Probability Letters, Elsevier, vol. 111(C), pages 49-54.
    5. Chi, Chang Koo & Murto, Pauli & Valimaki, Juuso, 2017. "All-Pay Auctions with Affiliated Values," MPRA Paper 80799, University Library of Munich, Germany.
    6. Arnaud Costinot & Jonathan Vogel, 2010. "Matching and Inequality in the World Economy," Journal of Political Economy, University of Chicago Press, vol. 118(4), pages 747-786, August.
    7. Badía, F.G. & Sangüesa, C. & Cha, J.H., 2014. "Stochastic comparison of multivariate conditionally dependent mixtures," Journal of Multivariate Analysis, Elsevier, vol. 129(C), pages 82-94.
    8. Prokopovych, Pavlo & Yannelis, Nicholas C., 2019. "On monotone approximate and exact equilibria of an asymmetric first-price auction with affiliated private information," Journal of Economic Theory, Elsevier, vol. 184(C).
    9. Miravete, Eugenio J., 2011. "Convolution and composition of totally positive random variables in economics," Journal of Mathematical Economics, Elsevier, vol. 47(4-5), pages 479-490.
    10. Patricio S. Dalton & Sayantan Ghosal & Anandi Mani, 2016. "Poverty and Aspirations Failure," Economic Journal, Royal Economic Society, vol. 126(590), pages 165-188, February.
    11. Bhattacharya, Bhaskar, 2012. "Covariance selection and multivariate dependence," Journal of Multivariate Analysis, Elsevier, vol. 106(C), pages 212-228.
    12. Castaño-Martínez, A. & Pigueiras, G. & Sordo, M.A., 2019. "On a family of risk measures based on largest claims," Insurance: Mathematics and Economics, Elsevier, vol. 86(C), pages 92-97.
    13. Elina Robeva & Bernd Sturmfels & Ngoc Tran & Caroline Uhler, 2021. "Maximum likelihood estimation for totally positive log‐concave densities," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 48(3), pages 817-844, September.
    14. Tsukuma, Hisayuki & Kubokawa, Tatsuya, 2008. "Stein's phenomenon in estimation of means restricted to a polyhedral convex cone," Journal of Multivariate Analysis, Elsevier, vol. 99(1), pages 141-164, January.
    15. H. Finner & M. Roters & K. Strassburger, 2017. "On the Simes test under dependence," Statistical Papers, Springer, vol. 58(3), pages 775-789, September.
    16. Müller, Alfred & Scarsini, Marco, 2005. "Archimedean copulæ and positive dependence," Journal of Multivariate Analysis, Elsevier, vol. 93(2), pages 434-445, April.
    17. Arnaud Costinot, 2009. "An Elementary Theory of Comparative Advantage," Econometrica, Econometric Society, vol. 77(4), pages 1165-1192, July.
    18. Barmalzan, Ghobad & Akrami, Abbas & Balakrishnan, Narayanaswamy, 2020. "Stochastic comparisons of the smallest and largest claim amounts with location-scale claim severities," Insurance: Mathematics and Economics, Elsevier, vol. 93(C), pages 341-352.
    19. Lu, I-Li & Richards, Donald, 1996. "Total positivity properties of the bivariate diagonal natural exponential families," Statistics & Probability Letters, Elsevier, vol. 26(2), pages 119-124, February.
    20. Jian Yang, 2023. "A Partial Order for Strictly Positive Coalitional Games and a Link from Risk Aversion to Cooperation," Papers 2304.10652, arXiv.org.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:oropre:v:63:y:2015:i:2:p:428-434. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.