IDEAS home Printed from https://ideas.repec.org/a/wsi/igtrxx/v22y2020i02ns0219198920400083.html
   My bibliography  Save this article

A Policy Improvement Algorithm for Solving a Mixture Class of Perfect Information and AR-AT Semi-Markov Games

Author

Listed:
  • P. Mondal

    (Mathematics Department, Government General Degree College, Ranibandh, Bankura 722135, India)

  • S. K. Neogy

    (Indian Statistical Institute, Delhi Centre, New Delhi 110016, India)

  • A. Gupta

    (Indian Statistical Institute, Kolkata Centre, Kolkata 700108, India)

  • D. Ghorui

    (Mathematics Department, Jadavpur University, Kolkata 700032, India)

Abstract

Zero-sum two-person discounted semi-Markov games with finite state and action spaces are studied where a collection of states having Perfect Information (PI) property is mixed with another collection of states having Additive Reward–Additive Transition and Action Independent Transition Time (AR-AT-AITT) property. For such a PI/AR-AT-AITT mixture class of games, we prove the existence of an optimal pure stationary strategy for each player. We develop a policy improvement algorithm for solving discounted semi-Markov decision processes (one player version of semi-Markov games) and using it we obtain a policy-improvement type algorithm for computing an optimal strategy pair of a PI/AR-AT-AITT mixture semi-Markov game. Finally, we extend our results when the states having PI property are replaced by a subclass of Switching Control (SC) states.

Suggested Citation

  • P. Mondal & S. K. Neogy & A. Gupta & D. Ghorui, 2020. "A Policy Improvement Algorithm for Solving a Mixture Class of Perfect Information and AR-AT Semi-Markov Games," International Game Theory Review (IGTR), World Scientific Publishing Co. Pte. Ltd., vol. 22(02), pages 1-19, June.
  • Handle: RePEc:wsi:igtrxx:v:22:y:2020:i:02:n:s0219198920400083
    DOI: 10.1142/S0219198920400083
    as

    Download full text from publisher

    File URL: http://www.worldscientific.com/doi/abs/10.1142/S0219198920400083
    Download Restriction: Access to full text is restricted to subscribers

    File URL: https://libkey.io/10.1142/S0219198920400083?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Sina Nayeri & Mahdieh Tavakoli & Mehrab Tanhaeean & Fariborz Jolai, 2022. "A robust fuzzy stochastic model for the responsive-resilient inventory-location problem: comparison of metaheuristic algorithms," Annals of Operations Research, Springer, vol. 315(2), pages 1895-1935, August.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wsi:igtrxx:v:22:y:2020:i:02:n:s0219198920400083. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Tai Tone Lim (email available below). General contact details of provider: http://www.worldscinet.com/igtr/igtr.shtml .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.