IDEAS home Printed from https://ideas.repec.org/a/spr/queues/v104y2023i1d10.1007_s11134-023-09875-x.html
   My bibliography  Save this article

Exponential asymptotic optimality of Whittle index policy

Author

Listed:
  • Nicolas Gast

    (Univ. Grenoble Alpes)

  • Bruno Gaujal

    (Univ. Grenoble Alpes)

  • Chen Yan

    (Univ. Grenoble Alpes)

Abstract

We evaluate the performance of Whittle index policy for restless Markovian bandit. It is shown in Weber and Weiss (J Appl Probab 27(3):637–648, 1990) that if the bandit is indexable and the associated deterministic system has a global attractor fixed point, then the Whittle index policy is asymptotically optimal in the regime where the arm population grows proportionally with the number of activation arms. In this paper, we show that, under the same conditions, this convergence rate is exponential in the arm population, unless the fixed point is singular (to be defined later), which almost never happens in practice. Our result holds for the continuous-time model of Weber and Weiss (1990) and for a discrete-time model in which all bandits make synchronous transitions. Our proof is based on the nature of the deterministic equation governing the stochastic system: We show that it is a piecewise affine continuous dynamical system inside the simplex of the empirical measure of the arms. Using simulations and numerical solvers, we also investigate the singular cases, as well as how the level of singularity influences the (exponential) convergence rate. We illustrate our theorem on a Markovian fading channel model.

Suggested Citation

  • Nicolas Gast & Bruno Gaujal & Chen Yan, 2023. "Exponential asymptotic optimality of Whittle index policy," Queueing Systems: Theory and Applications, Springer, vol. 104(1), pages 107-150, June.
  • Handle: RePEc:spr:queues:v:104:y:2023:i:1:d:10.1007_s11134-023-09875-x
    DOI: 10.1007/s11134-023-09875-x
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11134-023-09875-x
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11134-023-09875-x?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Kurtz, Thomas G., 1978. "Strong approximation theorems for density dependent Markov chains," Stochastic Processes and their Applications, Elsevier, vol. 6(3), pages 223-240, February.
    2. David B. Brown & James E. Smith, 2020. "Index Policies and Performance Bounds for Dynamic Selection Problems," Management Science, INFORMS, vol. 66(7), pages 3029-3050, July.
    3. P. S. Ansell & K. D. Glazebrook & J. Niño-Mora & M. O'Keeffe, 2003. "Whittle's index policy for a multi-class queueing system with convex holding costs," Mathematical Methods of Operations Research, Springer;Gesellschaft für Operations Research (GOR);Nederlands Genootschap voor Besliskunde (NGB), vol. 57(1), pages 21-39, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. José Niño-Mora, 2023. "Markovian Restless Bandits and Index Policies: A Review," Mathematics, MDPI, vol. 11(7), pages 1-27, March.
    2. José Niño-Mora, 2006. "Restless Bandit Marginal Productivity Indices, Diminishing Returns, and Optimal Control of Make-to-Order/Make-to-Stock M/G/1 Queues," Mathematics of Operations Research, INFORMS, vol. 31(1), pages 50-84, February.
    3. Santiago R. Balseiro & David B. Brown & Chen Chen, 2021. "Dynamic Pricing of Relocating Resources in Large Networks," Management Science, INFORMS, vol. 67(7), pages 4075-4094, July.
    4. Achal Bassamboo & J. Michael Harrison & Assaf Zeevi, 2006. "Design and Control of a Large Call Center: Asymptotic Analysis of an LP-Based Method," Operations Research, INFORMS, vol. 54(3), pages 419-435, June.
    5. Urtzi Ayesta & Manu K. Gupta & Ina Maria Verloop, 2021. "On the computation of Whittle’s index for Markovian restless bandits," Mathematical Methods of Operations Research, Springer;Gesellschaft für Operations Research (GOR);Nederlands Genootschap voor Besliskunde (NGB), vol. 93(1), pages 179-208, February.
    6. Samuli Aalto & Ziv Scully, 2023. "Minimizing the mean slowdown in the M/G/1 queue," Queueing Systems: Theory and Applications, Springer, vol. 104(3), pages 187-210, August.
    7. Davide Crapis & Bar Ifrach & Costis Maglaras & Marco Scarsini, 2017. "Monopoly Pricing in the Presence of Social Learning," Management Science, INFORMS, vol. 63(11), pages 3586-3608, November.
    8. Ankit Gupta & Mustafa Khammash, 2022. "Frequency spectra and the color of cellular noise," Nature Communications, Nature, vol. 13(1), pages 1-18, December.
    9. Deligiannis, Michalis & Liberopoulos, George, 2023. "Dynamic ordering and buyer selection policies when service affects future demand," Omega, Elsevier, vol. 118(C).
    10. Ephraim M. Hanks, 2017. "Modeling Spatial Covariance Using the Limiting Distribution of Spatio-Temporal Random Walks," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(518), pages 497-507, April.
    11. José Niño-Mora, 2020. "A Verification Theorem for Threshold-Indexability of Real-State Discounted Restless Bandits," Mathematics of Operations Research, INFORMS, vol. 45(2), pages 465-496, May.
    12. Keliger, Dániel & Horváth, Illés, 2023. "Accuracy criterion for mean field approximations of Markov processes on hypergraphs," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 609(C).
    13. Jamaal Ahmad & Mogens Bladt, 2022. "Phase-type representations of stochastic interest rates with applications to life insurance," Papers 2207.11292, arXiv.org, revised Nov 2022.
    14. He, Yuheng & Xue, Xiaofeng, 2023. "Moderate deviations of hitting times of a family of density-dependent Markov chains," Statistics & Probability Letters, Elsevier, vol. 195(C).
    15. Horst, Ulrich, 2010. "Dynamic systems of social interactions," Journal of Economic Behavior & Organization, Elsevier, vol. 73(2), pages 158-170, February.
    16. Natiello, Mario A. & Solari, Hernán G., 2020. "Modelling population dynamics based on experimental trials with genetically modified (RIDL) mosquitoes," Ecological Modelling, Elsevier, vol. 424(C).
    17. Ramandeep S. Randhawa & Sunil Kumar, 2009. "Multiserver Loss Systems with Subscribers," Mathematics of Operations Research, INFORMS, vol. 34(1), pages 142-179, February.
    18. Nicolas Gast & Bruno Gaujal & Kimang Khun, 2023. "Testing indexability and computing Whittle and Gittins index in subcubic time," Mathematical Methods of Operations Research, Springer;Gesellschaft für Operations Research (GOR);Nederlands Genootschap voor Besliskunde (NGB), vol. 97(3), pages 391-436, June.
    19. Ya‐Tang Chuang & Manaf Zargoush & Somayeh Ghazalbash & Saied Samiedaluie & Kerry Kuluski & Sara Guilcher, 2023. "From prediction to decision: Optimizing long‐term care placements among older delayed discharge patients," Production and Operations Management, Production and Operations Management Society, vol. 32(4), pages 1041-1058, April.
    20. S. Duran & U. Ayesta & I. M. Verloop, 2022. "On the Whittle index of Markov modulated restless bandits," Queueing Systems: Theory and Applications, Springer, vol. 102(3), pages 373-430, December.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:queues:v:104:y:2023:i:1:d:10.1007_s11134-023-09875-x. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.