IDEAS home Printed from https://ideas.repec.org/a/ecm/emetrp/v60y1992i5p1071-96.html
   My bibliography  Save this article

Denumerable-Armed Bandits

Author

Listed:
  • Banks, Jeffrey S
  • Sundaram, Rangarajan K

Abstract

This paper studies the class of denumerable-armed (i.e., finite- or countably infinite-armed) Bandit problems with independent arms and geometric discounting over an infinite horizon in which each arm generates rewards according to one of a finite number of distributions. The authors derive certain continuity and curvature properties of the Gittins Index, and provide necessary and sufficient conditions under which this index characterizes the optimal strategies. They then show that at each point in time the arm selected by an optimal strategy will, with positive probability, remain an optimal selection forever. Copyright 1992 by The Econometric Society.

Suggested Citation

  • Banks, Jeffrey S & Sundaram, Rangarajan K, 1992. "Denumerable-Armed Bandits," Econometrica, Econometric Society, vol. 60(5), pages 1071-1096, September.
  • Handle: RePEc:ecm:emetrp:v:60:y:1992:i:5:p:1071-96
    as

    Download full text from publisher

    File URL: http://links.jstor.org/sici?sici=0012-9682%28199209%2960%3A5%3C1071%3ADB%3E2.0.CO%3B2-O&origin=repec
    File Function: full text
    Download Restriction: Access to full text is restricted to JSTOR subscribers. See http://www.jstor.org for details.

    As the access to this document is restricted, you may want to look for a different version below or search for a different version of it.

    Other versions of this item:

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Gale, Douglas & Rosenthal, Robert W., 1999. "Experimentation, Imitation, and Stochastic Stability," Journal of Economic Theory, Elsevier, vol. 84(1), pages 1-40, January.
    2. Araujo, Luis & Camargo, Braz, 2006. "Information, learning, and the stability of fiat money," Journal of Monetary Economics, Elsevier, vol. 53(7), pages 1571-1591, October.
    3. Araujo, Luis & Camargo, Braz, 2008. "Endogenous supply of fiat money," Journal of Economic Theory, Elsevier, vol. 142(1), pages 48-72, September.
    4. Benkert, Jean-Michel & Letina, Igor & Nöldeke, Georg, 2018. "Optimal search from multiple distributions with infinite horizon," Economics Letters, Elsevier, vol. 164(C), pages 15-18.
    5. Elena Pastorino, 2004. "Optimal Job Design and Career Dynamics in the Presence of Uncertainty," Econometric Society 2004 North American Summer Meetings 292, Econometric Society.
    6. Kung-Yu Chen & Chien-Tai Lin, 2005. "A note on infinite-armed Bernoulli bandit problems with generalized beta prior distributions," Statistical Papers, Springer, vol. 46(1), pages 129-140, January.
    7. Luis Araujo & Braz Camargo, 2005. "Monetary Equilibrium with Decentralized Trade and Learning," UWO Department of Economics Working Papers 20051, University of Western Ontario, Department of Economics.
    8. Forand, Jean Guillaume, 2015. "Keeping your options open," Journal of Economic Dynamics and Control, Elsevier, vol. 53(C), pages 47-68.
    9. Cripps, Martin W., 2013. "Optimal learning of a set: Or how to edit a journal if you must," Economics Letters, Elsevier, vol. 120(3), pages 384-388.
    10. Bergemann, Dirk & Valimaki, Juuso, 1996. "Learning and Strategic Pricing," Econometrica, Econometric Society, vol. 64(5), pages 1125-1149, September.
    11. Epstein, Gil S., 1996. "The extraction of natural resources from two sites under uncertainty," Economics Letters, Elsevier, vol. 51(3), pages 309-313, June.
    12. Klimenko, Mikhail M., 2004. "Industrial targeting, experimentation and long-run specialization," Journal of Development Economics, Elsevier, vol. 73(1), pages 75-105, February.
    13. Bergemann, Dirk & Valimaki, Juuso, 2001. "Stationary multi-choice bandit problems," Journal of Economic Dynamics and Control, Elsevier, vol. 25(10), pages 1585-1594, October.
    14. Camargo, Braz, 2014. "Learning in society," Games and Economic Behavior, Elsevier, vol. 87(C), pages 381-396.
    15. Keller, Godfrey & Oldale, Alison, 2003. "Branching bandits: a sequential search process with correlated pay-offs," Journal of Economic Theory, Elsevier, vol. 113(2), pages 302-315, December.
    16. Chien-Tai Lin & C. Shiau, 2000. "Some Optimal Strategies for Bandit Problems with Beta Prior Distributions," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 52(2), pages 397-405, June.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ecm:emetrp:v:60:y:1992:i:5:p:1071-96. See general information about how to correct material in RePEc.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Wiley-Blackwell Digital Licensing) or (Christopher F. Baum). General contact details of provider: http://edirc.repec.org/data/essssea.html .

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service hosted by the Research Division of the Federal Reserve Bank of St. Louis . RePEc uses bibliographic data supplied by the respective publishers.