Denumerable-Armed Bandits

My bibliography Save this article

Denumerable-Armed Bandits

Author

Listed:

Banks, Jeffrey S
Sundaram, Rangarajan K

Registered:

Jeffrey Scot Banks †

Abstract

This paper studies the class of denumerable-armed (i.e., finite- or countably infinite-armed) Bandit problems with independent arms and geometric discounting over an infinite horizon in which each arm generates rewards according to one of a finite number of distributions. The authors derive certain continuity and curvature properties of the Gittins Index, and provide necessary and sufficient conditions under which this index characterizes the optimal strategies. They then show that at each point in time the arm selected by an optimal strategy will, with positive probability, remain an optimal selection forever. Copyright 1992 by The Econometric Society.

Suggested Citation

Banks, Jeffrey S & Sundaram, Rangarajan K, 1992. "Denumerable-Armed Bandits," Econometrica, Econometric Society, vol. 60(5), pages 1071-1096, September.

Handle: RePEc:ecm:emetrp:v:60:y:1992:i:5:p:1071-96

Download full text from publisher

As the access to this document is restricted, you may want to look for a different version below or search for a different version of it.

Other versions of this item:

Banks, J.s. & Sunderam, R.K., 1991. "Denumerable-Armed Bandits," RCER Working Papers 277, University of Rochester - Center for Economic Research (RCER).

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Gale, Douglas & Rosenthal, Robert W., 1999. "Experimentation, Imitation, and Stochastic Stability," Journal of Economic Theory, Elsevier, vol. 84(1), pages 1-40, January.
- Gale, D. & Rosental, R.W., 1996. "Experimentation, Imitation, and Stochastic Stability," Papers 65, Boston University - Industry Studies Programme.
- Douglas Gale & Robert W. Rosenthal, 1996. "Experimentation, Imitation, and Stochastic Stability," Papers 0065, Boston University - Industry Studies Programme.
Araujo, Luis & Camargo, Braz, 2006. "Information, learning, and the stability of fiat money," Journal of Monetary Economics, Elsevier, vol. 53(7), pages 1571-1591, October.
Araujo, Luis & Camargo, Braz, 2008. "Endogenous supply of fiat money," Journal of Economic Theory, Elsevier, vol. 142(1), pages 48-72, September.
Kung-Yu Chen & Chien-Tai Lin, 2005. "A note on infinite-armed Bernoulli bandit problems with generalized beta prior distributions," Statistical Papers, Springer, vol. 46(1), pages 129-140, January.
Benkert, Jean-Michel & Letina, Igor & Nöldeke, Georg, 2018. "Optimal search from multiple distributions with infinite horizon," Economics Letters, Elsevier, vol. 164(C), pages 15-18.
- Jean-Michel Benkert & Igor Letina & Georg Nöldeke, 2017. "Optimal search from multiple distributions with infinite horizon," ECON - Working Papers 262, Department of Economics - University of Zurich, revised Dec 2017.
Epstein, Gil S., 1996. "The extraction of natural resources from two sites under uncertainty," Economics Letters, Elsevier, vol. 51(3), pages 309-313, June.
Luis Araujo & Braz Camargo, 2005. "Monetary Equilibrium with Decentralized Trade and Learning," University of Western Ontario, Departmental Research Report Series 20051, University of Western Ontario, Department of Economics.
- Araujo, Luis Fernando Oliveira de & Camargo, Bráz Ministério de, 2010. "Monetary equilibrium with decentralized trade and learning," Textos para discussão 222, FGV EESP - Escola de Economia de São Paulo, Fundação Getulio Vargas (Brazil).
Forand, Jean Guillaume, 2015. "Keeping your options open," Journal of Economic Dynamics and Control, Elsevier, vol. 53(C), pages 47-68.
- Jean Guillaume Forand, 2010. "Keeping Your Options Open," RCER Working Papers 557, University of Rochester - Center for Economic Research (RCER).
- Jean Guillaume Forand, 2011. "Keeping Your Options Open," 2011 Meeting Papers 82, Society for Economic Dynamics.
- Jean Guillaume Forand, 2013. "Keeping Your options Open," Working Papers 1301, University of Waterloo, Department of Economics, revised Feb 2015.
Camargo, Braz, 2014. "Learning in society," Games and Economic Behavior, Elsevier, vol. 87(C), pages 381-396.
- Braz Camargo, 2006. "Learning in Society," 2006 Meeting Papers 435, Society for Economic Dynamics.
Chien-Tai Lin & C. Shiau, 2000. "Some Optimal Strategies for Bandit Problems with Beta Prior Distributions," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 52(2), pages 397-405, June.
Seok‐ju Cho, 2009. "Retrospective Voting and Political Representation," American Journal of Political Science, John Wiley & Sons, vol. 53(2), pages 276-291, April.
Joseph Deutsch & Gil S. Epstein, 1998. "Changing a Decision Taken under Uncertainty: The Case of the Criminal's Location Choice," Urban Studies, Urban Studies Journal Limited, vol. 35(8), pages 1335-1343, July.
Elena Pastorino, 2004. "Optimal Job Design and Career Dynamics in the Presence of Uncertainty," Econometric Society 2004 North American Summer Meetings 292, Econometric Society.
Roland Fryer & Philipp Harms, 2018. "Two-Armed Restless Bandits with Imperfect Information: Stochastic Control and Indexability," Mathematics of Operations Research, INFORMS, vol. 43(2), pages 399-427, May.
Cripps, Martin W., 2013. "Optimal learning of a set: Or how to edit a journal if you must," Economics Letters, Elsevier, vol. 120(3), pages 384-388.
Bergemann, Dirk & Valimaki, Juuso, 1996. "Learning and Strategic Pricing," Econometrica, Econometric Society, vol. 64(5), pages 1125-1149, September.
- Dirk Bergemann & Juuso Valimaki, 1996. "Learning and Strategic Pricing," Cowles Foundation Discussion Papers 1113, Cowles Foundation for Research in Economics, Yale University.
Klimenko, Mikhail M., 2004. "Industrial targeting, experimentation and long-run specialization," Journal of Development Economics, Elsevier, vol. 73(1), pages 75-105, February.
Bergemann, Dirk & Valimaki, Juuso, 2001. "Stationary multi-choice bandit problems," Journal of Economic Dynamics and Control, Elsevier, vol. 25(10), pages 1585-1594, October.
- Dirk Bergemann & Juuso Vaimaki, 1999. "Stationary Multi Choice Bandit Problems," Cowles Foundation Discussion Papers 1240, Cowles Foundation for Research in Economics, Yale University.
Keller, Godfrey & Oldale, Alison, 2003. "Branching bandits: a sequential search process with correlated pay-offs," Journal of Economic Theory, Elsevier, vol. 113(2), pages 302-315, December.

More about this item

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ecm:emetrp:v:60:y:1992:i:5:p:1071-96. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

We have no bibliographic references for this item. You can help adding them by using this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/essssea.html .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Denumerable-Armed Bandits

Author

Abstract

Suggested Citation

Download full text from publisher

Other versions of this item:

Citations

More about this item

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data