This paper studies the class of denumerable-armed (i.e., finite- or countably infinite-armed) Bandit problems with independent arms and geometric discounting over an infinite horizon in which each arm generates rewards according to one of a finite number of distributions. The authors derive certain continuity and curvature properties of the Gittins Index, and provide necessary and sufficient conditions under which this index characterizes the optimal strategies. They then show that at each point in time the arm selected by an optimal strategy will, with positive probability, remain an optimal selection forever. Copyright 1992 by The Econometric Society.
If you experience problems downloading a file, check if you have the proper application to view it first. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.
As the access to this document is restricted, you may want to look for a different version under "Related research" (further below) or search for a different version of it.
Volume (Year): 60 (1992)
Issue (Month): 5 (September)
|Contact details of provider:|| Phone: 1 212 998 3820|
Fax: 1 212 995 4487
Web page: http://www.econometricsociety.org/
More information through EDIRC
|Order Information:|| Web: https://www.econometricsociety.org/publications/econometrica/access/ordering-back-issues Email: |