Bandit Problems

My bibliography Save this paper

Bandit Problems

Author

Listed:

Dirk Bergemann
(Department of Economics, Yale University)
Juuso Valimaki
(Department of Economics, Helsinki School of Economics and University of Southampton)

Registered:

Abstract

We survey the literature on multi-armed bandit models and their applications in economics. The multi-armed bandit problem is a statistical decision model of an agent trying to optimize his decisions while improving his information at the same time. This classic problem has received much attention in economics as it concisely models the trade-off between exploration (trying out each arm to find the best one) and exploitation (playing the arm believed to give the best payoff).

Suggested Citation

Dirk Bergemann & Juuso Valimaki, 2006. "Bandit Problems," Cowles Foundation Discussion Papers 1551, Cowles Foundation for Research in Economics, Yale University.

Handle: RePEc:cwl:cwldpp:1551
Note: CFP 1292

Download full text from publisher

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Ufuk Akcigit & Qingmin Liu, 2011. "The Role of Information in Competitive Experimentation," Levine's Working Paper Archive 786969000000000321, David K. Levine.
- Ufuk Akcigit & Qingmin Liu, 2011. "The Role of Information in Competitive Experimentation," PIER Working Paper Archive 11-038, Penn Institute for Economic Research, Department of Economics, University of Pennsylvania.
- Ufuk Akcigit & Qingmin Liu, 2011. "The Role of Information in Competitive Experimentation," NBER Working Papers 17602, National Bureau of Economic Research, Inc.
Patrick Warren & Tom Wilkening, 2010. "Regulatory Fog: The Informational Origins of Regulatory Persistence," Department of Economics - Working Papers Series 1113, The University of Melbourne.
Sorensen, Morten, 2007. "Learning by Investing: Evidence from Venture Capital," SIFR Research Report Series 53, Institute for Financial Research.
Lori Beaman & Raghabendra Chattopadhyay & Esther Duflo & Rohini Pande & Petia Topalova, 2009. "Powerful Women: Does Exposure Reduce Bias?," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 124(4), pages 1497-1540.
- Pande, Rohini & Duflo, Esther & Beaman, Lori & Topalova, Petia & ,, 2008. "Powerful Women: Does Exposure Reduce Bias?," CEPR Discussion Papers 6922, C.E.P.R. Discussion Papers.
- Lori A. Beaman & Raghabendra Chattopadhyay & Esther Duflo & Rohini Pande & Petia Topalova, 2008. "Powerful Women: Does Exposure Reduce Bias?," NBER Working Papers 14198, National Bureau of Economic Research, Inc.
- Lori Beaman, 2008. "Powerful Women: Does Exposure Reduce Bias?," Working Papers id:1617, eSocialSciences.
- Raghabendra Chattopadyay & Esther Duflo & Rohini Pande & Petia Topalova, 2008. "Powerful Women: Does Exposure Reduce Bias?," CID Working Papers 175, Center for International Development at Harvard University.
- Beaman, Lori & Chattopadhyay, Raghebendra & Duflo, Esther & Pande, Rohini & Topalova, Petia, 2008. "Powerful Women: Does Exposure Reduce Bias?," Working Paper Series rwp08-037, Harvard University, John F. Kennedy School of Government.
Eitan Altman, 2007. "Comments on: Dynamic priority allocation via restless bandit marginal productivity indices," TOP: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 15(2), pages 202-207, December.
Warren, Patrick L. & Wilkening, Tom S., 2012. "Regulatory fog: The role of information in regulatory persistence," Journal of Economic Behavior & Organization, Elsevier, vol. 84(3), pages 840-856.
Nicolas Klein & Sven Rady, 2011. "Negatively Correlated Bandits," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 78(2), pages 693-732.
- Rady, Sven & Klein, Nicolas, 2008. "Negatively Correlated Bandits," CEPR Discussion Papers 6983, C.E.P.R. Discussion Papers.
- Nicolas Klein & Sven Rady, 2008. "Negatively Correlated Bandits," Working Papers 040, Bavarian Graduate Program in Economics (BGPE).
- Klein, Nicolas & Rady, Sven, 2008. "Negatively Correlated Bandits," Discussion Papers in Economics 5332, University of Munich, Department of Economics.
- Sven Rady & Nicolas Klein, 2008. "Negatively Correlated Bandits," 2008 Meeting Papers 136, Society for Economic Dynamics.
- Klein, Nicolas & Rady, Sven, 2008. "Negatively Correlated Bandits," Discussion Paper Series of SFB/TR 15 Governance and the Efficiency of Economic Systems 243, Free University of Berlin, Humboldt University of Berlin, University of Bonn, University of Mannheim, University of Munich.
, & ,, 2010. "Strategic experimentation with Poisson bandits," Theoretical Economics, Econometric Society, vol. 5(2), May.
- Sven Rady & Godfrey Keller, 2007. "Strategic Experimentation with Poisson Bandits," 2007 Meeting Papers 332, Society for Economic Dynamics.
- Keller, Godfrey & Rady, Sven, 2009. "Strategic Experimentation with Poisson Bandits," Discussion Paper Series of SFB/TR 15 Governance and the Efficiency of Economic Systems 260, Free University of Berlin, Humboldt University of Berlin, University of Bonn, University of Mannheim, University of Munich.
- Keller, Godfrey & Rady, Sven, 2009. "Strategic Experimentation with Poisson Bandits," Discussion Papers in Economics 10575, University of Munich, Department of Economics.
- Rady, Sven & Keller, R Godfrey, 2009. "Strategic Experimentation with Poisson Bandits," CEPR Discussion Papers 7270, C.E.P.R. Discussion Papers.
Piermont, Evan & Takeoka, Norio & Teper, Roee, 2016. "Learning the Krepsian state: Exploration through consumption," Games and Economic Behavior, Elsevier, vol. 100(C), pages 69-94.
Rosenberg, Dinah & Salomon, Antoine & Vieille, Nicolas, 2013. "On games of strategic experimentation," Games and Economic Behavior, Elsevier, vol. 82(C), pages 31-51.
- Dinah Rosenberg & Antoine Salomon & Nicolas Vieille, 2010. "On Games of Strategic Experimentation," Working Papers hal-00579613, HAL.
- Rosenberg, Dinah & Salomon , Antoine & Vieille , Nicolas, 2013. "On Games of Strategic Experimentation," HEC Research Papers Series 1008, HEC Paris.
Ramana Nanda & Matthew Rhodes-Kropf, 2012. "Innovation Policies," Harvard Business School Working Papers 13-038, Harvard Business School, revised Mar 2017.
Berndt, Ernst R. & Gibbons, Robert S. & Kolotilin, Anton & Taub, Anna Levine, 2015. "The heterogeneity of concentrated prescribing behavior: Theory and evidence from antipsychotics," Journal of Health Economics, Elsevier, vol. 40(C), pages 26-39.
Cripps, Martin W., 2013. "Optimal learning of a set: Or how to edit a journal if you must," Economics Letters, Elsevier, vol. 120(3), pages 384-388.
Deb, Rahul, 2008. "Optimal Contracting Of New Experience Goods," MPRA Paper 9880, University Library of Munich, Germany.
Springborn, Michael R., 2014. "Risk aversion and adaptive management: Insights from a multi-armed bandit model of invasive species risk," Journal of Environmental Economics and Management, Elsevier, vol. 68(2), pages 226-242.
Roee Teper, 2016. "Learning the Krepsian State: Exploration Through Consumption," Working Paper 5860, Department of Economics, University of Pittsburgh.

More about this item

Keywords

One-Armed Bandit; Multi-Armed Bandit; Bayesian Learning; Experimentation; Index Policy; Matching; Experience Goods;
All these keywords.

JEL classification:

C72 - Mathematical and Quantitative Methods - - Game Theory and Bargaining Theory - - - Noncooperative Games
C73 - Mathematical and Quantitative Methods - - Game Theory and Bargaining Theory - - - Stochastic and Dynamic Games; Evolutionary Games
D43 - Microeconomics - - Market Structure, Pricing, and Design - - - Oligopoly and Other Forms of Market Imperfection
D83 - Microeconomics - - Information, Knowledge, and Uncertainty - - - Search; Learning; Information and Knowledge; Communication; Belief; Unawareness

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:cwl:cwldpp:1551. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

We have no bibliographic references for this item. You can help adding them by using this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Brittany Ladd (email available below). General contact details of provider: https://edirc.repec.org/data/cowleus.html .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Bandit Problems

Author

Abstract

Suggested Citation

Download full text from publisher

Citations

More about this item

Keywords

JEL classification:

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data