Strategic Experimentation with Exponential Bandits

My bibliography Save this paper

Strategic Experimentation with Exponential Bandits

Author

Listed:

Rady, Sven
Cripps, Martin William
Keller, R Godfrey

Registered:

Abstract

This Paper studies a game of strategic experimentation with two-armed bandits whose risky arm might yield a pay-off only after some exponentially distributed random time. Because of free-riding, there is an inefficiently low level of experimentation in any equilibrium where the players use stationary Markovian strategies with posterior beliefs as the state variable. After characterizing the unique symmetric Markovian equilibrium of the game, which is in mixed strategies, we construct a variety of pure-strategy equilibria. There is no equilibrium where all players use simple cut-off strategies. Equilibria where players switch finitely often between the roles of experimenter and free-rider all lead to the same pattern of information acquisition; the efficiency of these equilibria depends on the way players share the burden of experimentation among them. In equilibria where players switch roles infinitely often, they can acquire an approximately efficient amount of information, but the rate at which it is acquired still remains inefficient; moreover, the expected pay-off of an experimenter exhibits the novel feature that it rises as players become more pessimistic. Finally, over the range of beliefs where players use both arms a positive fraction of the time, the symmetric equilibrium is dominated by any asymmetric one in terms of aggregate pay-offs.

Suggested Citation

Rady, Sven & Cripps, Martin William & Keller, R Godfrey, 2003. "Strategic Experimentation with Exponential Bandits," CEPR Discussion Papers 3814, C.E.P.R. Discussion Papers.

Handle: RePEc:cpr:ceprdp:3814

Download full text from publisher

As the access to this document is restricted, you may want to look for a different version below or search for a different version of it.

Other versions of this item:

Godfrey Keller & Sven Rady & Martin Cripps, 2005. "Strategic Experimentation with Exponential Bandits," Econometrica, Econometric Society, vol. 73(1), pages 39-68, January.

Cripps, Martin & Keller, Godfrey & Rady, Sven, 2003. "Strategic Experimentation with Exponential Bandits," Discussion Papers in Economics 4, University of Munich, Department of Economics.
Godfrey Keller & Martin Cripps & Olin School of Business & Washington University & Sven Rady & Department of Economics & University of Munich, 2003. "Strategic Experimentation with Exponential Bandits," Economics Series Working Papers 143, University of Oxford, Department of Economics.

References listed on IDEAS

Leslie M. Marx & Steven A. Matthews, 2000. "Dynamic Voluntary Contribution to a Public Project," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 67(2), pages 327-358.
- Leslie M. Marx & Steven A. Matthews, "undated". "Dynamic Voluntary Contribution to a Public Project," Penn CARESS Working Papers 6f8dbf67d492ff8a10975496b, Penn Economics Department.
- Leslie M. Marx & Steven A. Matthews, "undated". ""Dynamic Voluntary Contribution to a Public Project''," CARESS Working Papres 99-01, University of Pennsylvania Center for Analytic Research and Economics in the Social Sciences.
- Leslie M. Marx & Steven A. Matthews, 1997. "Dynamic Voluntary Contribution to a Public Project," Discussion Papers 1188, Northwestern University, Center for Mathematical Studies in Economics and Management Science.
David A. Malueg & Shunichi O. Tsutsui, 1997. "Dynamic R&D Competition with Learning," RAND Journal of Economics, The RAND Corporation, vol. 28(4), pages 751-772, Winter.
Patrick Bolton & Christopher Harris, 1999. "Strategic Experimentation," Econometrica, Econometric Society, vol. 67(2), pages 349-374, March.
Dirk Bergemann & Ulrigh Hege, 2005. "The Financing of Innovation: Learning and Stopping," RAND Journal of Economics, The RAND Corporation, vol. 36(4), pages 719-752, Winter.
- Dirk Bergemann & Ulrich Hege, 2001. "The Financing of Innovation: Learning and Stopping," Cowles Foundation Discussion Papers 1292, Cowles Foundation for Research in Economics, Yale University.
- Bergemann, D. & Hege, U., 2001. "The Financing of Innovation : Learning and Stopping," Other publications TiSEM 85bb8c47-af02-4c41-88b4-0, Tilburg University, School of Economics and Management.
- Ulrich Hege & Dirk Bergemann, 2005. "The Financing of Innovation: Learning and Stopping," Post-Print hal-00459926, HAL.
- Hege, Ulrich & Bergemann, Dirk, 2001. "The Financing of Innovation: Learning and Stopping," CEPR Discussion Papers 2763, C.E.P.R. Discussion Papers.
- Dirk Bergemann & Ulrich Hege, 2001. "The Financing of Innovation: Learning and Stopping," Cowles Foundation Discussion Papers 1292R, Cowles Foundation for Research in Economics, Yale University, revised Oct 2004.
- Bergemann, D. & Hege, U., 2001. "The Financing of Innovation : Learning and Stopping," Discussion Paper 2001-16, Tilburg University, Center for Economic Research.
- Ulrich Hege & D. Bergemann, 2012. "The Financing of Innovation: Learning and Stopping," Working Papers hal-00759793, HAL.
Rothschild, Michael, 1974. "A two-armed bandit theory of market pricing," Journal of Economic Theory, Elsevier, vol. 9(2), pages 185-202, October.
Ben Lockwood & Jonathan P. Thomas, 2002. "Gradualism and Irreversibility," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 69(2), pages 339-356.
- Ben Lockwood & Jonathan P. Thomas, 1999. "Gradualism and Irreversibility," CSGR Working papers series 28/99, Centre for the Study of Globalisation and Regionalisation (CSGR), University of Warwick.
- Lockwood, Ben & Thomas, Jonathan P., 1999. "Gradualism and Irreversibility," Economic Research Papers 269301, University of Warwick - Department of Economics.
- Lockwood, B. & Thomas, J.P., 1999. "Gradualism and Irreversibility," The Warwick Economics Research Paper Series (TWERPS) 550, University of Warwick, Department of Economics.
- Lockwood, Ben & Thomas, Jonathan, 1999. "Gradualism And Irreversibility," Economic Research Papers 269245, University of Warwick - Department of Economics.
- Lockwood, B. & Thomas, J.P., 1999. "Gradualism and Irreversibility," The Warwick Economics Research Paper Series (TWERPS) 525, University of Warwick, Department of Economics.
Anat R. Admati & Motty Perry, 1991. "Joint Projects without Commitment," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 58(2), pages 259-276.
Bergemann, Dirk & Hege, Ulrich, 1998. "Venture capital financing, moral hazard, and learning," Journal of Banking & Finance, Elsevier, vol. 22(6-8), pages 703-735, August.
- Bergemann, Dirk & Hege, Ulrich, 1997. "Venture Capital Financing, Moral Hazard and Learning," CEPR Discussion Papers 1738, C.E.P.R. Discussion Papers.
- Ulrich Hege & Dirk Bergemann, 1998. "Venture capital financing, moral hazard, and learning," Post-Print hal-00481696, HAL.
- Bergemann, D. & Hege, U., 1997. "Venture Capital Financing, Moral Hazard and Learning," Other publications TiSEM d70119dd-1d85-4dde-9d59-1, Tilburg University, School of Economics and Management.
Christopher Harris, 1993. "Generalized Solutions of Stochastic Differential Games in One Dimension," Papers 0044, Boston University - Industry Studies Programme.
- Harris, C., 1993. "Generalized Solutions of Stochastic Differential Games in One Dimension," Papers 44, Boston University - Industry Studies Programme.
Keller, Godfrey & Rady, Sven, 2003. "Price Dispersion and Learning in a Dynamic Differentiated-Goods Duopoly," RAND Journal of Economics, The RAND Corporation, vol. 34(1), pages 138-165, Spring.
- Keller, Godfrey & Rady, Sven, 2001. "Price Dispersion and Learning in a Dynamic Differentiated-Goods Duopoly," Discussion Papers in Economics 21, University of Munich, Department of Economics.
- Rady, Sven & Keller, R Godfrey, 2001. "Price Dispersion and Learning in a Dynamic Differentiated-Goods Duopoly," CEPR Discussion Papers 2919, C.E.P.R. Discussion Papers.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Martin Cripps & Godfrey Keller & Sven Rady, 2000. "Strategic Experimentation: The Case of the Poisson Bandits," Econometric Society World Congress 2000 Contributed Papers 0878, Econometric Society.
- Martin W. Cripps & Godfrey Keller & Sven Rady, 2002. "Strategic Experimentation: The Case of Poisson Bandits," CESifo Working Paper Series 737, CESifo.
Heidhues, Paul & Rady, Sven & Strack, Philipp, 2015. "Strategic experimentation with private payoffs," Journal of Economic Theory, Elsevier, vol. 159(PA), pages 531-551.
- Heidhues, Paul & Rady, Sven & Strack, Philipp, 2012. "Strategic Experimentation with Private Payoffs," Discussion Paper Series of SFB/TR 15 Governance and the Efficiency of Economic Systems 387, Free University of Berlin, Humboldt University of Berlin, University of Bonn, University of Mannheim, University of Munich.
- Rady, Sven & Heidhues, Paul & Strack, Philipp, 2015. "Strategic Experimentation with Private Payoffs," CEPR Discussion Papers 10634, C.E.P.R. Discussion Papers.
Chen, Yi, 2020. "A revision game of experimentation on a common threshold," Journal of Economic Theory, Elsevier, vol. 186(C).
Johannes Hörner & Larry Samuelson, 2013. "Incentives for experimenting agents," RAND Journal of Economics, RAND Corporation, vol. 44(4), pages 632-663, December.
- Johannes Horner & Larry Samuelson, 2009. "Incentives for Experimenting Agents," Cowles Foundation Discussion Papers 1726R, Cowles Foundation for Research in Economics, Yale University, revised Feb 2012.
- Johannes Horner & Larry Samuelson, 2009. "Incentives for Experimenting Agents," Cowles Foundation Discussion Papers 1726R3, Cowles Foundation for Research in Economics, Yale University, revised Jun 2013.
- Johannes Horner & Larry Samuelson, 2009. "Incentives for Experimenting Agents," Cowles Foundation Discussion Papers 1726, Cowles Foundation for Research in Economics, Yale University.
- Johannes Horner & Larry Samuelson, 2012. "Incentives for Experimenting Agents," Levine's Working Paper Archive 786969000000000418, David K. Levine.
- Johannes Horner & Larry Samuelson, 2013. "Incentives for Experimenting Agents," Levine's Working Paper Archive 786969000000000671, David K. Levine.
- Johannes Horner & Larry Samuelson, 2009. "Incentives for Experimenting Agents," Cowles Foundation Discussion Papers 1726R2, Cowles Foundation for Research in Economics, Yale University, revised Mar 2013.
Sorensen, Morten, 2007. "Learning by Investing: Evidence from Venture Capital," SIFR Research Report Series 53, Institute for Financial Research.
Alessandro Bonatti & Johannes Horner, 2011. "Collaborating," American Economic Review, American Economic Association, vol. 101(2), pages 632-663, April.
- Johannes Horner & Alessandro Bonatti, 2009. "Collaborating," 2009 Meeting Papers 1019, Society for Economic Dynamics.
- Alessandro Bonatti & Johannes Horner, 2009. "Collaborating," Cowles Foundation Discussion Papers 1695, Cowles Foundation for Research in Economics, Yale University, revised Nov 2009.
Külpmann, Philipp, 2015. "Procrastination and projects," Center for Mathematical Economics Working Papers 544, Center for Mathematical Economics, Bielefeld University.
Chen, Chia-Hui & Ishida, Junichiro, 2018. "Hierarchical experimentation," Journal of Economic Theory, Elsevier, vol. 177(C), pages 365-404.
- Chia-Hui Chen & Junichiro Ishida, 2015. "Hierarchical Experimentation," ISER Discussion Paper 0949, Institute of Social and Economic Research, Osaka University.
Matros, Alexander & Smirnov, Vladimir, 2011. "Treasure game," Working Papers 2011-10, University of Sydney, School of Economics, revised May 2014.
, & ,, 2010. "Strategic experimentation with Poisson bandits," Theoretical Economics, Econometric Society, vol. 5(2), May.
- Sven Rady & Godfrey Keller, 2007. "Strategic Experimentation with Poisson Bandits," 2007 Meeting Papers 332, Society for Economic Dynamics.
- Keller, Godfrey & Rady, Sven, 2009. "Strategic Experimentation with Poisson Bandits," Discussion Paper Series of SFB/TR 15 Governance and the Efficiency of Economic Systems 260, Free University of Berlin, Humboldt University of Berlin, University of Bonn, University of Mannheim, University of Munich.
- Keller, Godfrey & Rady, Sven, 2009. "Strategic Experimentation with Poisson Bandits," Discussion Papers in Economics 10575, University of Munich, Department of Economics.
- Rady, Sven & Keller, R Godfrey, 2009. "Strategic Experimentation with Poisson Bandits," CEPR Discussion Papers 7270, C.E.P.R. Discussion Papers.
Nicolas Klein & Tymofiy Mylovanov, 2011. "Should the Flatterers be Avoided?," 2011 Meeting Papers 1273, Society for Economic Dynamics.
Doruk Cetemen & Can Urgun & Leeat Yariv, 2023. "Collective Progress: Dynamics of Exit Waves," Journal of Political Economy, University of Chicago Press, vol. 131(9), pages 2402-2450.
- Doruk Cetemen & Can Urgun & Leeat Yariv, 2021. "Collective Progress: Dynamics of Exit Waves," Working Papers 2021-34, Princeton University. Economics Department..
- Doruk Cetemen & Can Urgun & Leeat Yariv, 2021. "Collective Progress: Dynamics of Exit Waves," NBER Working Papers 29008, National Bureau of Economic Research, Inc.
- Doruk Cetemen & Can Urgun & Leeat Yariv, 2021. "Collective Progress: Dynamics of Exit Waves," Papers 2107.00406, arXiv.org.
- Yariv, Leeat & Cetemen, Doruk & Urgun, Can, 2021. "Collective Progress: Dynamics of Exit Waves," CEPR Discussion Papers 16341, C.E.P.R. Discussion Papers.
Besanko, David & Tong, Jian & Wu, Jianjun, 2016. "Subsidizing research programs with "if" and "when" uncertainty in the face of severe informational constraints," Discussion Paper Series In Economics And Econometrics 1605, Economics Division, School of Social Sciences, University of Southampton.
Matros, Alexander & Smirnov, Vladimir, 2016. "Duplicative search," Games and Economic Behavior, Elsevier, vol. 99(C), pages 1-22.
- Matros, Alexander & Smirnov, Vladimir, 2016. "Duplicative Search," Working Papers 2016-02, University of Sydney, School of Economics.
May Elsayyad & Florian Morath, 2016. "Technology Transfers For Climate Change," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 57(3), pages 1057-1084, August.
- May Elsayyad & Florian Morath, 2013. "Technology Transfers for Climate Change," CESifo Working Paper Series 4521, CESifo.
- Morath, Florian & Elsayyad, May, 2014. "Technology transfers for climate change," VfS Annual Conference 2014 (Hamburg): Evidence-based Economic Policy 100396, Verein für Socialpolitik / German Economic Association.
Aghamolla, Cyrus & Hashimoto, Tadashi, 2020. "Information arrival, delay, and clustering in financial markets with dynamic freeriding," Journal of Financial Economics, Elsevier, vol. 138(1), pages 27-52.
Georgiadis, George, 2017. "Deadlines and infrequent monitoring in the dynamic provision of public goods," Journal of Public Economics, Elsevier, vol. 152(C), pages 1-12.
Khalil, Fahad & Lawarree, Jacques & Rodivilov, Alexander, 2020. "Learning from failures: Optimal contracts for experimentation and production," Journal of Economic Theory, Elsevier, vol. 190(C).
- Fahad Khalil & Jacques Lawarree & Alexander Rodivilov, 2018. "Learning from Failures: Optimal Contract for Experimentation and Production," CESifo Working Paper Series 7310, CESifo.
Arthur Charpentier & Romuald Élie & Carl Remlinger, 2023. "Reinforcement Learning in Economics and Finance," Computational Economics, Springer;Society for Computational Economics, vol. 62(1), pages 425-462, June.
Keller, Godfrey & Rady, Sven, 2020. "Undiscounted bandit games," Games and Economic Behavior, Elsevier, vol. 124(C), pages 43-61.
- Keller, Godfrey & Rady, Sven, 2015. "Undiscounted Bandit Games," Discussion Paper Series of SFB/TR 15 Governance and the Efficiency of Economic Systems 520, Free University of Berlin, Humboldt University of Berlin, University of Bonn, University of Mannheim, University of Munich.
- Godfrey Keller & Sven Rady, 2019. "Undiscounted Bandit Games," Economics Series Working Papers 882, University of Oxford, Department of Economics.
- Godfrey Keller & Sven Rady, 2019. "Undiscounted Bandit Games," CRC TR 224 Discussion Paper Series crctr224_2019_130, University of Bonn and University of Mannheim, Germany.
- Rady, Sven & Keller, R Godfrey, 2019. "Undiscounted Bandit Games," CEPR Discussion Papers 14046, C.E.P.R. Discussion Papers.
- Godfrey Keller & Sven Rady, 2020. "Undiscounted Bandit Games," CRC TR 224 Discussion Paper Series crctr224_2020_130v2, University of Bonn and University of Mannheim, Germany.
- Godfrey Keller & Sven Rady, 2019. "Undiscounted Bandit Games," Papers 1909.13323, arXiv.org, revised Aug 2020.

More about this item

Keywords

Strategic experimentation; Two-armed bandits; Exponential distribution; Bayesian learning; Markov perfect equilibrium; Public goods;
All these keywords.

JEL classification:

C73 - Mathematical and Quantitative Methods - - Game Theory and Bargaining Theory - - - Stochastic and Dynamic Games; Evolutionary Games
D83 - Microeconomics - - Information, Knowledge, and Uncertainty - - - Search; Learning; Information and Knowledge; Communication; Belief; Unawareness
H41 - Public Economics - - Publicly Provided Goods - - - Public Goods
O32 - Economic Development, Innovation, Technological Change, and Growth - - Innovation; Research and Development; Technological Change; Intellectual Property Rights - - - Management of Technological Innovation and R&D

NEP fields

This paper has been announced in the following NEP Reports:

NEP-GTH-2003-07-13 (Game Theory)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:cpr:ceprdp:3814. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: the person in charge (email available below). General contact details of provider: https://www.cepr.org .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Strategic Experimentation with Exponential Bandits

Author

Abstract

Suggested Citation

Download full text from publisher

Other versions of this item:

References listed on IDEAS

Most related items

More about this item

Keywords

JEL classification:

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data