Why Imitate, and if so, How? A Bounded Rational Approach to Multi- Armed Bandits

Why Imitate, and if so, How? A Bounded Rational Approach to Multi- Armed Bandits

Author

Listed:

Karl H. Schlag

Karl H. Schlag

Abstract

We consider the situation in which individuals in a finite population must repeatedly choose an action yielding an uncertain payoff. Between choices, each individual may observe the performance of one other individual. We search for rules of behavior with limited memory that increase expected pay-off s for any underlying payoff distribution. It is shown that the rule that outperforms all other rules with this property is the one that specifies imita-tion of the action of an individual that performed better with a probability proportional to how much better she performed. When each individual uses this best rule, the aggregate population behavior can be approximated by the replicator dynamic.

Suggested Citation

Karl H. Schlag, "undated". "Why Imitate, and if so, How? A Bounded Rational Approach to Multi- Armed Bandits," ELSE working papers 028, ESRC Centre on Economics Learning and Social Evolution.

Handle: RePEc:els:esrcls:028

Download full text from publisher

Other versions of this item:

Schlag, Karl H., 1998. "Why Imitate, and If So, How?, : A Boundedly Rational Approach to Multi-armed Bandits," Journal of Economic Theory, Elsevier, vol. 78(1), pages 130-156, January.

References listed on IDEAS

Samuelson, L., 1989. "Evolutionnary Stability In Asymmetric Games," Papers 11-8-2, Pennsylvania State - Department of Economics.
- Samuelson, L. & Zhang, J., 1991. "Evolutionary Stability in Asymmetric Games," Papers 9132, Tilburg - Center for Economic Research.
- Samuelson, L. & Zhang, J., 1990. "Evolutionary Stability In Symmetric Games," Working papers 90-24, Wisconsin Madison - Social Systems.
Glenn Ellison & Drew Fudenberg, 1995. "Word-of-Mouth Communication and Social Learning," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 110(1), pages 93-125.
- Fudenberg, Drew & Ellison, Glenn, 1995. "Word-of-Mouth Communication and Social Learning," Scholarly Articles 3196300, Harvard University Department of Economics.
- A. Banerjee & Drew Fudenberg, 2010. "Word-of-Mouth Communication and Social Learning," Levine's Working Paper Archive 425, David K. Levine.
Rothschild, Michael, 1974. "A two-armed bandit theory of market pricing," Journal of Economic Theory, Elsevier, vol. 9(2), pages 185-202, October.
Binmore Kenneth G. & Samuelson Larry & Vaughan Richard, 1995. "Musical Chairs: Modeling Noisy Evolution," Games and Economic Behavior, Elsevier, vol. 11(1), pages 1-35, October.
L. Samuelson & J. Zhang, 2010. "Evolutionary Stability in Asymmetric Games," Levine's Working Paper Archive 453, David K. Levine.
Dan Friedman, 2010. "Evolutionary Games in Economics," Levine's Working Paper Archive 392, David K. Levine.
Robson, Arthur J., 1996. "A Biological Basis for Expected and Non-expected Utility," Journal of Economic Theory, Elsevier, vol. 68(2), pages 397-424, February.
Samuelson, Larry & Zhang, Jianbo, 1992. "Evolutionary stability in asymmetric games," Journal of Economic Theory, Elsevier, vol. 57(2), pages 363-391, August.
Abhijit V. Banerjee, 1992. "A Simple Model of Herd Behavior," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 107(3), pages 797-817.
Friedman, Daniel, 1991. "Evolutionary Games in Economics," Econometrica, Econometric Society, vol. 59(3), pages 637-666, May.
Helbing, Dirk, 1992. "Interrelations between stochastic equations for systems with pair interactions," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 181(1), pages 29-52.
Gale, John & Binmore, Kenneth G. & Samuelson, Larry, 1995. "Learning to be imperfect: The ultimatum game," Games and Economic Behavior, Elsevier, vol. 8(1), pages 56-90.
Matsui, Akihiko, 1992. "Best response dynamics and socially stable strategies," Journal of Economic Theory, Elsevier, vol. 57(2), pages 343-362, August.
Boylan, Richard T., 1992. "Laws of large numbers for dynamical systems with randomly matched individuals," Journal of Economic Theory, Elsevier, vol. 57(2), pages 473-504, August.
Schmalensee, Richard, 1975. "Alternative models of bandit selection," Journal of Economic Theory, Elsevier, vol. 10(3), pages 333-342, June.
Binmore, K. & Samuelson, L. & Gale, J., 1993. "Learning to be Imperfect: The Ultimatum Game," Working papers 9325, Wisconsin Madison - Social Systems.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Weibull, Jörgen W., 1997. "What have we learned from Evolutionary Game Theory so far?," Working Paper Series 487, Research Institute of Industrial Economics, revised 26 Oct 1998.
Jörg Oechssler & Karl H Schlag, 1997. "Loss of Commitment? An Evolutionary Analysis of Bagwell’s Example," Levine's Working Paper Archive 598, David K. Levine.
- Oechssler, Jörg & Schlag, Karl H., 1997. "Loss of commitment? An evolutionary analysis of Bagwell's example," SFB 373 Discussion Papers 1997,39, Humboldt University of Berlin, Interdisciplinary Research Project 373: Quantification and Simulation of Economic Processes.
Squintani, Francesco & Valimaki, Juuso, 2002. "Imitation and Experimentation in Changing Contests," Journal of Economic Theory, Elsevier, vol. 104(2), pages 376-404, June.
Cressman, R., 1997. "Local stability of smooth selection dynamics for normal form games," Mathematical Social Sciences, Elsevier, vol. 34(1), pages 1-19, August.
Schlag, Karl H., 1999. "Which one should I imitate?," Journal of Mathematical Economics, Elsevier, vol. 31(4), pages 493-522, May.
Jorg Oechssler & Karl Schlag, 1997. "An Evolutionary Analysis of Bagwell's Example," Game Theory and Information 9704001, University Library of Munich, Germany, revised 11 Apr 1997.
Viossat, Yannick, 2008. "Evolutionary dynamics may eliminate all strategies used in correlated equilibrium," Mathematical Social Sciences, Elsevier, vol. 56(1), pages 27-43, July.
- Viossat, Yannick, 2006. "Evolutionary dynamics may eliminate all strategies used in correlated equilibrium," SSE/EFI Working Paper Series in Economics and Finance 629, Stockholm School of Economics, revised 21 Jun 2006.
- Yannick Viossat, 2008. "Evolutionary Dynamics May Eliminate All Strategies Used in Correlated Equilibria," Post-Print hal-00360756, HAL.
Demichelis, Stefano & Ritzberger, Klaus, 2003. "From evolutionary to strategic stability," Journal of Economic Theory, Elsevier, vol. 113(1), pages 51-75, November.
- DEMICHELIS, Stefano & RITZBERGER, Klaus, 2000. "From evolutionary to strategic stability," LIDAM Discussion Papers CORE 2000059, Université catholique de Louvain, Center for Operations Research and Econometrics (CORE).
Antonio Cabrales & Giovanni Ponti, 2000. "Implementation, Elimination of Weakly Dominated Strategies and Evolutionary Dynamics," Review of Economic Dynamics, Elsevier for the Society for Economic Dynamics, vol. 3(2), pages 247-282, April.
Cressman, R. & Schlag, K. H., 1998. "The Dynamic (In)Stability of Backwards Induction," Journal of Economic Theory, Elsevier, vol. 83(2), pages 260-285, December.
- R. Cressman & K.H. Schlag, "undated". "The Dynamic (In)Stability of Backwards Induction," ELSE working papers 027, ESRC Centre on Economics Learning and Social Evolution.
Ken Binmore & Larry Samuelson, "undated". "Evolutionary Drift and Equilibrium Selection," ELSE working papers 011, ESRC Centre on Economics Learning and Social Evolution.
Hopkins, Ed, 1999. "Learning, Matching, and Aggregation," Games and Economic Behavior, Elsevier, vol. 26(1), pages 79-110, January.
- Ed Hopkins, "undated". "Learning, Matching and Aggregation," Discussion Papers 1996-2, Edinburgh School of Economics, University of Edinburgh.
- Ed Hopkins, 1995. "Learning, Matching and Aggregation," Game Theory and Information 9512001, University Library of Munich, Germany.
- Ed Hopkins, 1995. "Learning, Matching and Aggregation," Edinburgh School of Economics Discussion Paper Series 2, Edinburgh School of Economics, University of Edinburgh.
- Ed Hopkins, "undated". "Learning, Matching and Aggregation," ELSE working papers 033, ESRC Centre on Economics Learning and Social Evolution.
- Ed Hopkins, "undated". "Learning, Matching and Aggregation," Department of Economics 1996 : II, Edinburgh School of Economics, University of Edinburgh.
- Hopkins, E., 1995. "Learning, Matching and Aggregation," G.R.E.Q.A.M. 95a20, Universite Aix-Marseille III.
Dawid, Herbert, 1999. "On the stability of monotone discrete selection dynamics with inertia," Mathematical Social Sciences, Elsevier, vol. 37(3), pages 265-280, May.
Sandholm, William H., 2001. "Potential Games with Continuous Player Sets," Journal of Economic Theory, Elsevier, vol. 97(1), pages 81-108, March.
- Sandholm,W.H., 1999. "Potential games with continuous player sets," Working papers 23, Wisconsin Madison - Social Systems.
Sandholm, William H., 2005. "Excess payoff dynamics and other well-behaved evolutionary dynamics," Journal of Economic Theory, Elsevier, vol. 124(2), pages 149-170, October.
Gerard van der Laan & A.F. Tieman, 1996. "Evolutionary Game Theory and the Modelling of Economic Behavior," Tinbergen Institute Discussion Papers 96-172/8, Tinbergen Institute.
Sobel, Joel, 2000. "Economists' Models of Learning," Journal of Economic Theory, Elsevier, vol. 94(2), pages 241-261, October.
Nick Feltovich & John Duffy, 1999. "Does observation of others affect learning in strategic environments? An experimental study," International Journal of Game Theory, Springer;Game Theory Society, vol. 28(1), pages 131-152.
- John Duffy & Nick Feltovich, 1997. "Does Observation of Others Affect Learning in Strategic Environments? An Experimental Study," Levine's Working Paper Archive 592, David K. Levine.
Bill Sandholm, 2003. "Excess Payoff Dynamics, Potential Dynamics, and Stable Games," Theory workshop papers 505798000000000042, UCLA Department of Economics.
- Sandholm,W.H., 2003. "Excess payoff dynamics, potential dynamics, and stable games," Working papers 5, Wisconsin Madison - Social Systems.
Huck, Steffen & Oechssler, Jorg, 1999. "The Indirect Evolutionary Approach to Explaining Fair Allocations," Games and Economic Behavior, Elsevier, vol. 28(1), pages 13-24, July.
- Steffen Huck & Joerg Oechssler, 1995. "The Indirect Evolutionary Approach to Explaining Fair Allocations," Game Theory and Information 9507001, University Library of Munich, Germany, revised 27 Aug 1998.
- Huck, S. & Oechssler, J., 1996. "The Indirect Evolutionary Approach To Explaining Fair Allocations," SFB 373 Discussion Papers 1996,13, Humboldt University of Berlin, Interdisciplinary Research Project 373: Quantification and Simulation of Economic Processes.

More about this item

Keywords

; ; ; ; ; ; ;

JEL classification:

C72 - Mathematical and Quantitative Methods - - Game Theory and Bargaining Theory - - - Noncooperative Games
C79 - Mathematical and Quantitative Methods - - Game Theory and Bargaining Theory - - - Other

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:els:esrcls:028. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: s. malkani The email address of this maintainer does not seem to be valid anymore. Please ask s. malkani to update the entry or send us the correct address (email available below). General contact details of provider: https://edirc.repec.org/data/elucluk.html .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Why Imitate, and if so, How? A Bounded Rational Approach to Multi- Armed Bandits

Author

Abstract

Suggested Citation

Download full text from publisher

Other versions of this item:

References listed on IDEAS

Most related items

More about this item

Keywords

JEL classification:

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data