Learning to play games in extensive form by valuation

My bibliography Save this paper

Learning to play games in extensive form by valuation

Author

Listed:

Philippe Jehiel
Dov Samet

Registered:

Abstract

A valuation for a board game is an assignment of numeric values to different states of the board. The valuation reflects the desirability of the states for the player. It can be used by a player to decide on her next move during the play. We assume a myopic player, who chooses a move with the highest valuation. Valuations can also be revised, and hopefully improved, after each play of the game. Here, a very simple valuation revision is considered, in which the states of the board visited in a play are assigned the payoff obtained in the play. We show that by adopting such a learning process a player who has a winning strategy in a win-lose game can almost surely guarantee a win in a repeated game. When a player has more than two payoffs, a more elaborate learning procedure is required. We consider one that associates with each state the average payoff in the rounds in which this node was reached. When all players adopt this learning procedure, with some perturbations, then, with probability 1, strategies that are close to subgame perfect equilibrium are played after some time. A single player who adopts this procedure can guarantee only her individually rational payoff.

Suggested Citation

Philippe Jehiel & Dov Samet, 2001. "Learning to play games in extensive form by valuation," Game Theory and Information 0012001, University Library of Munich, Germany.

Handle: RePEc:wpa:wuwpga:0012001
Note: Type of Document - ; pages: 18

Download full text from publisher

Other versions of this item:

Jehiel, Philippe & Samet, Dov, 2005. "Learning to play games in extensive form by valuation," Journal of Economic Theory, Elsevier, vol. 124(2), pages 129-148, October.

Philippe Jehiel & Dov Samet, 2010. "Learning to play games in extensive form by valuation," Levine's Working Paper Archive 391749000000000040, David K. Levine.
Philippe Jehiel & Dov Samet, 2001. "Learning To Play Games In Extensive Form By Valuation," NajEcon Working Paper Reviews 391749000000000010, www.najecon.org.
Philippe Jehiel & Dov Samet, 2010. "Learning To Play Games In Extensive Form By Valuation," Levine's Working Paper Archive 391749000000000034, David K. Levine.
Philippe Jehiel & Dov Samet, 2005. "Learning to play games in extensive form by valuation," Post-Print halshs-00754057, HAL.
Philippe Jehiel & Dov Samet, 2001. "Learning To Play Games In Extensive Form By Valuation," Levine's Working Paper Archive 391749000000000010, David K. Levine.

References listed on IDEAS

Karandikar, Rajeeva & Mookherjee, Dilip & Ray, Debraj & Vega-Redondo, Fernando, 1998. "Evolving Aspirations and Cooperation," Journal of Economic Theory, Elsevier, vol. 80(2), pages 292-331, June.
- Debraj Ray & Dilip Mookherjee & Fernando Vega Redondo & Rajeeva L. Karandikar, 1996. "Evolving aspirations and cooperation," Working Papers. Serie AD 1996-06, Instituto Valenciano de Investigaciones Económicas, S.A. (Ivie).
Fudenberg, Drew & Levine, David, 1998. "Learning in games," European Economic Review, Elsevier, vol. 42(3-5), pages 631-639, May.
- Drew Fudenberg & David K. Levine, 1998. "Learning in Games," Levine's Working Paper Archive 2222, David K. Levine.
Erev, Ido & Roth, Alvin E, 1998. "Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria," American Economic Review, American Economic Association, vol. 88(4), pages 848-881, September.
Itzhak Gilboa & David Schmeidler, 1995. "Case-Based Decision Theory," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 110(3), pages 605-639.
- Itzhak Gilboa & David Schmeidler, 1992. "Case-Based Decision Theory," Discussion Papers 994, Northwestern University, Center for Mathematical Studies in Economics and Management Science.
- Itzhak Gilboa & David Schmeidler, 1995. "Case-Based Decision Theory," Post-Print hal-00753144, HAL.
, & ,, 2007. "Valuation equilibrium," Theoretical Economics, Econometric Society, vol. 2(2), June.
- Philippe Jehiel & Dov Samet, 2003. "Valuation Equilibria," Game Theory and Information 0310003, University Library of Munich, Germany.
- Philippe Jehiel & Dov Samet, 2007. "Valuation Equilibrium," Post-Print halshs-00754229, HAL.
- Philippe Jehiel & Dov Samet, 2006. "Valuation Equilibria," Levine's Bibliography 784828000000000111, UCLA Department of Economics.
- Philippe Jehiel & Dov Samet, 2003. "Valuation Equilibria," Levine's Bibliography 666156000000000046, UCLA Department of Economics.
- Philippe Jehiel & Dov Samet, 2007. "Valuation Equilibrium," PSE-Ecole d'économie de Paris (Postprint) halshs-00754229, HAL.
Hendon, Ebbe & Jacobsen, Hans Jorgen & Sloth, Birgitte, 1996. "Fictitious Play in Extensive Form Games," Games and Economic Behavior, Elsevier, vol. 15(2), pages 177-202, August.
- Ebbe Hendon & Hans Jørgen Jacobsen & Birgitte Sloth, "undated". "Fictitious Play in Extensive Form Games," Discussion Papers 94-06, University of Copenhagen. Department of Economics.
Fudenberg, Drew & Levine, David K, 1993. "Self-Confirming Equilibrium," Econometrica, Econometric Society, vol. 61(3), pages 523-545, May.
- Fudenberg, D. & Levine, D.K., 1991. "Self-Confirming Equilibrium ," Working papers 581, Massachusetts Institute of Technology (MIT), Department of Economics.
- Drew Fudenberg & David K. Levine, 1993. "Self-Confirming Equilibrium," Levine's Working Paper Archive 2147, David K. Levine.
Fudenberg, Drew & Levine, David K., 1995. "Consistency and cautious fictitious play," Journal of Economic Dynamics and Control, Elsevier, vol. 19(5-7), pages 1065-1089.
- Fudenberg, Drew & Levine, David, 1995. "Consistency and Cautious Fictitious Play," Scholarly Articles 3198694, Harvard University Department of Economics.
- Drew Fudenberg & David K. Levine, 1996. "Consistency and Cautious Fictitious Play," Levine's Working Paper Archive 470, David K. Levine.
Sergiu Hart & Andreu Mas-Colell, 2013. "A Simple Adaptive Procedure Leading To Correlated Equilibrium," World Scientific Book Chapters, in: Simple Adaptive Strategies From Regret-Matching to Uncoupled Dynamics, chapter 2, pages 17-46, World Scientific Publishing Co. Pte. Ltd..
- Sergiu Hart & Andreu Mas-Colell, 2000. "A Simple Adaptive Procedure Leading to Correlated Equilibrium," Econometrica, Econometric Society, vol. 68(5), pages 1127-1150, September.
- Sergiu Hart & Andreu Mas-Colell, 1996. "A simple adaptive procedure leading to correlated equilibrium," Economics Working Papers 200, Department of Economics and Business, Universitat Pompeu Fabra, revised Dec 1996.
- S. Hart & A. Mas-Collel, 2010. "A Simple Adaptive Procedure Leading to Correlated Equilibrium," Levine's Working Paper Archive 572, David K. Levine.
- Sergiu Hart & Andreu Mas-Colell, 1997. "A Simple Adaptive Procedure Leading to Correlated Equilibrium," Game Theory and Information 9703006, University Library of Munich, Germany, revised 25 Nov 1997.
Borgers, Tilman & Sarin, Rajiv, 1997. "Learning Through Reinforcement and Replicator Dynamics," Journal of Economic Theory, Elsevier, vol. 77(1), pages 1-14, November.
- Tilman Börgers & Rajiv Sarin, "undated". "Learning Through Reinforcement and Replicator Dynamics," ELSE working papers 051, ESRC Centre on Economics Learning and Social Evolution.
- T. Borgers & R. Sarin, 2010. "Learning Through Reinforcement and Replicator Dynamics," Levine's Working Paper Archive 380, David K. Levine.
Ross Cressman, 2003. "Evolutionary Dynamics and Extensive Form Games," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262033054, December.
Noldeke Georg & Samuelson Larry, 1993. "An Evolutionary Analysis of Backward and Forward Induction," Games and Economic Behavior, Elsevier, vol. 5(3), pages 425-454, July.
- G. Noldeke & L. Samuelson, 2010. "An Evolutionary Analysis of Backward and Forward Induction," Levine's Working Paper Archive 538, David K. Levine.
Drew Fudenberg & David K. Levine, 1998. "The Theory of Learning in Games," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262061945, December.
- Drew Fudenberg & David K. Levine, 1996. "The Theory of Learning in Games," Levine's Working Paper Archive 624, David K. Levine.
Cho, In-Koo & Matsui, Akihiko, 2005. "Learning aspiration in repeated games," Journal of Economic Theory, Elsevier, vol. 124(2), pages 171-201, October.
Hart, Sergiu, 2002. "Evolutionary dynamics and backward induction," Games and Economic Behavior, Elsevier, vol. 41(2), pages 227-264, November.
- Sergiu Hart, 1999. "Evolutionary Dynamics and Backward Induction," Game Theory and Information 9905002, University Library of Munich, Germany, revised 23 Mar 2000.
Sarin, Rajiv & Vahid, Farshid, 1999. "Payoff Assessments without Probabilities: A Simple Dynamic Model of Choice," Games and Economic Behavior, Elsevier, vol. 28(2), pages 294-309, August.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

, & ,, 2007. "Valuation equilibrium," Theoretical Economics, Econometric Society, vol. 2(2), June.
- Philippe Jehiel & Dov Samet, 2003. "Valuation Equilibria," Game Theory and Information 0310003, University Library of Munich, Germany.
- Philippe Jehiel & Dov Samet, 2007. "Valuation Equilibrium," Post-Print halshs-00754229, HAL.
- Philippe Jehiel & Dov Samet, 2006. "Valuation Equilibria," Levine's Bibliography 784828000000000111, UCLA Department of Economics.
- Philippe Jehiel & Dov Samet, 2003. "Valuation Equilibria," Levine's Bibliography 666156000000000046, UCLA Department of Economics.
- Philippe Jehiel & Dov Samet, 2007. "Valuation Equilibrium," PSE-Ecole d'économie de Paris (Postprint) halshs-00754229, HAL.
Ran Spiegler, 2016. "Bayesian Networks and Boundedly Rational Expectations," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 131(3), pages 1243-1290.
- Spiegler, Ran, 2014. "Bayesian Networks and Boundedly Rational Expectations," CEPR Discussion Papers 10062, C.E.P.R. Discussion Papers.
- Ran Spiegler, 2014. "Bayesian Networks and Boundedly Rational Expectations," Discussion Papers 1417, Centre for Macroeconomics (CFM).
- Spiegler, Ran, 2014. "Bayesian Networks and Boundedly Rational Expectations," Foerder Institute for Economic Research Working Papers 275828, Tel-Aviv University > Foerder Institute for Economic Research.
- Spiegler, Ran, 2014. "Bayesian networks and boundedly rational expectations," LSE Research Online Documents on Economics 57994, London School of Economics and Political Science, LSE Library.
Mengel, Friederike, 2012. "Learning across games," Games and Economic Behavior, Elsevier, vol. 74(2), pages 601-619.
- Friederike Mengel, 2007. "Learning Across Games," Working Papers. Serie AD 2007-05, Instituto Valenciano de Investigaciones Económicas, S.A. (Ivie).
Philippe Jehiel, 2022. "Analogy-Based Expectation Equilibrium and Related Concepts:Theory, Applications, and Beyond," Working Papers halshs-03735680, HAL.
- Philippe Jehiel, 2022. "Analogy-Based Expectation Equilibrium and Related Concepts:Theory, Applications, and Beyond," PSE Working Papers halshs-03735680, HAL.
Drew Fudenberg & Kevin He, 2018. "Learning and Type Compatibility in Signaling Games," Econometrica, Econometric Society, vol. 86(4), pages 1215-1255, July.
- Drew Fudenberg & Kevin He, 2017. "Learning and Type Compatibility in Signaling Games," Papers 1702.01819, arXiv.org, revised Jun 2018.
Lambson, Val & van den Berghe, John, 2015. "Skill, complexity, and strategic interaction," Journal of Economic Theory, Elsevier, vol. 159(PA), pages 516-530.
Jehiel, Philippe & Singh, Juni, 2021. "Multi-state choices with aggregate feedback on unfamiliar alternatives," Games and Economic Behavior, Elsevier, vol. 130(C), pages 1-24.
- Philippe Jehiel & Juni Singh, 2019. "Multi-state choices with aggregate feedback on unfamiliar alternatives," PSE Working Papers halshs-02183444, HAL.
- Philippe Jehiel & Juni Singh, 2021. "Multi-state choices with aggregate feedback on unfamiliar alternatives," Post-Print halshs-03672197, HAL.
- Philippe Jehiel & Juni Singh, 2021. "Multi-state choices with aggregate feedback on unfamiliar alternatives," PSE-Ecole d'économie de Paris (Postprint) halshs-03672197, HAL.
- Philippe Jehiel & Juni Singh, 2019. "Multi-state choices with aggregate feedback on unfamiliar alternatives," Working Papers halshs-02183444, HAL.
Florian Herold, 2012. "Carrot or Stick? The Evolution of Reciprocal Preferences in a Haystack Model," American Economic Review, American Economic Association, vol. 102(2), pages 914-940, April.
Drew Fudenberg & David K. Levine, 2006. "Superstition and Rational Learning," American Economic Review, American Economic Association, vol. 96(3), pages 630-651, June.
- Drew Fudenberg & David K Levine, 2005. "Superstition and Rational Learning," Levine's Working Paper Archive 618897000000000731, David K. Levine.
- Levine, David & Fudenberg, Drew, 2006. "Superstition and Rational Learning," Scholarly Articles 3196330, Harvard University Department of Economics.
- Drew Fudenberg & David K. Levine, 2006. "Superstition and Rational Learning," Harvard Institute of Economic Research Working Papers 2114, Harvard - Institute of Economic Research.
Naoki Funai, 2019. "Convergence results on stochastic adaptive learning," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 68(4), pages 907-934, November.
Drew Fudenberg & David K Levine, 2006. "An Economists Perspective on Multi-Agent Learning," Levine's Working Paper Archive 784828000000000683, David K. Levine.
- Fudenberg, Drew & Levine, David, 2007. "An Economist's Perspective on Multi-Agent Learning," Scholarly Articles 3200613, Harvard University Department of Economics.
Wichardt, Philipp C., 2012. "Existence of valuation equilibria when equilibrium strategies cannot differentiate between equal ties," Games and Economic Behavior, Elsevier, vol. 74(2), pages 709-713.
Oyarzun, Carlos & Sarin, Rajiv, 2013. "Learning and risk aversion," Journal of Economic Theory, Elsevier, vol. 148(1), pages 196-225.
- Carlos Oyarzun & Rajiv Sarin, 2005. "Learning and Risk Aversion," Levine's Bibliography 784828000000000482, UCLA Department of Economics.
- Carlos Oyarzun & Rajiv Sarin, 2012. "Learning and Risk Aversion," Levine's Working Paper Archive 786969000000000572, David K. Levine.
Wichardt, Philipp C., 2010. "Modelling equilibrium play as governed by analogy and limited foresight," Games and Economic Behavior, Elsevier, vol. 70(2), pages 472-487, November.
Yoav Shoham & Rob Powers & Trond Grenager, 2006. "If multi-agent learning is the answer, what is the question?," Levine's Working Paper Archive 122247000000001156, David K. Levine.
Norman, Thomas W.L., 2023. "Pigouvian algorithmic platform design," Journal of Economic Behavior & Organization, Elsevier, vol. 212(C), pages 322-332.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Mengel, Friederike, 2012. "Learning across games," Games and Economic Behavior, Elsevier, vol. 74(2), pages 601-619.
- Friederike Mengel, 2007. "Learning Across Games," Working Papers. Serie AD 2007-05, Instituto Valenciano de Investigaciones Económicas, S.A. (Ivie).
Tassos Patokos, 2014. "Introducing Disappointment Dynamics and Comparing Behaviors in Evolutionary Games: Some Simulation Results," Games, MDPI, vol. 5(1), pages 1-25, January.
Block, Juan I. & Fudenberg, Drew & Levine, David K., 2019. "Learning dynamics with social comparisons and limited memory," Theoretical Economics, Econometric Society, vol. 14(1), January.
Dean P Foster & Peyton Young, 2006. "Regret Testing Leads to Nash Equilibrium," Levine's Working Paper Archive 784828000000000676, David K. Levine.
Ianni, A., 2002. "Reinforcement learning and the power law of practice: some analytical results," Discussion Paper Series In Economics And Econometrics 203, Economics Division, School of Social Sciences, University of Southampton.
Erhao Xie, 2019. "Monetary Payoff and Utility Function in Adaptive Learning Models," Staff Working Papers 19-50, Bank of Canada.
Xie, Erhao, 2021. "Empirical properties and identification of adaptive learning models in behavioral game theory," Journal of Economic Behavior & Organization, Elsevier, vol. 191(C), pages 798-821.
Beggs, A.W., 2005. "On the convergence of reinforcement learning," Journal of Economic Theory, Elsevier, vol. 122(1), pages 1-36, May.
- Alan Beggs, 2002. "On the Convergence of Reinforcement Learning," Economics Series Working Papers 96, University of Oxford, Department of Economics.
Oyarzun, Carlos & Sarin, Rajiv, 2013. "Learning and risk aversion," Journal of Economic Theory, Elsevier, vol. 148(1), pages 196-225.
- Carlos Oyarzun & Rajiv Sarin, 2005. "Learning and Risk Aversion," Levine's Bibliography 784828000000000482, UCLA Department of Economics.
- Carlos Oyarzun & Rajiv Sarin, 2012. "Learning and Risk Aversion," Levine's Working Paper Archive 786969000000000572, David K. Levine.
Jehiel, Philippe & Singh, Juni, 2021. "Multi-state choices with aggregate feedback on unfamiliar alternatives," Games and Economic Behavior, Elsevier, vol. 130(C), pages 1-24.
- Philippe Jehiel & Juni Singh, 2019. "Multi-state choices with aggregate feedback on unfamiliar alternatives," PSE Working Papers halshs-02183444, HAL.
- Philippe Jehiel & Juni Singh, 2021. "Multi-state choices with aggregate feedback on unfamiliar alternatives," Post-Print halshs-03672197, HAL.
- Philippe Jehiel & Juni Singh, 2021. "Multi-state choices with aggregate feedback on unfamiliar alternatives," PSE-Ecole d'économie de Paris (Postprint) halshs-03672197, HAL.
- Philippe Jehiel & Juni Singh, 2019. "Multi-state choices with aggregate feedback on unfamiliar alternatives," Working Papers halshs-02183444, HAL.
Daskalova, Vessela & Vriend, Nicolaas J., 2021. "Learning frames," Journal of Economic Behavior & Organization, Elsevier, vol. 191(C), pages 78-96.
- Vessela Daskalova & Nicolaas J. Vriend, 2021. "Learning Frames," Working Papers 202118, School of Economics, University College Dublin.
- Vessela Daskalova & Nicolaas J.Vriend, 2021. "Learning frames," Working Papers 929, Queen Mary University of London, School of Economics and Finance.
Eric Friedman & Scott Shenker & Amy Greenwald, 1998. "Learning in Networks Contexts: Experimental Results from Simulations," Departmental Working Papers 199825, Rutgers University, Department of Economics.
Mario Bravo & Mathieu Faure, 2013. "Reinforcement Learning with Restrictions on the Action Set," AMSE Working Papers 1335, Aix-Marseille School of Economics, France, revised 01 Jul 2013.
- Mario Bravo & Mathieu Faure, 2015. "Reinforcement Learning with Restrictions on the Action Set," Post-Print hal-01457301, HAL.
Germano, Fabrizio & Lugosi, Gabor, 2007. "Global Nash convergence of Foster and Young's regret testing," Games and Economic Behavior, Elsevier, vol. 60(1), pages 135-154, July.
- Fabrizio Germano & Gábor Lugosi, 2004. "Global Nash convergence of Foster and Young's regret testing," Economics Working Papers 788, Department of Economics and Business, Universitat Pompeu Fabra.
Juan I Block & Drew Fudenberg & David K Levine, 2017. "Learning Dynamics Based on Social Comparisons," Levine's Working Paper Archive 786969000000001375, David K. Levine.
Mario Bravo, 2016. "An Adjusted Payoff-Based Procedure for Normal Form Games," Mathematics of Operations Research, INFORMS, vol. 41(4), pages 1469-1483, November.
Duffy, John, 2006. "Agent-Based Models and Human Subject Experiments," Handbook of Computational Economics, in: Leigh Tesfatsion & Kenneth L. Judd (ed.), Handbook of Computational Economics, edition 1, volume 2, chapter 19, pages 949-1011, Elsevier.
- John Duffy, 2004. "Agent-Based Models and Human Subject Experiments," Computational Economics 0412001, University Library of Munich, Germany.
Yoav Shoham & Rob Powers & Trond Grenager, 2006. "If multi-agent learning is the answer, what is the question?," Levine's Working Paper Archive 122247000000001156, David K. Levine.
Georgios Chasparis & Jeff Shamma, 2012. "Distributed Dynamic Reinforcement of Efficient Outcomes in Multiagent Coordination and Network Formation," Dynamic Games and Applications, Springer, vol. 2(1), pages 18-50, March.
Dridi, Slimane & Lehmann, Laurent, 2014. "On learning dynamics underlying the evolution of learning rules," Theoretical Population Biology, Elsevier, vol. 91(C), pages 20-36.

More about this item

Keywords

reinforcement learning;

JEL classification:

C7 - Mathematical and Quantitative Methods - - Game Theory and Bargaining Theory
D8 - Microeconomics - - Information, Knowledge, and Uncertainty

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wpa:wuwpga:0012001. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: EconWPA (email available below). General contact details of provider: https://econwpa.ub.uni-muenchen.de .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Learning to play games in extensive form by valuation

Author

Abstract

Suggested Citation

Download full text from publisher

Other versions of this item:

References listed on IDEAS

Citations

Most related items

More about this item

Keywords

JEL classification:

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data