Convergence results on stochastic adaptive learning
Author
Abstract
Suggested Citation
DOI: 10.1007/s00199-018-1150-8
Download full text from publisher
As the access to this document is restricted, you may want to search for a different version of it.
References listed on IDEAS
- Jehiel, Philippe & Samet, Dov, 2005.
"Learning to play games in extensive form by valuation,"
Journal of Economic Theory, Elsevier, vol. 124(2), pages 129-148, October.
- Philippe Jehiel & Dov Samet, 2001. "Learning To Play Games In Extensive Form By Valuation," Levine's Working Paper Archive 391749000000000010, David K. Levine.
- Philippe Jehiel & Dov Samet, 2010. "Learning to play games in extensive form by valuation," Levine's Working Paper Archive 391749000000000040, David K. Levine.
- Philippe Jehiel & Dov Samet, 2001. "Learning To Play Games In Extensive Form By Valuation," NajEcon Working Paper Reviews 391749000000000010, www.najecon.org.
- Philippe Jehiel & Dov Samet, 2010. "Learning To Play Games In Extensive Form By Valuation," Levine's Working Paper Archive 391749000000000034, David K. Levine.
- Philippe Jehiel & Dov Samet, 2005. "Learning to play games in extensive form by valuation," Post-Print halshs-00754057, HAL.
- Philippe Jehiel & Dov Samet, 2001. "Learning to play games in extensive form by valuation," Game Theory and Information 0012001, University Library of Munich, Germany.
- Fudenberg Drew & Kreps David M., 1993.
"Learning Mixed Equilibria,"
Games and Economic Behavior, Elsevier, vol. 5(3), pages 320-367, July.
- Fudenberg, D. & Kreps, D.M., 1992. "Learning Mixed Equilibria," Working papers 92-13, Massachusetts Institute of Technology (MIT), Department of Economics.
- Drew Fudenberg & David Kreps, 2010. "Learning Mixed Equilibria," Levine's Working Paper Archive 415, David K. Levine.
- Sarin, Rajiv & Vahid, Farshid, 2001.
"Predicting How People Play Games: A Simple Dynamic Model of Choice,"
Games and Economic Behavior, Elsevier, vol. 34(1), pages 104-122, January.
- Sarin, R. & Vahid, F., 1999. "Predicting how People Play Games: a Simple Dynamic Model of Choice," Monash Econometrics and Business Statistics Working Papers 12/99, Monash University, Department of Econometrics and Business Statistics.
- Nick Feltovich & John Duffy, 1999.
"Does observation of others affect learning in strategic environments? An experimental study,"
International Journal of Game Theory, Springer;Game Theory Society, vol. 28(1), pages 131-152.
- John Duffy & Nick Feltovich, 1997. "Does Observation of Others Affect Learning in Strategic Environments? An Experimental Study," Levine's Working Paper Archive 592, David K. Levine.
- Chen, Yan & Khoroshilov, Yuri, 2003. "Learning under limited information," Games and Economic Behavior, Elsevier, vol. 44(1), pages 1-25, July.
- Erev, Ido & Roth, Alvin E, 1998. "Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria," American Economic Review, American Economic Association, vol. 88(4), pages 848-881, September.
- Laslier, Jean-Francois & Topol, Richard & Walliser, Bernard, 2001.
"A Behavioral Learning Process in Games,"
Games and Economic Behavior, Elsevier, vol. 37(2), pages 340-366, November.
- Laslier, J.-F. & Topol, R. & Walliser, B., 1999. "A Behavioral Learning Process in Games," Papers 99-03, Paris X - Nanterre, U.F.R. de Sc. Ec. Gest. Maths Infor..
- J.-F. Laslier & R. Topol & B. Walliser, 1999. "A behavioral learning process in games," THEMA Working Papers 99-03, THEMA (THéorie Economique, Modélisation et Applications), Université de Cergy-Pontoise.
- Cominetti, Roberto & Melo, Emerson & Sorin, Sylvain, 2010. "A payoff-based learning procedure and its application to traffic games," Games and Economic Behavior, Elsevier, vol. 70(1), pages 71-83, September.
- Wu, Hang & Bayer, Ralph-C, 2015.
"Learning from inferred foregone payoffs,"
Journal of Economic Dynamics and Control, Elsevier, vol. 51(C), pages 445-458.
- Ralph-C. Bayer & Hang Wu, 2013. "Learning from Inferred Foregone Payoffs," School of Economics and Public Policy Working Papers 2013-22, University of Adelaide, School of Economics and Public Policy.
- Ed Hopkins, 2002.
"Two Competing Models of How People Learn in Games,"
Econometrica, Econometric Society, vol. 70(6), pages 2141-2166, November.
- Ed Hopkins, 2000. "Two Competing Models of How People Learn in Games," Edinburgh School of Economics Discussion Paper Series 51, Edinburgh School of Economics, University of Edinburgh.
- Ed Hopkins, 2001. "Two Competing Models of How People Learn in Games," Levine's Working Paper Archive 625018000000000226, David K. Levine.
- Ed Hopkins, 2001. "Two Competing Models of How People Learn in Games," NajEcon Working Paper Reviews 625018000000000226, www.najecon.org.
- Hopkins, Ed & Posch, Martin, 2005.
"Attainability of boundary points under reinforcement learning,"
Games and Economic Behavior, Elsevier, vol. 53(1), pages 110-125, October.
- Ed Hopkins & Martin Posch, 2003. "Attainability of Boundary Points under Reinforcement Learning," Edinburgh School of Economics Discussion Paper Series 79, Edinburgh School of Economics, University of Edinburgh.
- Ed Hopkins & Martin Posch, 2003. "Attainability of Boundary Points under Reinforcement Learning," Levine's Working Paper Archive 506439000000000350, David K. Levine.
- Rustichini, Aldo, 1999. "Optimal Properties of Stimulus--Response Learning Models," Games and Economic Behavior, Elsevier, vol. 29(1-2), pages 244-273, October.
- Fudenberg, Drew & Takahashi, Satoru, 2011.
"Heterogeneous beliefs and local information in stochastic fictitious play,"
Games and Economic Behavior, Elsevier, vol. 71(1), pages 100-120, January.
- Drew Fudenberg & Satoru Takahashi, 2008. "Heterogeneous Beliefs and Local Information in Stochastic Fictitious Play," Levine's Working Paper Archive 122247000000001695, David K. Levine.
- Takahashi, Satoru & Fudenberg, Drew, 2011. "Heterogeneous beliefs and local information in stochastic fictitious play," Scholarly Articles 27755310, Harvard University Department of Economics.
- Beggs, A.W., 2005.
"On the convergence of reinforcement learning,"
Journal of Economic Theory, Elsevier, vol. 122(1), pages 1-36, May.
- Alan Beggs, 2002. "On the Convergence of Reinforcement Learning," Economics Series Working Papers 96, University of Oxford, Department of Economics.
- Timothy G. Conley & Christopher R. Udry, 2010.
"Learning about a New Technology: Pineapple in Ghana,"
American Economic Review, American Economic Association, vol. 100(1), pages 35-69, March.
- Timothy G. Conley & Christopher R. Udry, 2005. "Learning about a new technology: pineapple in Ghana," Proceedings, Federal Reserve Bank of San Francisco.
- Conley, Timothy G. & Udry, Christopher R., 2000. "Learning About a New Technology: Pineapple In Ghana," Center Discussion Papers 28400, Yale University, Economic Growth Center.
- Conley, T.G. & Udry, C.R., 2000. "Learning about a New Technology: Pineapple in Ghana," Papers 817, Yale - Economic Growth Center.
- Timothy G. Conley & Christopher R. Udry, 2000. "Learning About a New Technology: Pineapple in Ghana," Working Papers 817, Economic Growth Center, Yale University, revised May 2004.
- Sergiu Hart & Andreu Mas-Colell, 2013.
"A Simple Adaptive Procedure Leading To Correlated Equilibrium,"
World Scientific Book Chapters, in: Simple Adaptive Strategies From Regret-Matching to Uncoupled Dynamics, chapter 2, pages 17-46,
World Scientific Publishing Co. Pte. Ltd..
- Sergiu Hart & Andreu Mas-Colell, 2000. "A Simple Adaptive Procedure Leading to Correlated Equilibrium," Econometrica, Econometric Society, vol. 68(5), pages 1127-1150, September.
- Sergiu Hart & Andreu Mas-Colell, 1996. "A simple adaptive procedure leading to correlated equilibrium," Economics Working Papers 200, Department of Economics and Business, Universitat Pompeu Fabra, revised Dec 1996.
- Sergiu Hart & Andreu Mas-Colell, 1997. "A Simple Adaptive Procedure Leading to Correlated Equilibrium," Game Theory and Information 9703006, University Library of Munich, Germany, revised 25 Nov 1997.
- S. Hart & A. Mas-Collel, 2010. "A Simple Adaptive Procedure Leading to Correlated Equilibrium," Levine's Working Paper Archive 572, David K. Levine.
- McKelvey Richard D. & Palfrey Thomas R., 1995.
"Quantal Response Equilibria for Normal Form Games,"
Games and Economic Behavior, Elsevier, vol. 10(1), pages 6-38, July.
- McKelvey, Richard D. & Palfrey, Thomas R., 1994. "Quantal Response Equilibria For Normal Form Games," Working Papers 883, California Institute of Technology, Division of the Humanities and Social Sciences.
- R. McKelvey & T. Palfrey, 2010. "Quantal Response Equilibria for Normal Form Games," Levine's Working Paper Archive 510, David K. Levine.
- Josef Hofbauer & William H. Sandholm, 2002. "On the Global Convergence of Stochastic Fictitious Play," Econometrica, Econometric Society, vol. 70(6), pages 2265-2294, November.
- Hofbauer, Josef & Hopkins, Ed, 2005.
"Learning in perturbed asymmetric games,"
Games and Economic Behavior, Elsevier, vol. 52(1), pages 133-152, July.
- Josef Hofbauer & Ed Hopkins, 2000. "Learning in Perturbed Asymmetric Games," Edinburgh School of Economics Discussion Paper Series 53, Edinburgh School of Economics, University of Edinburgh.
- Roth, Alvin E. & Erev, Ido, 1995. "Learning in extensive-form games: Experimental data and simple dynamic models in the intermediate term," Games and Economic Behavior, Elsevier, vol. 8(1), pages 164-212.
- Ianni, Antonella, 2014.
"Learning strict Nash equilibria through reinforcement,"
Journal of Mathematical Economics, Elsevier, vol. 50(C), pages 148-155.
- Ianni, Antonella, 2011. "Learning Strict Nash Equilibria through Reinforcement," MPRA Paper 33936, University Library of Munich, Germany.
- Colin Camerer & Teck-Hua Ho, 1999. "Experience-weighted Attraction Learning in Normal Form Games," Econometrica, Econometric Society, vol. 67(4), pages 827-874, July.
- Benaim, Michel & Hirsch, Morris W., 1999. "Mixed Equilibria and Dynamical Systems Arising from Fictitious Play in Perturbed Games," Games and Economic Behavior, Elsevier, vol. 29(1-2), pages 36-72, October.
- Brit Grosskopf & Ido Erev & Eldad Yechiam, 2006. "Foregone with the Wind: Indirect Payoff Information and its Implications for Choice," International Journal of Game Theory, Springer;Game Theory Society, vol. 34(2), pages 285-302, August.
- Sarin, Rajiv & Vahid, Farshid, 1999. "Payoff Assessments without Probabilities: A Simple Dynamic Model of Choice," Games and Economic Behavior, Elsevier, vol. 28(2), pages 294-309, August.
Citations
Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
Cited by:
- Sawa, Ryoji, 2021. "A prospect theory Nash bargaining solution and its stochastic stability," Journal of Economic Behavior & Organization, Elsevier, vol. 184(C), pages 692-711.
- Pablo S. Castro & Ajit Desai & Han Du & Rodney Garratt & Francisco Rivadeneyra, 2021. "Estimating Policy Functions in Payments Systems Using Reinforcement Learning," Staff Working Papers 21-7, Bank of Canada.
- Funai, Naoki, 2022. "Reinforcement learning with foregone payoff information in normal form games," Journal of Economic Behavior & Organization, Elsevier, vol. 200(C), pages 638-660.
Most related items
These are the items that most often cite the same works as this one and are cited by the same works as this one.- Funai, Naoki, 2022. "Reinforcement learning with foregone payoff information in normal form games," Journal of Economic Behavior & Organization, Elsevier, vol. 200(C), pages 638-660.
- Ed Hopkins, 2002.
"Two Competing Models of How People Learn in Games,"
Econometrica, Econometric Society, vol. 70(6), pages 2141-2166, November.
- Ed Hopkins, 2000. "Two Competing Models of How People Learn in Games," Edinburgh School of Economics Discussion Paper Series 51, Edinburgh School of Economics, University of Edinburgh.
- Ed Hopkins, 2001. "Two Competing Models of How People Learn in Games," NajEcon Working Paper Reviews 625018000000000226, www.najecon.org.
- Ed Hopkins, 2001. "Two Competing Models of How People Learn in Games," Levine's Working Paper Archive 625018000000000226, David K. Levine.
- Jonathan Newton, 2018. "Evolutionary Game Theory: A Renaissance," Games, MDPI, vol. 9(2), pages 1-67, May.
- Benaïm, Michel & Hofbauer, Josef & Hopkins, Ed, 2009.
"Learning in games with unstable equilibria,"
Journal of Economic Theory, Elsevier, vol. 144(4), pages 1694-1709, July.
- Ed Hopkins & Josef Hofbauer & Michel Benaim, 2005. "Learning in Games with Unstable Equilibria," Edinburgh School of Economics Discussion Paper Series 135, Edinburgh School of Economics, University of Edinburgh.
- Michel Benaim & Josef Hofbauer & Ed Hopkins, 2006. "Learning in Games with Unstable Equilibria," Levine's Bibliography 321307000000000547, UCLA Department of Economics.
- Michel Benaim & Josef Hofbauer & Ed Hopkins, 2005. "Learning in Games with Unstable Equilibria," Levine's Bibliography 784828000000000609, UCLA Department of Economics.
- Funai Naoki, 2014. "An Adaptive Learning Model with Foregone Payoff Information," The B.E. Journal of Theoretical Economics, De Gruyter, vol. 14(1), pages 149-176, January.
- Hopkins, Ed & Posch, Martin, 2005.
"Attainability of boundary points under reinforcement learning,"
Games and Economic Behavior, Elsevier, vol. 53(1), pages 110-125, October.
- Ed Hopkins & Martin Posch, 2003. "Attainability of Boundary Points under Reinforcement Learning," Edinburgh School of Economics Discussion Paper Series 79, Edinburgh School of Economics, University of Edinburgh.
- Ed Hopkins & Martin Posch, 2003. "Attainability of Boundary Points under Reinforcement Learning," Levine's Working Paper Archive 506439000000000350, David K. Levine.
- Naoki Funai, 2013. "An Adaptive Learning Model in Coordination Games," Games, MDPI, vol. 4(4), pages 1-22, November.
- Panayotis Mertikopoulos & William H. Sandholm, 2016. "Learning in Games via Reinforcement and Regularization," Mathematics of Operations Research, INFORMS, vol. 41(4), pages 1297-1324, November.
- Hofbauer, Josef & Hopkins, Ed, 2005.
"Learning in perturbed asymmetric games,"
Games and Economic Behavior, Elsevier, vol. 52(1), pages 133-152, July.
- Josef Hofbauer & Ed Hopkins, 2000. "Learning in Perturbed Asymmetric Games," Edinburgh School of Economics Discussion Paper Series 53, Edinburgh School of Economics, University of Edinburgh.
- Oyarzun, Carlos & Sarin, Rajiv, 2013.
"Learning and risk aversion,"
Journal of Economic Theory, Elsevier, vol. 148(1), pages 196-225.
- Carlos Oyarzun & Rajiv Sarin, 2005. "Learning and Risk Aversion," Levine's Bibliography 784828000000000482, UCLA Department of Economics.
- Carlos Oyarzun & Rajiv Sarin, 2012. "Learning and Risk Aversion," Levine's Working Paper Archive 786969000000000572, David K. Levine.
- Duffy, John & Hopkins, Ed, 2005.
"Learning, information, and sorting in market entry games: theory and evidence,"
Games and Economic Behavior, Elsevier, vol. 51(1), pages 31-62, April.
- John Duffy & Ed Hopkins, 2001. "Learning, Information and Sorting in Market Entry Games: Theory and Evidence," Edinburgh School of Economics Discussion Paper Series 78, Edinburgh School of Economics, University of Edinburgh.
- John Duffy & Ed Hopkins, 2010. "Learning, Information and Sorting in Market Entry Games: Theory and Evidence," Levine's Working Paper Archive 506439000000000355, David K. Levine.
- Schuster, Stephan, 2012. "Applications in Agent-Based Computational Economics," MPRA Paper 47201, University Library of Munich, Germany.
- Schuster, Stephan, 2010. "Network Formation with Adaptive Agents," MPRA Paper 27388, University Library of Munich, Germany.
- Jakub Bielawski & Thiparat Chotibut & Fryderyk Falniowski & Michal Misiurewicz & Georgios Piliouras, 2022. "Unpredictable dynamics in congestion games: memory loss can prevent chaos," Papers 2201.10992, arXiv.org, revised Jan 2022.
- Ianni, Antonella, 2014.
"Learning strict Nash equilibria through reinforcement,"
Journal of Mathematical Economics, Elsevier, vol. 50(C), pages 148-155.
- Ianni, Antonella, 2011. "Learning Strict Nash Equilibria through Reinforcement," MPRA Paper 33936, University Library of Munich, Germany.
- Kets, W., 2008. "Networks and learning in game theory," Other publications TiSEM 7713fce1-3131-498c-8c6f-3, Tilburg University, School of Economics and Management.
- Ianni, A., 2002. "Reinforcement learning and the power law of practice: some analytical results," Discussion Paper Series In Economics And Econometrics 203, Economics Division, School of Social Sciences, University of Southampton.
- Cason, Timothy N. & Friedman, Daniel & Hopkins, Ed, 2010.
"Testing the TASP: An experimental investigation of learning in games with unstable equilibria,"
Journal of Economic Theory, Elsevier, vol. 145(6), pages 2309-2331, November.
- Timothy N. Cason & Daniel Friedman & Ed Hopkins, 2009. "Testing the TASP: An Experimental Investigation of Learning in Games with Unstable Equilibria," Edinburgh School of Economics Discussion Paper Series 188, Edinburgh School of Economics, University of Edinburgh.
- Cason, Timothy N. & Friedman, Daniel & Hopkins, Ed H, 2009. "Testing the TASP: An Experimental Investigation of Learning in Games with Unstable Equilibria," Santa Cruz Department of Economics, Working Paper Series qt8kp6c049, Department of Economics, UC Santa Cruz.
- Timothy N. Cason & Daniel Friedman & Ed Hopkins, 2010. "Testing the TASP: An Experimental Investigation of Learning in Games with Unstable Equilibria," Purdue University Economics Working Papers 1233, Purdue University, Department of Economics.
- Cason, Timothy N. & Friedman, Daniel UC & Hopkins, Ed, 2009. "Testing the TASP: An Experimental Investigation of Learning in Games with Unstable Equilibria," SIRE Discussion Papers 2009-15, Scottish Institute for Research in Economics (SIRE).
- Fudenberg, Drew & Takahashi, Satoru, 2011.
"Heterogeneous beliefs and local information in stochastic fictitious play,"
Games and Economic Behavior, Elsevier, vol. 71(1), pages 100-120, January.
- Drew Fudenberg & Satoru Takahashi, 2008. "Heterogeneous Beliefs and Local Information in Stochastic Fictitious Play," Levine's Working Paper Archive 122247000000001695, David K. Levine.
- Takahashi, Satoru & Fudenberg, Drew, 2011. "Heterogeneous beliefs and local information in stochastic fictitious play," Scholarly Articles 27755310, Harvard University Department of Economics.
- Beggs, A.W., 2005.
"On the convergence of reinforcement learning,"
Journal of Economic Theory, Elsevier, vol. 122(1), pages 1-36, May.
- Alan Beggs, 2002. "On the Convergence of Reinforcement Learning," Economics Series Working Papers 96, University of Oxford, Department of Economics.
More about this item
Keywords
Adaptive learning; Normal form games; Asynchronous stochastic approximation; Quantal response equilibrium;All these keywords.
JEL classification:
- C72 - Mathematical and Quantitative Methods - - Game Theory and Bargaining Theory - - - Noncooperative Games
- D83 - Microeconomics - - Information, Knowledge, and Uncertainty - - - Search; Learning; Information and Knowledge; Communication; Belief; Unawareness
Statistics
Access and download statisticsCorrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:joecth:v:68:y:2019:i:4:d:10.1007_s00199-018-1150-8. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .
Please note that corrections may take a couple of weeks to filter through the various RePEc services.