Reinforcement learning and stochastic fictitious play are apparent rivals as models of human learning. They embody quite different assumptions about the processing of information and optimisation. This paper compares their properties and finds that they are far more similar than was thought. In particular, exponential fictitious play and a suitably perturbed reinforcement model have the same expected motion and therefore will have the same asymptotic behaviour. It is also shown that more general models of stochastic fictitious play and perturbed reinforcement learning have identical local stability properties. The main identifiable difference between the two models is speed: stochastic fictitious play gives rise to faster learning.
Download Info
To our knowledge, this item is not available for
download. To find whether it is available, there are three
options:
1. Check below under "Related research" whether another version of this item is available online.
2. Check on the provider's web page
whether it is in fact available.
3. Perform a search for a similarly titled item that would be
available.
Publisher Info
Paper provided by Edinburgh School of Economics, University of Edinburgh in its series ESE Discussion Papers with number
42.
Find related papers by JEL classification: C72 - Mathematical and Quantitative Methods - - Game Theory and Bargaining Theory - - - Noncooperative Games D83 - Microeconomics - - Information, Knowledge, and Uncertainty - - - Search, Learning, and Information