A Simple Adaptive Procedure Leading to Correlated Equilibrium
We propose a new and simple adaptive procedure for playing a game: "regret-matching." In this procedure, players depart from their current play with probabilities that are proportional to measures of regret for not having used other strategies in the past. It is shown that our adaptive procedure guarantees that, with probability one, the empirical distributions of play converge to the set of correlated equilibria of the game. To compute these regret measures, a player needs to know his payoff function and the history of play. We also offer a variation where every player knows only his own realized payoff history (but not his payoff function).
|Date of creation:||24 Mar 1997|
|Date of revision:||24 Mar 1997|
|Note:||January 1997. Revised: October 1997. Paper + 3 figures (postscript). Also available at URL below|
|Contact details of provider:|| Web page: http://econwpa.repec.org|
Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:
- Drew Fudenberg & David K. Levine, 1998.
"Learning in Games,"
Levine's Working Paper Archive
2222, David K. Levine.
- Fudenberg, Drew & Levine, David K., 1999.
"Conditional Universal Consistency,"
Games and Economic Behavior,
Elsevier, vol. 29(1-2), pages 104-130, October.
- Sergiu Hart & Andreu Mas-Colell, 1999.
"A General Class of Adaptive Strategies,"
Game Theory and Information
9904001, EconWPA, revised 23 Mar 2000.
- Fudenberg, Drew & Levine, David, 1995.
"Consistency and Cautious Fictitious Play,"
3198694, Harvard University Department of Economics.
- Nimrod Megiddo, 1979. "On Repeated Games with Incomplete Information Played by Non-Bayesian Players," Discussion Papers 373, Northwestern University, Center for Mathematical Studies in Economics and Management Science.
- Roger B. Myerson, 1995.
"Dual Reduction and Elementary Games,"
1133, Northwestern University, Center for Mathematical Studies in Economics and Management Science.
- Nau, Robert F. & McCardle, Kevin F., 1990. "Coherent behavior in noncooperative games," Journal of Economic Theory, Elsevier, vol. 50(2), pages 424-444, April.
- R. Aumann, 2010.
"Correlated Equilibrium as an expression of Bayesian Rationality,"
513, UCLA Department of Economics.
- Aumann, Robert J, 1987. "Correlated Equilibrium as an Expression of Bayesian Rationality," Econometrica, Econometric Society, vol. 55(1), pages 1-18, January.
- Robert J. Aumann, 2010. "Correlated Equilibrium as an expression of Bayesian Rationality," Levine's Working Paper Archive 661465000000000377, David K. Levine.
- Colin Camerer & Teck-Hua Ho, 1999. "Experience-weighted Attraction Learning in Normal Form Games," Econometrica, Econometric Society, vol. 67(4), pages 827-874, July.
- AUMANN, Robert J., .
"Subjectivity and correlation in randomized strategies,"
CORE Discussion Papers RP
-167, Université catholique de Louvain, Center for Operations Research and Econometrics (CORE).
- Aumann, Robert J., 1974. "Subjectivity and correlation in randomized strategies," Journal of Mathematical Economics, Elsevier, vol. 1(1), pages 67-96, March.
- R. Aumann, 2010. "Subjectivity and Correlation in Randomized Strategies," Levine's Working Paper Archive 389, David K. Levine.
- Foster, Dean P. & Vohra, Rakesh, 1999. "Regret in the On-Line Decision Problem," Games and Economic Behavior, Elsevier, vol. 29(1-2), pages 7-35, October.
- Mertens, J.-F., 1986. "Repeated games," CORE Discussion Papers 1986024, Université catholique de Louvain, Center for Operations Research and Econometrics (CORE).
- Erev, Ido & Roth, Alvin E, 1998. "Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria," American Economic Review, American Economic Association, vol. 88(4), pages 848-81, September.
- Sanchirico, Chris William, 1996.
"A Probabilistic Model of Learning in Games,"
Econometric Society, vol. 64(6), pages 1375-93, November.
- Roth, Alvin E. & Erev, Ido, 1995. "Learning in extensive-form games: Experimental data and simple dynamic models in the intermediate term," Games and Economic Behavior, Elsevier, vol. 8(1), pages 164-212.
When requesting a correction, please mention this item's handle: RePEc:wpa:wuwpga:9703006. See general information about how to correct material in RePEc.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (EconWPA)
If references are entirely missing, you can add them using this form.