This file is part of IDEAS, which uses RePEc data


[ Papers | Articles | Software | Books | Chapters | Authors | Institutions | JEL Classification | NEP reports | Search | New papers by email | Author registration | Rankings | Volunteers | FAQ | Blog | Help! ]

A Theoretical Analysis of Cooperative Behavior in Multi-Agent Q-learning

Author info | Abstract | Publisher info | Download info | Related research | Statistics
Author Info
Waltman, L.R.
Kaymak, U. (Erasmus Research Institute of Management (ERIM), RSM Erasmus University)

Additional information is available for the following registered author(s):

Abstract

A number of experimental studies have investigated whether cooperative behavior may emerge in multi-agent Q-learning. In some studies cooperative behavior did emerge, in others it did not. This report provides a theoretical analysis of this issue. The analysis focuses on multi-agent Q-learning in iterated prisoner’s dilemmas. It is shown that under certain assumptions cooperative behavior may emerge when multi-agent Q-learning is applied in an iterated prisoner’s dilemma. An important consequence of the analysis is that multi-agent Q-learning may result in non-Nash behavior. It is found experimentally that the theoretical results derived in this report are quite robust to violations of the underlying assumptions.

Download Info
To download:

If you experience problems downloading a file, check if you have the proper application to view it first. Information about this may be contained in the File-Format links below. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.

File URL: http://hdl.handle.net/1765/7323
File Format: application/pdf
File Function:
Download Restriction: no

Publisher Info
Paper provided by Erasmus Research Institute of Management (ERIM), ERIM is the joint research institute of the Rotterdam School of Management, Erasmus University and the Erasmus School of Economics (ESE) at Erasmus University Rotterdam. in its series Research Paper with number ERS-2006-006-LIS Revision_Date: 2009-07-29.

Download reference. The following formats are available: HTML (with abstract), plain text (with abstract), BibTeX, RIS (EndNote, RefMan, ProCite), ReDIF
Length:
Date of creation: 01 Feb 2006
Date of revision:
Handle: RePEc:dgr:eureri:30007962

Contact details of provider:
Web page: http://www.erim.eur.nl/

For technical questions regarding this item, or to correct its listing, contact: (ERIM Series Handler at the ERIM Office).

Related research
Keywords: Prisoner’s Dilemma; Cooperation; Nash Equilibrium; Multi-Agent Reinforcement Learning; Multi-Agent Q-Learning;

This paper has been announced in the following NEP Reports:

References listed on IDEAS
Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:
  1. Kandori Michihiro & Rob Rafael, 1995. "Evolution of Equilibria in the Long Run: A General Theory and Applications," Journal of Economic Theory, Elsevier, vol. 65(2), pages 383-414, April. [Downloadable!] (restricted)
  2. Young, H Peyton, 1993. "The Evolution of Conventions," Econometrica, Econometric Society, vol. 61(1), pages 57-84, January. [Downloadable!] (restricted)
Full references

Statistics
Access and download statistics

Did you know? A tutorial is available.

This page was last updated on 2009-12-2.


This information is provided to you by IDEAS at the Department of Economics, College of Liberal Arts and Sciences, University of Connecticut using RePEc data on a server sponsored by the Society for Economic Dynamics.