Multiagent Learning For Black Box System Reward Functions

My bibliography Save this article

Multiagent Learning For Black Box System Reward Functions

Author

Listed:

KAGAN TUMER
(Oregon State University, 204 Rogers Hall, Corvallis, Oregon 97331, USA)
ADRIAN AGOGINO
(UCSC, NASA Ames Research Center, Mailstop 269-3, Moffett Field, California 94035, USA)

Registered:

Abstract

In large, distributed systems composed of adaptive and interactive components (agents), ensuring the coordination among the agents so that the system achieves certain performance objectives is a challenging proposition. The key difficulty to overcome in such systems is one of credit assignment: How to apportion credit (or blame) to a particular agent based on the performance of the entire system. In this paper, we show how this problem can be solved in general for a large class of reward functions whose analytical form may be unknown (hence "black box" reward). This method combines the salient features of global solutions (e.g. "team games") which are broadly applicable but provide poor solutions in large problems with those of local solutions (e.g. "difference rewards") which learn quickly, but can be computationally burdensome. We introduce two estimates for local rewards for a class of problems where the mapping from the agent actions to system reward functions can be decomposed into a linear combination of nonlinear functions of the agents' actions. We test our method's performance on a distributed marketing problem and an air traffic flow management problem and show a 44% performance improvement over team games and a speedup of ordernfor difference rewards (for annagent system).

Suggested Citation

Kagan Tumer & Adrian Agogino, 2009. "Multiagent Learning For Black Box System Reward Functions," Advances in Complex Systems (ACS), World Scientific Publishing Co. Pte. Ltd., vol. 12(04n05), pages 475-492.

Handle: RePEc:wsi:acsxxx:v:12:y:2009:i:04n05:n:s0219525909002295
DOI: 10.1142/S0219525909002295

Download full text from publisher

As the access to this document is restricted, you may want to search for a different version of it.

References listed on IDEAS

Johnson, N.F. & Jarvis, S. & Jonson, R. & Cheung, P. & Kwong, Y.R. & Hui, P.M., 1998. "Volatility and agent adaptability in a self-organizing market," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 258(1), pages 230-236.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Thorsten Chmura & Thomas Pitz, 2007. "An Extended Reinforcement Algorithm for Estimation of Human Behaviour in Experimental Congestion Games," Journal of Artificial Societies and Social Simulation, Journal of Artificial Societies and Social Simulation, vol. 10(2), pages 1-1.
Chmura, Thorsten & Pitz, Thomas, 2004. "Minority Game: Experiments and Simulations of Traffic Scenarios," Bonn Econ Discussion Papers 23/2004, University of Bonn, Bonn Graduate School of Economics (BGSE).
Matteo Marsili & Damien Challet, 2001. "Trading Behavior And Excess Volatility In Toy Markets," Advances in Complex Systems (ACS), World Scientific Publishing Co. Pte. Ltd., vol. 4(01), pages 3-17.
- M. Marsili & D. Challet, 2000. "Trading behavior and excess volatility in toy markets," Papers cond-mat/0004376, arXiv.org, revised Jun 2000.
Li-Xin Zhong & Wen-Juan Xu & Fei Ren & Yong-Dong Shi, 2012. "Coupled effects of market impact and asymmetric sensitivity in financial markets," Papers 1209.3399, arXiv.org, revised Jan 2013.
Willemien Kets, 2007. "The minority game: An economics perspective," Papers 0706.4432, arXiv.org.
- Kets, W., 2007. "The Minority Game : An Economics Perspective," Discussion Paper 2007-53, Tilburg University, Center for Economic Research.
- Kets, W., 2007. "The Minority Game : An Economics Perspective," Other publications TiSEM 65d52a6a-b27d-45a9-93a7-e, Tilburg University, School of Economics and Management.
Zhong, Li-Xin & Xu, Wen-Juan & Ren, Fei & Shi, Yong-Dong, 2013. "Coupled effects of market impact and asymmetric sensitivity in financial markets," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 392(9), pages 2139-2149.
Marsili, Matteo & Challet, Damien & Zecchina, Riccardo, 2000. "Exact solution of a modified El Farol's bar problem: Efficiency and the role of market impact," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 280(3), pages 522-553.
Li-Xin Zhong & Wen-Juan Xu & Ping Huang & Chen-Yang Zhong & Tian Qiu, 2013. "Self-organization and phase transition in financial markets with multiple choices," Papers 1312.0690, arXiv.org, revised Jun 2014.
Zhong, Li-Xin & Xu, Wen-Juan & Chen, Rong-Da & Zhong, Chen-Yang & Qiu, Tian & Ren, Fei & He, Yun-Xing, 2018. "Self-reinforcing feedback loop in financial markets with coupling of market impact and momentum traders," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 493(C), pages 301-310.
Mansilla, R, 2000. "From naive to sophisticated behavior in multiagents-based financial market models," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 284(1), pages 478-488.
Epstein, Daniel & Bazzan, Ana L.C., 2013. "The value of less connected agents in Boolean networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 392(21), pages 5387-5398.
Aki-Hiro Sato & Hideki Takayasu, 2001. "Derivation of ARCH(1) process from market price changes based on deterministic microscopic multi-agent," Papers cond-mat/0104313, arXiv.org.

More about this item

Keywords

Multiagent learning; black box reward functions; multiagent coordination;
All these keywords.

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wsi:acsxxx:v:12:y:2009:i:04n05:n:s0219525909002295. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Tai Tone Lim (email available below). General contact details of provider: http://www.worldscinet.com/acs/acs.shtml .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Multiagent Learning For Black Box System Reward Functions

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data