Single-leader-multiple-follower games with boundedly rational agents
This paper studies a class of hierarchical games called single-leader-multiple-follower games (SLMFGs) that have important applications in economics and engineering. We consider such games in the context of boundedly rational agents that are limited in the information and computational power they may possess. Agents in our SLMFG are modeled as adaptive learners that use simple reinforcement learning schemes to learn their optimal behavior. The proposed learning approach is illustrated using a well-studied problem in economics. It is shown that with a patiently learning leader the repeated plays of the game result in approximate equilibrium outcomes.
References listed on IDEAS
Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:
- Kalai, Ehud & Ledyard, John, 1997.
1027, California Institute of Technology, Division of the Humanities and Social Sciences.
- Kutschinski, Erich & Uthmann, Thomas & Polani, Daniel, 2003. "Learning competitive pricing strategies by multi-agent reinforcement learning," Journal of Economic Dynamics and Control, Elsevier, vol. 27(11-12), pages 2207-2218, September.
- Drew Fudenberg & David K. Levine, 1998.
"The Theory of Learning in Games,"
MIT Press Books,
The MIT Press,
edition 1, volume 1, number 0262061945, June.
- Radner, Roy, 1985. "Repeated Principal-Agent Games with Discounting," Econometrica, Econometric Society, vol. 53(5), pages 1173-98, September.
- Brock, W.A. & Hommes, C.H., 1996.
"A Rational Route to Randomness,"
9530r, Wisconsin Madison - Social Systems.
- Kutschinski, Erich & Uthmann, Thomas & Polani, Daniel, 2003. "Learning competitive pricing strategies by multi-agent reinforcement learning," Journal of Economic Dynamics and Control, Elsevier, vol. 27(11), pages 2207-2218.
- Vallee, Thomas & Basar, Tamer, 1999. "Off-Line Computation of Stackelberg Solutions with the Genetic Algorithm," Computational Economics, Society for Computational Economics, vol. 13(3), pages 201-09, June.
- Groves, Theodore, 1973. "Incentives in Teams," Econometrica, Econometric Society, vol. 41(4), pages 617-31, July.
- Waltman, Ludo & Kaymak, Uzay, 2008. "Q-learning agents in a Cournot oligopoly model," Journal of Economic Dynamics and Control, Elsevier, vol. 32(10), pages 3275-3293, October.
When requesting a correction, please mention this item's handle: RePEc:eee:dyncon:v:33:y:2009:i:8:p:1593-1603. See general information about how to correct material in RePEc.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Zhang, Lei)
If references are entirely missing, you can add them using this form.