Dynamic learning in behavioral games: A hidden Markov mixture of experts approach

Dynamic learning in behavioral games: A hidden Markov mixture of experts approach

Author

Listed:

Asim Ansari
Ricardo Montoya
Oded Netzer

Registered:

Ricardo Montoya

Abstract

Over the course of a repeated game, players often exhibit learning in selecting their best response. Research in economics and marketing has identified two key types of learning rules: belief and reinforcement. It has been shown that players use either one of these learning rules or a combination of them, as in the Experience-Weighted Attraction (EWA) model. Accounting for such learning may help in understanding and predicting the outcomes of games. In this research, we demonstrate that players not only employ learning rules to determine what actions to choose based on past choices and outcomes, but also change their learning rules over the course of the game. We investigate the degree of state dependence in learning and uncover the latent learning rules and learning paths used by the players. We build a non-homogeneous hidden Markov mixture of experts model which captures shifts between different learning rules over the course of a repeated game. The transition between the learning rule states can be affected by the players’ experiences in the previous round of the game. We empirically validate our model using data from six games that have been previously used in the literature. We demonstrate that one can obtain a richer understanding of how different learning rules impact the observed strategy choices of players by accounting for the latent dynamics in the learning rules. In addition, we show that such an approach can improve our ability to predict observed choices in games. Copyright Springer Science+Business Media, LLC 2012

Suggested Citation

Asim Ansari & Ricardo Montoya & Oded Netzer, 2012. "Dynamic learning in behavioral games: A hidden Markov mixture of experts approach," Quantitative Marketing and Economics (QME), Springer, vol. 10(4), pages 475-503, December.

Handle: RePEc:kap:qmktec:v:10:y:2012:i:4:p:475-503
DOI: 10.1007/s11129-012-9125-8

Download full text from publisher

As the access to this document is restricted, you may want to

for a different version of it.

References listed on IDEAS

Fudenberg, Drew & Levine, David, 1998. "Learning in games," European Economic Review, Elsevier, vol. 42(3-5), pages 631-639, May.
- Drew Fudenberg & David K. Levine, 1998. "Learning in Games," Levine's Working Paper Archive 2222, David K. Levine.
Oded Netzer & James M. Lattin & V. Srinivasan, 2008. "A Hidden Markov Model of Customer Relationship Dynamics," Marketing Science, INFORMS, vol. 27(2), pages 185-204, 03-04.
Mookherjee Dilip & Sopher Barry, 1994. "Learning Behavior in an Experimental Matching Pennies Game," Games and Economic Behavior, Elsevier, vol. 7(1), pages 62-91, July.
Van Huyck, John B & Battalio, Raymond C & Beil, Richard O, 1990. "Tacit Coordination Games, Strategic Uncertainty, and Coordination Failure," American Economic Review, American Economic Association, vol. 80(1), pages 234-248, March.
- John B Van Huyck & Raymond C Battalio & Richard O Beil, 1997. "Tacit coordination games, strategic uncertainty, and coordination failure," Levine's Working Paper Archive 1225, David K. Levine.
- J. B. Van Huyck & R. C. Battalio & R. O. Beil, 2010. "Tacit coordination games, strategic uncertainty, and coordination failure," Levine's Working Paper Archive 661465000000000393, David K. Levine.
Keane, Michael P, 1997. "Modeling Heterogeneity and State Dependence in Consumer Choice Behavior," Journal of Business & Economic Statistics, American Statistical Association, vol. 15(3), pages 310-327, July.
Crawford, Vincent P, 1995. "Adaptive Dynamics in Coordination Games," Econometrica, Econometric Society, vol. 63(1), pages 103-143, January.
- V. Crawford, 2010. "Adaptive Dynamics in Coordination Games," Levine's Working Paper Archive 404, David K. Levine.
Mookherjee, Dilip & Sopher, Barry, 1997. "Learning and Decision Costs in Experimental Constant Sum Games," Games and Economic Behavior, Elsevier, vol. 19(1), pages 97-132, April.
- Barry Sopher & Dilip Mookherjee, 1997. "Learning and Decision Costs in Experimental Constant Sum Games," Departmental Working Papers 199527, Rutgers University, Department of Economics.
- Barry Sopher & Dilip Mookherjee, 2000. "Learning and Decision Costs in Experimental Constant Sum Games," Departmental Working Papers 199625, Rutgers University, Department of Economics.
Teck H. Ho & Xin Wang & Colin F. Camerer, 2008. "Individual Differences in EWA Learning with Partial Payoff Information," Economic Journal, Royal Economic Society, vol. 118(525), pages 37-59, January.
Roth, Alvin E. & Erev, Ido, 1995. "Learning in extensive-form games: Experimental data and simple dynamic models in the intermediate term," Games and Economic Behavior, Elsevier, vol. 8(1), pages 164-212.
Colin Camerer & Teck-Hua Ho, 1999. "Experience-weighted Attraction Learning in Normal Form Games," Econometrica, Econometric Society, vol. 67(4), pages 827-874, July.
Ricardo Montoya & Oded Netzer & Kamel Jedidi, 2010. "Dynamic Allocation of Pharmaceutical Detailing and Sampling for Long-Term Profitability," Marketing Science, INFORMS, vol. 29(5), pages 909-924, 09-10.
Ho, Teck-Hua & Camerer, Colin & Weigelt, Keith, 1998. "Iterated Dominance and Iterated Best Response in Experimental "p-Beauty Contests."," American Economic Review, American Economic Association, vol. 88(4), pages 947-969, September.
- Ho, Teck Hua & Weigelt, Keith & Camerer, Colin, 1996. "Iterated Dominance and Iterated Best-Response in Experimental P-Beauty Contests," Working Papers 974, California Institute of Technology, Division of the Humanities and Social Sciences.
Timothy C. Salmon, 2001. "An Evaluation of Econometric Models of Adaptive Learning," Econometrica, Econometric Society, vol. 69(6), pages 1597-1628, November.
Stahl, Dale O., 2001. "Population rule learning in symmetric normal-form games: theory and evidence," Journal of Economic Behavior & Organization, Elsevier, vol. 45(1), pages 19-35, May.
Van Huyck, John B. & Cook, Joseph P. & Battalio, Raymond C., 1997. "Adaptive behavior and coordination failure," Journal of Economic Behavior & Organization, Elsevier, vol. 32(4), pages 483-503, April.
Ho, Teck H. & Camerer, Colin F. & Chong, Juin-Kuan, 2007. "Self-tuning experience weighted attraction learning in games," Journal of Economic Theory, Elsevier, vol. 133(1), pages 177-198, March.
Asim Ansari & Raghuram Iyengar, 2006. "Semiparametric Thurstonian Models for Recurrent Choices: A Bayesian Analysis," Psychometrika, Springer;The Psychometric Society, vol. 71(4), pages 631-657, December.
Dale O. Stahl, 1999. "Evidence based rules and learning in symmetric normal-form games," International Journal of Game Theory, Springer;Game Theory Society, vol. 28(1), pages 111-130.
Nathaniel T Wilcox, 2006. "Theories of Learning in Games and Heterogeneity Bias," Econometrica, Econometric Society, vol. 74(5), pages 1271-1292, September.
Rapoport, Amnon & Amaldoss, Wilfred, 2000. "Mixed strategies and iterative elimination of strongly dominated strategies: an experimental investigation of states of knowledge," Journal of Economic Behavior & Organization, Elsevier, vol. 42(4), pages 483-521, August.
Huck, Steffen & Normann, Hans-Theo & Oechssler, Jorg, 1999. "Learning in Cournot Oligopoly--An Experiment," Economic Journal, Royal Economic Society, vol. 109(454), pages 80-95, March.
- Steffen Huck & Hans-Theo Normann & Joerg Oechssler, 1997. "Learning in Cournot Oligopoly - An Experiment," Game Theory and Information 9707009, University Library of Munich, Germany, revised 22 Jul 1997.
TeckH. Ho & Xin Wang & ColinF. Camerer, 2008. "Individual Differences in EWA Learning with Partial Payoff Information," Economic Journal, Royal Economic Society, vol. 118(525), pages 37-59, January.
Erev, Ido & Roth, Alvin E, 1998. "Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria," American Economic Review, American Economic Association, vol. 88(4), pages 848-881, September.
Colin Camerer & Teck Ho & Kuan Chong, 2003. "Models of Thinking, Learning, and Teaching in Games," American Economic Review, American Economic Association, vol. 93(2), pages 192-195, May.
John Huyck & Raymond Battalio & Frederick Rankin, 2007. "Selection dynamics and adaptive behavior without much information," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 33(1), pages 53-65, October.
Selten, Reinhard, 1991. "Evolution, learning, and economic behavior," Games and Economic Behavior, Elsevier, vol. 3(1), pages 3-24, February.
Drew Fudenberg & David K. Levine, 1998. "The Theory of Learning in Games," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262061945, December.
- Drew Fudenberg & David K. Levine, 1996. "The Theory of Learning in Games," Levine's Working Paper Archive 624, David K. Levine.
Stahl, Dale O., 2000. "Rule Learning in Symmetric Normal-Form Games: Theory and Evidence," Games and Economic Behavior, Elsevier, vol. 32(1), pages 105-138, July.
- Dale O. Stahl, 1997. "Rule Learning in Symmetric Normal-Form Games: Theory and Evidence," CARE Working Papers 9710, The University of Texas at Austin, Center for Applied Research in Economics.
Howard Kunreuther & Gabriel Silvasi & Eric T. Bradlow & Dylan Small, 2009. "Bayesian analysis of deterministic and stochastic prisoner's dilemma games," Judgment and Decision Making, Society for Judgment and Decision Making, vol. 4(5), pages 363-384, August.
repec:cup:judgdm:v:4:y:2009:i:5:p:363-384 is not listed on IDEAS
Timothy Salmon, 2004. "Evidence for Learning to Learn Behavior in Normal Form Games," Theory and Decision, Springer, vol. 56(4), pages 367-404, April.
Teck H Ho & Colin Camerer & Juin-Kuan Chong, 2003. "Functional EWA: A one-parameter theory of learning in games," Levine's Working Paper Archive 506439000000000514, David K. Levine.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Kappe, Eelco & Stadler Blank, Ashley & DeSarbo, Wayne S., 2018. "A random coefficients mixture hidden Markov model for marketing research," International Journal of Research in Marketing, Elsevier, vol. 35(3), pages 415-431.
Peter Ebbes & Oded Netzer, 2022. "Using Social Network Activity Data to Identify and Target Job Seekers," Management Science, INFORMS, vol. 68(4), pages 3026-3046, April.
Alina Ferecatu & Arnaud De Bruyn, 2022. "Understanding Managers’ Trade-Offs Between Exploration and Exploitation," Marketing Science, INFORMS, vol. 41(1), pages 139-165, January.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Camerer, Colin F. & Ho, Teck-Hua, 2015. "Behavioral Game Theory Experiments and Modeling," Handbook of Game Theory with Economic Applications,, Elsevier.
Camerer, Colin F. & Ho, Teck-Hua & Chong, Juin-Kuan, 2002. "Sophisticated Experience-Weighted Attraction Learning and Strategic Teaching in Repeated Games," Journal of Economic Theory, Elsevier, vol. 104(1), pages 137-188, May.
Teck H Ho & Colin Camerer & Juin-Kuan Chong, 2003. "Functional EWA: A one-parameter theory of learning in games," Levine's Working Paper Archive 506439000000000514, David K. Levine.
Xie, Erhao, 2021. "Empirical properties and identification of adaptive learning models in behavioral game theory," Journal of Economic Behavior & Organization, Elsevier, vol. 191(C), pages 798-821.
Ho, Teck H. & Camerer, Colin F. & Chong, Juin-Kuan, 2007. "Self-tuning experience weighted attraction learning in games," Journal of Economic Theory, Elsevier, vol. 133(1), pages 177-198, March.
Teck H. Ho & Xin Wang & Colin F. Camerer, 2008. "Individual Differences in EWA Learning with Partial Payoff Information," Economic Journal, Royal Economic Society, vol. 118(525), pages 37-59, January.
repec:wyi:journl:002151 is not listed on IDEAS
Shachat, Jason & Swarthout, J. Todd, 2012. "Learning about learning in games through experimental control of strategic interdependence," Journal of Economic Dynamics and Control, Elsevier, vol. 36(3), pages 383-402.
- Jason Shachat & J. Todd Swarthout, 2002. "Learning about Learning in Games through Experimental Control of Strategic Interdependence," Experimental Economics Center Working Paper Series 2006-17, Experimental Economics Center, Andrew Young School of Policy Studies, Georgia State University, revised Aug 2008.
- Jason Shachat & J. Todd Swarthout, 2013. "Learning about learning in games through experimental control of strategic interdependence," Working Papers 2013-10-14, Wang Yanan Institute for Studies in Economics (WISE), Xiamen University.
- Jason Shachat & J. Todd Swarthout, 2003. "Learning about Learning in Games through Experimental Control of Strategic Interdependence," Experimental 0310003, University Library of Munich, Germany.
- Jason Shachat & J. Todd Swarthout, 2011. "Learning about learning in games through experimental control of strategic interdependence," Working Papers 1103, Xiamen Unversity, The Wang Yanan Institute for Studies in Economics, Finance and Economics Experimental Laboratory, revised 28 Apr 2011.
Haruvy, Ernan & Stahl, Dale O., 2012. "Between-game rule learning in dissimilar symmetric normal-form games," Games and Economic Behavior, Elsevier, vol. 74(1), pages 208-221.
Duffy, John, 2006. "Agent-Based Models and Human Subject Experiments," Handbook of Computational Economics, in: Leigh Tesfatsion & Kenneth L. Judd (ed.), Handbook of Computational Economics, edition 1, volume 2, chapter 19, pages 949-1011, Elsevier.
- John Duffy, 2004. "Agent-Based Models and Human Subject Experiments," Computational Economics 0412001, University Library of Munich, Germany.
Teck-Hua Ho & So-Eun Park & Xuanming Su, 2021. "A Bayesian Level- k Model in n -Person Games," Management Science, INFORMS, vol. 67(3), pages 1622-1638, March.
Erhao Xie, 2019. "Monetary Payoff and Utility Function in Adaptive Learning Models," Staff Working Papers 19-50, Bank of Canada.
Andreas Nicklisch, 2011. "Learning strategic environments: an experimental study of strategy formation and transfer," Theory and Decision, Springer, vol. 71(4), pages 539-558, October.
Atanasios Mitropoulos, 2001. "Learning Under Little Information: An Experiment on Mutual Fate Control," Game Theory and Information 0110003, University Library of Munich, Germany.
Wen, Yuanji, 2018. "Voluntary information acquisition in an asymmetric-Information game:comparing learning theories in the laboratory," Journal of Economic Behavior & Organization, Elsevier, vol. 150(C), pages 202-219.
Mohlin, Erik & Östling, Robert & Wang, Joseph Tao-yi, 2020. "Learning by similarity-weighted imitation in winner-takes-all games," Games and Economic Behavior, Elsevier, vol. 120(C), pages 225-245.
Battalio,R. & Samuelson,L. & Huyck,J. van, 1998. "Risk dominance, payoff dominance and probabilistic choice learning," Working papers 2, Wisconsin Madison - Social Systems.
- Raymond Battalio & Larry Samuelson & John Van Huyck, 2010. "Risk Dominance, Payoff Dominance and Probabilistic Choice Learning," Levine's Working Paper Archive 50, David K. Levine.
Hanaki, Nobuyuki & Sethi, Rajiv & Erev, Ido & Peterhansl, Alexander, 2005. "Learning strategies," Journal of Economic Behavior & Organization, Elsevier, vol. 56(4), pages 523-542, April.
- Nobuyuki Hanaki & Rajiv Sethi & Ido Erev & Alexander Peterhansl, 2002. "Learning Strategies," Game Theory and Information 0211004, University Library of Munich, Germany.
Rapoport, Amnon & Stein, William E. & Parco, James E. & Nicholas, Thomas E., 2003. "Equilibrium play and adaptive learning in a three-person centipede game," Games and Economic Behavior, Elsevier, vol. 43(2), pages 239-265, May.
repec:ehu:ikerla:9171 is not listed on IDEAS
Terracol, Antoine & Vaksmann, Jonathan, 2009. "Dumbing down rational players: Learning and teaching in an experimental game," Journal of Economic Behavior & Organization, Elsevier, vol. 70(1-2), pages 54-71, May.
- Antoine Terracol & Jonathan Vaksmann, 2007. "Dumbing down rational players: Learning and teaching in an experimental game," Documents de travail du Centre d'Economie de la Sorbonne bla07017, Université Panthéon-Sorbonne (Paris 1), Centre d'Economie de la Sorbonne.
- Antoine Terracol & Jonathan Vaksmann, 2009. "Dumbing down rational players: Learning and teaching in an experimental game," Université Paris1 Panthéon-Sorbonne (Post-Print and Working Papers) hal-00607223, HAL.
- Antoine Terracol & Jonathan Vaksmann, 2009. "Dumbing down rational players: Learning and teaching in an experimental game," Post-Print hal-00607223, HAL.
- Antoine Terracol & Jonathan Vaksmann, 2007. "Dumbing down rational players: learning and teaching in an experimental game," Université Paris1 Panthéon-Sorbonne (Post-Print and Working Papers) halshs-00145436, HAL.
- Antoine Terracol & Jonathan Vaksmann, 2009. "Dumbing down rational players: Learning and teaching in an experimental game," PSE-Ecole d'économie de Paris (Postprint) hal-00607223, HAL.
- Antoine Terracol & Jonathan Vaksmann, 2009. "Dumbing down rational players: Learning and teaching in an experimental game," Post-Print hal-00672292, HAL.
- Antoine Terracol & Jonathan Vaksmann, 2007. "Dumbing down rational players: learning and teaching in an experimental game," Post-Print halshs-00145436, HAL.
Bigoni, Maria & Fort, Margherita, 2013. "Information and learning in oligopoly: An experiment," Games and Economic Behavior, Elsevier, vol. 81(C), pages 192-214.
- Maria Bigoni, 2008. "Information and Learning in Oligopoly: an Experiment," "Marco Fanno" Working Papers 0072, Dipartimento di Scienze Economiche "Marco Fanno".
- M. Bigoni & M. Fort, 2013. "Information and Learning in Oligopoly: an Experiment," Working Papers wp860, Dipartimento Scienze Economiche, Universita' di Bologna.
- Bigoni, Maria & Fort, Margherita, 2013. "Information and Learning in Oligopoly: An Experiment," IZA Discussion Papers 7125, Institute of Labor Economics (IZA).

More about this item

Keywords

; ; ; ; ; ; ; ; ;

JEL classification:

D83 - Microeconomics - - Information, Knowledge, and Uncertainty - - - Search; Learning; Information and Knowledge; Communication; Belief; Unawareness

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:kap:qmktec:v:10:y:2012:i:4:p:475-503. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Dynamic learning in behavioral games: A hidden Markov mixture of experts approach

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Keywords

JEL classification:

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data