Understanding Managers’ Trade-Offs Between Exploration and Exploitation

My bibliography Save this article

Understanding Managers’ Trade-Offs Between Exploration and Exploitation

Author

Listed:

Alina Ferecatu
(Department of Marketing Management, Rotterdam School of Management, Erasmus University, 3062 PA Rotterdam, Netherlands)
Arnaud De Bruyn
(Department of Marketing, ESSEC Business School, 95000 Cergy, France)

Registered:

Abstract

Managers frequently explore new strategies, and exploit familiar ones, when making decisions on new product development, pricing, or advertising. Exploring for too long, or exploiting too soon, will generate inferior financial returns. Our research describes decision makers’ exploration/exploitation trade-offs and their link to psychometric traits. We conduct an incentive-aligned study in which subjects play a multiarmed bandit experiment and evaluate how subjects balance exploration and exploitation, linked to psychometric traits. To formally describe exploration/exploitation trade-offs, we develop a behavioral model that captures latent dynamics in learning behavior. Subjects transition between three unobserved states—exploration, exploitation, and inertia—updating their beliefs about expected payoffs. Our analysis suggests that decision makers overexplore low-performing options, forgoing over 30% of potential revenue. They heavily rely on recent experiences. Risk-averse decision makers spend more time exploring. Maximizers are more sensitive to payoffs than satisficers. Our research builds the groundwork needed to devise remedial actions aimed at helping managers find an optimal balance between exploration and exploitation. One way to achieve this goal is by carefully designing the learning environment. In two additional studies, we analyze the evolution of exploration/exploitation trade-offs across different learning environments. Offering decision makers repeated opportunities to learn and increasing the planning horizon appears beneficial.

Suggested Citation

Alina Ferecatu & Arnaud De Bruyn, 2022. "Understanding Managers’ Trade-Offs Between Exploration and Exploitation," Marketing Science, INFORMS, vol. 41(1), pages 139-165, January.

Handle: RePEc:inm:ormksc:v:41:y:2022:i:1:p:139-165
DOI: 10.1287/mksc.2021.1304

Download full text from publisher

References listed on IDEAS

Nathaniel D. Daw & John P. O'Doherty & Peter Dayan & Ben Seymour & Raymond J. Dolan, 2006. "Cortical substrates for exploratory decisions in humans," Nature, Nature, vol. 441(7095), pages 876-879, June.
Roth, Alvin E. & Erev, Ido, 1995. "Learning in extensive-form games: Experimental data and simple dynamic models in the intermediate term," Games and Economic Behavior, Elsevier, vol. 8(1), pages 164-212.
Colin F. Camerer & Teck-Hua Ho & Juin-Kuan Chong, 2004. "A Cognitive Hierarchy Model of Games," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 119(3), pages 861-898.
Paul S. Adler & Barbara Goldoftas & David I. Levine, 1999. "Flexibility Versus Efficiency? A Case Study of Model Changeovers in the Toyota Production System," Organization Science, INFORMS, vol. 10(1), pages 43-68, February.
Tversky, Amos & Kahneman, Daniel, 1992. "Advances in Prospect Theory: Cumulative Representation of Uncertainty," Journal of Risk and Uncertainty, Springer, vol. 5(4), pages 297-323, October.
repec:cup:judgdm:v:3:y:2008:i::p:371-388 is not listed on IDEAS
Noah Gans & George Knox & Rachel Croson, 2007. "Simple Models of Discrete Choice and Their Performance in Bandit Experiments," Manufacturing & Service Operations Management, INFORMS, vol. 9(4), pages 383-408, December.
Kanishka Misra & Eric M. Schwartz & Jacob Abernethy, 2019. "Dynamic Online Pricing with Incomplete Information Using Multiarmed Bandit Experiments," Marketing Science, INFORMS, vol. 38(2), pages 226-252, March.
Robert J. Meyer & Yong Shi, 1995. "Sequential Choice Under Ambiguity: Intuitive Solutions to the Armed-Bandit Problem," Management Science, INFORMS, vol. 41(5), pages 817-834, May.
Erev, Ido & Roth, Alvin E, 1998. "Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria," American Economic Review, American Economic Association, vol. 88(4), pages 848-881, September.
James G. March, 1991. "Exploration and Exploitation in Organizational Learning," Organization Science, INFORMS, vol. 2(1), pages 71-87, February.
Rapoport, Amnon & Amaldoss, Wilfred, 2000. "Mixed strategies and iterative elimination of strongly dominated strategies: an experimental investigation of states of knowledge," Journal of Economic Behavior & Organization, Elsevier, vol. 42(4), pages 483-521, August.
Eric M. Schwartz & Eric T. Bradlow & Peter S. Fader, 2017. "Customer Acquisition via Display Advertising Using Multi-Armed Bandit Experiments," Marketing Science, INFORMS, vol. 36(4), pages 500-522, July.
Avi Goldfarb & Teck-Hua Ho & Wilfred Amaldoss & Alexander Brown & Yan Chen & Tony Cui & Alberto Galasso & Tanjim Hossain & Ming Hsu & Noah Lim & Mo Xiao & Botao Yang, 2012. "Behavioral models of managerial decision-making," Marketing Letters, Springer, vol. 23(2), pages 405-421, June.
Eva Ascarza & Bruce G. S. Hardie, 2013. "A Joint Model of Usage and Churn in Contractual Settings," Marketing Science, INFORMS, vol. 32(4), pages 570-590, July.
Asim Ansari & Ricardo Montoya & Oded Netzer, 2012. "Dynamic learning in behavioral games: A hidden Markov mixture of experts approach," Quantitative Marketing and Economics (QME), Springer, vol. 10(4), pages 475-503, December.
Steven L. Scott, 2010. "A modern Bayesian look at the multi‐armed bandit," Applied Stochastic Models in Business and Industry, John Wiley & Sons, vol. 26(6), pages 639-658, November.
Avi Goldfarb & Mo Xiao, 2011. "Who Thinks about the Competition? Managerial Ability and Strategic Entry in US Local Telephone Markets," American Economic Review, American Economic Association, vol. 101(7), pages 3130-3161, December.
- Avi Goldfarb & Mo Xiao, 2008. "Who thinks about the competition? Managerial ability and strategic entry in US local telephone markets," Working Papers 08-21, NET Institute, revised Oct 2008.
Colin Camerer & Teck-Hua Ho, 1999. "Experience-weighted Attraction Learning in Normal Form Games," Econometrica, Econometric Society, vol. 67(4), pages 827-874, July.
Sarin, Rajiv & Vahid, Farshid, 1999. "Payoff Assessments without Probabilities: A Simple Dynamic Model of Choice," Games and Economic Behavior, Elsevier, vol. 28(2), pages 294-309, August.
Eva Ascarza & Oded Netzer & Bruce G. S. Hardie, 2018. "Some Customers Would Rather Leave Without Saying Goodbye," Marketing Science, INFORMS, vol. 37(1), pages 54-77, January.
Song Lin & Juanjuan Zhang & John R. Hauser, 2015. "Learning from Experience, Simply," Marketing Science, INFORMS, vol. 34(1), pages 1-19, January.
Gergana Y. Nenkov & Maureen Morrin & Andrew Ward & Barry Schwartz & John Hulland, 2008. "A short form of the Maximization Scale: Factor structure, reliability and validity studies," Judgment and Decision Making, Society for Judgment and Decision Making, vol. 3, pages 371-388, June.
Erev, I. & Roth, Alvin E., 2014. "Maximization, learning, and economic behavior," Scholarly Articles 30831199, Harvard University Department of Economics.
Jeffrey Banks & David Porter & Mark Olson, 1997. "An experimental analysis of the bandit problem," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 10(1), pages 55-77.
Jerker Denrell & James G. March, 2001. "Adaptation as Information Restriction: The Hot Stove Effect," Organization Science, INFORMS, vol. 12(5), pages 523-538, October.
Thomas P. Novak & Donna L. Hoffman, 2009. "The Fit of Thinking Style and Situation: New Measures of Situation-Specific Experiential and Rational Cognition," Journal of Consumer Research, Journal of Consumer Research Inc., vol. 36(1), pages 56-72, June.
John R. Hauser & Guilherme (Gui) Liberali & Glen L. Urban, 2014. "Website Morphing 2.0: Switching Costs, Partial Exposure, Random Exit, and When to Morph," Management Science, INFORMS, vol. 60(6), pages 1594-1616, June.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Runhui Lin & Yalin Li & Wenchang Li & Ze Ji & Biting Li, 2025. "AI-enabled individual learning strategies and scientific innovation: a case from the field of computer science," Scientometrics, Springer;Akadémiai Kiadó, vol. 130(7), pages 3651-3677, July.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Gui Liberali & Alina Ferecatu, 2022. "Morphing for Consumer Dynamics: Bandits Meet Hidden Markov Models," Marketing Science, INFORMS, vol. 41(4), pages 769-794, July.
Hu, Yingyao & Kayaba, Yutaka & Shum, Matthew, 2013. "Nonparametric learning rules from bandit experiments: The eyes have it!," Games and Economic Behavior, Elsevier, vol. 81(C), pages 215-231.
- Yingyao Hu & Yutaka Kayaba & Matthew Shum, 2010. "Nonparametric learning rules from bandit experiments: the eyes have it!," CeMMAP working papers CWP15/10, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Yingyao Hu & Yutaka Kayaba & Matt Shum, 2010. "Nonparametric Learning Rules from Bandit Experiments: The Eyes have it!," Economics Working Paper Archive 560, The Johns Hopkins University,Department of Economics.
Victor Aguirregabiria & Jihye Jeon, 2020. "Firms’ Beliefs and Learning: Models, Identification, and Empirical Evidence," Review of Industrial Organization, Springer;The Industrial Organization Society, vol. 56(2), pages 203-235, March.
- Victor Aguirregabiria & Jihye Jeon, 2018. "Firms' Beliefs and Learning: Models, Identification, and Empirical Evidence," Working Papers tecipa-620, University of Toronto, Department of Economics.
- Aguirregabiria, Victor & Jeon, Jihye, 2018. "Firms' Beliefs and Learning: Models, Identification, and Empirical Evidence," CEPR Discussion Papers 13255, C.E.P.R. Discussion Papers.
Yechiam, Eldad & Busemeyer, Jerome R., 2008. "Evaluating generalizability and parameter consistency in learning models," Games and Economic Behavior, Elsevier, vol. 63(1), pages 370-394, May.
Ho, Teck H. & Camerer, Colin F. & Chong, Juin-Kuan, 2007. "Self-tuning experience weighted attraction learning in games," Journal of Economic Theory, Elsevier, vol. 133(1), pages 177-198, March.
Daniel E Acuña & Paul Schrater, 2010. "Structure Learning in Human Sequential Decision-Making," PLOS Computational Biology, Public Library of Science, vol. 6(12), pages 1-12, December.
Phanish Puranam & Murali Swamy, 2016. "How Initial Representations Shape Coupled Learning Processes," Organization Science, INFORMS, vol. 27(2), pages 323-335, April.
Avi Goldfarb & Teck-Hua Ho & Wilfred Amaldoss & Alexander Brown & Yan Chen & Tony Cui & Alberto Galasso & Tanjim Hossain & Ming Hsu & Noah Lim & Mo Xiao & Botao Yang, 2012. "Behavioral models of managerial decision-making," Marketing Letters, Springer, vol. 23(2), pages 405-421, June.
Noah Gans & George Knox & Rachel Croson, 2007. "Simple Models of Discrete Choice and Their Performance in Bandit Experiments," Manufacturing & Service Operations Management, INFORMS, vol. 9(4), pages 383-408, December.
Asim Ansari & Ricardo Montoya & Oded Netzer, 2012. "Dynamic learning in behavioral games: A hidden Markov mixture of experts approach," Quantitative Marketing and Economics (QME), Springer, vol. 10(4), pages 475-503, December.
Oyarzun, Carlos & Sarin, Rajiv, 2012. "Mean and variance responsive learning," Games and Economic Behavior, Elsevier, vol. 75(2), pages 855-866.
Yilmaz Kocer, 2010. "Endogenous Learning with Bounded Memory," Working Papers 1290, Princeton University, Department of Economics, Econometric Research Program..
Nobuyuki Hanaki & Alan Kirman & Paul Pezanis-Christou, 2016. "Counter Intuitive Learning: An Exploratory Study," School of Economics and Public Policy Working Papers 2016-12, University of Adelaide, School of Economics and Public Policy.
- Nobuyuki Hanaki & Alan Kirman & Paul Pezanis-Christou, 2016. "Counter intuitive learning: An exploratory study," Working Papers hal-01358716, HAL.
- Nobuyuki Hanaki & Alan P. Kirman & Paul Pezanis-Christou, 2016. "Counter Intuitive Learning: An Exploratory Study," CESifo Working Paper Series 6029, CESifo.
Eric Guerci & Nobuyuki Hanaki & Naoki Watanabe, 2017. "Meaningful learning in weighted voting games: an experiment," Theory and Decision, Springer, vol. 83(1), pages 131-153, June.
- Eric Guerci & Nobuyuki Hanaki & Naoki Watanabe, 2015. "Meaningful Learning in Weighted Voting Games: An Experiment," GREDEG Working Papers 2015-40, Groupe de REcherche en Droit, Economie, Gestion (GREDEG CNRS), Université Côte d'Azur, France.
- Eric Guerci & Nobuyuki Hanaki & Naoki Watanabe, 2017. "Meaningful Learning in Weighted Voting Games: An Experiment," Post-Print halshs-01216244, HAL.
Wu, Hang & Bayer, Ralph-C, 2015. "Learning from inferred foregone payoffs," Journal of Economic Dynamics and Control, Elsevier, vol. 51(C), pages 445-458.
- Ralph-C. Bayer & Hang Wu, 2013. "Learning from Inferred Foregone Payoffs," School of Economics and Public Policy Working Papers 2013-22, University of Adelaide, School of Economics and Public Policy.
Camerer, Colin F. & Ho, Teck-Hua & Chong, Juin-Kuan, 2002. "Sophisticated Experience-Weighted Attraction Learning and Strategic Teaching in Repeated Games," Journal of Economic Theory, Elsevier, vol. 104(1), pages 137-188, May.
Sarin, Rajiv & Vahid, Farshid, 2001. "Predicting How People Play Games: A Simple Dynamic Model of Choice," Games and Economic Behavior, Elsevier, vol. 34(1), pages 104-122, January.
- Sarin, R. & Vahid, F., 1999. "Predicting how People Play Games: a Simple Dynamic Model of Choice," Monash Econometrics and Business Statistics Working Papers 12/99, Monash University, Department of Econometrics and Business Statistics.
Hart E. Posen & Dirk Martignoni & Daniel A. Levinthal, 2013. "E Pluribus Unum: Organizational Size and the Efficacy of Learning," DRUID Working Papers 13-09, DRUID, Copenhagen Business School, Department of Industrial Economics and Strategy/Aalborg University, Department of Business Studies.
Di Guida, Sibilla & Erev, Ido & Marchiori, Davide, 2015. "Cross cultural differences in decisions from experience: Evidence from Denmark, Israel, and Taiwan," Journal of Economic Psychology, Elsevier, vol. 49(C), pages 47-58.
Christina Fang & Daniel Levinthal, 2009. "Near-Term Liability of Exploitation: Exploration and Exploitation in Multistage Problems," Organization Science, INFORMS, vol. 20(3), pages 538-551, June.

More about this item

Keywords

; ; ; ; ; ;

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:ormksc:v:41:y:2022:i:1:p:139-165. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Understanding Managers’ Trade-Offs Between Exploration and Exploitation

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data