IDEAS home Printed from https://ideas.repec.org/a/inm/ormksc/v41y2022i1p139-165.html
   My bibliography  Save this article

Understanding Managers’ Trade-Offs Between Exploration and Exploitation

Author

Listed:
  • Alina Ferecatu

    (Department of Marketing Management, Rotterdam School of Management, Erasmus University, 3062 PA Rotterdam, Netherlands)

  • Arnaud De Bruyn

    (Department of Marketing, ESSEC Business School, 95000 Cergy, France)

Abstract

Managers frequently explore new strategies, and exploit familiar ones, when making decisions on new product development, pricing, or advertising. Exploring for too long, or exploiting too soon, will generate inferior financial returns. Our research describes decision makers’ exploration/exploitation trade-offs and their link to psychometric traits. We conduct an incentive-aligned study in which subjects play a multiarmed bandit experiment and evaluate how subjects balance exploration and exploitation, linked to psychometric traits. To formally describe exploration/exploitation trade-offs, we develop a behavioral model that captures latent dynamics in learning behavior. Subjects transition between three unobserved states—exploration, exploitation, and inertia—updating their beliefs about expected payoffs. Our analysis suggests that decision makers overexplore low-performing options, forgoing over 30% of potential revenue. They heavily rely on recent experiences. Risk-averse decision makers spend more time exploring. Maximizers are more sensitive to payoffs than satisficers. Our research builds the groundwork needed to devise remedial actions aimed at helping managers find an optimal balance between exploration and exploitation. One way to achieve this goal is by carefully designing the learning environment. In two additional studies, we analyze the evolution of exploration/exploitation trade-offs across different learning environments. Offering decision makers repeated opportunities to learn and increasing the planning horizon appears beneficial.

Suggested Citation

  • Alina Ferecatu & Arnaud De Bruyn, 2022. "Understanding Managers’ Trade-Offs Between Exploration and Exploitation," Marketing Science, INFORMS, vol. 41(1), pages 139-165, January.
  • Handle: RePEc:inm:ormksc:v:41:y:2022:i:1:p:139-165
    DOI: 10.1287/mksc.2021.1304
    as

    Download full text from publisher

    File URL: http://dx.doi.org/10.1287/mksc.2021.1304
    Download Restriction: no

    File URL: https://libkey.io/10.1287/mksc.2021.1304?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Colin F. Camerer & Teck-Hua Ho & Juin-Kuan Chong, 2004. "A Cognitive Hierarchy Model of Games," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 119(3), pages 861-898.
    2. Paul S. Adler & Barbara Goldoftas & David I. Levine, 1999. "Flexibility Versus Efficiency? A Case Study of Model Changeovers in the Toyota Production System," Organization Science, INFORMS, vol. 10(1), pages 43-68, February.
    3. Noah Gans & George Knox & Rachel Croson, 2007. "Simple Models of Discrete Choice and Their Performance in Bandit Experiments," Manufacturing & Service Operations Management, INFORMS, vol. 9(4), pages 383-408, December.
    4. Kanishka Misra & Eric M. Schwartz & Jacob Abernethy, 2019. "Dynamic Online Pricing with Incomplete Information Using Multiarmed Bandit Experiments," Marketing Science, INFORMS, vol. 38(2), pages 226-252, March.
    5. Erev, Ido & Roth, Alvin E, 1998. "Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria," American Economic Review, American Economic Association, vol. 88(4), pages 848-881, September.
    6. James G. March, 1991. "Exploration and Exploitation in Organizational Learning," Organization Science, INFORMS, vol. 2(1), pages 71-87, February.
    7. Eric M. Schwartz & Eric T. Bradlow & Peter S. Fader, 2017. "Customer Acquisition via Display Advertising Using Multi-Armed Bandit Experiments," Marketing Science, INFORMS, vol. 36(4), pages 500-522, July.
    8. Avi Goldfarb & Teck-Hua Ho & Wilfred Amaldoss & Alexander Brown & Yan Chen & Tony Cui & Alberto Galasso & Tanjim Hossain & Ming Hsu & Noah Lim & Mo Xiao & Botao Yang, 2012. "Behavioral models of managerial decision-making," Marketing Letters, Springer, vol. 23(2), pages 405-421, June.
    9. Eva Ascarza & Bruce G. S. Hardie, 2013. "A Joint Model of Usage and Churn in Contractual Settings," Marketing Science, INFORMS, vol. 32(4), pages 570-590, July.
    10. Asim Ansari & Ricardo Montoya & Oded Netzer, 2012. "Dynamic learning in behavioral games: A hidden Markov mixture of experts approach," Quantitative Marketing and Economics (QME), Springer, vol. 10(4), pages 475-503, December.
    11. Colin Camerer & Teck-Hua Ho, 1999. "Experience-weighted Attraction Learning in Normal Form Games," Econometrica, Econometric Society, vol. 67(4), pages 827-874, July.
    12. Avi Goldfarb & Mo Xiao, 2011. "Who Thinks about the Competition? Managerial Ability and Strategic Entry in US Local Telephone Markets," American Economic Review, American Economic Association, vol. 101(7), pages 3130-3161, December.
    13. Song Lin & Juanjuan Zhang & John R. Hauser, 2015. "Learning from Experience, Simply," Marketing Science, INFORMS, vol. 34(1), pages 1-19, January.
    14. Erev, I. & Roth, Alvin E., 2014. "Maximization, learning, and economic behavior," Scholarly Articles 30831199, Harvard University Department of Economics.
    15. Jeffrey Banks & David Porter & Mark Olson, 1997. "An experimental analysis of the bandit problem," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 10(1), pages 55-77.
    16. Jerker Denrell & James G. March, 2001. "Adaptation as Information Restriction: The Hot Stove Effect," Organization Science, INFORMS, vol. 12(5), pages 523-538, October.
    17. John R. Hauser & Guilherme (Gui) Liberali & Glen L. Urban, 2014. "Website Morphing 2.0: Switching Costs, Partial Exposure, Random Exit, and When to Morph," Management Science, INFORMS, vol. 60(6), pages 1594-1616, June.
    18. Nathaniel D. Daw & John P. O'Doherty & Peter Dayan & Ben Seymour & Raymond J. Dolan, 2006. "Cortical substrates for exploratory decisions in humans," Nature, Nature, vol. 441(7095), pages 876-879, June.
    19. Tversky, Amos & Kahneman, Daniel, 1992. "Advances in Prospect Theory: Cumulative Representation of Uncertainty," Journal of Risk and Uncertainty, Springer, vol. 5(4), pages 297-323, October.
    20. repec:cup:judgdm:v:3:y:2008:i::p:371-388 is not listed on IDEAS
    21. Robert J. Meyer & Yong Shi, 1995. "Sequential Choice Under Ambiguity: Intuitive Solutions to the Armed-Bandit Problem," Management Science, INFORMS, vol. 41(5), pages 817-834, May.
    22. Rapoport, Amnon & Amaldoss, Wilfred, 2000. "Mixed strategies and iterative elimination of strongly dominated strategies: an experimental investigation of states of knowledge," Journal of Economic Behavior & Organization, Elsevier, vol. 42(4), pages 483-521, August.
    23. Steven L. Scott, 2010. "A modern Bayesian look at the multi‐armed bandit," Applied Stochastic Models in Business and Industry, John Wiley & Sons, vol. 26(6), pages 639-658, November.
    24. Sarin, Rajiv & Vahid, Farshid, 1999. "Payoff Assessments without Probabilities: A Simple Dynamic Model of Choice," Games and Economic Behavior, Elsevier, vol. 28(2), pages 294-309, August.
    25. Eva Ascarza & Oded Netzer & Bruce G. S. Hardie, 2018. "Some Customers Would Rather Leave Without Saying Goodbye," Marketing Science, INFORMS, vol. 37(1), pages 54-77, January.
    26. Thomas P. Novak & Donna L. Hoffman, 2009. "The Fit of Thinking Style and Situation: New Measures of Situation-Specific Experiential and Rational Cognition," Journal of Consumer Research, Journal of Consumer Research Inc., vol. 36(1), pages 56-72, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Gui Liberali & Alina Ferecatu, 2022. "Morphing for Consumer Dynamics: Bandits Meet Hidden Markov Models," Marketing Science, INFORMS, vol. 41(4), pages 769-794, July.
    2. Hu, Yingyao & Kayaba, Yutaka & Shum, Matthew, 2013. "Nonparametric learning rules from bandit experiments: The eyes have it!," Games and Economic Behavior, Elsevier, vol. 81(C), pages 215-231.
    3. Victor Aguirregabiria & Jihye Jeon, 2020. "Firms’ Beliefs and Learning: Models, Identification, and Empirical Evidence," Review of Industrial Organization, Springer;The Industrial Organization Society, vol. 56(2), pages 203-235, March.
    4. Daniel E Acuña & Paul Schrater, 2010. "Structure Learning in Human Sequential Decision-Making," PLOS Computational Biology, Public Library of Science, vol. 6(12), pages 1-12, December.
    5. Phanish Puranam & Murali Swamy, 2016. "How Initial Representations Shape Coupled Learning Processes," Organization Science, INFORMS, vol. 27(2), pages 323-335, April.
    6. Noah Gans & George Knox & Rachel Croson, 2007. "Simple Models of Discrete Choice and Their Performance in Bandit Experiments," Manufacturing & Service Operations Management, INFORMS, vol. 9(4), pages 383-408, December.
    7. Yilmaz Kocer, 2010. "Endogenous Learning with Bounded Memory," Working Papers 1290, Princeton University, Department of Economics, Econometric Research Program..
    8. Nobuyuki Hanaki & Alan P. Kirman & Paul Pezanis-Christou, 2016. "Counter Intuitive Learning: An Exploratory Study," CESifo Working Paper Series 6029, CESifo.
    9. Eric Guerci & Nobuyuki Hanaki & Naoki Watanabe, 2017. "Meaningful learning in weighted voting games: an experiment," Theory and Decision, Springer, vol. 83(1), pages 131-153, June.
    10. Wu, Hang & Bayer, Ralph-C, 2015. "Learning from inferred foregone payoffs," Journal of Economic Dynamics and Control, Elsevier, vol. 51(C), pages 445-458.
    11. Hart E. Posen & Dirk Martignoni & Daniel A. Levinthal, 2013. "E Pluribus Unum: Organizational Size and the Efficacy of Learning," DRUID Working Papers 13-09, DRUID, Copenhagen Business School, Department of Industrial Economics and Strategy/Aalborg University, Department of Business Studies.
    12. Di Guida, Sibilla & Erev, Ido & Marchiori, Davide, 2015. "Cross cultural differences in decisions from experience: Evidence from Denmark, Israel, and Taiwan," Journal of Economic Psychology, Elsevier, vol. 49(C), pages 47-58.
    13. Christina Fang & Daniel Levinthal, 2009. "Near-Term Liability of Exploitation: Exploration and Exploitation in Multistage Problems," Organization Science, INFORMS, vol. 20(3), pages 538-551, June.
    14. repec:cup:judgdm:v:12:y:2017:i:2:p:104-117 is not listed on IDEAS
    15. Hart E. Posen & Daniel A. Levinthal, 2012. "Chasing a Moving Target: Exploitation and Exploration in Dynamic Environments," Management Science, INFORMS, vol. 58(3), pages 587-601, March.
    16. Ori Plonsky & Yefim Roth & Ido Erev, 2021. "Underweighting of rare events in social interactions and its implications to the design of voluntary health applications," Judgment and Decision Making, Society for Judgment and Decision Making, vol. 16(2), pages 267-289, March.
    17. Yechiam, Eldad & Busemeyer, Jerome R., 2008. "Evaluating generalizability and parameter consistency in learning models," Games and Economic Behavior, Elsevier, vol. 63(1), pages 370-394, May.
    18. Ho, Teck H. & Camerer, Colin F. & Chong, Juin-Kuan, 2007. "Self-tuning experience weighted attraction learning in games," Journal of Economic Theory, Elsevier, vol. 133(1), pages 177-198, March.
    19. Teck-Hua Ho & So-Eun Park & Xuanming Su, 2021. "A Bayesian Level- k Model in n -Person Games," Management Science, INFORMS, vol. 67(3), pages 1622-1638, March.
    20. Gars, Jared & Ward, Patrick S., 2019. "Can differences in individual learning explain patterns of technology adoption? Evidence on heterogeneous learning patterns and hybrid rice adoption in Bihar, India," World Development, Elsevier, vol. 115(C), pages 178-189.
    21. Hanaki, Nobuyuki & Kirman, Alan & Pezanis-Christou, Paul, 2018. "Observational and reinforcement pattern-learning: An exploratory study," European Economic Review, Elsevier, vol. 104(C), pages 1-21.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:ormksc:v:41:y:2022:i:1:p:139-165. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.