IDEAS home Printed from https://ideas.repec.org/a/eee/gamebe/v148y2024icp415-426.html
   My bibliography  Save this article

Risk preferences of learning algorithms

Author

Listed:
  • Haupt, Andreas
  • Narayanan, Aroon

Abstract

Many economic decision-makers today rely on learning algorithms for important decisions. This paper shows that a widely used learning algorithm—ε-Greedy—exhibits emergent risk aversion, favoring actions with lower payoff variance. When presented with actions of the same expectated payoff, under a wide range of conditions, ε-Greedy chooses the lower-variance action with probability approaching one. This emergent preference can have wide-ranging consequences, from inequity to homogenization, and holds transiently even when the higher-variance action has a strictly higher expected payoff. We discuss two methods to restore risk neutrality. The first method reweights data as a function of how likely an action is chosen. The second method employs optimistic payoff estimates for actions that have not been taken often.

Suggested Citation

  • Haupt, Andreas & Narayanan, Aroon, 2024. "Risk preferences of learning algorithms," Games and Economic Behavior, Elsevier, vol. 148(C), pages 415-426.
  • Handle: RePEc:eee:gamebe:v:148:y:2024:i:c:p:415-426
    DOI: 10.1016/j.geb.2024.09.013
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S089982562400143X
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.geb.2024.09.013?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Joseph G. Altonji & Charles R. Pierret, 2001. "Employer Learning and Statistical Discrimination," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 116(1), pages 313-350.
    2. Henry S. Farber & Robert Gibbons, 1996. "Learning and Wage Dynamics," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 111(4), pages 1007-1047.
    3. Emilio Calvano & Giacomo Calzolari & Vincenzo Denicolò & Sergio Pastorello, 2020. "Artificial Intelligence, Algorithmic Pricing, and Collusion," American Economic Review, American Economic Association, vol. 110(10), pages 3267-3297, October.
    4. Godfrey Keller & Sven Rady & Martin Cripps, 2005. "Strategic Experimentation with Exponential Bandits," Econometrica, Econometric Society, vol. 73(1), pages 39-68, January.
    5. Zach Y. Brown & Alexander MacKay, 2023. "Competition in Pricing Algorithms," American Economic Journal: Microeconomics, American Economic Association, vol. 15(2), pages 109-156, May.
    6. Gregory S. Crawford & Matthew Shum, 2005. "Uncertainty and Learning in Pharmaceutical Demand," Econometrica, Econometric Society, vol. 73(4), pages 1137-1173, July.
    7. Nicolas Klein & Sven Rady, 2011. "Negatively Correlated Bandits," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 78(2), pages 693-732.
    8. Patrick Bolton & Christopher Harris, 1999. "Strategic Experimentation," Econometrica, Econometric Society, vol. 67(2), pages 349-374, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Kaustav Das, 2014. "Strategic Experimentation with Competition and Private Arrival of Information," Discussion Papers 1404, University of Exeter, Department of Economics.
    2. , & ,, 2010. "Strategic experimentation with Poisson bandits," Theoretical Economics, Econometric Society, vol. 5(2), May.
    3. Kohei Kawaguchi, 2021. "When Will Workers Follow an Algorithm? A Field Experiment with a Retail Business," Management Science, INFORMS, vol. 67(3), pages 1670-1695, March.
    4. Heidhues, Paul & Rady, Sven & Strack, Philipp, 2015. "Strategic experimentation with private payoffs," Journal of Economic Theory, Elsevier, vol. 159(PA), pages 531-551.
    5. Deimen, Inga & Wirtz, Julia, 2022. "Control, cost, and confidence: Perseverance and procrastination in the face of failure," Games and Economic Behavior, Elsevier, vol. 134(C), pages 52-74.
    6. Sorensen, Morten, 2007. "Learning by Investing: Evidence from Venture Capital," SIFR Research Report Series 53, Institute for Financial Research.
    7. Kaustav Das, 2017. "The Role of Heterogeneity in a model of Strategic Experimentation," Discussion Papers 1703, University of Exeter, Department of Economics.
    8. Rosenberg, Dinah & Salomon, Antoine & Vieille, Nicolas, 2013. "On games of strategic experimentation," Games and Economic Behavior, Elsevier, vol. 82(C), pages 31-51.
    9. Forand, Jean Guillaume, 2015. "Keeping your options open," Journal of Economic Dynamics and Control, Elsevier, vol. 53(C), pages 47-68.
    10. Xie, Yinxi & Xie, Yang, 2017. "Machiavellian experimentation," Journal of Comparative Economics, Elsevier, vol. 45(4), pages 685-711.
    11. Kaustav Das & Nicolas Klein & Katharina Schmid, 2020. "Strategic experimentation with asymmetric players," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 69(4), pages 1147-1175, June.
    12. Tarantino, Emanuele & Simcoe, Timothy S. & Ganglmair, Bernhard, 2018. "Learning When to Quit: An Empirical Model of Experimentation," CEPR Discussion Papers 12733, C.E.P.R. Discussion Papers.
    13. Fudenberg, Drew & He, Kevin, 2021. "Player-compatible learning and player-compatible equilibrium," Journal of Economic Theory, Elsevier, vol. 194(C).
    14. Klein, Nicolas, 2013. "Strategic learning in teams," Games and Economic Behavior, Elsevier, vol. 82(C), pages 636-657.
    15. Rao, Neel, 2016. "Social effects in employer learning: An analysis of siblings," Labour Economics, Elsevier, vol. 38(C), pages 24-36.
    16. Lepage, Louis Pierre, 2020. "Endogenous learning and the persistence of employer biases in the labor market," CLEF Working Paper Series 24, Canadian Labour Economics Forum (CLEF), University of Waterloo.
    17. Kostas Bimpikis & Shayan Ehsani & Mohamed Mostagir, 2019. "Designing Dynamic Contests," Operations Research, INFORMS, vol. 67(2), pages 339-356, March.
    18. Das, Kaustav & Klein, Nicolas & Schmid, Katharina, 2024. "Strategic experimentation with asymmetric safe options," Economics Letters, Elsevier, vol. 239(C).
    19. Kaustav Das, 2015. "The Role of Heterogeneity in a Model of Strategic Experimentation," Discussion Papers 1507, University of Exeter, Department of Economics.
    20. Külpmann, Philipp, 2015. "Procrastination and projects," Center for Mathematical Economics Working Papers 544, Center for Mathematical Economics, Bielefeld University.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:gamebe:v:148:y:2024:i:c:p:415-426. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/inca/622836 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.