Risk preferences of learning algorithms

My bibliography Save this article

Risk preferences of learning algorithms

Author

Listed:

Haupt, Andreas
Narayanan, Aroon

Registered:

Abstract

Many economic decision-makers today rely on learning algorithms for important decisions. This paper shows that a widely used learning algorithm—ε-Greedy—exhibits emergent risk aversion, favoring actions with lower payoff variance. When presented with actions of the same expectated payoff, under a wide range of conditions, ε-Greedy chooses the lower-variance action with probability approaching one. This emergent preference can have wide-ranging consequences, from inequity to homogenization, and holds transiently even when the higher-variance action has a strictly higher expected payoff. We discuss two methods to restore risk neutrality. The first method reweights data as a function of how likely an action is chosen. The second method employs optimistic payoff estimates for actions that have not been taken often.

Suggested Citation

Haupt, Andreas & Narayanan, Aroon, 2024. "Risk preferences of learning algorithms," Games and Economic Behavior, Elsevier, vol. 148(C), pages 415-426.

Handle: RePEc:eee:gamebe:v:148:y:2024:i:c:p:415-426
DOI: 10.1016/j.geb.2024.09.013

Download full text from publisher

As the access to this document is restricted, you may want to

for a different version of it.

References listed on IDEAS

Nicolas Klein & Sven Rady, 2011. "Negatively Correlated Bandits," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 78(2), pages 693-732.
- Nicolas Klein & Sven Rady, 2008. "Negatively Correlated Bandits," Working Papers 040, Bavarian Graduate Program in Economics (BGPE).
- Klein, Nicolas & Rady, Sven, 2008. "Negatively Correlated Bandits," Discussion Paper Series of SFB/TR 15 Governance and the Efficiency of Economic Systems 243, Free University of Berlin, Humboldt University of Berlin, University of Bonn, University of Mannheim, University of Munich.
- Rady, Sven & Klein, Nicolas, 2008. "Negatively Correlated Bandits," CEPR Discussion Papers 6983, C.E.P.R. Discussion Papers.
- Klein, Nicolas & Rady, Sven, 2008. "Negatively Correlated Bandits," Discussion Papers in Economics 5332, University of Munich, Department of Economics.
- Sven Rady & Nicolas Klein, 2008. "Negatively Correlated Bandits," 2008 Meeting Papers 136, Society for Economic Dynamics.
Joseph G. Altonji & Charles R. Pierret, 2001. "Employer Learning and Statistical Discrimination," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 116(1), pages 313-350.
- Joseph G. Altonji & Charles R. Pierret, "undated". "Employer Learning and Statistical Discrimination," IPR working papers 97-18, Institute for Policy Resarch at Northwestern University.
- Joseph G. Altonji & Charles R. Pierret, 1997. "Employer Learning and Statistical Discrimination," NBER Working Papers 6279, National Bureau of Economic Research, Inc.
- Joseph Altonji & Charles R. Pierret, 1997. "Employer learning and statistical discrimination," Working Paper Series, Macroeconomic Issues WP-97-11, Federal Reserve Bank of Chicago.
Godfrey Keller & Sven Rady & Martin Cripps, 2005. "Strategic Experimentation with Exponential Bandits," Econometrica, Econometric Society, vol. 73(1), pages 39-68, January.
- Rady, Sven & Cripps, Martin William & Keller, R Godfrey, 2003. "Strategic Experimentation with Exponential Bandits," CEPR Discussion Papers 3814, C.E.P.R. Discussion Papers.
- Cripps, Martin & Keller, Godfrey & Rady, Sven, 2003. "Strategic Experimentation with Exponential Bandits," Discussion Papers in Economics 4, University of Munich, Department of Economics.
- Godfrey Keller & Martin Cripps & Olin School of Business & Washington University & Sven Rady & Department of Economics & University of Munich, 2003. "Strategic Experimentation with Exponential Bandits," Economics Series Working Papers 143, University of Oxford, Department of Economics.
Patrick Bolton & Christopher Harris, 1999. "Strategic Experimentation," Econometrica, Econometric Society, vol. 67(2), pages 349-374, March.
Henry S. Farber & Robert Gibbons, 1996. "Learning and Wage Dynamics," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 111(4), pages 1007-1047.
- Henry S. Farber & Robert Gibbons, 1991. "Learning and Wage Dynamics," NBER Working Papers 3764, National Bureau of Economic Research, Inc.
- Henry S. Farber & Robert Gibbons, 1994. "Learning and Wage Dynamics," Working Papers 707, Princeton University, Department of Economics, Industrial Relations Section..
Emilio Calvano & Giacomo Calzolari & Vincenzo Denicolò & Sergio Pastorello, 2020. "Artificial Intelligence, Algorithmic Pricing, and Collusion," American Economic Review, American Economic Association, vol. 110(10), pages 3267-3297, October.
- Calzolari, Giacomo & Calvano, Emilio & Denicolo, Vincenzo & Pastorello, Sergio, 2018. "Artificial intelligence, algorithmic pricing and collusion," CEPR Discussion Papers 13405, C.E.P.R. Discussion Papers.
Zach Y. Brown & Alexander MacKay, 2023. "Competition in Pricing Algorithms," American Economic Journal: Microeconomics, American Economic Association, vol. 15(2), pages 109-156, May.
- Zach Y. Brown & Alexander MacKay, 2021. "Competition in Pricing Algorithms," NBER Working Papers 28860, National Bureau of Economic Research, Inc.
Gregory S. Crawford & Matthew Shum, 2005. "Uncertainty and Learning in Pharmaceutical Demand," Econometrica, Econometric Society, vol. 73(4), pages 1137-1173, July.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

, & ,, 2010. "Strategic experimentation with Poisson bandits," Theoretical Economics, Econometric Society, vol. 5(2), May.
- Sven Rady & Godfrey Keller, 2007. "Strategic Experimentation with Poisson Bandits," 2007 Meeting Papers 332, Society for Economic Dynamics.
- Keller, Godfrey & Rady, Sven, 2009. "Strategic Experimentation with Poisson Bandits," Discussion Paper Series of SFB/TR 15 Governance and the Efficiency of Economic Systems 260, Free University of Berlin, Humboldt University of Berlin, University of Bonn, University of Mannheim, University of Munich.
- Keller, Godfrey & Rady, Sven, 2009. "Strategic Experimentation with Poisson Bandits," Discussion Papers in Economics 10575, University of Munich, Department of Economics.
- Rady, Sven & Keller, R Godfrey, 2009. "Strategic Experimentation with Poisson Bandits," CEPR Discussion Papers 7270, C.E.P.R. Discussion Papers.
Kohei Kawaguchi, 2021. "When Will Workers Follow an Algorithm? A Field Experiment with a Retail Business," Management Science, INFORMS, vol. 67(3), pages 1670-1695, March.
Xie, Yinxi & Xie, Yang, 2017. "Machiavellian experimentation," Journal of Comparative Economics, Elsevier, vol. 45(4), pages 685-711.
Tarantino, Emanuele & Simcoe, Timothy S. & Ganglmair, Bernhard, 2018. "Learning When to Quit: An Empirical Model of Experimentation," CEPR Discussion Papers 12733, C.E.P.R. Discussion Papers.
- Bernhard Ganglmair & Timothy Simcoe & Emanuele Tarantino, 2018. "Learning When to Quit: An Empirical Model of Experimentation," Working Papers id:12569, eSocialSciences.
- Bernhard Ganglmair & Timothy Simcoe & Emanuele Tarantino, 2018. "Learning When to Quit: An Empirical Model of Experimentation," NBER Working Papers 24358, National Bureau of Economic Research, Inc.
Forand, Jean Guillaume, 2015. "Keeping your options open," Journal of Economic Dynamics and Control, Elsevier, vol. 53(C), pages 47-68.
- Jean Guillaume Forand, 2010. "Keeping Your Options Open," RCER Working Papers 557, University of Rochester - Center for Economic Research (RCER).
- Jean Guillaume Forand, 2013. "Keeping Your options Open," Working Papers 1301, University of Waterloo, Department of Economics, revised Feb 2015.
- Jean Guillaume Forand, 2011. "Keeping Your Options Open," 2011 Meeting Papers 82, Society for Economic Dynamics.
Fudenberg, Drew & He, Kevin, 2021. "Player-compatible learning and player-compatible equilibrium," Journal of Economic Theory, Elsevier, vol. 194(C).
- Drew Fudenberg & Kevin He, 2017. "Player-Compatible Learning and Player-Compatible Equilibrium," Papers 1712.08954, arXiv.org, revised May 2020.
Kostas Bimpikis & Shayan Ehsani & Mohamed Mostagir, 2019. "Designing Dynamic Contests," Operations Research, INFORMS, vol. 67(2), pages 339-356, March.
Boyarchenko, Svetlana, 2021. "Inefficiency of sponsored research," Journal of Mathematical Economics, Elsevier, vol. 95(C).
Deimen, Inga & Wirtz, Julia, 2022. "Control, cost, and confidence: Perseverance and procrastination in the face of failure," Games and Economic Behavior, Elsevier, vol. 134(C), pages 52-74.
- Inga Deimen & Julia Wirtz, 2021. "Control, Cost, and Confidence:Perseverance and Procrastination in the Face of Failure," Bristol Economics Discussion Papers 21/738, School of Economics, University of Bristol, UK.
Simina Br^anzei & Yuval Peres, 2019. "Multiplayer Bandit Learning, from Competition to Cooperation," Papers 1908.01135, arXiv.org, revised Jan 2024.
Rosenberg, Dinah & Salomon, Antoine & Vieille, Nicolas, 2013. "On games of strategic experimentation," Games and Economic Behavior, Elsevier, vol. 82(C), pages 31-51.
- Dinah Rosenberg & Antoine Salomon & Nicolas Vieille, 2010. "On Games of Strategic Experimentation," Working Papers hal-00579613, HAL.
- Rosenberg, Dinah & Salomon , Antoine & Vieille , Nicolas, 2013. "On Games of Strategic Experimentation," HEC Research Papers Series 1008, HEC Paris.
Svetlana Boyarchenko, 2020. "Super- and submodularity of stopping games with random observations," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 70(4), pages 983-1022, November.
Lepage, Louis Pierre, 2021. "Endogenous learning, persistent employer biases, and discrimination," CLEF Working Paper Series 34, Canadian Labour Economics Forum (CLEF), University of Waterloo.
Kaustav Das, 2013. "Strategic Experimentation with Heterogeneous Agents and Payoff Externalities," Discussion Papers 1315, University of Exeter, Department of Economics.
Chen, Chia-Hui & Ishida, Junichiro, 2018. "Hierarchical experimentation," Journal of Economic Theory, Elsevier, vol. 177(C), pages 365-404.
- Chia-Hui Chen & Junichiro Ishida, 2015. "Hierarchical Experimentation," ISER Discussion Paper 0949, Institute of Social and Economic Research, The University of Osaka.
Weng, Xi, 2015. "Dynamic pricing in the presence of individual learning," Journal of Economic Theory, Elsevier, vol. 155(C), pages 262-299.
Kaustav Das, 2014. "Strategic Experimentation with Competition and Private Arrival of Information," Discussion Papers 1404, University of Exeter, Department of Economics.
Heidhues, Paul & Rady, Sven & Strack, Philipp, 2015. "Strategic experimentation with private payoffs," Journal of Economic Theory, Elsevier, vol. 159(PA), pages 531-551.
- Heidhues, Paul & Rady, Sven & Strack, Philipp, 2012. "Strategic Experimentation with Private Payoffs," Discussion Paper Series of SFB/TR 15 Governance and the Efficiency of Economic Systems 387, Free University of Berlin, Humboldt University of Berlin, University of Bonn, University of Mannheim, University of Munich.
- Rady, Sven & Heidhues, Paul & Strack, Philipp, 2015. "Strategic Experimentation with Private Payoffs," CEPR Discussion Papers 10634, C.E.P.R. Discussion Papers.
Sorensen, Morten, 2007. "Learning by Investing: Evidence from Venture Capital," SIFR Research Report Series 53, Institute for Financial Research.
Kaustav Das, 2017. "The Role of Heterogeneity in a model of Strategic Experimentation," Discussion Papers 1703, University of Exeter, Department of Economics.

More about this item

Keywords

; ; ;

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:gamebe:v:148:y:2024:i:c:p:415-426. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/inca/622836 .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Risk preferences of learning algorithms

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data