A model of discrete choice based on reinforcement learning under short-term memory

My bibliography Save this paper

A model of discrete choice based on reinforcement learning under short-term memory

Author

Listed:

Misha Perepelitsa

Registered:

Abstract

A family of models of individual discrete choice are constructed by means of statistical averaging of choices made by a subject in a reinforcement learning process, where the subject has short, k-term memory span. The choice probabilities in these models combine in a non-trivial, non-linear way the initial learning bias and the experience gained through learning. The properties of such models are discussed and, in particular, it is shown that probabilities deviate from Luce's Choice Axiom, even if the initial bias adheres to it. Moreover, we shown that the latter property is recovered as the memory span becomes large. Two applications in utility theory are considered. In the first, we use the discrete choice model to generate binary preference relation on simple lotteries. We show that the preferences violate transitivity and independence axioms of expected utility theory. Furthermore, we establish the dependence of the preferences on frames, with risk aversion for gains, and risk seeking for losses. Based on these findings we propose next a parametric model of choice based on the probability maximization principle, as a model for deviations from expected utility principle. To illustrate the approach we apply it to the classical problem of demand for insurance.

Suggested Citation

Misha Perepelitsa, 2019. "A model of discrete choice based on reinforcement learning under short-term memory," Papers 1908.06133, arXiv.org.

Handle: RePEc:arx:papers:1908.06133

Download full text from publisher

References listed on IDEAS

Fudenberg, Drew & Levine, David, 1998. "Learning in games," European Economic Review, Elsevier, vol. 42(3-5), pages 631-639, May.
- Drew Fudenberg & David K. Levine, 1998. "Learning in Games," Levine's Working Paper Archive 2222, David K. Levine.
Erev, Ido & Roth, Alvin E, 1998. "Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria," American Economic Review, American Economic Association, vol. 88(4), pages 848-881, September.
Machina, Mark J, 1982. ""Expected Utility" Analysis without the Independence Axiom," Econometrica, Econometric Society, vol. 50(2), pages 277-323, March.
- Mark J Machina, 1982. ""Expected Utility" Analysis without the Independence Axiom," Levine's Working Paper Archive 7650, David K. Levine.
Yaari, Menahem E, 1987. "The Dual Theory of Choice under Risk," Econometrica, Econometric Society, vol. 55(1), pages 95-115, January.
Daniel Kahneman & Amos Tversky, 2013. "Prospect Theory: An Analysis of Decision Under Risk," World Scientific Book Chapters, in: Leonard C MacLean & William T Ziemba (ed.), HANDBOOK OF THE FUNDAMENTALS OF FINANCIAL DECISION MAKING Part I, chapter 6, pages 99-127, World Scientific Publishing Co. Pte. Ltd..
- Kahneman, Daniel & Tversky, Amos, 1979. "Prospect Theory: An Analysis of Decision under Risk," Econometrica, Econometric Society, vol. 47(2), pages 263-291, March.
- Amos Tversky & Daniel Kahneman, 1979. "Prospect Theory: An Analysis of Decision under Risk," Levine's Working Paper Archive 7656, David K. Levine.
Tversky, Amos & Kahneman, Daniel, 1992. "Advances in Prospect Theory: Cumulative Representation of Uncertainty," Journal of Risk and Uncertainty, Springer, vol. 5(4), pages 297-323, October.
Roth, Alvin E. & Erev, Ido, 1995. "Learning in extensive-form games: Experimental data and simple dynamic models in the intermediate term," Games and Economic Behavior, Elsevier, vol. 8(1), pages 164-212.
Quiggin, John, 1982. "A theory of anticipated utility," Journal of Economic Behavior & Organization, Elsevier, vol. 3(4), pages 323-343, December.
Drew Fudenberg & David K. Levine, 1998. "The Theory of Learning in Games," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262061945, December.
- Drew Fudenberg & David K. Levine, 1996. "The Theory of Learning in Games," Levine's Working Paper Archive 624, David K. Levine.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Misha Perepelitsa, 2019. "RPS(1) Preferences," Papers 1901.04995, arXiv.org, revised Feb 2019.
Belianin, A., 2017. "Face to Face to Human Being: Achievements and Challenges of Behavioral Economics," Journal of the New Economic Association, New Economic Association, vol. 34(2), pages 166-175.
Upravitelev, A., 2023. "Neoclassical roots of behavioral economics," Journal of the New Economic Association, New Economic Association, vol. 58(1), pages 110-140.
Rania HENTATI & Jean-Luc PRIGENT, 2010. "Structured Portfolio Analysis under SharpeOmega Ratio," EcoMod2010 259600073, EcoMod.
- Rania Hentati-KAFFEL & Jean-Luc Prigent, 2014. "Structured portfolio analysis under SharpeOmega ratio," Working Papers 2014-425, Department of Research, Ipag Business School.
- Rania Hentati & Jean-Luc Prigent, 2012. "Structured portfolio analysis under SharpeOmega ratio," Université Paris1 Panthéon-Sorbonne (Post-Print and Working Papers) hal-00657327, HAL.
- Rania Hentati & Jean-Luc Prigent, 2012. "Structured portfolio analysis under SharpeOmega ratio," Working Papers hal-00657327, HAL.
- Rania Hentati-Kaffel & Jean-Luc Prigent, 2012. "Structured portfolio analysis under SharpeOmega ratio," Documents de travail du Centre d'Economie de la Sorbonne 12002, Université Panthéon-Sorbonne (Paris 1), Centre d'Economie de la Sorbonne.
Border, Kim C. & Segal, Uzi, 1997. "Coherent Odds and Subjective Probability," University of Western Ontario, Departmental Research Report Series 9717, University of Western Ontario, Department of Economics.
- Kim C. Border & Uzi Segal, 2001. "Coherent Odds and Subjective Probability," Boston College Working Papers in Economics 513, Boston College Department of Economics.
Michal Skořepa, 2007. "Zpochybnění deskriptivnosti teorie očekávaného užitku [Doubts about the descriptive validity of the expected utility theory]," Politická ekonomie, Prague University of Economics and Business, vol. 2007(1), pages 106-120.
Zvi Safra & Uzi Segal, 2005. "Are Universal Preferences Possible? Calibration Results for Non-Expected Utility Theories," Boston College Working Papers in Economics 633, Boston College Department of Economics.
repec:cup:judgdm:v:16:y:2021:i:6:p:1324-1369 is not listed on IDEAS
Trabelsi, Mohamed Ali, 2006. "Les nouveaux modèles de décision dans le risque et l’incertain : quel apport ? [The new models of decision under risk or uncertainty : What approach?]," MPRA Paper 25442, University Library of Munich, Germany.
- Trabelsi, Mohamed Ali, 2008. "Les nouveaux modèles de décision dans le risque et l’incertain : quel apport ? [The new models of decision under risk or uncertainty: What approach?]," MPRA Paper 83347, University Library of Munich, Germany, revised 2008.
Charles-Cadogan, G., 2016. "Expected utility theory and inner and outer measures of loss aversion," Journal of Mathematical Economics, Elsevier, vol. 63(C), pages 10-20.
David B. BROWN & Enrico G. DE GIORGI & Melvyn SIM, 2009. "A Satiscing Alternative to Prospect Theory," Swiss Finance Institute Research Paper Series 09-19, Swiss Finance Institute.
- David B. Brown & Enrico G. De Giorgi & Melvyn Sim, 2009. "A Satisficing Alternative to Prospect Theory," University of St. Gallen Department of Economics working paper series 2009 2009-09, Department of Economics, University of St. Gallen.
Haim Levy, 2008. "First Degree Stochastic Dominance Violations: Decision Weights and Bounded Rationality," Economic Journal, Royal Economic Society, vol. 118(528), pages 759-774, April.
Laurent Denant-Boemont & Olivier L’Haridon, 2013. "La rationalité à l'épreuve de l'économie comportementale," Revue française d'économie, Presses de Sciences-Po, vol. 0(2), pages 35-89.
- Laurent Denant-Boèmont & Olivier l'Haridon, 2013. "La rationalité à l’épreuve de l’économie comportementale," Economics Working Paper Archive (University of Rennes & University of Caen) 201323, Center for Research in Economics and Management (CREM), University of Rennes, University of Caen and CNRS.
- Laurent Denant-Boèmont & Olivier L’haridon, 2013. "La rationalité à l'épreuve de l'économie comportementale," Post-Print halshs-00921070, HAL.
Levy, Haim & Levy, Moshe, 2002. "Experimental test of the prospect theory value function: A stochastic dominance approach," Organizational Behavior and Human Decision Processes, Elsevier, vol. 89(2), pages 1058-1081, November.
Delli Gatti,Domenico & Fagiolo,Giorgio & Gallegati,Mauro & Richiardi,Matteo & Russo,Alberto (ed.), 2018. "Agent-Based Models in Economics," Cambridge Books, Cambridge University Press, number 9781108400046, January.
Rapoport, Amnon & Chung Lo, Alison King & Zwick, Rami, 2002. "Choice of Prizes Allocated by Multiple Lotteries with Endogenously Determined Probabilities," Organizational Behavior and Human Decision Processes, Elsevier, vol. 87(1), pages 180-206, January.
- Amnon Rapoport & Alison King Chung Lo & Rami Zwick, 2001. "Choice of Prizes Allocated by Multiple Lotteries with Endogenously Determined Probabilities," Experimental 0110003, University Library of Munich, Germany.
Trabelsi, Mohamed Ali, 2019. "The new models of decision in risk: A review of the critical literature," MPRA Paper 92693, University Library of Munich, Germany, revised 2019.
Erev, Ido & Bereby-Meyer, Yoella & Roth, Alvin E., 1999. "The effect of adding a constant to all payoffs: experimental investigation, and implications for reinforcement learning models," Journal of Economic Behavior & Organization, Elsevier, vol. 39(1), pages 111-128, May.
James Cox & Vjollca Sadiraj & Ulrich Schmidt, 2015. "Paradoxes and mechanisms for choice under risk," Experimental Economics, Springer;Economic Science Association, vol. 18(2), pages 215-250, June.
- Cox, James C. & Sadiraj, Vjollca & Schmidt, Ulrich, 2011. "Paradoxes and mechanisms for choice under risk," Kiel Working Papers 1712, Kiel Institute for the World Economy (IfW Kiel).
- James C. Cox & Vjollca Sadiraj & Ulrich Schmidt, 2011. "Paradoxes and Mechanisms for Choice under Risk," Experimental Economics Center Working Paper Series 2011-07, Experimental Economics Center, Andrew Young School of Policy Studies, Georgia State University, revised Mar 2014.
Marco LiCalzi, 2005. "A language for the construction of preferences under uncertainty," Game Theory and Information 0509002, University Library of Munich, Germany.
Phillips Peter J. & Pohl Gabriela, 2018. "The Deferral of Attacks: SP/A Theory as a Model of Terrorist Choice when Losses Are Inevitable," Open Economics, De Gruyter, vol. 1(1), pages 71-85, February.

More about this item

NEP fields

This paper has been announced in the following NEP Reports:

NEP-DCM-2019-08-26 (Discrete Choice Models)
NEP-IAS-2019-08-26 (Insurance Economics)
NEP-MIC-2019-08-26 (Microeconomics)
NEP-UPT-2019-08-26 (Utility Models and Prospect Theory)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:1908.06133. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

A model of discrete choice based on reinforcement learning under short-term memory

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data