IDEAS home Printed from https://ideas.repec.org/p/arx/papers/1908.06133.html
   My bibliography  Save this paper

A model of discrete choice based on reinforcement learning under short-term memory

Author

Listed:
  • Misha Perepelitsa

Abstract

A family of models of individual discrete choice are constructed by means of statistical averaging of choices made by a subject in a reinforcement learning process, where the subject has short, k-term memory span. The choice probabilities in these models combine in a non-trivial, non-linear way the initial learning bias and the experience gained through learning. The properties of such models are discussed and, in particular, it is shown that probabilities deviate from Luce's Choice Axiom, even if the initial bias adheres to it. Moreover, we shown that the latter property is recovered as the memory span becomes large. Two applications in utility theory are considered. In the first, we use the discrete choice model to generate binary preference relation on simple lotteries. We show that the preferences violate transitivity and independence axioms of expected utility theory. Furthermore, we establish the dependence of the preferences on frames, with risk aversion for gains, and risk seeking for losses. Based on these findings we propose next a parametric model of choice based on the probability maximization principle, as a model for deviations from expected utility principle. To illustrate the approach we apply it to the classical problem of demand for insurance.

Suggested Citation

  • Misha Perepelitsa, 2019. "A model of discrete choice based on reinforcement learning under short-term memory," Papers 1908.06133, arXiv.org.
  • Handle: RePEc:arx:papers:1908.06133
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/1908.06133
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Fudenberg, Drew & Levine, David, 1998. "Learning in games," European Economic Review, Elsevier, vol. 42(3-5), pages 631-639, May.
    2. Erev, Ido & Roth, Alvin E, 1998. "Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria," American Economic Review, American Economic Association, vol. 88(4), pages 848-881, September.
    3. Machina, Mark J, 1982. ""Expected Utility" Analysis without the Independence Axiom," Econometrica, Econometric Society, vol. 50(2), pages 277-323, March.
    4. Yaari, Menahem E, 1987. "The Dual Theory of Choice under Risk," Econometrica, Econometric Society, vol. 55(1), pages 95-115, January.
    5. Daniel Kahneman & Amos Tversky, 2013. "Prospect Theory: An Analysis of Decision Under Risk," World Scientific Book Chapters, in: Leonard C MacLean & William T Ziemba (ed.), HANDBOOK OF THE FUNDAMENTALS OF FINANCIAL DECISION MAKING Part I, chapter 6, pages 99-127, World Scientific Publishing Co. Pte. Ltd..
    6. Tversky, Amos & Kahneman, Daniel, 1992. "Advances in Prospect Theory: Cumulative Representation of Uncertainty," Journal of Risk and Uncertainty, Springer, vol. 5(4), pages 297-323, October.
    7. Roth, Alvin E. & Erev, Ido, 1995. "Learning in extensive-form games: Experimental data and simple dynamic models in the intermediate term," Games and Economic Behavior, Elsevier, vol. 8(1), pages 164-212.
    8. Quiggin, John, 1982. "A theory of anticipated utility," Journal of Economic Behavior & Organization, Elsevier, vol. 3(4), pages 323-343, December.
    9. Drew Fudenberg & David K. Levine, 1998. "The Theory of Learning in Games," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262061945, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Misha Perepelitsa, 2019. "RPS(1) Preferences," Papers 1901.04995, arXiv.org, revised Feb 2019.
    2. Belianin, A., 2017. "Face to Face to Human Being: Achievements and Challenges of Behavioral Economics," Journal of the New Economic Association, New Economic Association, vol. 34(2), pages 166-175.
    3. Upravitelev, A., 2023. "Neoclassical roots of behavioral economics," Journal of the New Economic Association, New Economic Association, vol. 58(1), pages 110-140.
    4. Rania HENTATI & Jean-Luc PRIGENT, 2010. "Structured Portfolio Analysis under SharpeOmega Ratio," EcoMod2010 259600073, EcoMod.
    5. Border, Kim C. & Segal, Uzi, 1997. "Coherent Odds and Subjective Probability," University of Western Ontario, Departmental Research Report Series 9717, University of Western Ontario, Department of Economics.
    6. Michal Skořepa, 2007. "Zpochybnění deskriptivnosti teorie očekávaného užitku [Doubts about the descriptive validity of the expected utility theory]," Politická ekonomie, Prague University of Economics and Business, vol. 2007(1), pages 106-120.
    7. Zvi Safra & Uzi Segal, 2005. "Are Universal Preferences Possible? Calibration Results for Non-Expected Utility Theories," Boston College Working Papers in Economics 633, Boston College Department of Economics.
    8. repec:cup:judgdm:v:16:y:2021:i:6:p:1324-1369 is not listed on IDEAS
    9. Charles-Cadogan, G., 2016. "Expected utility theory and inner and outer measures of loss aversion," Journal of Mathematical Economics, Elsevier, vol. 63(C), pages 10-20.
    10. David B. BROWN & Enrico G. DE GIORGI & Melvyn SIM, 2009. "A Satiscing Alternative to Prospect Theory," Swiss Finance Institute Research Paper Series 09-19, Swiss Finance Institute.
    11. Haim Levy, 2008. "First Degree Stochastic Dominance Violations: Decision Weights and Bounded Rationality," Economic Journal, Royal Economic Society, vol. 118(528), pages 759-774, April.
    12. Laurent Denant-Boemont & Olivier L’Haridon, 2013. "La rationalité à l'épreuve de l'économie comportementale," Revue française d'économie, Presses de Sciences-Po, vol. 0(2), pages 35-89.
    13. Levy, Haim & Levy, Moshe, 2002. "Experimental test of the prospect theory value function: A stochastic dominance approach," Organizational Behavior and Human Decision Processes, Elsevier, vol. 89(2), pages 1058-1081, November.
    14. Delli Gatti,Domenico & Fagiolo,Giorgio & Gallegati,Mauro & Richiardi,Matteo & Russo,Alberto (ed.), 2018. "Agent-Based Models in Economics," Cambridge Books, Cambridge University Press, number 9781108400046.
    15. Rapoport, Amnon & Chung Lo, Alison King & Zwick, Rami, 2002. "Choice of Prizes Allocated by Multiple Lotteries with Endogenously Determined Probabilities," Organizational Behavior and Human Decision Processes, Elsevier, vol. 87(1), pages 180-206, January.
    16. Trabelsi, Mohamed Ali, 2006. "Les nouveaux modèles de décision dans le risque et l’incertain : quel apport ? [The new models of decision under risk or uncertainty : What approach?]," MPRA Paper 25442, University Library of Munich, Germany.
    17. Trabelsi, Mohamed Ali, 2019. "The new models of decision in risk: A review of the critical literature," MPRA Paper 92693, University Library of Munich, Germany, revised 2019.
    18. Erev, Ido & Bereby-Meyer, Yoella & Roth, Alvin E., 1999. "The effect of adding a constant to all payoffs: experimental investigation, and implications for reinforcement learning models," Journal of Economic Behavior & Organization, Elsevier, vol. 39(1), pages 111-128, May.
    19. James Cox & Vjollca Sadiraj & Ulrich Schmidt, 2015. "Paradoxes and mechanisms for choice under risk," Experimental Economics, Springer;Economic Science Association, vol. 18(2), pages 215-250, June.
    20. Marco LiCalzi, 2005. "A language for the construction of preferences under uncertainty," Game Theory and Information 0509002, University Library of Munich, Germany.
    21. Phillips Peter J. & Pohl Gabriela, 2018. "The Deferral of Attacks: SP/A Theory as a Model of Terrorist Choice when Losses Are Inevitable," Open Economics, De Gruyter, vol. 1(1), pages 71-85, February.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:1908.06133. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.