IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2504.03743.html

Modelling bounded rational decision-making through Wasserstein constraints

Author

Listed:
  • Benjamin Patrick Evans
  • Leo Ardon
  • Sumitra Ganesh

Abstract

Modelling bounded rational decision-making through information constrained processing provides a principled approach for representing departures from rationality within a reinforcement learning framework, while still treating decision-making as an optimization process. However, existing approaches are generally based on Entropy, Kullback-Leibler divergence, or Mutual Information. In this work, we highlight issues with these approaches when dealing with ordinal action spaces. Specifically, entropy assumes uniform prior beliefs, missing the impact of a priori biases on decision-makings. KL-Divergence addresses this, however, has no notion of "nearness" of actions, and additionally, has several well known potentially undesirable properties such as the lack of symmetry, and furthermore, requires the distributions to have the same support (e.g. positive probability for all actions). Mutual information is often difficult to estimate. Here, we propose an alternative approach for modeling bounded rational RL agents utilising Wasserstein distances. This approach overcomes the aforementioned issues. Crucially, this approach accounts for the nearness of ordinal actions, modeling "stickiness" in agent decisions and unlikeliness of rapidly switching to far away actions, while also supporting low probability actions, zero-support prior distributions, and is simple to calculate directly.

Suggested Citation

  • Benjamin Patrick Evans & Leo Ardon & Sumitra Ganesh, 2025. "Modelling bounded rational decision-making through Wasserstein constraints," Papers 2504.03743, arXiv.org, revised May 2025.
  • Handle: RePEc:arx:papers:2504.03743
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2504.03743
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Sims, Christopher A., 2003. "Implications of rational inattention," Journal of Monetary Economics, Elsevier, vol. 50(3), pages 665-690, April.
    2. Benjamin Patrick Evans & Mikhail Prokopenko, 2021. "A maximum entropy model of bounded rational decision-making with prior beliefs and market feedback," Papers 2102.09180, arXiv.org, revised May 2021.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Benjamin Patrick Evans & Mikhail Prokopenko, 2021. "Bounded rationality for relaxing best response and mutual consistency: The Quantal Hierarchy model of decision-making," Papers 2106.15844, arXiv.org, revised Mar 2023.
    2. Benjamin Patrick Evans & Sumitra Ganesh, 2024. "Learning and Calibrating Heterogeneous Bounded Rational Market Behaviour with Multi-Agent Reinforcement Learning," Papers 2402.00787, arXiv.org.
    3. Benjamin Patrick Evans & Mikhail Prokopenko, 2024. "Bounded rationality for relaxing best response and mutual consistency: the quantal hierarchy model of decision making," Theory and Decision, Springer, vol. 96(1), pages 71-111, February.
    4. Persson, Petra, 2018. "Attention manipulation and information overload," Behavioural Public Policy, Cambridge University Press, vol. 2(1), pages 78-106, May.
    5. Kurt Lewis, 2009. "The Two-Period Rational Inattention Model: Accelerations and Analyses," Computational Economics, Springer;Society for Computational Economics, vol. 33(1), pages 79-97, February.
    6. Karun Adusumilli, 2022. "How to sample and when to stop sampling: The generalized Wald problem and minimax policies," Papers 2210.15841, arXiv.org, revised May 2025.
    7. Weijie Zhong, 2018. "The Indirect Cost of Information," Papers 1809.00697, arXiv.org, revised Apr 2020.
    8. Johannes Johnen, 2019. "Automatic‐renewal contracts with heterogeneous consumer inertia," Journal of Economics & Management Strategy, Wiley Blackwell, vol. 28(4), pages 765-786, November.
    9. Bonomo, Marco Antônio Cesar & Terra, Maria Cristina T., 2005. "Special interests and political business cycles," FGV EPGE Economics Working Papers (Ensaios Economicos da EPGE) 597, EPGE Brazilian School of Economics and Finance - FGV EPGE (Brazil).
    10. Kim, Duk Gyoo & Yoon, Yeochang, 2019. "A theory of FAQs: Public announcements with rational ignorance," Journal of Economic Behavior & Organization, Elsevier, vol. 158(C), pages 560-574.
    11. Goethner, Maximilian & Hornuf, Lars & Regner, Tobias, 2021. "Protecting investors in equity crowdfunding: An empirical analysis of the small investor protection act," Technological Forecasting and Social Change, Elsevier, vol. 162(C).
    12. Philippe Jehiel & Jakub Steiner, 2020. "Selective Sampling with Information-Storage Constraints [On interim rationality, belief formation and learning in decision problems with bounded memory]," The Economic Journal, Royal Economic Society, vol. 130(630), pages 1753-1781.
    13. Sebastian Eichfelder & Mona Lau, 2015. "Capitalization of capital gains taxes: (In)attention and turn-of-the-year returns," FEMM Working Papers 150019, Otto-von-Guericke University Magdeburg, Faculty of Economics and Management.
    14. Sergeyev, Dmitriy & Iovino, Luigi, 2018. "Central Bank Balance Sheet Policies Without Rational Expectations," CEPR Discussion Papers 13100, C.E.P.R. Discussion Papers.
    15. Jonas D. M. Fisher & Leonardo Melosi & Sebastian Rast, 2025. "Long-Run Inflation Expectations," Working Papers 829, DNB.
    16. Atahan Afsar; José Elías Gallegos; Richard Jaimes; Edgar Silgado Gómez & Jos� El�as Gallegos & Richard Jaimes & Edgar Silgado G�mez, 2020. "Reconciling Empirics and Theory: The Behavioral Hybrid New Keynesian Model," Vniversitas Económica, Universidad Javeriana - Bogotá, vol. 0(0), pages 1-41.
    17. Philippe Jehiel, 2022. "Analogy-Based Expectation Equilibrium and Related Concepts:Theory, Applications, and Beyond," Working Papers halshs-03735680, HAL.
    18. Fatemeh Mokhtarzadeh & Luba Petersen, 2021. "Coordinating expectations through central bank projections," Experimental Economics, Springer;Economic Science Association, vol. 24(3), pages 883-918, September.
    19. Benchimol, Jonathan & Bounader, Lahcen & Dotta, Mario, 2025. "Estimating Behavioral Inattention," Journal of Economic Behavior & Organization, Elsevier, vol. 236(C).
    20. repec:esx:essedp:770 is not listed on IDEAS
    21. Han, Leyla Jianyu & Kasa, Kenneth & Luo, Yulei, 2024. "Ambiguity, information processing, and financial intermediation," Journal of Economic Theory, Elsevier, vol. 222(C).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2504.03743. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.