IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2504.03743.html
   My bibliography  Save this paper

Modelling bounded rational decision-making through Wasserstein constraints

Author

Listed:
  • Benjamin Patrick Evans
  • Leo Ardon
  • Sumitra Ganesh

Abstract

Modelling bounded rational decision-making through information constrained processing provides a principled approach for representing departures from rationality within a reinforcement learning framework, while still treating decision-making as an optimization process. However, existing approaches are generally based on Entropy, Kullback-Leibler divergence, or Mutual Information. In this work, we highlight issues with these approaches when dealing with ordinal action spaces. Specifically, entropy assumes uniform prior beliefs, missing the impact of a priori biases on decision-makings. KL-Divergence addresses this, however, has no notion of "nearness" of actions, and additionally, has several well known potentially undesirable properties such as the lack of symmetry, and furthermore, requires the distributions to have the same support (e.g. positive probability for all actions). Mutual information is often difficult to estimate. Here, we propose an alternative approach for modeling bounded rational RL agents utilising Wasserstein distances. This approach overcomes the aforementioned issues. Crucially, this approach accounts for the nearness of ordinal actions, modeling "stickiness" in agent decisions and unlikeliness of rapidly switching to far away actions, while also supporting low probability actions, zero-support prior distributions, and is simple to calculate directly.

Suggested Citation

  • Benjamin Patrick Evans & Leo Ardon & Sumitra Ganesh, 2025. "Modelling bounded rational decision-making through Wasserstein constraints," Papers 2504.03743, arXiv.org, revised May 2025.
  • Handle: RePEc:arx:papers:2504.03743
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2504.03743
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Sims, Christopher A., 2003. "Implications of rational inattention," Journal of Monetary Economics, Elsevier, vol. 50(3), pages 665-690, April.
    2. Benjamin Patrick Evans & Mikhail Prokopenko, 2021. "A maximum entropy model of bounded rational decision-making with prior beliefs and market feedback," Papers 2102.09180, arXiv.org, revised May 2021.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Benjamin Patrick Evans & Mikhail Prokopenko, 2021. "Bounded rationality for relaxing best response and mutual consistency: The Quantal Hierarchy model of decision-making," Papers 2106.15844, arXiv.org, revised Mar 2023.
    2. Benjamin Patrick Evans & Sumitra Ganesh, 2024. "Learning and Calibrating Heterogeneous Bounded Rational Market Behaviour with Multi-Agent Reinforcement Learning," Papers 2402.00787, arXiv.org.
    3. Benjamin Patrick Evans & Mikhail Prokopenko, 2024. "Bounded rationality for relaxing best response and mutual consistency: the quantal hierarchy model of decision making," Theory and Decision, Springer, vol. 96(1), pages 71-111, February.
    4. Persson, Petra, 2018. "Attention manipulation and information overload," Behavioural Public Policy, Cambridge University Press, vol. 2(1), pages 78-106, May.
    5. Kurt Lewis, 2009. "The Two-Period Rational Inattention Model: Accelerations and Analyses," Computational Economics, Springer;Society for Computational Economics, vol. 33(1), pages 79-97, February.
    6. Karun Adusumilli, 2022. "How to sample and when to stop sampling: The generalized Wald problem and minimax policies," Papers 2210.15841, arXiv.org, revised May 2025.
    7. Weijie Zhong, 2018. "The Indirect Cost of Information," Papers 1809.00697, arXiv.org, revised Apr 2020.
    8. Johannes Johnen, 2019. "Automatic‐renewal contracts with heterogeneous consumer inertia," Journal of Economics & Management Strategy, Wiley Blackwell, vol. 28(4), pages 765-786, November.
    9. Bonomo, Marco Antônio Cesar & Terra, Maria Cristina T., 2005. "Special interests and political business cycles," FGV EPGE Economics Working Papers (Ensaios Economicos da EPGE) 597, EPGE Brazilian School of Economics and Finance - FGV EPGE (Brazil).
    10. George-Marios Angeletos & Chen Lian, 2018. "Forward Guidance without Common Knowledge," American Economic Review, American Economic Association, vol. 108(9), pages 2477-2512, September.
    11. Kim, Duk Gyoo & Yoon, Yeochang, 2019. "A theory of FAQs: Public announcements with rational ignorance," Journal of Economic Behavior & Organization, Elsevier, vol. 158(C), pages 560-574.
    12. Pierre Fleckinger & Matthieu Glachant & Gabrielle Moineville, 2017. "Incentives for Quality in Friendly and Hostile Informational Environments," American Economic Journal: Microeconomics, American Economic Association, vol. 9(1), pages 242-274, February.
    13. Goethner, Maximilian & Hornuf, Lars & Regner, Tobias, 2021. "Protecting investors in equity crowdfunding: An empirical analysis of the small investor protection act," Technological Forecasting and Social Change, Elsevier, vol. 162(C).
    14. Sushant Acharya & Shu Lin Wee, 2020. "Rational Inattention in Hiring Decisions," American Economic Journal: Macroeconomics, American Economic Association, vol. 12(1), pages 1-40, January.
    15. Philippe Jehiel & Jakub Steiner, 2020. "Selective Sampling with Information-Storage Constraints [On interim rationality, belief formation and learning in decision problems with bounded memory]," The Economic Journal, Royal Economic Society, vol. 130(630), pages 1753-1781.
    16. Sebastian Eichfelder & Mona Lau, 2015. "Capitalization of capital gains taxes: (In)attention and turn-of-the-year returns," FEMM Working Papers 150019, Otto-von-Guericke University Magdeburg, Faculty of Economics and Management.
    17. Ichiro Fukunaga, 2007. "Imperfect Common Knowledge, Staggered Price Setting, and the Effects of Monetary Policy," Journal of Money, Credit and Banking, Blackwell Publishing, vol. 39(7), pages 1711-1739, October.
    18. Luminita Stevens, 2020. "Coarse Pricing Policies," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 87(1), pages 420-453.
    19. Bennani, Hamza, 2018. "Media coverage and ECB policy-making: Evidence from an augmented Taylor rule," Journal of Macroeconomics, Elsevier, vol. 57(C), pages 26-38.
    20. Etienne Gagnon & David López-Salido & Nicolas Vincent, 2013. "Individual Price Adjustment along the Extensive Margin," NBER Macroeconomics Annual, University of Chicago Press, vol. 27(1), pages 235-281.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2504.03743. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.