IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2503.05828.html
   My bibliography  Save this paper

Market-based Architectures in RL and Beyond

Author

Listed:
  • Abhimanyu Pallavi Sudhir
  • Long Tran-Thanh

Abstract

Market-based agents refer to reinforcement learning agents which determine their actions based on an internal market of sub-agents. We introduce a new type of market-based algorithm where the state itself is factored into several axes called ``goods'', which allows for greater specialization and parallelism than existing market-based RL algorithms. Furthermore, we argue that market-based algorithms have the potential to address many current challenges in AI, such as search, dynamic scaling and complete feedback, and demonstrate that they may be seen to generalize neural networks; finally, we list some novel ways that market algorithms may be applied in conjunction with Large Language Models for immediate practical applicability.

Suggested Citation

  • Abhimanyu Pallavi Sudhir & Long Tran-Thanh, 2025. "Market-based Architectures in RL and Beyond," Papers 2503.05828, arXiv.org.
  • Handle: RePEc:arx:papers:2503.05828
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2503.05828
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Karim Jamal & Michael Maier & Shyam Sunder, 2017. "Simple Agents, Intelligent Markets," Computational Economics, Springer;Society for Computational Economics, vol. 49(4), pages 653-675, April.
    2. Gul, Faruk & Stacchetti, Ennio, 1999. "Walrasian Equilibrium with Gross Substitutes," Journal of Economic Theory, Elsevier, vol. 87(1), pages 95-124, July.
    3. Gode, Dhananjay K & Sunder, Shyam, 1993. "Allocative Efficiency of Markets with Zero-Intelligence Traders: Market as a Partial Substitute for Individual Rationality," Journal of Political Economy, University of Chicago Press, vol. 101(1), pages 119-137, February.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Lu, Dong & Zhan, Yaosong, 2022. "Over-the-counter versus double auction in asset markets with near-zero-intelligence traders," Journal of Economic Dynamics and Control, Elsevier, vol. 143(C).
    2. Jamal, Karim & Maier, Michael & Sunder, Shyam, 2024. "Emergence of information aggregation to rational expectations equilibria in markets populated by biased heuristic traders," Journal of Economic Behavior & Organization, Elsevier, vol. 228(C).
    3. Berg, Joyce E. & Rietz, Thomas A., 2019. "Longshots, overconfidence and efficiency on the Iowa Electronic Market," International Journal of Forecasting, Elsevier, vol. 35(1), pages 271-287.
    4. Daniel Sutter & Daniel J. Smith, 2017. "Coordination in disaster: Nonprice learning and the allocation of resources after natural disasters," The Review of Austrian Economics, Springer;Society for the Development of Austrian Economics, vol. 30(4), pages 469-492, December.
    5. Erlanson, Albin & Szwagrzak, Karol, 2013. "Strategy-Proof Package Assignment," Working Papers 2013:43, Lund University, Department of Economics.
    6. Simon, Herbert A., 2000. "Barriers and bounds to Rationality," Structural Change and Economic Dynamics, Elsevier, vol. 11(1-2), pages 243-253, July.
    7. G. A. Koshevoy, 2016. "Stability of rejections and Stable Many-to-Many Matchings," Documents de recherche 16-02, Centre d'Études des Politiques Économiques (EPEE), Université d'Evry Val d'Essonne.
    8. Lovric, M. & Kaymak, U. & Spronk, J., 2008. "A Conceptual Model of Investor Behavior," ERIM Report Series Research in Management ERS-2008-030-F&A, Erasmus Research Institute of Management (ERIM), ERIM is the joint research institute of the Rotterdam School of Management, Erasmus University and the Erasmus School of Economics (ESE) at Erasmus University Rotterdam.
    9. Makarewicz, Tomasz, 2021. "Traders, forecasters and financial instability: A model of individual learning of anchor-and-adjustment heuristics," Journal of Economic Behavior & Organization, Elsevier, vol. 190(C), pages 626-673.
    10. Yokote, Koji, 2021. "Consistency of the doctor-optimal equilibrium price vector in job-matching markets," Journal of Economic Theory, Elsevier, vol. 197(C).
    11. Chao Huang, 2021. "Stable matching: an integer programming approach," Papers 2103.03418, arXiv.org, revised Apr 2022.
    12. Robin Nicole & Aleksandra Alori'c & Peter Sollich, 2020. "Fragmentation in trader preferences among multiple markets: Market coexistence versus single market dominance," Papers 2012.04103, arXiv.org, revised Aug 2021.
    13. Kazuo Murota & Akiyoshi Shioura & Zaifu Yang, 2014. "Time Bounds for Iterative Auctions: A Unified Approach by Discrete Convex Analysis," Discussion Papers 14/27, Department of Economics, University of York.
    14. Lange, Andreas & Ross, Johannes, 2024. "Internalizing match-dependent externalities," Journal of Economic Behavior & Organization, Elsevier, vol. 218(C), pages 356-378.
    15. Juan Manuel Larrosa, 2016. "Agentes computacionales y análisis económico," Revista de Economía Institucional, Universidad Externado de Colombia - Facultad de Economía, vol. 18(34), pages 87-113, January-J.
    16. Richard B. Freeman, 2007. "Labor Market Institutions Around the World," NBER Working Papers 13242, National Bureau of Economic Research, Inc.
    17. Gaël Giraud & Céline Rochon, 2010. "Transition to Equilibrium in International Trades," Université Paris1 Panthéon-Sorbonne (Post-Print and Working Papers) halshs-00657038, HAL.
    18. Tesfatsion, Leigh, 1998. "Ex Ante Capacity Effects in Evolutionary Labor Markets with Adaptive Search," ISU General Staff Papers 199810010700001046, Iowa State University, Department of Economics.
    19. Nicolas C. Bedard & Jacob K. Goeree & Philippos Louis & Jingjing Zhang, 2020. "The Favored but Flawed Simultaneous Multiple-Round Auction," Working Paper Series 2020/03, Economics Discipline Group, UTS Business School, University of Technology, Sydney.
    20. Yeh, Chia-Hsuan & Yang, Chun-Yi, 2010. "Examining the effectiveness of price limits in an artificial stock market," Journal of Economic Dynamics and Control, Elsevier, vol. 34(10), pages 2089-2108, October.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2503.05828. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.