IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2604.16472.html

Training Language Models for Bilateral Trade with Private Information

Author

Listed:
  • Dirk Bergemann
  • Soheil Ghili
  • Xinyang Hu
  • Chuanhao Li
  • Zhuoran Yang

Abstract

Bilateral bargaining under incomplete information provides a controlled testbed for evaluating large language model (LLM) agent capabilities. Bilateral trade demands individual rationality, strategic surplus maximization, and cooperation to realize gains from trade. We develop a structured bargaining environment where LLMs negotiate via tool calls within an event-driven simulator, separating binding offers from natural-language messages to enable automated evaluation. The environment serves two purposes: as a benchmark for frontier models and as a training environment for open-weight models via reinforcement learning. In benchmark experiments, a round-robin tournament among five frontier models (15,000 negotiations) reveals that effective strategies implement price discrimination through sequential offers. Aggressive anchoring, calibrated concession, and temporal patience correlate with the highest surplus share and deal rate. Accommodating strategies that concede quickly disable price discrimination in the buyer role, yielding the lowest surplus capture and deal completion. Stronger models scale their behavior proportionally to item value, maintaining performance across price tiers; weaker models perform well only when wide zones of possible agreement offset suboptimal strategies. In training experiments, we fine-tune Qwen3 (8B, 14B) via supervised fine-tuning (SFT) followed by Group Relative Policy Optimization (GRPO) against a fixed frontier opponent. These stages optimize competing objectives: SFT approximately doubles surplus share but reduces deal rates, while RL recovers deal rates but erodes surplus gains, reflecting the reward structure. SFT also compresses surplus variation across price tiers, which generalizes to unseen opponents, suggesting that behavioral cloning instills proportional strategies rather than memorized price points.

Suggested Citation

  • Dirk Bergemann & Soheil Ghili & Xinyang Hu & Chuanhao Li & Zhuoran Yang, 2026. "Training Language Models for Bilateral Trade with Private Information," Papers 2604.16472, arXiv.org.
  • Handle: RePEc:arx:papers:2604.16472
    as

    Download full text from publisher

    File URL: https://arxiv.org/pdf/2604.16472
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Rubinstein, Ariel, 1982. "Perfect Equilibrium in a Bargaining Model," Econometrica, Econometric Society, vol. 50(1), pages 97-109, January.
    2. N.R. Jennings & P. Faratin & A.R. Lomuscio & S. Parsons & M.J. Wooldridge & C. Sierra, 2001. "Automated Negotiation: Prospects, Methods and Challenges," Group Decision and Negotiation, Springer, vol. 10(2), pages 199-215, March.
    3. John Riley & Richard Zeckhauser, 1983. "Optimal Selling Strategies: When to Haggle, When to Hold Firm," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 98(2), pages 267-289.
    4. Roger B. Myerson, 1981. "Optimal Auction Design," Mathematics of Operations Research, INFORMS, vol. 6(1), pages 58-73, February.
    5. Matthew Backus & Thomas Blakee & Brad Larsen & Steven Tadelis, 2020. "Sequential Bargaining in the Field: Evidence from Millions of Online Bargaining Interactions," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 135(3), pages 1319-1361.
    6. Joel Sobel & Ichiro Takahashi, 1983. "A Multistage Model of Bargaining," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 50(3), pages 411-426.
    7. Kalyan Chatterjee & William Samuelson, 1983. "Bargaining under Incomplete Information," Operations Research, INFORMS, vol. 31(5), pages 835-851, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. David Bounies & Antoine Dubus & Patrick Waelbroeck, 2020. "Market for Information and Selling Mechanisms," Working Papers ECARES 2020-07, ULB -- Universite Libre de Bruxelles.
    2. Preyas Desai & Pranav Jindal, 2024. "Getting a Break in Bargaining: An Upside of Time Delays," Marketing Science, INFORMS, vol. 43(6), pages 1260-1278, November.
    3. Walter Beckert, 2004. "Dynamic Monopolies with Stochastic Demand," Birkbeck Working Papers in Economics and Finance 0404, Birkbeck, Department of Economics, Mathematics & Statistics.
    4. Correia-da-Silva, João, 2021. "Optimal priority pricing by a durable goods monopolist," Games and Economic Behavior, Elsevier, vol. 129(C), pages 310-328.
    5. Jean Tirole, 2016. "From Bottom of the Barrel to Cream of the Crop: Sequential Screening With Positive Selection," Econometrica, Econometric Society, vol. 84(4), pages 1291-1343, July.
    6. Olivier Bochet & Manshu Khanna & Simon Siegenthaler, 2024. "Beyond Dividing the Pie: Multi-Issue Bargaining in the Laboratory," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 91(1), pages 163-191.
    7. Milam, Garrett, 2006. "A laboratory study of haggling with deadlines," International Journal of Industrial Organization, Elsevier, vol. 24(3), pages 505-520, May.
    8. P. Ding & M. D. Gerst & G. Bang & M. E. Borsuk, 2015. "An Application of Automated Mediation to International Climate Treaty Negotiation," Group Decision and Negotiation, Springer, vol. 24(5), pages 885-903, September.
    9. Kjell Hausken, 1997. "Game-theoretic and Behavioral Negotiation Theory," Group Decision and Negotiation, Springer, vol. 6(6), pages 511-528, December.
    10. Kalyan Chatterjee & Gary L. Lilien, 1984. "Efficiency of Alternative Bargaining Procedures," Journal of Conflict Resolution, Peace Science Society (International), vol. 28(2), pages 270-295, June.
    11. David Bounie & Antoine Dubus & Patrick Waelbroeck, 2022. "Collecting and Selling Consumer Information: Selling Mechanisms Matter," Working Papers hal-02288708, HAL.
    12. Devanur, Nikhil R. & Peres, Yuval & Sivan, Balasubramanian, 2019. "Perfect Bayesian Equilibria in repeated sales," Games and Economic Behavior, Elsevier, vol. 118(C), pages 570-588.
    13. Roger B. Myerson, 1984. "An Introduction to Game Theory," Discussion Papers 623, Northwestern University, Center for Mathematical Studies in Economics and Management Science.
    14. Sahoo, Satya & Cariou, Pierre, 2024. "Unveiling the influence of bargaining power in shipping: An empirical study on iron ore freight market," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 192(C).
    15. Peter C. Cramton, 1984. "Bargaining with Incomplete Information: An Infinite-Horizon Model with Two-Sided Uncertainty," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 51(4), pages 579-593.
    16. Etro, Federico, 2017. "Research in economics and game theory. A 70th anniversary," Research in Economics, Elsevier, vol. 71(1), pages 1-7.
    17. Kyung nok Chun & Zachary Schaller & Stergios Skaperdas, 2020. "Why Are There Strikes?," Revue d'économie politique, Dalloz, vol. 130(6), pages 929-956.
    18. Peter C. Cramton, 1992. "Strategic Delay in Bargaining with Two-Sided Uncertainty," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 59(1), pages 205-225.
    19. Sandro Shelegia & Joshua Sherman, 2022. "Bargaining at Retail Stores: Evidence from Vienna," Management Science, INFORMS, vol. 68(1), pages 27-36, January.
    20. Bradley J Larsen, 2021. "The Efficiency of Real-World Bargaining: Evidence from Wholesale Used-Auto Auctions," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 88(2), pages 851-882.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2604.16472. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: https://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.