IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2402.09721.html

Generalized Principal-Agent Problem with a Learning Agent

Author

Listed:
  • Tao Lin
  • Yiling Chen

Abstract

In classic principal-agent problems such as Stackelberg games, contract design, and Bayesian persuasion, the agent best responds to the principal's committed strategy. We study repeated generalized principal-agent problems under the assumption that the principal does not have commitment power and the agent uses algorithms to learn to respond to the principal. We reduce this problem to a one-shot problem where the agent approximately best responds, and prove that: (1) If the agent uses contextual no-regret learning algorithms with regret $\mathrm{Reg}(T)$, then the principal can guarantee utility at least $U^* - \Theta\big(\sqrt{\tfrac{\mathrm{Reg}(T)}{T}}\big)$, where $U^*$ is the principal's optimal utility in the classic model with a best-responding agent. (2) If the agent uses contextual no-swap-regret learning algorithms with swap-regret $\mathrm{SReg}(T)$, then the principal cannot obtain utility more than $U^* + O(\frac{\mathrm{SReg(T)}}{T})$. (3) In addition, if the agent uses mean-based learning algorithms (which can be no-regret but not no-swap-regret), then the principal can sometimes do significantly better than $U^*$. These results not only refine previous works on Stackelberg games and contract design, but also lead to new results for Bayesian persuasion with a learning agent and all generalized principal-agent problems where the agent does not have private information.

Suggested Citation

  • Tao Lin & Yiling Chen, 2024. "Generalized Principal-Agent Problem with a Learning Agent," Papers 2402.09721, arXiv.org, revised Oct 2025.
  • Handle: RePEc:arx:papers:2402.09721
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2402.09721
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Piotr Dworczak & Alessandro Pavan, 2022. "Preparing for the Worst but Hoping for the Best: Robust (Bayesian) Persuasion," Econometrica, Econometric Society, vol. 90(5), pages 2017-2051, September.
    2. Colin Camerer, 1998. "Bounded Rationality in Individual Decision Making," Experimental Economics, Springer;Economic Science Association, vol. 1(2), pages 163-183, September.
    3. Crawford, Vincent P & Sobel, Joel, 1982. "Strategic Information Transmission," Econometrica, Econometric Society, vol. 50(6), pages 1431-1451, November.
    4. Jiarui Gan & Minbiao Han & Jibang Wu & Haifeng Xu, 2022. "Generalized Principal-Agency: Contracts, Information, Games and Beyond," Papers 2209.01146, arXiv.org, revised Feb 2024.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Ce Li & Tao Lin, 2024. "Information Design with Unknown Prior," Papers 2410.05533, arXiv.org, revised Sep 2025.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Semyon Malamud & Andreas Schrimpf, 2021. "Persuasion by Dimension Reduction," Swiss Finance Institute Research Paper Series 21-69, Swiss Finance Institute.
    2. Dirk Bergemann & Tan Gan & Yingkai Li, 2023. "Managing Persuasion Robustly: The Optimality of Quota Rules," Papers 2310.10024, arXiv.org, revised Dec 2025.
    3. , & Frechette, Guilaume & Perego, Jacopo, 2019. "Rules and Commitment in Communication," CEPR Discussion Papers 14085, C.E.P.R. Discussion Papers.
    4. Persson, Petra, 2018. "Attention manipulation and information overload," Behavioural Public Policy, Cambridge University Press, vol. 2(1), pages 78-106, May.
    5. Johannes Abeler & Armin Falk & Fabian Kosse, 2025. "Malleability of Preferences for Honesty," The Economic Journal, Royal Economic Society, vol. 135(667), pages 982-998.
    6. Mechtenberg, Lydia & Münster, Johannes, 2012. "A strategic mediator who is biased in the same direction as the expert can improve information transmission," Economics Letters, Elsevier, vol. 117(2), pages 490-492.
    7. Thomas de Haan & Theo Offerman & Randolph Sloof, 2015. "Money Talks? An Experimental Investigation Of Cheap Talk And Burned Money," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 56(4), pages 1385-1426, November.
    8. Tim Besley & Rohini Pande, 1998. "Read my lips: the political economy of information transmission," IFS Working Papers W98/13, Institute for Fiscal Studies.
    9. Feltenstein, Andrew & Lagunoff, Roger, 2005. "International versus domestic auditing of bank solvency," Journal of International Economics, Elsevier, vol. 67(1), pages 73-96, September.
    10. Ronelle Burger & Canh Thien Dang & Trudy Owens, 2017. "Better performing NGOs do report more accurately: Evidence from investigating Ugandan NGO financial accounts," Discussion Papers 2017-10, University of Nottingham, CREDIT.
    11. David Laibson, 1997. "Golden Eggs and Hyperbolic Discounting," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 112(2), pages 443-478.
    12. Jan Marc Berk & Beata Bierut, 2009. "Communication in a monetary policy committee: a note," DNB Working Papers 226, Netherlands Central Bank, Research Department.
    13. Nasimeh Heydaribeni & Achilleas Anastasopoulos, 2019. "Linear Equilibria for Dynamic LQG Games with Asymmetric Information and Dependent Types," Papers 1909.04834, arXiv.org.
    14. Aleksei Smirnov & Egor Starkov, 2019. "Timing of predictions in dynamic cheap talk: experts vs. quacks," ECON - Working Papers 334, Department of Economics - University of Zurich.
    15. Shuo Liu & Dimitri Migrow, 2019. "Designing organizations in volatile markets," ECON - Working Papers 319, Department of Economics - University of Zurich.
    16. Francisco Silva, 2016. "Should the Government Provide Public Goods if it Cannot Commit?," Documentos de Trabajo 477, Instituto de Economia. Pontificia Universidad Católica de Chile..
    17. Fleckinger, Pierre, 2008. "Bayesian improvement of the phantom voters rule: An example of dichotomic communication," Mathematical Social Sciences, Elsevier, vol. 55(1), pages 1-13, January.
    18. Gehrig, Thomas & Güth, Werner & Leví0nský, René & Popova, Vera, 2010. "On the evolution of professional consulting," Journal of Economic Behavior & Organization, Elsevier, vol. 76(1), pages 113-126, October.
    19. Zhang, Pengfei & Li, Ji, 2025. "Notice-and-takedown as dispute resolution: An empirical analysis of GitHub notices," International Review of Law and Economics, Elsevier, vol. 83(C).
    20. Xiao-Jun Zhang, 2012. "Information relevance, reliability and disclosure," Review of Accounting Studies, Springer, vol. 17(1), pages 189-226, March.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2402.09721. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.