Generalized Principal-Agent Problem with a Learning Agent

Generalized Principal-Agent Problem with a Learning Agent

Author

Listed:

Tao Lin
Yiling Chen

Abstract

In classic principal-agent problems such as Stackelberg games, contract design, and Bayesian persuasion, the agent best responds to the principal's committed strategy. We study repeated generalized principal-agent problems under the assumption that the principal does not have commitment power and the agent uses algorithms to learn to respond to the principal. We reduce this problem to a one-shot problem where the agent approximately best responds, and prove that: (1) If the agent uses contextual no-regret learning algorithms with regret $\mathrm{Reg}(T)$, then the principal can guarantee utility at least $U^* - \Theta\big(\sqrt{\tfrac{\mathrm{Reg}(T)}{T}}\big)$, where $U^*$ is the principal's optimal utility in the classic model with a best-responding agent. (2) If the agent uses contextual no-swap-regret learning algorithms with swap-regret $\mathrm{SReg}(T)$, then the principal cannot obtain utility more than $U^* + O(\frac{\mathrm{SReg(T)}}{T})$. (3) In addition, if the agent uses mean-based learning algorithms (which can be no-regret but not no-swap-regret), then the principal can sometimes do significantly better than $U^*$. These results not only refine previous works on Stackelberg games and contract design, but also lead to new results for Bayesian persuasion with a learning agent and all generalized principal-agent problems where the agent does not have private information.

Suggested Citation

Tao Lin & Yiling Chen, 2024. "Generalized Principal-Agent Problem with a Learning Agent," Papers 2402.09721, arXiv.org, revised Oct 2025.

Handle: RePEc:arx:papers:2402.09721

Download full text from publisher

References listed on IDEAS

Piotr Dworczak & Alessandro Pavan, 2022. "Preparing for the Worst but Hoping for the Best: Robust (Bayesian) Persuasion," Econometrica, Econometric Society, vol. 90(5), pages 2017-2051, September.
- Pavan, Alessandro & Dworczak, Piotr, 2020. "Preparing for the Worst But Hoping for the Best: Robust (Bayesian) Persuasion," CEPR Discussion Papers 15017, C.E.P.R. Discussion Papers.
Colin Camerer, 1998. "Bounded Rationality in Individual Decision Making," Experimental Economics, Springer;Economic Science Association, vol. 1(2), pages 163-183, September.
- Camerer, Colin, 1998. "Bounded Rationality in Individual Decision Making," Working Papers 1029, California Institute of Technology, Division of the Humanities and Social Sciences.
Jiarui Gan & Minbiao Han & Jibang Wu & Haifeng Xu, 2022. "Generalized Principal-Agency: Contracts, Information, Games and Beyond," Papers 2209.01146, arXiv.org, revised Feb 2024.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Ce Li & Tao Lin, 2024. "Information Design with Unknown Prior," Papers 2410.05533, arXiv.org, revised Sep 2025.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

David Laibson, 1997. "Golden Eggs and Hyperbolic Discounting," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 112(2), pages 443-478.
- Laibson, David I., 1997. "Golden Eggs and Hyperbolic Discounting," Scholarly Articles 4481499, Harvard University Department of Economics.
Bocqueho, Geraldine & Jacquet, Florence & Reynaud, Arnaud, 2011. "Expected Utility or Prospect Theory Maximizers? Results from a Structural Model based on Field-experiment Data," 2011 International Congress, August 30-September 2, 2011, Zurich, Switzerland 114257, European Association of Agricultural Economists.
Tommaso Denti & Doron Ravid, 2023. "Robust Predictions in Games with Rational Inattention," Papers 2306.09964, arXiv.org.
Asprilla-Echeverria, John, 2024. "How do farmers adapt to water scarcity? Evidence from field experiments," Agricultural Water Management, Elsevier, vol. 297(C).
Roger J. Jiao & Feng Zhou & Chih-Hsing Chu, 2017. "Decision theoretic modeling of affective and cognitive needs for product experience engineering: key issues and a conceptual framework," Journal of Intelligent Manufacturing, Springer, vol. 28(7), pages 1755-1767, October.
Yulia Evsyukova & Federico Innocenti & Niccolò Lomys, 2024. "Optimal Multiple Loan Contracting under Sequential Audits and Contagion Losses," CSEF Working Papers 743, Centre for Studies in Economics and Finance (CSEF), University of Naples, Italy.
Martin Richardson, 2021. "Of hired guns and ideologues: why would a law firm ever retain an honest expert witness?," ANU Working Papers in Economics and Econometrics 2021-678, Australian National University, College of Business and Economics, School of Economics.
Ulrich Schmidt & Horst Zank, 2012. "A genuine foundation for prospect theory," Journal of Risk and Uncertainty, Springer, vol. 45(2), pages 97-113, October.
- Ulrich Schmidt & Horst Zank, 2011. "A Genuine Foundation for Prospect Theory," Economics Discussion Paper Series 1114, Economics, The University of Manchester.
Robison, Lindon J. & Shupp, Robert S. & Myers, Robert J., 2010. "Expected utility paradoxes," Journal of Behavioral and Experimental Economics (formerly The Journal of Socio-Economics), Elsevier, vol. 39(2), pages 187-193, April.
Daniel, Kent & Hirshleifer, David & Teoh, Siew Hong, 2002. "Investor psychology in capital markets: evidence and policy implications," Journal of Monetary Economics, Elsevier, vol. 49(1), pages 139-209, January.
Feng Li & Zhi-Ping Fan & Bing-Bing Cao & Xin Li, 2020. "Logistics Service Mode Selection for Last Mile Delivery: An Analysis Method Considering Customer Utility and Delivery Service Cost," Sustainability, MDPI, vol. 13(1), pages 1-22, December.
Pessali, Huascar & Berger, Bruno, 2010. "A teoria da perspectiva e as mudanças de preferência no mainstream: um prospecto lakatoseano [Prospect theory and preference change in the mainstream of economics: a Lakatosian prospect]," MPRA Paper 26104, University Library of Munich, Germany.
Ralph-C Bayer, 2003. "Income Tax Evasion with Morally Constraint Taxpayers: The Role of Evasion Opportunities and Evasion Cost," School of Economics and Public Policy Working Papers 2003-04, University of Adelaide, School of Economics and Public Policy.
Guru Guruganesh & Jon Schneider & Joshua Wang & Junyao Zhao, 2023. "The Power of Menus in Contract Design," Papers 2306.12667, arXiv.org.
Ce Li & Tao Lin, 2024. "Information Design with Unknown Prior," Papers 2410.05533, arXiv.org, revised Sep 2025.
Marco Persichina, 2024. "Present Bias in Renewable Resource Management and Agentâ€™s Welfare," Journal of Interdisciplinary Economics, , vol. 36(1), pages 79-97, January.
Semyon Malamud & Andreas Schrimpf, 2021. "Persuasion by Dimension Reduction," Swiss Finance Institute Research Paper Series 21-69, Swiss Finance Institute.
- Semyon Malamud & Andreas Schrimpf, 2021. "Persuasion by Dimension Reduction," Papers 2110.08884, arXiv.org, revised Oct 2022.
Cardenas, Juan-Camilo & Ostrom, Elinor, 2004. "What do people bring into the game? Experiments in the field about cooperation in the commons," Agricultural Systems, Elsevier, vol. 82(3), pages 307-326, December.
- Cardenas, Juan-Camilo & Ostrom, Elinor, 2004. "What do people bring into the game : experiments in the field about cooperation in the commons," CAPRi Working Papers 51816, CGIAR, International Food Policy Research Institute (IFPRI).
- CÃ¡rdenas, Juan-Camilo & Ostrom, Elinor, 2004. "What do people bring into the game: experiments in the field about cooperation in the commons," CAPRi working papers 32, International Food Policy Research Institute (IFPRI).
- Juan-Camilo Cardenas & Elinor Ostrom, 2004. "What do people bring into the game? Experiments in the field about cooperation in the commons," Artefactual Field Experiments 00027, The Field Experiments Website.
Tisserand, Jean-Christian & Hopfensitz, Astrid & Blondel, Serge & Loheac, Youenn & Mantilla, César & Mateu, Guillermo & Rosaz, Julie & Rozan, Anne & Willinger, Marc & Sutan, Angela, 2022. "Management of common pool resources in a nation-wide experiment," Ecological Economics, Elsevier, vol. 201(C).
- Jean-Christian Tisserand & Astrid Hopfensitz & Serge Blondel & Youenn Loheac & César Mantilla & Guillermo Mateu & Julie Rosaz & Anne Rozan & Marc Willinger & Angela Sutan, 2022. "Management of common pool resources in a nation-wide experiment," Post-Print hal-03762599, HAL.
- Jean-Christian Tisserand & Astrid Hopfensitz & Serge Blondel & Youenn Loheac & Cesar Mantilla & Guillermo Mateu & Julie Rosaz & Anne Rozan & Marc Willinger & Angela Sutan, 2022. "Management of common pool resources in a nation-wide experiment," Post-Print hal-04325585, HAL.
- Jean-Christian Tisserand & Astrid Hopfensitz & Serge Blondel & Youenn Loheac & César Mantilla & Guillermo Mateu & Julie Rosaz & Anne Rozan & Marc Willinger & Angela Sutan, 2022. "Management of common pool resources in a nation-wide experiment," Post-Print hal-04075051, HAL.
Rodepeter, Ralf & Winter, Joachim, 1999. "Rules of thumb in life-cycle savings models," Sonderforschungsbereich 504 Publications 99-81, Sonderforschungsbereich 504, Universität Mannheim;Sonderforschungsbereich 504, University of Mannheim.
- Ralf Rodepeter & Joachim K. Winter, 2000. "Rules of Thumb in Life-Cycle Savings Models," Econometric Society World Congress 2000 Contributed Papers 1222, Econometric Society.
- Rodepeter, Ralf & Winter, Joachim, 1999. "Rules of thumb in life-cycle savings models," Papers 99-81, Sonderforschungsbreich 504.

More about this item

NEP fields

This paper has been announced in the following NEP Reports:

NEP-CTA-2024-03-18 (Contract Theory and Applications)
NEP-GTH-2024-03-18 (Game Theory)
NEP-MIC-2024-03-18 (Microeconomics)
NEP-UPT-2024-03-18 (Utility Models and Prospect Theory)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2402.09721. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Generalized Principal-Agent Problem with a Learning Agent

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data