IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2303.08613.html
   My bibliography  Save this paper

Learning to Incentivize Information Acquisition: Proper Scoring Rules Meet Principal-Agent Model

Author

Listed:
  • Siyu Chen
  • Jibang Wu
  • Yifan Wu
  • Zhuoran Yang

Abstract

We study the incentivized information acquisition problem, where a principal hires an agent to gather information on her behalf. Such a problem is modeled as a Stackelberg game between the principal and the agent, where the principal announces a scoring rule that specifies the payment, and then the agent then chooses an effort level that maximizes her own profit and reports the information. We study the online setting of such a problem from the principal's perspective, i.e., designing the optimal scoring rule by repeatedly interacting with the strategic agent. We design a provably sample efficient algorithm that tailors the UCB algorithm (Auer et al., 2002) to our model, which achieves a sublinear $T^{2/3}$-regret after $T$ iterations. Our algorithm features a delicate estimation procedure for the optimal profit of the principal, and a conservative correction scheme that ensures the desired agent's actions are incentivized. Furthermore, a key feature of our regret bound is that it is independent of the number of states of the environment.

Suggested Citation

  • Siyu Chen & Jibang Wu & Yifan Wu & Zhuoran Yang, 2023. "Learning to Incentivize Information Acquisition: Proper Scoring Rules Meet Principal-Agent Model," Papers 2303.08613, arXiv.org, revised Aug 2023.
  • Handle: RePEc:arx:papers:2303.08613
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2303.08613
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Myerson, Roger B, 1979. "Incentive Compatibility and the Bargaining Problem," Econometrica, Econometric Society, vol. 47(1), pages 61-73, January.
    2. Jibang Wu & Zixuan Zhang & Zhe Feng & Zhaoran Wang & Zhuoran Yang & Michael I. Jordan & Haifeng Xu, 2022. "Sequential Information Design: Markov Persuasion Process and Its Efficient Reinforcement Learning," Papers 2202.10678, arXiv.org.
    3. Modibo Camara & Jason Hartline & Aleck Johnsen, 2020. "Mechanisms for a No-Regret Agent: Beyond the Common Prior," Papers 2009.05518, arXiv.org.
    4. Gneiting, Tilmann & Raftery, Adrian E., 2007. "Strictly Proper Scoring Rules, Prediction, and Estimation," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 359-378, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Arve, Malin & Zwart, Gijsbert, 2023. "Optimal procurement and investment in new technologies under uncertainty," Journal of Economic Dynamics and Control, Elsevier, vol. 147(C).
    2. Hongpeng Guo & Zhihao Lv & Junyi Hua & Hongxu Yuan & Qingyu Yu, 2021. "Design of Combined Auction Model for Emission Rights of International Forestry Carbon Sequestration and Other Pollutants Based on SMRA," Sustainability, MDPI, vol. 13(20), pages 1-18, October.
    3. Andrea Attar & Thomas Mariotti & François Salanié, 2020. "The Social Costs of Side Trading," The Economic Journal, Royal Economic Society, vol. 130(630), pages 1608-1622.
    4. Azar, Pablo D. & Micali, Silvio, 2018. "Computational principal agent problems," Theoretical Economics, Econometric Society, vol. 13(2), May.
    5. Cramton, Peter & Gibbons, Robert & Klemperer, Paul, 1987. "Dissolving a Partnership Efficiently," Econometrica, Econometric Society, vol. 55(3), pages 615-632, May.
    6. Lau, Stephanie, 2011. "Investment incentives in bilateral trading," Games and Economic Behavior, Elsevier, vol. 73(2), pages 538-552.
    7. Angelica Gianfreda & Francesco Ravazzolo & Luca Rossini, 2023. "Large Time‐Varying Volatility Models for Hourly Electricity Prices," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 85(3), pages 545-573, June.
    8. Davide Pettenuzzo & Francesco Ravazzolo, 2016. "Optimal Portfolio Choice Under Decision‐Based Model Combinations," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 31(7), pages 1312-1332, November.
    9. Jianxiong Zhang & Lin Feng & Wansheng Tang, 2014. "Optimal Contract Design of Supplier-Led Outsourcing Based on Pontryagin Maximum Principle," Journal of Optimization Theory and Applications, Springer, vol. 161(2), pages 592-607, May.
    10. Rubio, F.J. & Steel, M.F.J., 2011. "Inference for grouped data with a truncated skew-Laplace distribution," Computational Statistics & Data Analysis, Elsevier, vol. 55(12), pages 3218-3231, December.
    11. Meirowitz, Adam, 2005. "Deliberative Democracy or Market Democracy: Designing Institutions to Aggregate Preferences and Information," Papers 03-28-2005, Princeton University, Research Program in Political Economy.
    12. Casey Rothschild & Florian Scheuer, 2014. "A Theory of Income Taxation under Multidimensional Skill Heterogeneity," NBER Working Papers 19822, National Bureau of Economic Research, Inc.
    13. Caillaud Bernard & Jullien Bruno, 1991. "Managerial incentives based on insider information," CEPREMAP Working Papers (Couverture Orange) 9128, CEPREMAP.
    14. Strausz, Roland, 2006. "Deterministic versus stochastic mechanisms in principal-agent models," Journal of Economic Theory, Elsevier, vol. 128(1), pages 306-314, May.
    15. Élodie Bertrand, 2006. "La thèse d'efficience du « théorème de Coase ». Quelle critique de la microéconomie ?," Revue économique, Presses de Sciences-Po, vol. 57(5), pages 983-1007.
    16. Gansterer, Margaretha & Hartl, Richard F. & Sörensen, Kenneth, 2020. "Pushing frontiers in auction-based transport collaborations," Omega, Elsevier, vol. 94(C).
    17. Hwang, Eunju, 2022. "Prediction intervals of the COVID-19 cases by HAR models with growth rates and vaccination rates in top eight affected countries: Bootstrap improvement," Chaos, Solitons & Fractals, Elsevier, vol. 155(C).
    18. R de Fondeville & A C Davison, 2018. "High-dimensional peaks-over-threshold inference," Biometrika, Biometrika Trust, vol. 105(3), pages 575-592.
    19. Armantier, Olivier & Treich, Nicolas, 2013. "Eliciting beliefs: Proper scoring rules, incentives, stakes and hedging," European Economic Review, Elsevier, vol. 62(C), pages 17-40.
    20. Francisco Silva, 2016. "Should the Government Provide Public Goods if it Cannot Commit?," Documentos de Trabajo 477, Instituto de Economia. Pontificia Universidad Católica de Chile..

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2303.08613. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.