Sequential Information Design: Markov Persuasion Process and Its Efficient Reinforcement Learning

My bibliography Save this paper

Sequential Information Design: Markov Persuasion Process and Its Efficient Reinforcement Learning

Author

Listed:

Jibang Wu
Zixuan Zhang
Zhe Feng
Zhaoran Wang
Zhuoran Yang
Michael I. Jordan
Haifeng Xu

Registered:

Abstract

In today's economy, it becomes important for Internet platforms to consider the sequential information design problem to align its long term interest with incentives of the gig service providers. This paper proposes a novel model of sequential information design, namely the Markov persuasion processes (MPPs), where a sender, with informational advantage, seeks to persuade a stream of myopic receivers to take actions that maximizes the sender's cumulative utilities in a finite horizon Markovian environment with varying prior and utility functions. Planning in MPPs thus faces the unique challenge in finding a signaling policy that is simultaneously persuasive to the myopic receivers and inducing the optimal long-term cumulative utilities of the sender. Nevertheless, in the population level where the model is known, it turns out that we can efficiently determine the optimal (resp. $\epsilon$-optimal) policy with finite (resp. infinite) states and outcomes, through a modified formulation of the Bellman equation. Our main technical contribution is to study the MPP under the online reinforcement learning (RL) setting, where the goal is to learn the optimal signaling policy by interacting with with the underlying MPP, without the knowledge of the sender's utility functions, prior distributions, and the Markov transition kernels. We design a provably efficient no-regret learning algorithm, the Optimism-Pessimism Principle for Persuasion Process (OP4), which features a novel combination of both optimism and pessimism principles. Our algorithm enjoys sample efficiency by achieving a sublinear $\sqrt{T}$-regret upper bound. Furthermore, both our algorithm and theory can be applied to MPPs with large space of outcomes and states via function approximation, and we showcase such a success under the linear setting.

Suggested Citation

Jibang Wu & Zixuan Zhang & Zhe Feng & Zhaoran Wang & Zhuoran Yang & Michael I. Jordan & Haifeng Xu, 2022. "Sequential Information Design: Markov Persuasion Process and Its Efficient Reinforcement Learning," Papers 2202.10678, arXiv.org.

Handle: RePEc:arx:papers:2202.10678

Download full text from publisher

References listed on IDEAS

Dirk Bergemann & Stephen Morris, 2019. "Information Design: A Unified Perspective," Journal of Economic Literature, American Economic Association, vol. 57(1), pages 44-95, March.
- Dirk Bergemann & Stephen Morris, 2017. "Information Design: A Unified Perspective," Working Papers 089_2017, Princeton University, Department of Economics, Econometric Research Program..
- Dirk Bergemann & Stephen Morris, 2017. "Information Design: A Unified Perspective," Cowles Foundation Discussion Papers 2075, Cowles Foundation for Research in Economics, Yale University.
- Dirk Bergemann & Stephen Morris, 2017. "Information Design: A Unified Perspective," Cowles Foundation Discussion Papers 2075R, Cowles Foundation for Research in Economics, Yale University, revised Mar 2017.
- Dirk Bergemann & Stephen Morris, 2017. "Information Design: A Unified Perspective," Cowles Foundation Discussion Papers 2075R3, Cowles Foundation for Research in Economics, Yale University, revised Mar 2018.
- Bergemann, Dirk & Morris, Stephen, 2017. "Information Design: A Unified Perspective," CEPR Discussion Papers 11867, C.E.P.R. Discussion Papers.
- Dirk Bergemann & Stephen Morris, 2017. "Information Design: A Unified Perspective," Cowles Foundation Discussion Papers 2075R2, Cowles Foundation for Research in Economics, Yale University, revised Nov 2017.
Goldstein, Itay & Leitner, Yaron, 2018. "Stress tests and information disclosure," Journal of Economic Theory, Elsevier, vol. 177(C), pages 34-69.
Emir Kamenica & Matthew Gentzkow, 2011. "Bayesian Persuasion," American Economic Review, American Economic Association, vol. 101(6), pages 2590-2615, October.
- Emir Kamenica & Matthew Gentzkow, 2009. "Bayesian Persuasion," NBER Working Papers 15540, National Bureau of Economic Research, Inc.
- Emir Kamenica & Matthew Gentzkow, 2009. "Bayesian Persuasion," NajEcon Working Paper Reviews 814577000000000369, www.najecon.org.
Giannoccaro, Ilaria & Pontrandolfo, Pierpaolo, 2002. "Inventory management in supply chains: a reinforcement learning approach," International Journal of Production Economics, Elsevier, vol. 78(2), pages 153-161, July.
Renault, Jérôme & Solan, Eilon & Vieille, Nicolas, 2017. "Optimal dynamic information provision," Games and Economic Behavior, Elsevier, vol. 104(C), pages 329-349.
- Renault, Jérôme & Solan, Eilon & Vieille, Nicolas, 2017. "Optimal Dynamic Information Provision," TSE Working Papers 17-749, Toulouse School of Economics (TSE).

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Krishnamurthy Iyer & Haifeng Xu & You Zu, 2023. "Markov Persuasion Processes with Endogenous Agent Beliefs," Papers 2307.03181, arXiv.org, revised Jul 2023.
Siyu Chen & Jibang Wu & Yifan Wu & Zhuoran Yang, 2023. "Learning to Incentivize Information Acquisition: Proper Scoring Rules Meet Principal-Agent Model," Papers 2303.08613, arXiv.org, revised Aug 2023.
Natalie Collina & Aaron Roth & Han Shao, 2023. "Efficient Prior-Free Mechanisms for No-Regret Agents," Papers 2311.07754, arXiv.org.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Miltiadis Makris & Ludovic Renou, 2018. "Information design in multi-stage games," Working Papers 861, Queen Mary University of London, School of Economics and Finance.
- Miltiadis Makris & Ludovic Renou, 2021. "Information Design in Multi-stage Games," Papers 2102.13482, arXiv.org, revised Apr 2021.
Escudé, Matteo & Sinander, Ludvig, 2023. "Slow persuasion," Theoretical Economics, Econometric Society, vol. 18(1), January.
- Matteo Escud'e & Ludvig Sinander, 2019. "Slow persuasion," Papers 1903.09055, arXiv.org, revised Apr 2022.
Ozan Candogan & Philipp Strack, 2021. "Optimal Disclosure of Information to a Privately Informed Receiver," Papers 2101.10431, arXiv.org, revised Jan 2022.
Gu, Jiadong, 2023. "Optimal stress tests and liquidation cost," Journal of Economic Dynamics and Control, Elsevier, vol. 146(C).
Leitner, Yaron & Yilmaz, Bilge, 2019. "Regulating a model," Journal of Financial Economics, Elsevier, vol. 131(2), pages 251-268.
Saed Alizamir & Francis de Véricourt & Shouqiang Wang, 2020. "Warning Against Recurring Risks: An Information Design Approach," Management Science, INFORMS, vol. 66(10), pages 4612-4629, October.
Parakhonyak, Alexei & Vikander, Nick, 2023. "Information design through scarcity and social learning," Journal of Economic Theory, Elsevier, vol. 207(C).
Farzaneh Farhadi & Demosthenis Teneketzis, 2022. "Dynamic Information Design: A Simple Problem on Optimal Sequential Information Disclosure," Dynamic Games and Applications, Springer, vol. 12(2), pages 443-484, June.
Babichenko, Yakov & Talgam-Cohen, Inbal & Xu, Haifeng & Zabarnyi, Konstantin, 2022. "Regret-minimizing Bayesian persuasion," Games and Economic Behavior, Elsevier, vol. 136(C), pages 226-248.
Emir Kamenica & Kyungmin Kim & Andriy Zapechelnyuk, 2021. "Bayesian persuasion and information design: perspectives and open issues," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 72(3), pages 701-704, October.
Koessler, Frederic & Laclau, Marie & Renault, Jérôme & Tomala, Tristan, 2022. "Long information design," Theoretical Economics, Econometric Society, vol. 17(2), May.
- Frédéric Koessler & Marie Laclau & Jérôme Renault & Tristan Tomala, 2021. "Long Information Design," PSE Working Papers halshs-02400053, HAL.
- Frédéric Koessler & Marie Laclau & Jerôme Renault & Tristan Tomala, 2022. "Long information design," PSE-Ecole d'économie de Paris (Postprint) hal-03700394, HAL.
- Koessler, Frédéric & Laclau, Marie & Renault, Jérôme & Tomala, Tristan, 2022. "Long information design," TSE Working Papers 22-1341, Toulouse School of Economics (TSE).
- Marie Laclau & Frédéric Koessler & Jérôme Renault & Tristan Tomala, 2022. "Long Information Design," Post-Print halshs-03342880, HAL.
- Marie Laclau & Frédéric Koessler & Jérôme Renault & Tristan Tomala, 2022. "Long Information Design," PSE-Ecole d'économie de Paris (Postprint) halshs-03342880, HAL.
- Frédéric Koessler & Marie Laclau & Jerôme Renault & Tristan Tomala, 2022. "Long information design," Post-Print hal-03700394, HAL.
- Frédéric Koessler & Marie Laclau & Jérôme Renault & Tristan Tomala, 2021. "Long Information Design," Working Papers halshs-02400053, HAL.
- Frédéric Koessler & Marie Laclau & Jérôme Renault & Tristan Tomala, 2022. "Long Information Design," Post-Print halshs-02400053, HAL.
- Frédéric Koessler & Marie Laclau & Jérôme Renault & Tristan Tomala, 2022. "Long Information Design," PSE-Ecole d'économie de Paris (Postprint) halshs-02400053, HAL.
Li, Fei & Song, Yangbo & Zhao, Mofei, 2023. "Global manipulation by local obfuscation," Journal of Economic Theory, Elsevier, vol. 207(C).
Aleksei Smirnov & Egor Starkov, 2019. "Timing of predictions in dynamic cheap talk: experts vs. quacks," ECON - Working Papers 334, Department of Economics - University of Zurich.
Eduardo Perez‐Richet & Vasiliki Skreta, 2022. "Test Design Under Falsification," Econometrica, Econometric Society, vol. 90(3), pages 1109-1142, May.
- Eduardo Perez & Vasiliki Skreta, 2018. "Test Design Under Falsification," SciencePo Working papers Main hal-03393136, HAL.
- Eduardo Perez & Vasiliki Skreta, 2018. "Test Design Under Falsification," SciencePo Working papers hal-03393136, HAL.
- Eduardo Perez & Vasiliki Skreta, 2018. "Test Design Under Falsification," Sciences Po Economics Discussion Papers 2018-13, Sciences Po Departement of Economics.
- Eduardo Perez-Richet & Vasiliki Skreta, 2022. "Test Design Under Falsification," SciencePo Working papers Main hal-03873972, HAL.
- Eduardo Perez & Vasiliki Skreta, 2018. "Test Design Under Falsification," Working Papers hal-03393136, HAL.
- Skreta, Vasiliki & Perez-Richet, Eduardo, 2021. "Test Design under Falsification," CEPR Discussion Papers 15627, C.E.P.R. Discussion Papers.
- Eduardo Perez-Richet & Vasiliki Skreta, 2022. "Test Design Under Falsification," Post-Print hal-03873972, HAL.
- Eduardo Perez & Vasiliki Skreta, 2018. "Test Design Under Falsification," Sciences Po publications 2018-13, Sciences Po.
Zhao, Wei & Mezzetti, Claudio & Renou, Ludovic & Tomala, Tristan, 0. "Contracting over persistent information," Theoretical Economics, Econometric Society.
- Wei Zhao & Claudio Mezzetti & Ludovic Renou & Tristan Tomala, 2020. "Contracting over persistent information," Papers 2007.05983, arXiv.org, revised Mar 2021.
- Renou, Ludovic & ZHAO, Wei & Mezzetti, Claudio & Tomala, Tristan, 2022. "Contracting over Persistent Information," CEPR Discussion Papers 16896, C.E.P.R. Discussion Papers.
Isaiah Andrews & Jesse M. Shapiro, 2021. "A Model of Scientific Communication," Econometrica, Econometric Society, vol. 89(5), pages 2117-2142, September.
- Isaiah Andrews & Jesse M. Shapiro, 2020. "A Model of Scientific Communication," NBER Working Papers 26824, National Bureau of Economic Research, Inc.
Goldstein, Itay & Leitner, Yaron, 2018. "Stress tests and information disclosure," Journal of Economic Theory, Elsevier, vol. 177(C), pages 34-69.
Ding, Haina & Guembel, Alexander & Ozanne, Alessio, 2020. "Market Information in Banking Supervision: The Role of Stress Test Design," TSE Working Papers 20-1144, Toulouse School of Economics (TSE).
Shih-Tang Su & Vijay G. Subramanian & Grant Schoenebeck, 2021. "Bayesian Persuasion in Sequential Trials," Papers 2110.09594, arXiv.org, revised Nov 2021.
Negrelli, Sara, 2020. "Bubbles and persuasion with uncertainty over market sentiment," Games and Economic Behavior, Elsevier, vol. 120(C), pages 67-85.

More about this item

NEP fields

This paper has been announced in the following NEP Reports:

NEP-BAN-2022-03-21 (Banking)
NEP-GTH-2022-03-21 (Game Theory)
NEP-ICT-2022-03-21 (Information and Communication Technologies)
NEP-UPT-2022-03-21 (Utility Models and Prospect Theory)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2202.10678. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Sequential Information Design: Markov Persuasion Process and Its Efficient Reinforcement Learning

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data