Robust Online Learning with Private Information

My bibliography Save this paper

Robust Online Learning with Private Information

Author

Listed:

Kyohei Okumura

Registered:

Abstract

This paper investigates the robustness of online learning algorithms when learners possess private information. No-external-regret algorithms, prevalent in machine learning, are vulnerable to strategic manipulation, allowing an adaptive opponent to extract full surplus. Even standard no-weak-external-regret algorithms, designed for optimal learning in stationary environments, exhibit similar vulnerabilities. This raises a fundamental question: can a learner simultaneously prevent full surplus extraction by adaptive opponents while maintaining optimal performance in well-behaved environments? To address this, we model the problem as a two-player repeated game, where the learner with private information plays against the environment, facing ambiguity about the environment's types: stationary or adaptive. We introduce \emph{partial safety} as a key design criterion for online learning algorithms to prevent full surplus extraction. We then propose the \emph{Explore-Exploit-Punish} (\textsf{EEP}) algorithm and prove that it satisfies partial safety while achieving optimal learning in stationary environments, and has a variant that delivers improved welfare performance. Our findings highlight the risks of applying standard online learning algorithms in strategic settings with adverse selection. We advocate for a shift toward online learning algorithms that explicitly incorporate safeguards against strategic manipulation while ensuring strong learning performance.

Suggested Citation

Kyohei Okumura, 2025. "Robust Online Learning with Private Information," Papers 2505.05341, arXiv.org, revised May 2025.

Handle: RePEc:arx:papers:2505.05341

Download full text from publisher

References listed on IDEAS

Sergiu Hart & Andreu Mas-Colell, 2013. "A General Class Of Adaptive Strategies," World Scientific Book Chapters, in: Simple Adaptive Strategies From Regret-Matching to Uncoupled Dynamics, chapter 3, pages 47-76, World Scientific Publishing Co. Pte. Ltd..
- Hart, Sergiu & Mas-Colell, Andreu, 2001. "A General Class of Adaptive Strategies," Journal of Economic Theory, Elsevier, vol. 98(1), pages 26-54, May.
- Sergiu Hart & Andreu Mas-Colell, 1999. "A General Class of Adaptive Strategies," Game Theory and Information 9904001, University Library of Munich, Germany, revised 23 Mar 2000.
- Sergiu Hart & Andreu Mas-Colell, 1999. "A general class of adaptative strategies," Economics Working Papers 373, Department of Economics and Business, Universitat Pompeu Fabra.
,, 2012. "A partial folk theorem for games with private learning," Theoretical Economics, Econometric Society, vol. 7(2), May.
- Thomas E. Wiseman, 2011. "A Partial Folk Theorem for Games with Private Learning," 2011 Meeting Papers 181, Society for Economic Dynamics.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Sandholm,W.H., 2003. "Excess payoff dynamics, potential dynamics, and stable games," Working papers 5, Wisconsin Madison - Social Systems.
- Bill Sandholm, 2003. "Excess Payoff Dynamics, Potential Dynamics, and Stable Games," Theory workshop papers 505798000000000042, UCLA Department of Economics.
Ehud Lehrer & Eilon Solan, 2007. "Learning to play partially-specified equilibrium," Levine's Working Paper Archive 122247000000001436, David K. Levine.
Sergiu Hart & Andreu Mas-Colell, 2013. "A Simple Adaptive Procedure Leading To Correlated Equilibrium," World Scientific Book Chapters, in: Simple Adaptive Strategies From Regret-Matching to Uncoupled Dynamics, chapter 2, pages 17-46, World Scientific Publishing Co. Pte. Ltd..
- Sergiu Hart & Andreu Mas-Colell, 2000. "A Simple Adaptive Procedure Leading to Correlated Equilibrium," Econometrica, Econometric Society, vol. 68(5), pages 1127-1150, September.
- Sergiu Hart & Andreu Mas-Colell, 1996. "A simple adaptive procedure leading to correlated equilibrium," Economics Working Papers 200, Department of Economics and Business, Universitat Pompeu Fabra, revised Dec 1996.
- S. Hart & A. Mas-Collel, 2010. "A Simple Adaptive Procedure Leading to Correlated Equilibrium," Levine's Working Paper Archive 572, David K. Levine.
- Sergiu Hart & Andreu Mas-Colell, 1997. "A Simple Adaptive Procedure Leading to Correlated Equilibrium," Game Theory and Information 9703006, University Library of Munich, Germany, revised 25 Nov 1997.
Salomon, Antoine & Forges, Françoise, 2015. "Bayesian repeated games and reputation," Journal of Economic Theory, Elsevier, vol. 159(PA), pages 70-104.
- Francoise Forges & Antoine Salomon, 2014. "Bayesian Repeated Games and Reputations," CESifo Working Paper Series 4700, CESifo.
- Antoine Salomon & Francoise Forges, 2015. "Bayesian repeated games and reputation," Post-Print hal-01252921, HAL.
- Francoise Forges & Antoine Salomon, 2014. "Bayesian repeated games and reputation," Working Papers hal-00803919, HAL.
Fudenberg, Drew & Yamamoto, Yuichi, 2011. "Learning from private information in noisy repeated games," Journal of Economic Theory, Elsevier, vol. 146(5), pages 1733-1769, September.
- Fudenberg, Drew & Yamamoto, Yuichi, 2011. "Learning from Private Information in Noisy Repeated Games," Scholarly Articles 9962008, Harvard University Department of Economics.
Karl Schlag & Andriy Zapechelnyuk, 2009. "Decision Making in Uncertain and Changing Environments," Discussion Papers 19, Kyiv School of Economics.
- Karl Schlag & Andriy Zapechelnyuk, 2009. "Decision making in uncertain and changing environments," Economics Working Papers 1160, Department of Economics and Business, Universitat Pompeu Fabra.
- Karl H. Schlag & Andriy Zapechelnyuk, 2009. "Decision Making in Uncertain and Changing Environments," Levine's Working Paper Archive 814577000000000259, David K. Levine.
Basu, Pathikrit & Chatterjee, Kalyan & Hoshino, Tetsuya & Tamuz, Omer, 2020. "Repeated coordination with private learning," Journal of Economic Theory, Elsevier, vol. 190(C).
Jean-François Laslier & Bernard Walliser, 2015. "Stubborn learning," Theory and Decision, Springer, vol. 79(1), pages 51-93, July.
- Jean-François Laslier & Bernard Walliser, 2011. "Stubborn Learning," Working Papers hal-00609501, HAL.
- Jean-François Laslier & Bernard Walliser, 2011. "Stubborn Learning," PSE Working Papers hal-00609501, HAL.
- Jean-François Laslier & Bernard Walliser, 2015. "Stubborn learning," Post-Print halshs-01310229, HAL.
- Jean-François Laslier & Bernard Walliser, 2015. "Stubborn learning," PSE-Ecole d'économie de Paris (Postprint) halshs-01310229, HAL.
Andriy Zapechelnyuk, 2007. "Better-Reply Strategies with Bounded Recall," Levine's Bibliography 321307000000000961, UCLA Department of Economics.
- Andriy Zapechelnyuk, 2007. "Better-Reply Strategies with Bounded Recall," Discussion Paper Series dp449, The Federmann Center for the Study of Rationality, the Hebrew University, Jerusalem.
Sergiu Hart & Andreu Mas-Colell, 2013. "Regret-Based Continuous-Time Dynamics," World Scientific Book Chapters, in: Simple Adaptive Strategies From Regret-Matching to Uncoupled Dynamics, chapter 5, pages 99-124, World Scientific Publishing Co. Pte. Ltd..
- Hart, Sergiu & Mas-Colell, Andreu, 2003. "Regret-based continuous-time dynamics," Games and Economic Behavior, Elsevier, vol. 45(2), pages 375-394, November.
- Sergiu Hart & Andreu Mas-Colell, 2001. "Regret-Based Continuous-Time Dynamics," Discussion Paper Series dp309, The Federmann Center for the Study of Rationality, the Hebrew University, Jerusalem, revised Apr 2003.
Eddie Dekel & Yossi Feinberg, 2006. "Non-Bayesian Testing of a Stochastic Prediction," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 73(4), pages 893-906.
- Eddie Dekel & Yossi Feinberg, 2006. "Non-Bayesian Testing of a Stochastic Prediction," Discussion Papers 1418, Northwestern University, Center for Mathematical Studies in Economics and Management Science.
Fudenberg, Drew & Takahashi, Satoru, 2011. "Heterogeneous beliefs and local information in stochastic fictitious play," Games and Economic Behavior, Elsevier, vol. 71(1), pages 100-120, January.
- Drew Fudenberg & Satoru Takahashi, 2008. "Heterogeneous Beliefs and Local Information in Stochastic Fictitious Play," Levine's Working Paper Archive 122247000000001695, David K. Levine.
- Takahashi, Satoru & Fudenberg, Drew, 2011. "Heterogeneous beliefs and local information in stochastic fictitious play," Scholarly Articles 27755310, Harvard University Department of Economics.
Hörner, Johannes & Lovo, Stefano & Tomala, Tristan, 2011. "Belief-free equilibria in games with incomplete information: Characterization and existence," Journal of Economic Theory, Elsevier, vol. 146(5), pages 1770-1795, September.
- Stefano Lovo & Tristan Tomala & Johannes Hörner, 2008. "Belief-free equilibria in games with incomplete information: characterization and existence," Working Papers hal-00489877, HAL.
- Stefano Lovo & Johannes Hörner & Tristan Tomala, 2011. "Belief-free equilibria in games with incomplete information: characterization and existence," Post-Print hal-00630299, HAL.
- Johannes Horner & Stefano Lovo & Tristan Tomala, 2009. "Belief-free Equilibria in Games with Incomplete Information: Characterization and Existence," Cowles Foundation Discussion Papers 1739, Cowles Foundation for Research in Economics, Yale University.
- Lovo, Stefano & Tomala, Tristan & Hörner, Johannes, 2009. "Belief-free equilibria in games with incomplete information: characterization and existence," HEC Research Papers Series 921, HEC Paris.
Pathikrit Basu & Kalyan Chatterjee & Tetsuya Hoshino & Omer Tamuz, 2018. "Repeated Coordination with Private Learning," Papers 1809.00051, arXiv.org.
Giovanni Di Bartolomeo & Debora Di Gioacchino, 2004. "Fiscal- Monetary Policy and Debt Management: a Two Stage Dynamic Analysis," Working Papers in Public Economics 74, Department of Economics and Law, Sapienza University of Roma.
Drew Fudenberg & David K Levine, 2005. "Learning and Belief Based Trading," Levine's Working Paper Archive 618897000000000975, David K. Levine.
Beggs, A.W., 2005. "On the convergence of reinforcement learning," Journal of Economic Theory, Elsevier, vol. 122(1), pages 1-36, May.
- Alan Beggs, 2002. "On the Convergence of Reinforcement Learning," Economics Series Working Papers 96, University of Oxford, Department of Economics.
Mannor, Shie & Shimkin, Nahum, 2008. "Regret minimization in repeated matrix games with variable stage duration," Games and Economic Behavior, Elsevier, vol. 63(1), pages 227-258, May.
Hofbauer, Josef & Sandholm, William H., 2009. "Stable games and their dynamics," Journal of Economic Theory, Elsevier, vol. 144(4), pages 1665-1693.4, July.
Young, H. Peyton, 2009. "Learning by trial and error," Games and Economic Behavior, Elsevier, vol. 65(2), pages 626-643, March.

More about this item

NEP fields

This paper has been announced in the following NEP Reports:

NEP-CTA-2025-06-09 (Contract Theory and Applications)
NEP-GTH-2025-06-09 (Game Theory)
NEP-MAC-2025-06-09 (Macroeconomics)
NEP-MIC-2025-06-09 (Microeconomics)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2505.05341. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Robust Online Learning with Private Information

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data