Regret minimization in repeated matrix games with variable stage duration

Regret minimization in repeated matrix games with variable stage duration

Author

Listed:

Mannor, Shie
Shimkin, Nahum

Abstract

Regret minimization in repeated matrix games has been extensively studied ever since Hannan's seminal paper [Hannan, J., 1957. Approximation to Bayes risk in repeated play. In: Dresher, M., Tucker, A.W., Wolfe, P. (Eds.), Contributions to the Theory of Games, vol. III. Ann. of Math. Stud., vol. 39, Princeton Univ. Press, Princeton, NJ, pp. 97-193]. Several classes of no-regret strategies now exist; such strategies secure a long-term average payoff as high as could be obtained by the fixed action that is best, in hindsight, against the observed action sequence of the opponent. We consider an extension of this framework to repeated games with variable stage duration, where the duration of each stage may depend on actions of both players, and the performance measure of interest is the average payoff per unit time. We start by showing that no-regret strategies, in the above sense, do not exist in general. Consequently, we consider two classes of adaptive strategies, one based on Blackwell's approachability theorem and the other on calibrated play, and examine their performance guarantees. We further provide sufficient conditions for existence of no-regret strategies in this model.

Suggested Citation

Mannor, Shie & Shimkin, Nahum, 2008. "Regret minimization in repeated matrix games with variable stage duration," Games and Economic Behavior, Elsevier, vol. 63(1), pages 227-258, May.

Handle: RePEc:eee:gamebe:v:63:y:2008:i:1:p:227-258

Download full text from publisher

As the access to this document is restricted, you may want to

for a different version of it.

References listed on IDEAS

Shie Mannor & Nahum Shimkin, 2003. "The Empirical Bayes Envelope and Regret Minimization in Competitive Markov Decision Processes," Mathematics of Operations Research, INFORMS, vol. 28(2), pages 327-345, May.
Fudenberg, Drew & Levine, David K., 1999. "An Easier Way to Calibrate," Games and Economic Behavior, Elsevier, vol. 29(1-2), pages 131-137, October.
- Drew Fudenberg & David K. Levine, 1996. "An Easier Way to Calibrate," Levine's Working Paper Archive 2059, David K. Levine.
- Fudenberg, Drew & Levine, David, 1999. "An Easier Way to Calibrate," Scholarly Articles 3203773, Harvard University Department of Economics.
Foster, Dean P. & Vohra, Rakesh V., 1997. "Calibrated Learning and Correlated Equilibrium," Games and Economic Behavior, Elsevier, vol. 21(1-2), pages 40-55, October.
- D. Foster & R. Vohra, 2010. "Calibrated Learning and Correlated Equilibrium," Levine's Working Paper Archive 568, David K. Levine.
Foster, Dean P., 1999. "A Proof of Calibration via Blackwell's Approachability Theorem," Games and Economic Behavior, Elsevier, vol. 29(1-2), pages 73-78, October.
- Dean P Foster, 1997. "A proof of Calibration via Blackwell's Approachability Theorem," Levine's Working Paper Archive 591, David K. Levine.
- Dean P. Foster, 1997. "A Proof of Calibration Via Blackwell's Approachability Theorem," Discussion Papers 1182, Northwestern University, Center for Mathematical Studies in Economics and Management Science.
Foster, Dean P. & Vohra, Rakesh, 1999. "Regret in the On-Line Decision Problem," Games and Economic Behavior, Elsevier, vol. 29(1-2), pages 7-35, October.
- D. Foster & R. Vohra, 2010. "Regret in the On-line Decision Problem," Levine's Working Paper Archive 569, David K. Levine.
Fudenberg, Drew & Levine, David K., 1999. "Conditional Universal Consistency," Games and Economic Behavior, Elsevier, vol. 29(1-2), pages 104-130, October.
- Drew Fudenberg & David K. Levine, 1997. "Conditional Universal Consistency," Levine's Working Paper Archive 471, David K. Levine.
- Fudenberg, Drew & Levine, David, 1999. "Conditional Universal Consistency," Scholarly Articles 3204826, Harvard University Department of Economics.
Fudenberg, Drew & Levine, David K., 1995. "Consistency and cautious fictitious play," Journal of Economic Dynamics and Control, Elsevier, vol. 19(5-7), pages 1065-1089.
- Fudenberg, Drew & Levine, David, 1995. "Consistency and Cautious Fictitious Play," Scholarly Articles 3198694, Harvard University Department of Economics.
- Drew Fudenberg & David K. Levine, 1996. "Consistency and Cautious Fictitious Play," Levine's Working Paper Archive 470, David K. Levine.
Rustichini, Aldo, 1999. "Minimizing Regret: The General Case," Games and Economic Behavior, Elsevier, vol. 29(1-2), pages 224-243, October.
Lehrer, Ehud, 2003. "A wide range no-regret theorem," Games and Economic Behavior, Elsevier, vol. 42(1), pages 101-115, January.
- Ehud Lehrer & Dinah Rosenberg, 2003. "A Wide Range No-Regret Theorem," Game Theory and Information 0312004, University Library of Munich, Germany.
Alvaro Sandroni & Rann Smorodinsky & Rakesh V. Vohra, 2003. "Calibration with Many Checking Rules," Mathematics of Operations Research, INFORMS, vol. 28(1), pages 141-153, February.
Sergiu Hart & Andreu Mas-Colell, 2013. "A Simple Adaptive Procedure Leading To Correlated Equilibrium," World Scientific Book Chapters, in: Simple Adaptive Strategies From Regret-Matching to Uncoupled Dynamics, chapter 2, pages 17-46, World Scientific Publishing Co. Pte. Ltd..
- Sergiu Hart & Andreu Mas-Colell, 2000. "A Simple Adaptive Procedure Leading to Correlated Equilibrium," Econometrica, Econometric Society, vol. 68(5), pages 1127-1150, September.
- Sergiu Hart & Andreu Mas-Colell, 1996. "A simple adaptive procedure leading to correlated equilibrium," Economics Working Papers 200, Department of Economics and Business, Universitat Pompeu Fabra, revised Dec 1996.
- S. Hart & A. Mas-Collel, 2010. "A Simple Adaptive Procedure Leading to Correlated Equilibrium," Levine's Working Paper Archive 572, David K. Levine.
- Sergiu Hart & Andreu Mas-Colell, 1997. "A Simple Adaptive Procedure Leading to Correlated Equilibrium," Game Theory and Information 9703006, University Library of Munich, Germany, revised 25 Nov 1997.
Sergiu Hart, 2013. "Adaptive Heuristics," World Scientific Book Chapters, in: Simple Adaptive Strategies From Regret-Matching to Uncoupled Dynamics, chapter 11, pages 253-287, World Scientific Publishing Co. Pte. Ltd..
- Sergiu Hart, 2005. "Adaptive Heuristics," Econometrica, Econometric Society, vol. 73(5), pages 1401-1430, September.
- Sergiu Hart, 2004. "Adaptive Heuristics," Levine's Bibliography 122247000000000471, UCLA Department of Economics.
- Sergiu Hart, 2004. "Adaptive Heuristics," Discussion Paper Series dp372, The Federmann Center for the Study of Rationality, the Hebrew University, Jerusalem.
Sergiu Hart & Andreu Mas-Colell, 2013. "A General Class Of Adaptive Strategies," World Scientific Book Chapters, in: Simple Adaptive Strategies From Regret-Matching to Uncoupled Dynamics, chapter 3, pages 47-76, World Scientific Publishing Co. Pte. Ltd..
- Hart, Sergiu & Mas-Colell, Andreu, 2001. "A General Class of Adaptive Strategies," Journal of Economic Theory, Elsevier, vol. 98(1), pages 26-54, May.
- Sergiu Hart & Andreu Mas-Colell, 1999. "A General Class of Adaptive Strategies," Game Theory and Information 9904001, University Library of Munich, Germany, revised 23 Mar 2000.
- Sergiu Hart & Andreu Mas-Colell, 1999. "A general class of adaptative strategies," Economics Working Papers 373, Department of Economics and Business, Universitat Pompeu Fabra.
Freund, Yoav & Schapire, Robert E., 1999. "Adaptive Game Playing Using Multiplicative Weights," Games and Economic Behavior, Elsevier, vol. 29(1-2), pages 79-103, October.
Kalai, Ehud & Lehrer, Ehud & Smorodinsky, Rann, 1999. "Calibrated Forecasting and Merging," Games and Economic Behavior, Elsevier, vol. 29(1-2), pages 151-169, October.
- Ehud Kalai, 1995. "Calibrated Forecasting and Merging," Discussion Papers 1144, Northwestern University, Center for Mathematical Studies in Economics and Management Science.
- Ehud Kalai & Ehud Lehrer & Rann Smorodinsky, 2010. "Calibrated Forecasting and Merging," Levine's Working Paper Archive 584, David K. Levine.
- Ehud Kalai, 1995. "Calibrated Forecasting and Merging," Discussion Papers 1144R, Northwestern University, Center for Mathematical Studies in Economics and Management Science.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Jia Yuan Yu & Shie Mannor & Nahum Shimkin, 2009. "Markov Decision Processes with Arbitrary Reward Processes," Mathematics of Operations Research, INFORMS, vol. 34(3), pages 737-757, August.
Michael Nwogugu, 2020. "Regret Theory And Asset Pricing Anomalies In Incomplete Markets With Dynamic Un-Aggregated Preferences," Papers 2005.01709, arXiv.org.
Andrey Bernstein & Shie Mannor & Nahum Shimkin, 2014. "Opportunistic Approachability and Generalized No-Regret Problems," Mathematics of Operations Research, INFORMS, vol. 39(4), pages 1057-1083, November.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Karl Schlag & Andriy Zapechelnyuk, 2009. "Decision Making in Uncertain and Changing Environments," Discussion Papers 19, Kyiv School of Economics.
- Karl Schlag & Andriy Zapechelnyuk, 2009. "Decision making in uncertain and changing environments," Economics Working Papers 1160, Department of Economics and Business, Universitat Pompeu Fabra.
- Karl H. Schlag & Andriy Zapechelnyuk, 2009. "Decision Making in Uncertain and Changing Environments," Levine's Working Paper Archive 814577000000000259, David K. Levine.
Sandroni, Alvaro & Smorodinsky, Rann, 2004. "Belief-based equilibrium," Games and Economic Behavior, Elsevier, vol. 47(1), pages 157-171, April.
Foster, Dean P. & Young, H. Peyton, 2003. "Learning, hypothesis testing, and Nash equilibrium," Games and Economic Behavior, Elsevier, vol. 45(1), pages 73-96, October.
- Peyton Young, 2002. "Learning Hypothesis Testing and Nash Equilibrium," Economics Working Paper Archive 474, The Johns Hopkins University,Department of Economics.
Eddie Dekel & Yossi Feinberg, 2006. "Non-Bayesian Testing of a Stochastic Prediction," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 73(4), pages 893-906.
- Eddie Dekel & Yossi Feinberg, 2006. "Non-Bayesian Testing of a Stochastic Prediction," Discussion Papers 1418, Northwestern University, Center for Mathematical Studies in Economics and Management Science.
Schlag, Karl H. & Zapechelnyuk, Andriy, 2017. "Dynamic benchmark targeting," Journal of Economic Theory, Elsevier, vol. 169(C), pages 145-169.
- Karl H. Schlag & Andriy Zapechelnyuk, 2016. "Dynamic Benchmark Targeting," Working Papers 2016_20, Business School - Economics, University of Glasgow.
Michel Benaïm & Josef Hofbauer & Sylvain Sorin, 2006. "Stochastic Approximations and Differential Inclusions, Part II: Applications," Mathematics of Operations Research, INFORMS, vol. 31(4), pages 673-695, November.
- Michel Benaïm & Josef Hofbauer & Sylvain Sorin, 2005. "Stochastic Approximations and Differential Inclusions; Part II: Applications," Working Papers hal-00242974, HAL.
Ehud Lehrer & Eilon Solan, 2016. "A General Internal Regret-Free Strategy," Dynamic Games and Applications, Springer, vol. 6(1), pages 112-138, March.
Wojciech Olszewski & Alvaro Sandroni, 2006. "Strategic Manipulation of Empirical Tests," Discussion Papers 1425, Northwestern University, Center for Mathematical Studies in Economics and Management Science.
- Alvaro Sandroni & Wojciech Olszewski, 2008. "Strategic Manipulation of Empirical Tests," PIER Working Paper Archive 08-015, Penn Institute for Economic Research, Department of Economics, University of Pennsylvania.
Yuichi Noguchi, 2009. "Note on universal conditional consistency," International Journal of Game Theory, Springer;Game Theory Society, vol. 38(2), pages 193-207, June.
Feinberg, Yossi & Dekel, Eddie, 2004. "A True Expert Knows which Question Should Be Asked," Research Papers 1856, Stanford University, Graduate School of Business.
- Eddie Dekel & Yossi Feinberg, 2006. "A True Expert Knows which Question Should be Asked," Discussion Papers 1385, Northwestern University, Center for Mathematical Studies in Economics and Management Science.
Germano, Fabrizio & Lugosi, Gabor, 2007. "Global Nash convergence of Foster and Young's regret testing," Games and Economic Behavior, Elsevier, vol. 60(1), pages 135-154, July.
- Fabrizio Germano & Gábor Lugosi, 2004. "Global Nash convergence of Foster and Young's regret testing," Economics Working Papers 788, Department of Economics and Business, Universitat Pompeu Fabra.
Karl Schlag & Andriy Zapechelnyuk, 2010. "On the Impossibility of Regret Minimization in Repeated Games," Working Papers 676, Queen Mary University of London, School of Economics and Finance.
Ehud Lehrer & Eilon Solan, 2007. "Learning to play partially-specified equilibrium," Levine's Working Paper Archive 122247000000001436, David K. Levine.
Andriy Zapechelnyuk, 2009. "Limit Behavior of No-regret Dynamics," Discussion Papers 21, Kyiv School of Economics.
Du, Ye & Lehrer, Ehud, 2020. "Constrained no-regret learning," Journal of Mathematical Economics, Elsevier, vol. 88(C), pages 16-24.
Schlag, Karl & Zapechelnyuk, Andriy, 2012. "On the impossibility of achieving no regrets in repeated games," Journal of Economic Behavior & Organization, Elsevier, vol. 81(1), pages 153-158.
Nicolò Cesa-Bianchi & Gábor Lugosi & Gilles Stoltz, 2006. "Regret Minimization Under Partial Monitoring," Mathematics of Operations Research, INFORMS, vol. 31(3), pages 562-580, August.
Dean Foster & Rakesh Vohra, 2011. "Calibration: Respice, Adspice, Prospice," Discussion Papers 1537, Northwestern University, Center for Mathematical Studies in Economics and Management Science.
Stoltz, Gilles & Lugosi, Gabor, 2007. "Learning correlated equilibria in games with compact sets of strategies," Games and Economic Behavior, Elsevier, vol. 59(1), pages 187-208, April.
Sergiu Hart & Andreu Mas-Colell, 2013. "A General Class Of Adaptive Strategies," World Scientific Book Chapters, in: Simple Adaptive Strategies From Regret-Matching to Uncoupled Dynamics, chapter 3, pages 47-76, World Scientific Publishing Co. Pte. Ltd..
- Hart, Sergiu & Mas-Colell, Andreu, 2001. "A General Class of Adaptive Strategies," Journal of Economic Theory, Elsevier, vol. 98(1), pages 26-54, May.
- Sergiu Hart & Andreu Mas-Colell, 1999. "A General Class of Adaptive Strategies," Game Theory and Information 9904001, University Library of Munich, Germany, revised 23 Mar 2000.
- Sergiu Hart & Andreu Mas-Colell, 1999. "A general class of adaptative strategies," Economics Working Papers 373, Department of Economics and Business, Universitat Pompeu Fabra.

More about this item

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:gamebe:v:63:y:2008:i:1:p:227-258. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/inca/622836 .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Regret minimization in repeated matrix games with variable stage duration

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data