Robust Multiarmed Bandit Problems
Author
Abstract
Suggested Citation
DOI: 10.1287/mnsc.2015.2153
Download full text from publisher
References listed on IDEAS
- Hansen, Lars Peter & Sargent, Thomas J., 2007.
"Recursive robust estimation and control without commitment,"
Journal of Economic Theory, Elsevier, vol. 136(1), pages 1-27, September.
- Hansen, Lars Peter & Sargent, Thomas J., 2005. "Recursive robust estimation and control without commitment," Discussion Paper Series 1: Economic Studies 2005,28, Deutsche Bundesbank.
- Felipe Caro & Jérémie Gallien, 2007. "Dynamic Assortment with Demand Learning for Seasonal Consumer Goods," Management Science, INFORMS, vol. 53(2), pages 276-292, February.
- Dimitris Bertsimas & Melvyn Sim, 2004. "The Price of Robustness," Operations Research, INFORMS, vol. 52(1), pages 35-53, February.
- Epstein, Larry G. & Schneider, Martin, 2003.
"Recursive multiple-priors,"
Journal of Economic Theory, Elsevier, vol. 113(1), pages 1-31, November.
- Larry G. Epstein & Martin Schneider, 2001. "Recursive Multiple-Priors," RCER Working Papers 485, University of Rochester - Center for Economic Research (RCER).
- Andrew E. B. Lim & J. George Shanthikumar, 2007. "Relative Entropy, Exponential Utility, and Robust Dynamic Pricing," Operations Research, INFORMS, vol. 55(2), pages 198-214, April.
- Hansen, Lars Peter & Sargent, Thomas J., 2005. "Robust estimation and control under commitment," Journal of Economic Theory, Elsevier, vol. 124(2), pages 258-301, October.
- David B. Brown & James E. Smith, 2013. "Optimal Sequential Exploration: Bandits, Clairvoyants, and Wildcats," Operations Research, INFORMS, vol. 61(3), pages 644-665, June.
- Carri W. Chan & Vivek F. Farias, 2009. "Stochastic Depletion Problems: Effective Myopic Policies for a Class of Dynamic Optimization Problems," Mathematics of Operations Research, INFORMS, vol. 34(2), pages 333-350, May.
- Larry G. Epstein & Martin Schneider, 2007.
"Learning Under Ambiguity,"
The Review of Economic Studies, Review of Economic Studies Ltd, vol. 74(4), pages 1275-1303.
- Larry Epstein & Martin Schneider, 2002. "Learning Under Ambiguity," RCER Working Papers 497, University of Rochester - Center for Economic Research (RCER), revised Mar 2005.
- Larry Epstein & Martin Schneider, 2006. "Learning Under Ambiguity," RCER Working Papers 527, University of Rochester - Center for Economic Research (RCER).
- Jonathan Li & Roy Kwon, 2013. "Portfolio selection under model uncertainty: a penalized moment-based optimization approach," Journal of Global Optimization, Springer, vol. 56(1), pages 131-164, May.
- Andrea Grove & Gary A. Berg (ed.), 2014. "Social Business," Springer Books, Springer, edition 127, number 978-3-642-45275-8, December.
- Wolfram Wiesemann & Daniel Kuhn & Berç Rustem, 2013. "Robust Markov Decision Processes," Mathematics of Operations Research, INFORMS, vol. 38(1), pages 153-183, February.
- Andrew E. B. Lim & J. George Shanthikumar & Gah-Yi Vahn, 2012. "Robust Portfolio Choice with Learning in the Framework of Regret: Single-Period Case," Management Science, INFORMS, vol. 58(9), pages 1732-1746, September.
- David B. Brown & James E. Smith & Peng Sun, 2010. "Information Relaxations and Duality in Stochastic Dynamic Programs," Operations Research, INFORMS, vol. 58(4-part-1), pages 785-801, August.
- Paat Rusmevichientong & Zuo-Jun Max Shen & David B. Shmoys, 2010. "Dynamic Assortment Optimization with a Multinomial Logit Choice Model and Capacity Constraint," Operations Research, INFORMS, vol. 58(6), pages 1666-1680, December.
- Garud N. Iyengar, 2005. "Robust Dynamic Programming," Mathematics of Operations Research, INFORMS, vol. 30(2), pages 257-280, May.
- Dimitris Bertsimas & José Niño-Mora, 1996. "Conservation Laws, Extended Polymatroids and Multiarmed Bandit Problems; A Polyhedral Approach to Indexable Systems," Mathematics of Operations Research, INFORMS, vol. 21(2), pages 257-306, May.
- A. Ben-Tal & A. Nemirovski, 1998. "Robust Convex Optimization," Mathematics of Operations Research, INFORMS, vol. 23(4), pages 769-805, November.
- Ilya O. Ryzhov & Warren B. Powell & Peter I. Frazier, 2012. "The Knowledge Gradient Algorithm for a General Class of Online Learning Problems," Operations Research, INFORMS, vol. 60(1), pages 180-195, February.
- Arnab Nilim & Laurent El Ghaoui, 2005. "Robust Control of Markov Decision Processes with Uncertain Transition Matrices," Operations Research, INFORMS, vol. 53(5), pages 780-798, October.
Citations
Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
Cited by:
- Nikolsko-Rzhevskyy, Alex & Papell, David H. & Prodan, Ruxandra, 2017. "The Yellen rules," Journal of Macroeconomics, Elsevier, vol. 54(PA), pages 59-71.
- Keskin, Burcu B. & Griffin, Emily C. & Prell, Jonathan O. & Dilkina, Bistra & Ferber, Aaron & MacDonald, John & Hilend, Rowan & Griffis, Stanley & Gore, Meredith L., 2023. "Quantitative Investigation of Wildlife Trafficking Supply Chains: A Review," Omega, Elsevier, vol. 115(C).
- Fu, Jing & Zhang, Lele & Liu, Zhiyuan, 2025. "A restless bandit model for dynamic ride matching with reneging travelers," European Journal of Operational Research, Elsevier, vol. 320(3), pages 581-592.
- Eisenberg, Julia & Krühner, Paul, 2018. "The impact of negative interest rates on optimal capital injections," Insurance: Mathematics and Economics, Elsevier, vol. 82(C), pages 1-10.
- Malekipirbazari, Milad & Çavuş, Özlem, 2024. "Index policy for multiarmed bandit problem with dynamic risk measures," European Journal of Operational Research, Elsevier, vol. 312(2), pages 627-640.
- Michael Jong Kim, 2020. "Variance Regularization in Sequential Bayesian Optimization," Mathematics of Operations Research, INFORMS, vol. 45(3), pages 966-992, August.
- Siwen Wang & Hui Chen & Chunyang Gong & Yanfei Shang & Zhixin Wang, 2024. "The Operation Strategy of a Multi-Microgrid Considering the Interaction of Different Subjects’ Interests," Energies, MDPI, vol. 17(19), pages 1-20, September.
- Flores-Szwagrzak, Karol, 2022. "Learning by Convex Combination," Working Papers 16-2022, Copenhagen Business School, Department of Economics.
- Xu, Jianyu & Chen, Lujie & Tang, Ou, 2021. "An online algorithm for the risk-aware restless bandit," European Journal of Operational Research, Elsevier, vol. 290(2), pages 622-639.
- David B. Brown & Martin B. Haugh, 2017. "Information Relaxation Bounds for Infinite Horizon Markov Decision Processes," Operations Research, INFORMS, vol. 65(5), pages 1355-1379, October.
- Xikui Wang & You Liang & Lysa Porth, 2019. "A Bayesian two‐armed bandit model," Applied Stochastic Models in Business and Industry, John Wiley & Sons, vol. 35(3), pages 624-636, May.
Most related items
These are the items that most often cite the same works as this one and are cited by the same works as this one.- Michael Jong Kim, 2016. "Robust Control of Partially Observable Failing Systems," Operations Research, INFORMS, vol. 64(4), pages 999-1014, August.
- Shie Mannor & Ofir Mebel & Huan Xu, 2016. "Robust MDPs with k -Rectangular Uncertainty," Mathematics of Operations Research, INFORMS, vol. 41(4), pages 1484-1509, November.
- Bren, Austin & Saghafian, Soroush, 2018. "Data-Driven Percentile Optimization for Multi-Class Queueing Systems with Model Ambiguity: Theory and Application," Working Paper Series rwp18-008, Harvard University, John F. Kennedy School of Government.
- Rasouli, Mohammad & Saghafian, Soroush, 2018. "Robust Partially Observable Markov Decision Processes," Working Paper Series rwp18-027, Harvard University, John F. Kennedy School of Government.
- Saghafian, Soroush, 2018. "Ambiguous partially observable Markov decision processes: Structural results and applications," Journal of Economic Theory, Elsevier, vol. 178(C), pages 1-35.
- Andrew J. Keith & Darryl K. Ahner, 2021. "A survey of decision making and optimization under uncertainty," Annals of Operations Research, Springer, vol. 300(2), pages 319-353, May.
- Bakker, Hannah & Dunke, Fabian & Nickel, Stefan, 2020. "A structuring review on multi-stage optimization under uncertainty: Aligning concepts from theory and practice," Omega, Elsevier, vol. 96(C).
- Maximilian Blesch & Philipp Eisenhauer, 2021. "Robust decision-making under risk and ambiguity," Papers 2104.12573, arXiv.org, revised Oct 2021.
- Hansen, Lars Peter & Mayer, Ricardo & Sargent, Thomas, 2010. "Robust hidden Markov LQG problems," Journal of Economic Dynamics and Control, Elsevier, vol. 34(10), pages 1951-1966, October.
- Jose Blanchet & Karthyek Murthy, 2019. "Quantifying Distributional Model Risk via Optimal Transport," Mathematics of Operations Research, INFORMS, vol. 44(2), pages 565-600, May.
- Nicole Bauerle & Alexander Glauner, 2020. "Distributionally Robust Markov Decision Processes and their Connection to Risk Measures," Papers 2007.13103, arXiv.org.
- Xin, Linwei & Goldberg, David A., 2021. "Time (in)consistency of multistage distributionally robust inventory models with moment constraints," European Journal of Operational Research, Elsevier, vol. 289(3), pages 1127-1141.
- Alexander Shapiro, 2016. "Rectangular Sets of Probability Measures," Operations Research, INFORMS, vol. 64(2), pages 528-541, April.
- Erim Kardeş & Fernando Ordóñez & Randolph W. Hall, 2011. "Discounted Robust Stochastic Games and an Application to Queueing Control," Operations Research, INFORMS, vol. 59(2), pages 365-382, April.
- Hui Chen & Nengjiu Ju & Jianjun Miao, 2014.
"Dynamic Asset Allocation with Ambiguous Return Predictability,"
Review of Economic Dynamics, Elsevier for the Society for Economic Dynamics, vol. 17(4), pages 799-823, October.
- Hui Chen & Nengjiu Ju & Jianjun Miao, "undated". "Dynamic Asset Allocation with Ambiguous Return Predictability," Boston University - Department of Economics - Working Papers Series wp2009-015, Boston University - Department of Economics.
- Hui Chen & Nengjiu Ju & Jianjun Miao, 2008. "Dynamic Asset Allocation with Ambiguous Return Predictability," Boston University - Department of Economics - The Institute for Economic Development Working Papers Series dp-179, Boston University - Department of Economics, revised Feb 2009.
- Zeynep Turgay & Fikri Karaesmen & Egemen Lerzan Örmeci, 2018. "Structural properties of a class of robust inventory and queueing control problems," Naval Research Logistics (NRL), John Wiley & Sons, vol. 65(8), pages 699-716, December.
- Dan A. Iancu & Marek Petrik & Dharmashankar Subramanian, 2015. "Tight Approximations of Dynamic Risk Measures," Mathematics of Operations Research, INFORMS, vol. 40(3), pages 655-682, March.
- Varagapriya, V & Singh, Vikas Vikram & Lisser, Abdel, 2024. "Rank-1 transition uncertainties in constrained Markov decision processes," European Journal of Operational Research, Elsevier, vol. 318(1), pages 167-178.
- Nengjiu Ju & Jianjun Miao, 2012.
"Ambiguity, Learning, and Asset Returns,"
Econometrica, Econometric Society, vol. 80(2), pages 559-591, March.
- Nengjiu Ju & Jianjun Miao, "undated". "Ambiguity, Learning, and Asset Returns," Boston University - Department of Economics - Working Papers Series wp2009-014, Boston University - Department of Economics.
- Jianjun Miao & NENGJIU JU, 2010. "Ambiguity, Learning, And Asset Returns," Boston University - Department of Economics - Working Papers Series WP2010-031, Boston University - Department of Economics.
- Ju, Nengjiu & Miao, Jianjun, 2009. "Ambiguity, Learning, and Asset Returns," MPRA Paper 14737, University Library of Munich, Germany, revised Apr 2009.
- Nengjiu Ju & Jianjun Miao, 2010. "Ambiguity, Learning, and Asset Returns," CEMA Working Papers 438, China Economics and Management Academy, Central University of Finance and Economics.
- René Caldentey & Ying Liu & Ilan Lobel, 2017. "Intertemporal Pricing Under Minimax Regret," Operations Research, INFORMS, vol. 65(1), pages 104-129, February.
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:ormnsc:v:62:y:2016:i:1:p:264-285. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .
Please note that corrections may take a couple of weeks to filter through the various RePEc services.