Robust Bayesian Dynamic Programming for On-policy Risk-sensitive Reinforcement Learning
Author
Abstract
Suggested Citation
Download full text from publisher
References listed on IDEAS
- Ahmed, Shabbir & Cakmak, Ulas & Shapiro, Alexander, 2007. "Coherent risk measures in inventory problems," European Journal of Operational Research, Elsevier, vol. 182(1), pages 226-238, October.
- Anthony Coache & Sebastian Jaimungal & 'Alvaro Cartea, 2022. "Conditionally Elicitable Dynamic Risk Measures for Deep Reinforcement Learning," Papers 2206.14666, arXiv.org, revised May 2023.
- Shanyu Han & Yang Liu & Xiang Yu, 2025. "Risk-sensitive Reinforcement Learning Based on Convex Scoring Functions," Papers 2505.04553, arXiv.org, revised May 2025.
- Stefan Jaschke & Uwe Küchler, 2001. "Coherent risk measures and good-deal bounds," Finance and Stochastics, Springer, vol. 5(2), pages 181-200.
- Nicole Bäuerle & Jonathan Ott, 2011. "Markov Decision Processes with Average-Value-at-Risk criteria," Mathematical Methods of Operations Research, Springer;Gesellschaft für Operations Research (GOR);Nederlands Genootschap voor Besliskunde (NGB), vol. 74(3), pages 361-379, December.
- Ethan X. Fang & Zhaoran Wang & Lan Wang, 2023. "Fairness-Oriented Learning for Optimal Individualized Treatment Rules," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 118(543), pages 1733-1746, July.
- Tolulope Fadina & Yang Liu & Ruodu Wang, 2024. "A framework for measures of risk under uncertainty," Finance and Stochastics, Springer, vol. 28(2), pages 363-390, April.
Most related items
These are the items that most often cite the same works as this one and are cited by the same works as this one.- Shanyu Han & Yang Liu & Xiang Yu, 2025. "Risk-sensitive Reinforcement Learning Based on Convex Scoring Functions," Papers 2505.04553, arXiv.org, revised May 2025.
- Walter Farkas & Pablo Koch-Medina & Cosimo Munari, 2014. "Beyond cash-additive risk measures: when changing the numéraire fails," Finance and Stochastics, Springer, vol. 18(1), pages 145-173, January.
- Minjiao Zhang & Simge Küçükyavuz & Saumya Goel, 2014. "A Branch-and-Cut Method for Dynamic Decision Making Under Joint Chance Constraints," Management Science, INFORMS, vol. 60(5), pages 1317-1333, May.
- Oleg Bondarenko & Iñaki Longarela, 2009. "A general framework for the derivation of asset price bounds: an application to stochastic volatility option models," Review of Derivatives Research, Springer, vol. 12(2), pages 81-107, July.
- Egging, Ruud & Pichler, Alois & Kalvø, Øyvind Iversen & Walle–Hansen, Thomas Meyer, 2017. "Risk aversion in imperfect natural gas markets," European Journal of Operational Research, Elsevier, vol. 259(1), pages 367-383.
- Leitner Johannes, 2005. "Optimal portfolios with expected loss constraints and shortfall risk optimal martingale measures," Statistics & Risk Modeling, De Gruyter, vol. 23(1/2005), pages 49-66, January.
- Tomasz R. Bielecki & Igor Cialenco & Ismail Iyigunler & Rodrigo Rodriguez, 2012. "Dynamic Conic Finance: Pricing and Hedging in Market Models with Transaction Costs via Dynamic Coherent Acceptability Indices," Papers 1205.4790, arXiv.org, revised Jun 2013.
- Qiu, Ruozhen & Sun, Minghe & Lim, Yun Fong, 2017. "Optimizing (s, S) policies for multi-period inventory models with demand distribution uncertainty: Robust dynamic programing approaches," European Journal of Operational Research, Elsevier, vol. 261(3), pages 880-892.
- Zhongren Chen & Siyu Chen & Zhengling Qi & Xiaohong Chen & Zhuoran Yang, 2025. "Quantile-Optimal Policy Learning under Unmeasured Confounding," Cowles Foundation Discussion Papers 2469, Cowles Foundation for Research in Economics, Yale University.
- Vadim Lesnevski & Barry L. Nelson & Jeremy Staum, 2007. "Simulation of Coherent Risk Measures Based on Generalized Scenarios," Management Science, INFORMS, vol. 53(11), pages 1756-1769, November.
- Fei Sun & Jingchao Li & Jieming Zhou, 2018. "Dynamic risk measures for fluctuations in market volatility under Bochner-Lebesgue spaces," Papers 1806.01166, arXiv.org, revised Jan 2026.
- Charles-Olivier Amédée-Manesme & Fabrice Barthélémy, 2018.
"Ex-ante real estate Value at Risk calculation method,"
Annals of Operations Research, Springer, vol. 262(2), pages 257-285, March.
- Charles-Olivier Amédée-Manesme & Fabrice Barthélémy, 2015. "Ex-ante real estate Value at Risk calculation method," ERES eres2015_56, European Real Estate Society (ERES).
- Borgonovo, Emanuele & Gatti, Stefano, 2013. "Risk analysis with contractual default. Does covenant breach matter?," European Journal of Operational Research, Elsevier, vol. 230(2), pages 431-443.
- Traian A. Pirvu & Gordan Zitkovic, 2007. "Maximizing the Growth Rate under Risk Constraints," Papers 0706.0480, arXiv.org.
- Oh, Sechan & Rhodes, James & Strong, Ray, 2016. "Impact of cost uncertainty on pricing decisions under risk aversion," European Journal of Operational Research, Elsevier, vol. 253(1), pages 144-153.
- Wu, Meng & Zhu, Stuart X. & Teunter, Ruud H., 2013. "Newsvendor problem with random shortage cost under a risk criterion," International Journal of Production Economics, Elsevier, vol. 145(2), pages 790-798.
- Andreas H. Hamel & Frank Heyde, 2021. "Set-Valued T -Translative Functions and Their Applications in Finance," Mathematics, MDPI, vol. 9(18), pages 1-33, September.
- Felipe J. P. Antunes & Yuri F. Saporito & Sebastian Jaimungal, 2025. "Deep Learning and Elicitability for McKean-Vlasov FBSDEs With Common Noise," Papers 2512.14967, arXiv.org.
- Patrick Cheridito & Michael Kupper & Ludovic Tangpi, 2016. "Duality formulas for robust pricing and hedging in discrete time," Papers 1602.06177, arXiv.org, revised Sep 2017.
- Youhua (Frank) Chen & Minghui Xu & Zhe George Zhang, 2009. "Technical Note---A Risk-Averse Newsvendor Model Under the CVaR Criterion," Operations Research, INFORMS, vol. 57(4), pages 1040-1044, August.
More about this item
NEP fields
This paper has been announced in the following NEP Reports:- NEP-RMG-2026-01-12 (Risk Management)
Statistics
Access and download statisticsCorrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2512.24580. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .
Please note that corrections may take a couple of weeks to filter through the various RePEc services.
Printed from https://ideas.repec.org/p/arx/papers/2512.24580.html