IDEAS home Printed from https://ideas.repec.org/a/inm/orijoc/v29y2017i2p332-349.html
   My bibliography  Save this article

Parallel Nonstationary Direct Policy Search for Risk-Averse Stochastic Optimization

Author

Listed:
  • Somayeh Moazeni

    (School of Systems and Enterprises, Stevens Institute of Technology, Hoboken, New Jersey 07030)

  • Warren B. Powell

    (Department of Operations Research and Financial Engineering, Princeton University, Princeton, New Jersey 08544)

  • Boris Defourny

    (Department of Industrial and Systems Engineering, Lehigh University, Bethlehem, Pennsylvania 18015)

  • Belgacem Bouzaiene-Ayari

    (Department of Operations Research and Financial Engineering, Princeton University, Princeton, New Jersey 08544)

Abstract

This paper presents an algorithmic strategy to nonstationary policy search for finite-horizon, discrete-time Markovian decision problems with large state spaces, constrained action sets, and a risk-sensitive optimality criterion. The methodology relies on modeling time-variant policy parameters by a nonparametric response surface model for an indirect parametrized policy motivated by Bellman’s equation. The policy structure is heuristic when the optimization of the risk-sensitive criterion does not admit a dynamic programming reformulation. Through the interpolating approximation, the level of nonstationarity of the policy, and consequently, the size of the resulting search problem can be adjusted. The computational tractability and the generality of the approach follow from a nested parallel implementation of derivative-free optimization in conjunction with Monte Carlo simulation. We demonstrate the efficiency of the approach on an optimal energy storage charging problem, and illustrate the effect of the risk functional on the improvement achieved by allowing a higher complexity in time variation for the policy.

Suggested Citation

  • Somayeh Moazeni & Warren B. Powell & Boris Defourny & Belgacem Bouzaiene-Ayari, 2017. "Parallel Nonstationary Direct Policy Search for Risk-Averse Stochastic Optimization," INFORMS Journal on Computing, INFORMS, vol. 29(2), pages 332-349, May.
  • Handle: RePEc:inm:orijoc:v:29:y:2017:i:2:p:332-349
    DOI: 10.1287/ijoc.2016.0733
    as

    Download full text from publisher

    File URL: https://doi.org/10.1287/ijoc.2016.0733
    Download Restriction: no

    File URL: https://libkey.io/10.1287/ijoc.2016.0733?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Sharon A. Johnson & Jery R. Stedinger & Christine A. Shoemaker & Ying Li & José Alberto Tejada-Guibert, 1993. "Numerical Solution of Continuous-State Dynamic Programs Using Linear and Spline Interpolation," Operations Research, INFORMS, vol. 41(3), pages 484-500, June.
    2. Rene Carmona & Michael Ludkovski, 2010. "Valuation of energy storage: an optimal switching approach," Quantitative Finance, Taylor & Francis Journals, vol. 10(4), pages 359-374.
    3. Rudloff, Birgit & Street, Alexandre & Valladão, Davi M., 2014. "Time consistency and risk averse dynamic decision models: Definition, interpretation and practical consequences," European Journal of Operational Research, Elsevier, vol. 234(3), pages 743-750.
    4. Somayeh Moazeni & Thomas F. Coleman & Yuying Li, 2016. "Smoothing and parametric rules for stochastic mean-CVaR optimal execution strategy," Annals of Operations Research, Springer, vol. 237(1), pages 99-120, February.
    5. Riedel, Frank, 2004. "Dynamic coherent risk measures," Stochastic Processes and their Applications, Elsevier, vol. 112(2), pages 185-200, August.
    6. Luis Rios & Nikolaos Sahinidis, 2013. "Derivative-free optimization: a review of algorithms and comparison of software implementations," Journal of Global Optimization, Springer, vol. 56(3), pages 1247-1293, July.
    7. Victoria C. P. Chen & David Ruppert & Christine A. Shoemaker, 1999. "Applying Experimental Design and Regression Splines to High-Dimensional Continuous-State Stochastic Dynamic Programming," Operations Research, INFORMS, vol. 47(1), pages 38-53, February.
    8. Dimitris Bertsimas & Dan A. Iancu & Pablo A. Parrilo, 2010. "Optimality of Affine Policies in Multistage Robust Optimization," Mathematics of Operations Research, INFORMS, vol. 35(2), pages 363-394, May.
    9. Jae Ho Kim & Warren B. Powell, 2011. "Optimal Energy Commitments with Storage and Intermittent Supply," Operations Research, INFORMS, vol. 59(6), pages 1347-1360, December.
    10. Somayeh Moazeni & Thomas Coleman & Yuying Li, 2016. "Smoothing and parametric rules for stochastic mean-CVaR optimal execution strategy," Annals of Operations Research, Springer, vol. 237(1), pages 99-120, February.
    11. Guoming Lai & François Margot & Nicola Secomandi, 2010. "An Approximate Dynamic Programming Approach to Benchmark Practice-Based Heuristics for Natural Gas Storage Valuation," Operations Research, INFORMS, vol. 58(3), pages 564-582, June.
    12. Huizhen Yu & Dimitri P. Bertsekas, 2010. "Error Bounds for Approximations from Projected Linear Equations," Mathematics of Operations Research, INFORMS, vol. 35(2), pages 306-329, May.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Powell, Warren B., 2019. "A unified framework for stochastic optimization," European Journal of Operational Research, Elsevier, vol. 275(3), pages 795-821.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Daniel R. Jiang & Warren B. Powell, 2015. "Optimal Hour-Ahead Bidding in the Real-Time Electricity Market with Battery Storage Using Approximate Dynamic Programming," INFORMS Journal on Computing, INFORMS, vol. 27(3), pages 525-543, August.
    2. Anna Maria Gambaro & Nicola Secomandi, 2021. "A Discussion of Non‐Gaussian Price Processes for Energy and Commodity Operations," Production and Operations Management, Production and Operations Management Society, vol. 30(1), pages 47-67, January.
    3. Secomandi, Nicola & Seppi, Duane J., 2014. "Real Options and Merchant Operations of Energy and Other Commodities," Foundations and Trends(R) in Technology, Information and Operations Management, now publishers, vol. 6(3-4), pages 161-331, July.
    4. Mauro Gaggero & Giorgio Gnecco & Marcello Sanguineti, 2013. "Dynamic Programming and Value-Function Approximation in Sequential Decision Problems: Error Analysis and Numerical Results," Journal of Optimization Theory and Applications, Springer, vol. 156(2), pages 380-416, February.
    5. Chen, Ruoran & Deng, Tianhu & Huang, Simin & Qin, Ruwen, 2015. "Optimal crude oil procurement under fluctuating price in an oil refinery," European Journal of Operational Research, Elsevier, vol. 245(2), pages 438-445.
    6. Zehua Yang & Victoria C. P. Chen & Michael E. Chang & Melanie L. Sattler & Aihong Wen, 2009. "A Decision-Making Framework for Ozone Pollution Control," Operations Research, INFORMS, vol. 57(2), pages 484-498, April.
    7. Diego Klabjan & Daniel Adelman, 2007. "An Infinite-Dimensional Linear Programming Algorithm for Deterministic Semi-Markov Decision Processes on Borel Spaces," Mathematics of Operations Research, INFORMS, vol. 32(3), pages 528-550, August.
    8. Chen, Victoria C. P., 1999. "Application of orthogonal arrays and MARS to inventory forecasting stochastic dynamic programs," Computational Statistics & Data Analysis, Elsevier, vol. 30(3), pages 317-341, May.
    9. Nadarajah, Selvaprabu & Secomandi, Nicola, 2023. "A review of the operations literature on real options in energy," European Journal of Operational Research, Elsevier, vol. 309(2), pages 469-487.
    10. M. Baglietto & C. Cervellera & M. Sanguineti & R. Zoppoli, 2010. "Management of water resource systems in the presence of uncertainties by nonlinear approximation techniques and deterministic sampling," Computational Optimization and Applications, Springer, vol. 47(2), pages 349-376, October.
    11. De Lara, Michel & Leclère, Vincent, 2016. "Building up time-consistency for risk measures and dynamic optimization," European Journal of Operational Research, Elsevier, vol. 249(1), pages 177-187.
    12. Yangfang (Helen) Zhou & Alan Scheller‐Wolf & Nicola Secomandi & Stephen Smith, 2019. "Managing Wind‐Based Electricity Generation in the Presence of Storage and Transmission Capacity," Production and Operations Management, Production and Operations Management Society, vol. 28(4), pages 970-989, April.
    13. Daniel R. Jiang & Warren B. Powell, 2015. "An Approximate Dynamic Programming Algorithm for Monotone Value Functions," Operations Research, INFORMS, vol. 63(6), pages 1489-1511, December.
    14. Schur, Rouven & Gönsch, Jochen & Hassler, Michael, 2019. "Time-consistent, risk-averse dynamic pricing," European Journal of Operational Research, Elsevier, vol. 277(2), pages 587-603.
    15. Felix, Bastian Joachim & Weber, Christoph, 2012. "Gas storage valuation applying numerically constructed recombining trees," European Journal of Operational Research, Elsevier, vol. 216(1), pages 178-187.
    16. Daniel R. Jiang & Warren B. Powell, 2018. "Risk-Averse Approximate Dynamic Programming with Quantile-Based Risk Measures," Mathematics of Operations Research, INFORMS, vol. 43(2), pages 554-579, May.
    17. Lai Wei & Yongpei Guan, 2014. "Optimal Control of Plug-In Hybrid Electric Vehicles with Market Impact and Risk Attitude," Transportation Science, INFORMS, vol. 48(4), pages 467-482, November.
    18. Löhndorf, Nils & Wozabal, David, 2021. "Gas storage valuation in incomplete markets," European Journal of Operational Research, Elsevier, vol. 288(1), pages 318-330.
    19. Wei Chen & Yun Wang & Mukesh Kumar Mehlawat, 2018. "A hybrid FA–SA algorithm for fuzzy portfolio selection with transaction costs," Annals of Operations Research, Springer, vol. 269(1), pages 129-147, October.
    20. Wei Chen & Yuxi Gai & Pankaj Gupta, 2018. "Efficiency evaluation of fuzzy portfolio in different risk measures via DEA," Annals of Operations Research, Springer, vol. 269(1), pages 103-127, October.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:orijoc:v:29:y:2017:i:2:p:332-349. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.