Value function gradient learning for large-scale multistage stochastic programming problems

Value function gradient learning for large-scale multistage stochastic programming problems

Author

Listed:

Lee, Jinkyu
Bae, Sanghyeon
Kim, Woo Chang
Lee, Yongjae

Abstract

A stagewise decomposition algorithm called “value function gradient learning” (VFGL) is proposed for large-scale multistage stochastic convex programs. VFGL finds the parameter values that best fit the gradient of the value function within a given parametric family. Widely used decomposition algorithms for multistage stochastic programming, such as stochastic dual dynamic programming (SDDP), approximate the value function by adding linear subgradient cuts at each iteration. Although this approach has been successful for linear problems, nonlinear problems may suffer from the increasing size of each subproblem as the iteration proceeds. On the other hand, VFGL has a fixed number of parameters; thus, the size of the subproblems remains constant throughout the iteration. Furthermore, VFGL can learn the parameters by means of stochastic gradient descent, which means that it can be easil0y parallelized and does not require a scenario tree approximation of the underlying uncertainties. VFGL was compared with a deterministic equivalent formulation of the multistage stochastic programming problem and SDDP approaches for three illustrative examples: production planning, hydrothermal generation, and the lifetime financial planning problem. Numerical examples show that VFGL generates high-quality solutions and is computationally efficient.

Suggested Citation

Lee, Jinkyu & Bae, Sanghyeon & Kim, Woo Chang & Lee, Yongjae, 2023. "Value function gradient learning for large-scale multistage stochastic programming problems," European Journal of Operational Research, Elsevier, vol. 308(1), pages 321-335.

Handle: RePEc:eee:ejores:v:308:y:2023:i:1:p:321-335
DOI: 10.1016/j.ejor.2022.10.011

Download full text from publisher

As the access to this document is restricted, you may want to

for a different version of it.

References listed on IDEAS

R. T. Rockafellar & Roger J.-B. Wets, 1991. "Scenarios and Policy Aggregation in Optimization Under Uncertainty," Mathematics of Operations Research, INFORMS, vol. 16(1), pages 119-147, February.
Reuven Y. Rubinstein & Ruth Marcus, 1985. "Efficiency of Multivariate Control Variates in Monte Carlo Simulation," Operations Research, INFORMS, vol. 33(3), pages 661-677, June.
Vincent Guigues, 2014. "SDDP for some interstage dependent risk-averse problems and application to hydro-thermal planning," Computational Optimization and Applications, Springer, vol. 57(1), pages 167-203, January.
Harvey M. Wagner & Thomson M. Whitin, 1958. "Dynamic Version of the Economic Lot Size Model," Management Science, INFORMS, vol. 5(1), pages 89-96, October.
Shapiro, Alexander, 2011. "Analysis of stochastic dual dynamic programming method," European Journal of Operational Research, Elsevier, vol. 209(1), pages 63-72, February.
Ponomareva, K. & Roman, D. & Date, P., 2015. "An algorithm for moment-matching scenario generation with application to financial portfolio optimisation," European Journal of Operational Research, Elsevier, vol. 240(3), pages 678-687.
Jean-Paul Watson & David Woodruff, 2011. "Progressive hedging innovations for a class of stochastic mixed-integer resource allocation problems," Computational Management Science, Springer, vol. 8(4), pages 355-370, November.
David R. Cariño & Terry Kent & David H. Myers & Celine Stacy & Mike Sylvanus & Andrew L. Turner & Kouji Watanabe & William T. Ziemba, 1994. "The Russell-Yasuda Kasai Model: An Asset/Liability Model for a Japanese Insurance Company Using Multistage Stochastic Programming," Interfaces, INFORMS, vol. 24(1), pages 29-49, February.
Merton, Robert C, 1969. "Lifetime Portfolio Selection under Uncertainty: The Continuous-Time Case," The Review of Economics and Statistics, MIT Press, vol. 51(3), pages 247-257, August.
Powell, Warren B., 2019. "A unified framework for stochastic optimization," European Journal of Operational Research, Elsevier, vol. 275(3), pages 795-821.
Staino, Alessandro & Russo, Emilio, 2015. "A moment-matching method to generate arbitrage-free scenarios," European Journal of Operational Research, Elsevier, vol. 246(2), pages 619-630.
Gulpinar, Nalan & Rustem, Berc & Settergren, Reuben, 2004. "Simulation and optimization approaches to scenario tree generation," Journal of Economic Dynamics and Control, Elsevier, vol. 28(7), pages 1291-1315, April.
Z. L. Chen & W. B. Powell, 1999. "Convergent Cutting-Plane and Partial-Sampling Algorithm for Multistage Stochastic Linear Programs with Recourse," Journal of Optimization Theory and Applications, Springer, vol. 102(3), pages 497-524, September.
Karimi, B. & Fatemi Ghomi, S. M. T. & Wilson, J. M., 2003. "The capacitated lot sizing problem: a review of models and algorithms," Omega, Elsevier, vol. 31(5), pages 365-378, October.
P. Girardeau & V. Leclere & A. B. Philpott, 2015. "On the Convergence of Decomposition Methods for Multistage Stochastic Convex Programs," Mathematics of Operations Research, INFORMS, vol. 40(1), pages 130-145, February.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Wu, Weitiao & Li, Yu, 2024. "Pareto truck fleet sizing for bike relocation with stochastic demand: Risk-averse multi-stage approximate stochastic programming," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 183(C).
Jun Zhan & Mei Huang & Xiaojia Sun & Yubo Zhang & Zuowei Chen & Yilin Chen & Yang Li & Chenyang Zhao & Qian Ai, 2025. "Optimisation Strategy for Electricity–Carbon Sharing Operation of Multi-Virtual Power Plants Considering Multivariate Uncertainties," Energies, MDPI, vol. 18(9), pages 1-23, May.
Liu, Nianxin & Yang, Kangyuan & Zhao, Liang & Ye, Zhencheng, 2025. "Stochastic dual dynamic programming for multi-stage stochastic programming of sustainable utility systems," Energy, Elsevier, vol. 338(C).
Chung-Han Hsieh & Jie-Ling Lu, 2024. "On Accelerating Large-Scale Robust Portfolio Optimization," Papers 2408.07879, arXiv.org.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Davi Valladão & Thuener Silva & Marcus Poggi, 2019. "Time-consistent risk-constrained dynamic portfolio optimization with transactional costs and time-dependent returns," Annals of Operations Research, Springer, vol. 282(1), pages 379-405, November.
W. Ackooij & X. Warin, 2020. "On conditional cuts for stochastic dual dynamic programming," EURO Journal on Computational Optimization, Springer;EURO - The Association of European Operational Research Societies, vol. 8(2), pages 173-199, June.
Wu, Dexiang & Wu, Desheng Dash, 2020. "A decision support approach for two-stage multi-objective index tracking using improved lagrangian decomposition," Omega, Elsevier, vol. 91(C).
Soares, Murilo Pereira & Street, Alexandre & Valladão, Davi Michel, 2017. "On the solution variability reduction of Stochastic Dual Dynamic Programming applied to energy planning," European Journal of Operational Research, Elsevier, vol. 258(2), pages 743-760.
Simon Thevenin & Yossiri Adulyasak & Jean-François Cordeau, 2022. "Stochastic Dual Dynamic Programming for Multiechelon Lot Sizing with Component Substitution," INFORMS Journal on Computing, INFORMS, vol. 34(6), pages 3151-3169, November.
Vitor L. de Matos & David P. Morton & Erlon C. Finardi, 2017. "Assessing policy quality in a multistage stochastic program for long-term hydrothermal scheduling," Annals of Operations Research, Springer, vol. 253(2), pages 713-731, June.
Marianne Akian & Jean-Philippe Chancelier & Benoît Tran, 2025. "A stochastic algorithm for deterministic multistage optimization problems," Annals of Operations Research, Springer, vol. 345(1), pages 1-38, February.
Tushar Rathi & Benjamin P. Riley & Angela Flores-Quiroz & Qi Zhang, 2026. "Column generation for multistage stochastic mixed-integer nonlinear programs with discrete state variables," Journal of Global Optimization, Springer, vol. 94(1), pages 95-126, January.
Hua, Yikang & Zhao, Dongfang & Wang, Xin & Li, Xiaopeng, 2019. "Joint infrastructure planning and fleet management for one-way electric car sharing under time-varying uncertain demand," Transportation Research Part B: Methodological, Elsevier, vol. 128(C), pages 185-206.
Guigues, Vincent & Juditsky, Anatoli & Nemirovski, Arkadi, 2021. "Constant Depth Decision Rules for multistage optimization under uncertainty," European Journal of Operational Research, Elsevier, vol. 295(1), pages 223-232.
Zhou, Shaorui & Zhang, Hui & Shi, Ning & Xu, Zhou & Wang, Fan, 2020. "A new convergent hybrid learning algorithm for two-stage stochastic programs," European Journal of Operational Research, Elsevier, vol. 283(1), pages 33-46.
Huang, Zhouchun & Zheng, Qipeng Phil, 2020. "A multistage stochastic programming approach for preventive maintenance scheduling of GENCOs with natural gas contract," European Journal of Operational Research, Elsevier, vol. 287(3), pages 1036-1051.
Kiszka, Adriana & Wozabal, David, 2025. "Stochastic dual dynamic programming for optimal power flow problems under uncertainty," European Journal of Operational Research, Elsevier, vol. 321(3), pages 814-836.
Pritchard, Geoffrey, 2015. "Stochastic inflow modeling for hydropower scheduling problems," European Journal of Operational Research, Elsevier, vol. 246(2), pages 496-504.
Nan Chen & Xiang Ma & Yanchu Liu & Wei Yu, 2024. "Information Relaxation and a Duality-Driven Algorithm for Stochastic Dynamic Programs," Operations Research, INFORMS, vol. 72(6), pages 2302-2320, November.
Fan, Yingjie & Schwartz, Frank & Voß, Stefan, 2017. "Flexible supply chain planning based on variable transportation modes," International Journal of Production Economics, Elsevier, vol. 183(PC), pages 654-666.
Melega, Gislaine Mara & de Araujo, Silvio Alexandre & Jans, Raf, 2018. "Classification and literature review of integrated lot-sizing and cutting stock problems," European Journal of Operational Research, Elsevier, vol. 271(1), pages 1-19.
Hu, Shaolong & Han, Chuanfeng & Dong, Zhijie Sasha & Meng, Lingpeng, 2019. "A multi-stage stochastic programming model for relief distribution considering the state of road network," Transportation Research Part B: Methodological, Elsevier, vol. 123(C), pages 64-87.
Zhicheng Zhu & Yisha Xiang & Bo Zeng, 2021. "Multicomponent Maintenance Optimization: A Stochastic Programming Approach," INFORMS Journal on Computing, INFORMS, vol. 33(3), pages 898-914, July.
Ferstl, Robert & Weissensteiner, Alex, 2011. "Asset-liability management under time-varying investment opportunities," Journal of Banking & Finance, Elsevier, vol. 35(1), pages 182-192, January.
- Ferstl, Robert & Weissensteiner, Alex, 2009. "Asset-Liability Management under time-varying Investment Opportunities," MPRA Paper 15068, University Library of Munich, Germany.

More about this item

Keywords

; ; ; ; ;

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:ejores:v:308:y:2023:i:1:p:321-335. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/eor .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Value function gradient learning for large-scale multistage stochastic programming problems

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data