Integrated estimate-and-optimize decision trees learning for two-stage linear decision-making problems

Integrated estimate-and-optimize decision trees learning for two-stage linear decision-making problems

Author

Listed:

Ribeiro, Rafaela
Fanzeres, Bruno

Abstract

Several decision-making under uncertainty problems found in industry and the scientific community can be framed as stochastic programs. Traditionally, these problems are addressed using a sequential two-step process, referred to as predict/estimate-then-optimize, in which a predictive distribution of the uncertain parameters is firstly estimated and then used to prescribe a decision. However, most predictive methods focus on minimizing forecast error, without accounting for its impact on decision quality. Moreover, practitioners often emphasize that their main goal is to obtain near-optimal solutions with minimum decision error, rather than least-error predictions. Therefore, in this work, we discuss a new framework for integrating prediction and prescription into the predictive distribution estimation process to be subsequently used to devise a decision. We particularly focus on decision trees and study decision-making problems representable as contextual two-stage linear programs. Firstly, we propose a workable framework along with a non-convex optimization model to account for the impact of the underlying decision-making problem on the predictive distribution estimation process. Then, we recast the non-convex model as a Mixed-Integer Programming (MIP) problem. Acknowledging the difficulty of the MIP reformulation to scale to large-scale instances, we devise a computationally efficient Heuristic strategy for the estimation problem leveraging the structure intrinsic to decision trees. A key feature of the proposed decision-making framework is its ability to instantly assess decisions by mapping new contexts to a leaf and retrieving the precomputed solution of the corresponding two-stage problem. A set of numerical experiments is conducted to illustrate the capability and effectiveness of the proposed framework using three distinct two-stage decision-making problems. We benchmark the proposed approach against prescriptions devised by various alternative frameworks. Five predict/estimate-then-optimize benchmarks that rely on commonly used predictive and distribution estimation methods and three benchmarks based on integrated predict-and-optimize decision-making processes are considered. We focus on evaluating solution quality and the computational performance of the MIP reformulation.

Suggested Citation

Ribeiro, Rafaela & Fanzeres, Bruno, 2026. "Integrated estimate-and-optimize decision trees learning for two-stage linear decision-making problems," European Journal of Operational Research, Elsevier, vol. 329(2), pages 607-628.

Handle: RePEc:eee:ejores:v:329:y:2026:i:2:p:607-628
DOI: 10.1016/j.ejor.2025.08.048

Download full text from publisher

As the access to this document is restricted, you may want to

for a different version of it.

References listed on IDEAS

Rohit Kannan & Güzin Bayraksan & James R. Luedtke, 2025. "Technical Note—Data-Driven Sample Average Approximation with Covariate Information," Operations Research, INFORMS, vol. 73(6), pages 3245-3259, November.
Bart P. G. Van Parys & Peyman Mohajerin Esfahani & Daniel Kuhn, 2021. "From Data to Decisions: Distributionally Robust Optimization Is Optimal," Management Science, INFORMS, vol. 67(6), pages 3387-3402, June.
Christoffersen, Peter F. & Diebold, Francis X., 1997. "Optimal Prediction Under Asymmetric Loss," Econometric Theory, Cambridge University Press, vol. 13(6), pages 808-817, December.
- Christoffersen & Diebold, "undated". "Optimal Prediction Under Asymmetric Loss," Home Pages 167, 1996., University of Pennsylvania.
- Peter F. Christoffersen & Francis X. Diebold, 1997. "Optimal prediction under asymmetric loss," Working Papers 97-11, Federal Reserve Bank of Philadelphia.
- Peter F. Christoffersen & Francis X. Diebold, 1994. "Optimal Prediction Under Asymmetric Loss," NBER Technical Working Papers 0167, National Bureau of Economic Research, Inc.
- Peter F. Christoffersen & Francis X. Diebold, "undated". "Optimal Prediction Under Asymmetric Loss," CARESS Working Papres 97-20, University of Pennsylvania Center for Analytic Research and Economics in the Social Sciences.
Wilson, Duncan T. & Hawe, Glenn I. & Coates, Graham & Crouch, Roger S., 2016. "Online optimization of casualty processing in major incident response: An experimental analysis," European Journal of Operational Research, Elsevier, vol. 252(1), pages 334-348.
Fanzeres, Bruno & Ahmed, Shabbir & Street, Alexandre, 2019. "Robust strategic bidding in auction-based markets," European Journal of Operational Research, Elsevier, vol. 272(3), pages 1158-1172.
Oktay Günlük & Jayant Kalagnanam & Minhan Li & Matt Menickelly & Katya Scheinberg, 2021. "Optimal decision trees for categorical data via integer programming," Journal of Global Optimization, Springer, vol. 81(1), pages 233-260, September.
Stein W. Wallace, 2000. "Decision Making Under Uncertainty: Is Sensitivity Analysis of Any Use?," Operations Research, INFORMS, vol. 48(1), pages 20-25, February.
Torraca, Ana Patrícia & Fanzeres, Bruno, 2021. "Optimal insurance contract specification in the upstream sector of the oil and gas industry," European Journal of Operational Research, Elsevier, vol. 295(2), pages 718-732.
Nathan Kallus & Xiaojie Mao, 2023. "Stochastic Optimization Forests," Management Science, INFORMS, vol. 69(4), pages 1975-1994, April.
Jeffrey D. Camm & Amitabh S. Raturi & Shigeru Tsubakitani, 1990. "Cutting Big M Down to Size," Interfaces, INFORMS, vol. 20(5), pages 61-66, October.
repec:inm:orijoo:v:5:y:2023:i:3:p:295-320 is not listed on IDEAS
Alexander Shapiro & Jos Berge, 2002. "Statistical inference of minimum rank factor analysis," Psychometrika, Springer;The Psychometric Society, vol. 67(1), pages 79-94, March.
Dimitris Bertsimas & Nathan Kallus, 2020. "From Predictive to Prescriptive Analytics," Management Science, INFORMS, vol. 66(3), pages 1025-1044, March.
Joaquim Dias Garcia & Alexandre Street & Tito Homem-de-Mello & Francisco D. Muñoz, 2025. "Application-Driven Learning: A Closed-Loop Prediction and Optimization Approach Applied to Dynamic Reserves and Demand Forecasting," Operations Research, INFORMS, vol. 73(1), pages 22-39, January.
Sadana, Utsav & Chenreddy, Abhilash & Delage, Erick & Forel, Alexandre & Frejinger, Emma & Vidal, Thibaut, 2025. "A survey of contextual optimization methods for decision-making under uncertainty," European Journal of Operational Research, Elsevier, vol. 320(2), pages 271-289.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Shuaian Wang & Xuecheng Tian, 2023. "A Deficiency of the Predict-Then-Optimize Framework: Decreased Decision Quality with Increased Data Size," Mathematics, MDPI, vol. 11(15), pages 1-9, July.
van Eekelen, Wouter, 2023. "Distributionally robust views on queues and related stochastic models," Other publications TiSEM 9b99fc05-9d68-48eb-ae8c-9, Tilburg University, School of Economics and Management.
Wei Zhang & Kai Wang & Alexandre Jacquillat & Shuaian Wang, 2023. "Optimized Scenario Reduction: Solving Large-Scale Stochastic Programs with Quality Guarantees," INFORMS Journal on Computing, INFORMS, vol. 35(4), pages 886-908, July.
Qi Feng & J. George Shanthikumar & Jian Wu, 2025. "Contextual Data-Integrated Newsvendor Solution with Operational Data Analytics (ODA)," Management Science, INFORMS, vol. 71(11), pages 9384-9403, November.
Ningyuan Chen & Ming Hu, 2023. "Frontiers in Service Science: Data-Driven Revenue Management: The Interplay of Data, Model, and Decisions," Service Science, INFORMS, vol. 15(2), pages 79-91, June.
Tian, Xuecheng & Wang, Shuaian & Laporte, Gilbert & Yang, Ying, 2024. "Determinism versus uncertainty: Examining the worst-case expected performance of data-driven policies," European Journal of Operational Research, Elsevier, vol. 318(1), pages 242-252.
Stratigakos, Akylas & Pineda, Salvador & Morales, Juan Miguel, 2025. "Decision-focused linear pooling for probabilistic forecast combination," International Journal of Forecasting, Elsevier, vol. 41(3), pages 1112-1125.
Huang, Di & Zhang, Jinyu & Liu, Zhiyuan & He, Yiliu & Liu, Pan, 2024. "A novel ranking method based on semi-SPO for battery swapping allocation optimization in a hybrid electric transit system," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 188(C).
Qi Hong & Mo Jia & Xuecheng Tian & Zhiyuan Liu & Shuaian Wang, 2025. "A Surrogate Piecewise Linear Loss Function for Contextual Stochastic Linear Programs in Transport," Mathematics, MDPI, vol. 13(12), pages 1-17, June.
Rohit Kannan & Güzin Bayraksan & James R. Luedtke, 2025. "Technical Note—Data-Driven Sample Average Approximation with Covariate Information," Operations Research, INFORMS, vol. 73(6), pages 3245-3259, November.
Viet Anh Nguyen & Fan Zhang & Shanshan Wang & José Blanchet & Erick Delage & Yinyu Ye, 2025. "Robustifying Conditional Portfolio Decisions via Optimal Transport," Operations Research, INFORMS, vol. 73(5), pages 2801-2829, September.
Tobias Sutter & Bart P. G. Van Parys & Daniel Kuhn, 2024. "A Pareto Dominance Principle for Data-Driven Optimization," Operations Research, INFORMS, vol. 72(5), pages 1976-1999, September.
Li Chen & Melvyn Sim & Xun Zhang & Long Zhao & Minglong Zhou, 2026. "Robust Actionable Prescriptive Analytics," Operations Research, INFORMS, vol. 74(1), pages 550-571, January.
Sadana, Utsav & Chenreddy, Abhilash & Delage, Erick & Forel, Alexandre & Frejinger, Emma & Vidal, Thibaut, 2025. "A survey of contextual optimization methods for decision-making under uncertainty," European Journal of Operational Research, Elsevier, vol. 320(2), pages 271-289.
CHADHA, Jagjit & SCHELLEKENS, Philip, "undated". "Monetary policy loss functions: two cheers for the quadratic," Working Papers 1999002, University of Antwerp, Faculty of Business and Economics.
- Jagjit Chadha & Philip Schellekens, 1999. "Monetary policy loss functions: two cheers for the quadratic," Bank of England working papers 101, Bank of England.
- Schellekens, P. & Chadha, J.S., 1999. "Monetary Policy Loss Functions: Two Cheers for the Quadratic," Cambridge Working Papers in Economics 9920, Faculty of Economics, University of Cambridge.
Baker, Erin & Bosetti, Valentina & Salo, Ahti, "undated". "Finding Common Ground when Experts Disagree: Belief Dominance over Portfolios of Alternatives," MITP: Mitigation, Innovation and Transformation Pathways 243147, Fondazione Eni Enrico Mattei (FEEM).
- Erin Baker & Valentina Bosetti & Ahti Salo, 2016. "Finding Common Ground when Experts Disagree: Belief Dominance over Portfolios of Alternatives," Working Papers 2016.46, Fondazione Eni Enrico Mattei.
Serrano, Breno & Minner, Stefan & Schiffer, Maximilian & Vidal, Thibaut, 2024. "Bilevel optimization for feature selection in the data-driven newsvendor problem," European Journal of Operational Research, Elsevier, vol. 315(2), pages 703-714.
Anastasiou, Andreas, 2017. "Bounds for the normal approximation of the maximum likelihood estimator from m-dependent random variables," Statistics & Probability Letters, Elsevier, vol. 129(C), pages 171-181.
Giovanni Cerulli & Francesco Caracciolo, 2025. "Risk-Adjusted Policy Learning and the Social Cost of Uncertainty: Theory and Evidence from CAP evaluation," Papers 2510.05007, arXiv.org.
Weitzman Nagar, 2007. "Asymmetry in Monetary Policy: An Asymmetric Objective Function and a New-Keynesian Model," Bank of Israel Working Papers 2007.02, Bank of Israel.

More about this item

Keywords

; ; ; ;

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:ejores:v:329:y:2026:i:2:p:607-628. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/eor .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Integrated estimate-and-optimize decision trees learning for two-stage linear decision-making problems

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data