IDEAS home Printed from https://ideas.repec.org/a/eee/ejores/v329y2026i2p607-628.html

Integrated estimate-and-optimize decision trees learning for two-stage linear decision-making problems

Author

Listed:
  • Ribeiro, Rafaela
  • Fanzeres, Bruno

Abstract

Several decision-making under uncertainty problems found in industry and the scientific community can be framed as stochastic programs. Traditionally, these problems are addressed using a sequential two-step process, referred to as predict/estimate-then-optimize, in which a predictive distribution of the uncertain parameters is firstly estimated and then used to prescribe a decision. However, most predictive methods focus on minimizing forecast error, without accounting for its impact on decision quality. Moreover, practitioners often emphasize that their main goal is to obtain near-optimal solutions with minimum decision error, rather than least-error predictions. Therefore, in this work, we discuss a new framework for integrating prediction and prescription into the predictive distribution estimation process to be subsequently used to devise a decision. We particularly focus on decision trees and study decision-making problems representable as contextual two-stage linear programs. Firstly, we propose a workable framework along with a non-convex optimization model to account for the impact of the underlying decision-making problem on the predictive distribution estimation process. Then, we recast the non-convex model as a Mixed-Integer Programming (MIP) problem. Acknowledging the difficulty of the MIP reformulation to scale to large-scale instances, we devise a computationally efficient Heuristic strategy for the estimation problem leveraging the structure intrinsic to decision trees. A key feature of the proposed decision-making framework is its ability to instantly assess decisions by mapping new contexts to a leaf and retrieving the precomputed solution of the corresponding two-stage problem. A set of numerical experiments is conducted to illustrate the capability and effectiveness of the proposed framework using three distinct two-stage decision-making problems. We benchmark the proposed approach against prescriptions devised by various alternative frameworks. Five predict/estimate-then-optimize benchmarks that rely on commonly used predictive and distribution estimation methods and three benchmarks based on integrated predict-and-optimize decision-making processes are considered. We focus on evaluating solution quality and the computational performance of the MIP reformulation.

Suggested Citation

  • Ribeiro, Rafaela & Fanzeres, Bruno, 2026. "Integrated estimate-and-optimize decision trees learning for two-stage linear decision-making problems," European Journal of Operational Research, Elsevier, vol. 329(2), pages 607-628.
  • Handle: RePEc:eee:ejores:v:329:y:2026:i:2:p:607-628
    DOI: 10.1016/j.ejor.2025.08.048
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S037722172500685X
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ejor.2025.08.048?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Oktay Günlük & Jayant Kalagnanam & Minhan Li & Matt Menickelly & Katya Scheinberg, 2021. "Optimal decision trees for categorical data via integer programming," Journal of Global Optimization, Springer, vol. 81(1), pages 233-260, September.
    2. Dimitris Bertsimas & Nathan Kallus, 2020. "From Predictive to Prescriptive Analytics," Management Science, INFORMS, vol. 66(3), pages 1025-1044, March.
    3. Stein W. Wallace, 2000. "Decision Making Under Uncertainty: Is Sensitivity Analysis of Any Use?," Operations Research, INFORMS, vol. 48(1), pages 20-25, February.
    4. Torraca, Ana Patrícia & Fanzeres, Bruno, 2021. "Optimal insurance contract specification in the upstream sector of the oil and gas industry," European Journal of Operational Research, Elsevier, vol. 295(2), pages 718-732.
    5. Joaquim Dias Garcia & Alexandre Street & Tito Homem-de-Mello & Francisco D. Muñoz, 2025. "Application-Driven Learning: A Closed-Loop Prediction and Optimization Approach Applied to Dynamic Reserves and Demand Forecasting," Operations Research, INFORMS, vol. 73(1), pages 22-39, January.
    6. Rohit Kannan & Güzin Bayraksan & James R. Luedtke, 2025. "Technical Note—Data-Driven Sample Average Approximation with Covariate Information," Operations Research, INFORMS, vol. 73(6), pages 3245-3259, November.
    7. Christoffersen, Peter F. & Diebold, Francis X., 1997. "Optimal Prediction Under Asymmetric Loss," Econometric Theory, Cambridge University Press, vol. 13(6), pages 808-817, December.
    8. Bart P. G. Van Parys & Peyman Mohajerin Esfahani & Daniel Kuhn, 2021. "From Data to Decisions: Distributionally Robust Optimization Is Optimal," Management Science, INFORMS, vol. 67(6), pages 3387-3402, June.
    9. Wilson, Duncan T. & Hawe, Glenn I. & Coates, Graham & Crouch, Roger S., 2016. "Online optimization of casualty processing in major incident response: An experimental analysis," European Journal of Operational Research, Elsevier, vol. 252(1), pages 334-348.
    10. Sadana, Utsav & Chenreddy, Abhilash & Delage, Erick & Forel, Alexandre & Frejinger, Emma & Vidal, Thibaut, 2025. "A survey of contextual optimization methods for decision-making under uncertainty," European Journal of Operational Research, Elsevier, vol. 320(2), pages 271-289.
    11. Nathan Kallus & Xiaojie Mao, 2023. "Stochastic Optimization Forests," Management Science, INFORMS, vol. 69(4), pages 1975-1994, April.
    12. Jeffrey D. Camm & Amitabh S. Raturi & Shigeru Tsubakitani, 1990. "Cutting Big M Down to Size," Interfaces, INFORMS, vol. 20(5), pages 61-66, October.
    13. repec:inm:orijoo:v:5:y:2023:i:3:p:295-320 is not listed on IDEAS
    14. Alexander Shapiro & Jos Berge, 2002. "Statistical inference of minimum rank factor analysis," Psychometrika, Springer;The Psychometric Society, vol. 67(1), pages 79-94, March.
    15. Fanzeres, Bruno & Ahmed, Shabbir & Street, Alexandre, 2019. "Robust strategic bidding in auction-based markets," European Journal of Operational Research, Elsevier, vol. 272(3), pages 1158-1172.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Qi Hong & Mo Jia & Xuecheng Tian & Zhiyuan Liu & Shuaian Wang, 2025. "A Surrogate Piecewise Linear Loss Function for Contextual Stochastic Linear Programs in Transport," Mathematics, MDPI, vol. 13(12), pages 1-17, June.
    2. Shuaian Wang & Xuecheng Tian, 2023. "A Deficiency of the Predict-Then-Optimize Framework: Decreased Decision Quality with Increased Data Size," Mathematics, MDPI, vol. 11(15), pages 1-9, July.
    3. van Eekelen, Wouter, 2023. "Distributionally robust views on queues and related stochastic models," Other publications TiSEM 9b99fc05-9d68-48eb-ae8c-9, Tilburg University, School of Economics and Management.
    4. Wei Zhang & Kai Wang & Alexandre Jacquillat & Shuaian Wang, 2023. "Optimized Scenario Reduction: Solving Large-Scale Stochastic Programs with Quality Guarantees," INFORMS Journal on Computing, INFORMS, vol. 35(4), pages 886-908, July.
    5. Sadana, Utsav & Chenreddy, Abhilash & Delage, Erick & Forel, Alexandre & Frejinger, Emma & Vidal, Thibaut, 2025. "A survey of contextual optimization methods for decision-making under uncertainty," European Journal of Operational Research, Elsevier, vol. 320(2), pages 271-289.
    6. Tian, Xuecheng & Wang, Shuaian & Laporte, Gilbert & Yang, Ying, 2024. "Determinism versus uncertainty: Examining the worst-case expected performance of data-driven policies," European Journal of Operational Research, Elsevier, vol. 318(1), pages 242-252.
    7. Stratigakos, Akylas & Pineda, Salvador & Morales, Juan Miguel, 2025. "Decision-focused linear pooling for probabilistic forecast combination," International Journal of Forecasting, Elsevier, vol. 41(3), pages 1112-1125.
    8. Huang, Di & Zhang, Jinyu & Liu, Zhiyuan & He, Yiliu & Liu, Pan, 2024. "A novel ranking method based on semi-SPO for battery swapping allocation optimization in a hybrid electric transit system," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 188(C).
    9. Nowak, Piotr Bolesław, 2016. "The MLE of the mean of the exponential distribution based on grouped data is stochastically increasing," Statistics & Probability Letters, Elsevier, vol. 111(C), pages 49-54.
    10. Athanasia Gavala & Nikolay Gospodinov & Deming Jiang, 2006. "Forecasting volatility," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 25(6), pages 381-400.
    11. CHADHA, Jagjit & SCHELLEKENS, Philip, "undated". "Monetary policy loss functions: two cheers for the quadratic," Working Papers 1999002, University of Antwerp, Faculty of Business and Economics.
    12. Tian, Xuecheng & Yan, Ran & Liu, Yannick & Wang, Shuaian, 2023. "A smart predict-then-optimize method for targeted and cost-effective maritime transportation," Transportation Research Part B: Methodological, Elsevier, vol. 172(C), pages 32-52.
    13. Baker, Erin & Bosetti, Valentina & Salo, Ahti, "undated". "Finding Common Ground when Experts Disagree: Belief Dominance over Portfolios of Alternatives," MITP: Mitigation, Innovation and Transformation Pathways 243147, Fondazione Eni Enrico Mattei (FEEM).
    14. Serrano, Breno & Minner, Stefan & Schiffer, Maximilian & Vidal, Thibaut, 2024. "Bilevel optimization for feature selection in the data-driven newsvendor problem," European Journal of Operational Research, Elsevier, vol. 315(2), pages 703-714.
    15. Camilo Alberto Cárdenas-Hurtado & Aaron Levi Garavito-Acosta & Jorge Hernán Toro-Córdoba, 2018. "Asymmetric Effects of Terms of Trade Shocks on Tradable and Non-tradable Investment Rates: The Colombian Case," Borradores de Economia 1043, Banco de la Republica de Colombia.
    16. Lars M. Hvattum & Arne Løkketangen & Gilbert Laporte, 2006. "Solving a Dynamic and Stochastic Vehicle Routing Problem with a Sample Scenario Hedging Heuristic," Transportation Science, INFORMS, vol. 40(4), pages 421-438, November.
    17. Fritsche, Ulrich & Pierdzioch, Christian & Rülke, Jan-Christoph & Stadtmann, Georg, 2015. "Forecasting the Brazilian real and the Mexican peso: Asymmetric loss, forecast rationality, and forecaster herding," International Journal of Forecasting, Elsevier, vol. 31(1), pages 130-139.
    18. Anastasiou, Andreas, 2017. "Bounds for the normal approximation of the maximum likelihood estimator from m-dependent random variables," Statistics & Probability Letters, Elsevier, vol. 129(C), pages 171-181.
    19. Deprez, Laurens & Antonio, Katrien & Boute, Robert, 2021. "Pricing service maintenance contracts using predictive analytics," European Journal of Operational Research, Elsevier, vol. 290(2), pages 530-545.
    20. Corradi, Valentina & Swanson, Norman R., 2004. "Some recent developments in predictive accuracy testing with nested models and (generic) nonlinear alternatives," International Journal of Forecasting, Elsevier, vol. 20(2), pages 185-199.

    More about this item

    Keywords

    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:ejores:v:329:y:2026:i:2:p:607-628. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/eor .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.