IDEAS home Printed from https://ideas.repec.org/a/inm/orijds/v4y2025i2p101-113.html

Cost-Aware Calibration of Classifiers

Author

Listed:
  • Mochen Yang

    (Department of Information and Decision Sciences, Carlson School of Management, University of Minnesota, Minneapolis, Minnesota 55455)

  • Xuan Bi

    (Department of Information and Decision Sciences, Carlson School of Management, University of Minnesota, Minneapolis, Minnesota 55455)

Abstract

Most classification techniques in machine learning are able to produce probability predictions in addition to class predictions. However, these predicted probabilities are often not well calibrated in that they deviate from the actual outcome rates (i.e., the proportion of data instances that actually belong to a certain class). A lack of calibration can jeopardize downstream decision tasks that rely on accurate probability predictions. Although several post hoc calibration methods have been proposed, they generally do not consider the potentially asymmetric costs associated with overprediction versus underprediction. In this research, we formally define the problem of cost-aware calibration and propose a metric to quantify the cost of miscalibration for a given classifier. Next, we propose three approaches to achieve cost-aware calibration, two of which are cost-aware adaptations of existing calibration algorithms; the third one (named MetaCal ) is a Bayes optimal learning algorithm inspired by prior work on cost-aware classification. We carry out systematic empirical evaluations on multiple public data sets to demonstrate the effectiveness of the proposed approaches in reducing the cost of miscalibration. Finally, we generalize the definition and metric as well as solution algorithms of cost-aware calibration to account for nonlinear cost structures that may arise in real-world decision tasks.

Suggested Citation

  • Mochen Yang & Xuan Bi, 2025. "Cost-Aware Calibration of Classifiers," INFORMS Joural on Data Science, INFORMS, vol. 4(2), pages 101-113, April.
  • Handle: RePEc:inm:orijds:v:4:y:2025:i:2:p:101-113
    DOI: 10.1287/ijds.2024.0038
    as

    Download full text from publisher

    File URL: http://dx.doi.org/10.1287/ijds.2024.0038
    Download Restriction: no

    File URL: https://libkey.io/10.1287/ijds.2024.0038?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Gah-Yi Ban & Cynthia Rudin, 2019. "The Big Data Newsvendor: Practical Insights from Machine Learning," Operations Research, INFORMS, vol. 67(1), pages 90-108, January.
    2. Viaene, Stijn & Dedene, Guido, 2005. "Cost-sensitive learning and decision making revisited," European Journal of Operational Research, Elsevier, vol. 166(1), pages 212-220, October.
    3. Mohsen Bayati & Mark Braverman & Michael Gillam & Karen M Mack & George Ruiz & Mark S Smith & Eric Horvitz, 2014. "Data-Driven Decisions for Reducing Readmissions for Heart Failure: General Methodology and Case Study," PLOS ONE, Public Library of Science, vol. 9(10), pages 1-9, October.
    4. Dimitris Bertsimas & Nathan Kallus, 2020. "From Predictive to Prescriptive Analytics," Management Science, INFORMS, vol. 66(3), pages 1025-1044, March.
    5. Huber, Jakob & Müller, Sebastian & Fleischmann, Moritz & Stuckenschmidt, Heiner, 2019. "A data-driven newsvendor problem: From data to decision," European Journal of Operational Research, Elsevier, vol. 278(3), pages 904-915.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Serrano, Breno & Minner, Stefan & Schiffer, Maximilian & Vidal, Thibaut, 2024. "Bilevel optimization for feature selection in the data-driven newsvendor problem," European Journal of Operational Research, Elsevier, vol. 315(2), pages 703-714.
    2. Yang, Cheng-Hu & Wang, Hai-Tang & Ma, Xin & Talluri, Srinivas, 2023. "A data-driven newsvendor problem: A high-dimensional and mixed-frequency method," International Journal of Production Economics, Elsevier, vol. 266(C).
    3. Liu, Congzheng & Letchford, Adam N. & Svetunkov, Ivan, 2022. "Newsvendor problems: An integrated method for estimation and optimisation," European Journal of Operational Research, Elsevier, vol. 300(2), pages 590-601.
    4. Schmidt, Felix G. & Pibernik, Richard, 2025. "Data-driven inventory control for large product portfolios: A practical application of prescriptive analytics," European Journal of Operational Research, Elsevier, vol. 322(1), pages 254-269.
    5. Corredera, Alberto & Ruiz, Carlos, 2023. "Prescriptive selection of machine learning hyperparameters with applications in power markets: Retailer’s optimal trading," European Journal of Operational Research, Elsevier, vol. 306(1), pages 370-388.
    6. Wang, Wanpeng & Deng, Shiming & Zhang, Yuying, 2025. "Data-driven ordering policies for target oriented newsvendor with censored demand," European Journal of Operational Research, Elsevier, vol. 323(1), pages 86-96.
    7. Thais de Castro Moraes & Jiancheng Qin & Xue-Ming Yuan & Ek Peng Chew, 2023. "Evolving Hybrid Deep Neural Network Models for End-to-End Inventory Ordering Decisions," Logistics, MDPI, vol. 7(4), pages 1-18, November.
    8. Erkip, Nesim Kohen, 2023. "Can accessing much data reshape the theory? Inventory theory under the challenge of data-driven systems," European Journal of Operational Research, Elsevier, vol. 308(3), pages 949-959.
    9. Corredera Barbado, Alberto & Ruiz Mora, Carlos, 2022. "Prescriptive selection of machine learning hyperparameters with applications in power markets: retailer's optimal trading," DES - Working Papers. Statistics and Econometrics. WS 33693, Universidad Carlos III de Madrid. Departamento de Estadística.
    10. Sadana, Utsav & Chenreddy, Abhilash & Delage, Erick & Forel, Alexandre & Frejinger, Emma & Vidal, Thibaut, 2025. "A survey of contextual optimization methods for decision-making under uncertainty," European Journal of Operational Research, Elsevier, vol. 320(2), pages 271-289.
    11. Zhen-Yu Chen & Zhi-Ping Fan & Minghe Sun, 2023. "Machine Learning Methods for Data-Driven Demand Estimation and Assortment Planning Considering Cross-Selling and Substitutions," INFORMS Journal on Computing, INFORMS, vol. 35(1), pages 158-177, January.
    12. Tian, Xuecheng & Wang, Shuaian & Laporte, Gilbert & Yang, Ying, 2024. "Determinism versus uncertainty: Examining the worst-case expected performance of data-driven policies," European Journal of Operational Research, Elsevier, vol. 318(1), pages 242-252.
    13. Felix Wick & Ulrich Kerzel & Martin Hahn & Moritz Wolf & Trapti Singhal & Daniel Stemmer & Jakob Ernst & Michael Feindt, 2021. "Demand Forecasting of Individual Probability Density Functions with Machine Learning," SN Operations Research Forum, Springer, vol. 2(3), pages 1-39, September.
    14. Olivares-Nadal, Alba V., 2024. "Constructing decision rules for multiproduct newsvendors: An integrated estimation-and-optimization framework," European Journal of Operational Research, Elsevier, vol. 315(3), pages 1021-1037.
    15. Viet Anh Nguyen & Fan Zhang & Shanshan Wang & Jose Blanchet & Erick Delage & Yinyu Ye, 2021. "Robustifying Conditional Portfolio Decisions via Optimal Transport," Papers 2103.16451, arXiv.org, revised Apr 2024.
    16. Meng Qi & Ying Cao & Zuo-Jun (Max) Shen, 2022. "Distributionally Robust Conditional Quantile Prediction with Fixed Design," Management Science, INFORMS, vol. 68(3), pages 1639-1658, March.
    17. Jos'e-Manuel Pe~na & Fernando Su'arez & Omar Larr'e & Domingo Ram'irez & Arturo Cifuentes, 2023. "A Modified CTGAN-Plus-Features Based Method for Optimal Asset Allocation," Papers 2302.02269, arXiv.org, revised May 2024.
    18. Luhao Zhang & Jincheng Yang & Rui Gao, 2024. "Optimal Robust Policy for Feature-Based Newsvendor," Management Science, INFORMS, vol. 70(4), pages 2315-2329, April.
    19. Mingyang Fu & Xiaobo Li & Lianmin Zhang, 2024. "Distributionally Robust Newsvendor Under Stochastic Dominance with a Feature-Based Application," Manufacturing & Service Operations Management, INFORMS, vol. 26(5), pages 1962-1977, September.
    20. Adam N. Elmachtoub & Paul Grigas, 2022. "Smart “Predict, then Optimize”," Management Science, INFORMS, vol. 68(1), pages 9-26, January.

    More about this item

    Keywords

    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:orijds:v:4:y:2025:i:2:p:101-113. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.