IDEAS home Printed from https://ideas.repec.org/a/wly/fufsci/v7y2025i1ne199.html

Calibration Feedback With the Practical Scoring Rule Does Not Improve Calibration of Confidence

Author

Listed:
  • Matthew Martin
  • David R. Mandel

Abstract

People are often overconfident in their probabilistic judgments of future events or the state of their own knowledge. Some training methods have proven effective at reducing bias, but these usually involve intensive training sessions with experienced facilitators. This is not conducive to a scalable and domain‐general training program for improving calibration. In two experiments (N1 = 610, N2 = 871), we examined the effectiveness of a performance feedback calibration training paradigm based on the Practical scoring rule, a modification of the logarithmic scoring rule designed to be more intuitive to facilitate learning. We examined this training regime in comparison to a control group and an outcome feedback group. Participants were tasked with selecting which of two world urban agglomerations had a higher population and to provide their confidence level. The outcome feedback group received information about the correctness of their choice on a trial‐by‐trial basis as well as a summary of their percent correct after each experimental block. The performance feedback group received this information plus the Practical score on a trial‐by‐trial basis and information about their overall over‐ or underconfidence at the end of each block. We also examined whether Actively Open‐Minded Thinking (AOMT) was predictive of calibration and its change across blocks. We found no improvement in calibration due to either training regime. Good calibration overall was predicted by AOMT, but not its change across blocks. The results shed light on the generalizability of other findings showing positive effects of performance training using the Practical scoring rule.

Suggested Citation

  • Matthew Martin & David R. Mandel, 2025. "Calibration Feedback With the Practical Scoring Rule Does Not Improve Calibration of Confidence," Futures & Foresight Science, John Wiley & Sons, vol. 7(1), April.
  • Handle: RePEc:wly:fufsci:v:7:y:2025:i:1:n:e199
    DOI: 10.1002/ffo2.199
    as

    Download full text from publisher

    File URL: https://doi.org/10.1002/ffo2.199
    Download Restriction: no

    File URL: https://libkey.io/10.1002/ffo2.199?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Du, Ning & Budescu, David V., 2007. "Does past volatility affect investors' price forecasts and confidence judgements?," International Journal of Forecasting, Elsevier, vol. 23(3), pages 497-511.
    2. David R. Mandel & Daniel Irwin, 2021. "Tracking accuracy of strategic intelligence forecasts: Findings from a long‐term Canadian study," Futures & Foresight Science, John Wiley & Sons, vol. 3(3-4), September.
    3. Ning Du & Sandra Shelton & Ray Whittington, 2012. "Does Supplementing Outcome Feedback with Performance Feedback Improve Probability Judgments?," International Journal of Financial Research, International Journal of Financial Research, Sciedu Press, vol. 3(4), pages 19-32, October.
    4. Benson, P. George & Onkal, Dilek, 1992. "The effects of feedback and training on the performance of probability forecasters," International Journal of Forecasting, Elsevier, vol. 8(4), pages 559-573, December.
    5. Lawrence, Michael & Goodwin, Paul & O'Connor, Marcus & Onkal, Dilek, 2006. "Judgmental forecasting: A review of progress over the last 25 years," International Journal of Forecasting, Elsevier, vol. 22(3), pages 493-518.
    6. Erceg, Nikola & Galić, Zvonimir, 2014. "Overconfidence bias and conjunction fallacy in predicting outcomes of football matches," Journal of Economic Psychology, Elsevier, vol. 42(C), pages 52-62.
    7. Daniel M Benjamin & Spencer P Hey & Amanda MacPherson & Yasmina Hachem & Kara S Smith & Sean X Zhang & Sandy Wong & Samantha Dolter & David R Mandel & Jonathan Kimmelman, 2022. "Principal investigators over-optimistically forecast scientific and operational outcomes for clinical trials," PLOS ONE, Public Library of Science, vol. 17(2), pages 1-13, February.
    8. Shane Frederick, 2005. "Cognitive Reflection and Decision Making," Journal of Economic Perspectives, American Economic Association, vol. 19(4), pages 25-42, Fall.
    9. Chul-Ho Bum & Chulhwan Choi & Kyongmin Lee, 2018. "Irrational Beliefs and Social Adaptation of Online Sports Gamblers According to Addiction Level: A Comparative Study," Sustainability, MDPI, vol. 10(11), pages 1-11, November.
    10. Iacus, Stefano M. & King, Gary & Porro, Giuseppe, 2012. "Causal Inference without Balance Checking: Coarsened Exact Matching," Political Analysis, Cambridge University Press, vol. 20(1), pages 1-24, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ross Gruetzemacher & Kang Bok Lee & David Paradice, 2024. "Calibration training for improving probabilistic judgments using an interactive app," Futures & Foresight Science, John Wiley & Sons, vol. 6(2), June.
    2. Dan Zhu & Qingwei Wang & John Goddard, 2022. "A new hedging hypothesis regarding prediction interval formation in stock price forecasting," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 41(4), pages 697-717, July.
    3. Chao Wu & Mahyar Eftekhar, 2024. "Does Volunteering Crowd Out Donations? Evidence from Online Experiments," Manufacturing & Service Operations Management, INFORMS, vol. 26(4), pages 1542-1566, July.
    4. Doron Sonsino & Tal Shavit, 2014. "Return prediction and stock selection from unidentified historical data," Quantitative Finance, Taylor & Francis Journals, vol. 14(4), pages 641-655, April.
    5. Alvarado-Valencia, Jorge & Barrero, Lope H. & Önkal, Dilek & Dennerlein, Jack T., 2017. "Expertise, credibility of system forecasts and integration methods in judgmental demand forecasting," International Journal of Forecasting, Elsevier, vol. 33(1), pages 298-313.
    6. Brent Moritz & Enno Siemsen & Mirko Kremer, 2014. "Judgmental Forecasting: Cognitive Reflection and Decision Speed," Production and Operations Management, Production and Operations Management Society, vol. 23(7), pages 1146-1160, July.
    7. Katsagounos, Ilias & Thomakos, Dimitrios D. & Litsiou, Konstantia & Nikolopoulos, Konstantinos, 2021. "Superforecasting reality check: Evidence from a small pool of experts and expedited identification," European Journal of Operational Research, Elsevier, vol. 289(1), pages 107-117.
    8. Song, Haiyan & Gao, Bastian Z. & Lin, Vera S., 2013. "Combining statistical and judgmental forecasts via a web-based tourism demand forecasting system," International Journal of Forecasting, Elsevier, vol. 29(2), pages 295-310.
    9. Xiaoxiao Niu & Nigel Harvey, 2022. "Point, interval, and density forecasts: Differences in bias, judgment noise, and overall accuracy," Futures & Foresight Science, John Wiley & Sons, vol. 4(3-4), September.
    10. Glaser, Markus & Langer, Thomas & Reynders, Jens & Weber, Martin, 2008. "Scale Dependence of Overconfidence in Stock Market Volatility Forecasts," Sonderforschungsbereich 504 Publications 08-22, Sonderforschungsbereich 504, Universität Mannheim;Sonderforschungsbereich 504, University of Mannheim.
    11. Ian Durbach & Gilberto Montibeller, 2018. "Predicting in shock: on the impact of negative, extreme, rare, and short lived events on judgmental forecasts," EURO Journal on Decision Processes, Springer;EURO - The Association of European Operational Research Societies, vol. 6(1), pages 213-233, June.
    12. Justin F. Landy, 2016. "Representations of moral violations: Category members and associated features," Judgment and Decision Making, Society for Judgment and Decision Making, vol. 11(5), pages 496-508, September.
    13. Insoo Cho & Peter F. Orazem, 2021. "How endogenous risk preferences and sample selection affect analysis of firm survival," Small Business Economics, Springer, vol. 56(4), pages 1309-1332, April.
    14. David J. Cooper & Krista Saral & Marie Claire Villeval, 2021. "Why Join a Team?," Management Science, INFORMS, vol. 67(11), pages 6980-6997, November.
    15. Jansesberger, Viktoria, 2024. "Storms, floods, landslides and elections in India's growing metropolises: Hotbeds for political protest?," Working Papers 28, University of Konstanz, Cluster of Excellence "The Politics of Inequality. Perceptions, Participation and Policies".
    16. Zakaria Babutsidze & Nobuyuki Hanaki & Adam Zylbersztejn, 2019. "Digital Communication and Swift Trust," Post-Print halshs-02409314, HAL.
    17. Brice Corgnet & Roberto Hernán Gonzalez & Ricardo Mateo, 2015. "Cognitive Reflection and the Diligent Worker: An Experimental Study of Millennials," PLOS ONE, Public Library of Science, vol. 10(11), pages 1-13, November.
    18. Baecke, Philippe & De Baets, Shari & Vanderheyden, Karlien, 2017. "Investigating the added value of integrating human judgement into statistical demand forecasting systems," International Journal of Production Economics, Elsevier, vol. 191(C), pages 85-96.
    19. Francesco Capozza & Ingar Haaland & Christopher Roth & Johannes Wohlfart, 2021. "Studying Information Acquisition in the Field: A Practical Guide and Review," CEBI working paper series 21-15, University of Copenhagen. Department of Economics. The Center for Economic Behavior and Inequality (CEBI).
    20. Luigi Guiso, 2015. "A Test of Narrow Framing and its Origin," Italian Economic Journal: A Continuation of Rivista Italiana degli Economisti and Giornale degli Economisti, Springer;Società Italiana degli Economisti (Italian Economic Association), vol. 1(1), pages 61-100, March.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wly:fufsci:v:7:y:2025:i:1:n:e199. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://doi.org/10.1002/(ISSN)2573-5152 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.