IDEAS home Printed from https://ideas.repec.org/p/hal/cesptp/halshs-05371342.html

Large-scale experimental investigation of the reliability of confidence measures

Author

Listed:
  • Clémentine Bouleau

    (PSE - Paris School of Economics - UP1 - Université Paris 1 Panthéon-Sorbonne - ENS-PSL - École normale supérieure - Paris - PSL - Université Paris Sciences et Lettres - EHESS - École des hautes études en sciences sociales - CNRS - Centre National de la Recherche Scientifique - INRAE - Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement - ENPC - École nationale des ponts et chaussées - IP Paris - Institut Polytechnique de Paris, CES - Centre d'économie de la Sorbonne - UP1 - Université Paris 1 Panthéon-Sorbonne - CNRS - Centre National de la Recherche Scientifique)

  • Maël Lebreton

    (PSE - Paris School of Economics - UP1 - Université Paris 1 Panthéon-Sorbonne - ENS-PSL - École normale supérieure - Paris - PSL - Université Paris Sciences et Lettres - EHESS - École des hautes études en sciences sociales - CNRS - Centre National de la Recherche Scientifique - INRAE - Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement - ENPC - École nationale des ponts et chaussées - IP Paris - Institut Polytechnique de Paris, PJSE - Paris Jourdan Sciences Economiques - UP1 - Université Paris 1 Panthéon-Sorbonne - ENS-PSL - École normale supérieure - Paris - PSL - Université Paris Sciences et Lettres - EHESS - École des hautes études en sciences sociales - CNRS - Centre National de la Recherche Scientifique - INRAE - Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement - ENPC - École nationale des ponts et chaussées - IP Paris - Institut Polytechnique de Paris)

  • Nicolas Jacquemet

    (CES - Centre d'économie de la Sorbonne - UP1 - Université Paris 1 Panthéon-Sorbonne - CNRS - Centre National de la Recherche Scientifique, PSE - Paris School of Economics - UP1 - Université Paris 1 Panthéon-Sorbonne - ENS-PSL - École normale supérieure - Paris - PSL - Université Paris Sciences et Lettres - EHESS - École des hautes études en sciences sociales - CNRS - Centre National de la Recherche Scientifique - INRAE - Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement - ENPC - École nationale des ponts et chaussées - IP Paris - Institut Polytechnique de Paris)

Abstract

Whether individuals feel confident about their own actions, choices, or statements being correct, and how these confidence levels differ between individuals are two key primitives for countless behavioral theories and phenomena. In cognitive tasks, individual confidence is typically measured as the average of reports about choice accuracy, but how reliable is the resulting characterization of within- and between-individual confidence remains surprisingly undocumented. Here, we perform a large-scale resampling exercise in the Confidence Database (103 studies, 6000 participants) to investigate the reliability of individual confidence estimates, and of comparisons across individuals' confidence levels. Our results show that confidence estimates are more stable than their choice-accuracy counterpart, reaching a reliability plateau after roughly 50 trials, regardless of a number of task design characteristics. While constituting a reliability upper-bound for task-based confidence measures, and thereby leaving open the question of the reliability of the construct itself, these results characterize the robustness of past and future task designs.

Suggested Citation

  • Clémentine Bouleau & Maël Lebreton & Nicolas Jacquemet, 2025. "Large-scale experimental investigation of the reliability of confidence measures," Université Paris1 Panthéon-Sorbonne (Post-Print and Working Papers) halshs-05371342, HAL.
  • Handle: RePEc:hal:cesptp:halshs-05371342
    DOI: 10.1038/s44271-025-00330-6
    Note: View the original document on HAL open archive server: https://shs.hal.science/halshs-05371342v1
    as

    Download full text from publisher

    File URL: https://shs.hal.science/halshs-05371342v1/document
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s44271-025-00330-6?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Smith, Vernon L & Walker, James M, 1993. "Monetary Rewards and Decision Cost in Experimental Economics," Economic Inquiry, Western Economic Association International, vol. 31(2), pages 245-261, April.
    2. Roey Schurr & Daniel Reznik & Hanna Hillman & Rahul Bhui & Samuel J. Gershman, 2024. "Dynamic computational phenotyping of human cognition," Nature Human Behaviour, Nature, vol. 8(5), pages 917-931, May.
    3. Florian Zimmermann, 2020. "The Dynamics of Motivated Beliefs," American Economic Review, American Economic Association, vol. 110(2), pages 337-361, February.
    4. Rebecca K West & William J Harrison & Natasha Matthews & Jason B Mattingley & David K Sewell, 2023. "Modality independent or modality specific? Common computations underlie confidence judgements in visual and auditory decisions," PLOS Computational Biology, Public Library of Science, vol. 19(7), pages 1-39, July.
    5. Dobromir Rahnev, 2025. "A comprehensive assessment of current methods for measuring metacognition," Nature Communications, Nature, vol. 16(1), pages 1-19, December.
    6. Maël Lebreton & Sophie Bavard & Jean Daunizeau & Stefano Palminteri, 2019. "Assessing inter-individual differences with task-related functional neuroimaging," Nature Human Behaviour, Nature, vol. 3(9), pages 897-905, September.
    7. Pietro Ortoleva & Erik Snowberg, 2015. "Overconfidence in Political Behavior," American Economic Review, American Economic Association, vol. 105(2), pages 504-535, February.
    8. Binnendyk, Jabin & Pennycook, Gordon, 2024. "Individual differences in overconfidence: A new measurement approach," Judgment and Decision Making, Cambridge University Press, vol. 19, pages 1-1, January.
    9. Andreas Pedroni & Renato Frey & Adrian Bruhin & Gilles Dutilh & Ralph Hertwig & Jörg Rieskamp, 2017. "The risk elicitation puzzle," Nature Human Behaviour, Nature, vol. 1(11), pages 803-809, November.
    10. Li, Sophia & Hale, Randall & Moore, Don A., 2025. "Is overconfidence an individual difference?," Judgment and Decision Making, Cambridge University Press, vol. 20, pages 1-1, January.
    11. Muriel Niederle & Lise Vesterlund, 2007. "Do Women Shy Away From Competition? Do Men Compete Too Much?," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 122(3), pages 1067-1101.
    12. Koellinger, Philipp & Minniti, Maria & Schade, Christian, 2007. ""I think I can, I think I can": Overconfidence and entrepreneurial behavior," Journal of Economic Psychology, Elsevier, vol. 28(4), pages 502-527, August.
    13. Karl Schlag & James Tremewan & Joël Weele, 2015. "A penny for your thoughts: a survey of methods for eliciting beliefs," Experimental Economics, Springer;Economic Science Association, vol. 18(3), pages 457-490, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Clémentine Bouleau & Nicolas Jacquemet & Maël Lebreton, 2025. "How large is "large enough" ? Large-scale experimental investigation of the reliability of confidence measures," PSE Working Papers halshs-04893009, HAL.
    2. Peter Schwardmann & Joël van der Weele, 2016. "Deception and Self-Deception," Tinbergen Institute Discussion Papers 16-012/I, Tinbergen Institute.
    3. Gangadharan, Lata & Grossman, Philip J. & Xue, Nina, 2024. "Belief elicitation under competing motivations: Does it matter how you ask?," European Economic Review, Elsevier, vol. 169(C).
    4. Pedro Gonzalez-Fernandez, 2024. "Belief Bias Identification," Papers 2404.09297, arXiv.org, revised Mar 2026.
    5. Thomas Buser & Leonie Gerhards & Joël Weele, 2018. "Responsiveness to feedback as a personal trait," Journal of Risk and Uncertainty, Springer, vol. 56(2), pages 165-192, April.
    6. Lata Gangadharan & Philip J. Grossman & Nina Xue, 2022. "Stepping Stone: Identifying self-image concerns from motivated beliefs: Does it matter how and whom you ask?," Monash Economics Working Papers 2022-05, Monash University, Department of Economics.
    7. Schüssler, Katharina, 2018. "The Influence of Overconfidence and Competition Neglect On Entry Into Competition," Rationality and Competition Discussion Paper Series 87, CRC TRR 190 Rationality and Competition.
    8. Ersoy, Fulya Y., 2025. "Do incentives matter in elicitation of beliefs?," Journal of Behavioral and Experimental Economics (formerly The Journal of Socio-Economics), Elsevier, vol. 119(C).
    9. Christoph Drobner, 2020. "Motivated Beliefs and Anticipation of Uncertainty Resolution," Munich Papers in Political Economy 07, Munich School of Politics and Public Policy and the School of Management at the Technical University of Munich.
    10. James C. Cox & Vjollca Sadiraj, 2018. "Incentives," Experimental Economics Center Working Paper Series 2018-01, Experimental Economics Center, Andrew Young School of Policy Studies, Georgia State University.
    11. Banerjee, Ritwik & Gupta, Nabanita Datta & Villeval, Marie Claire, 2020. "Feedback spillovers across tasks, self-confidence and competitiveness," Games and Economic Behavior, Elsevier, vol. 123(C), pages 127-170.
    12. Verheul, Ingrid & Thurik, Roy & Grilo, Isabel & van der Zwan, Peter, 2012. "Explaining preferences and actual involvement in self-employment: Gender and the entrepreneurial personality," Journal of Economic Psychology, Elsevier, vol. 33(2), pages 325-341.
    13. Omar Al-Ubaydli & John List, 2013. "On the Generalizability of Experimental Results in Economics: With A Response To Camerer," Artefactual Field Experiments j0001, The Field Experiments Website.
    14. Assenza, Tiziana, 2021. "The Ability to 'Distill the Truth'," TSE Working Papers 21-1280, Toulouse School of Economics (TSE), revised Mar 2022.
    15. Vanessa Valero, 2022. "Redistribution and beliefs about the source of income inequality," Experimental Economics, Springer;Economic Science Association, vol. 25(3), pages 876-901, June.
    16. Juan Dubra & Jean-Pierre Beno t & Giorgia Romagnoli, 2019. "Belief elicitation when more than money matters," Documentos de Trabajo/Working Papers 1901, Facultad de Ciencias Empresariales y Economia. Universidad de Montevideo..
    17. Jean-Pierre Benoît & Juan Dubra & Giorgia Romagnoli, 2022. "Belief Elicitation When More than Money Matters: Controlling for "Control"," American Economic Journal: Microeconomics, American Economic Association, vol. 14(3), pages 837-888, August.
    18. Utteeyo Dasgupta & Subha Mani & Smriti Sharma & Saurabh Singhal, 2017. "Cognitive, socioemotional, and behavioural returns to college quality," WIDER Working Paper Series wp-2017-94, World Institute for Development Economic Research (UNU-WIDER).
    19. Stephen L. Cheung & Lachlan Johnstone, 2025. "True overconfidence, revealed through actions: An experiment," Journal of Risk and Uncertainty, Springer, vol. 70(2), pages 171-199, April.
    20. Thomas Buser & Leonie Gerhards & Joël J. van der Weele, 2016. "Measuring Responsiveness to Feedback as a Personal Trait," Tinbergen Institute Discussion Papers 16-043/I, Tinbergen Institute.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:hal:cesptp:halshs-05371342. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: CCSD (email available below). General contact details of provider: https://hal.archives-ouvertes.fr/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.