Die Interpretation des p-Wertes – Grundsätzliche Missverständnisse

My bibliography Save this article

Die Interpretation des p-Wertes – Grundsätzliche Missverständnisse

Author

Listed:

Hirschauer Norbert
(Professur für Unternehmensführung im Agribusiness, Martin-Luther-Universität Halle-Wittenberg, 06099 Halle (Saale))
Mußhoff Oliver
(Arbeitsbereich Landwirtschaftliche Betriebslehre, Georg-August-Universität Göttingen, Platz der Göttinger Sieben 5, 37073 Göttingen)
Grüner Sven
(Professur für Unternehmensführung im Agribusiness, Martin-Luther-Universität Halle-Wittenberg, 06099 Halle (Saale))
Frey Ulrich
(Professur für Agrar-, Umwelt- und Ernährungspolitik, Martin-Luther-Universität Halle-Wittenberg, 06099 Halle (Saale))
Theesfeld Insa
(Professur für Agrar-, Umwelt- und Ernährungspolitik, Martin-Luther-Universität Halle-Wittenberg, 06099 Halle (Saale))
Wagner Peter
(Professur für Landwirtschaftliche Betriebslehre, Martin-Luther-Universität Halle-Wittenberg, 06099 Halle (Saale))

Registered:

Abstract

The p-value is often considered as the gold standard in inferential statistics. The standard approach for evaluating empirical evidence is to equate low p-values with a high degree of credibility and to refer to findings with p-values below certain thresholds (e.g., 0.05) as statistically significant. The p-value is also referred to as error probability. Both terms are problematic as they invite serious misconceptions. In addition, researchers’ fixation on obtaining statistically significant results may introduce biases and increase the rate of false discoveries. Misinterpretations of the p-value as well as the introduction of bias through arbitrary analytical choices (p-hacking) have been critically discussed in the literature for decades. Nonetheless, they seem to persist in empirical research and criticisms of inappropriate approaches have increased in the recent past – mainly due to the non-replicability of many studies. Unfortunately, the critical concerns that have been raised in the literature are not only scattered over many academic disciplines but often also linguistically confusing and differing in their main reasons for criticisms. Against this background, our methodological comment systematizes the most serious flaws and discusses suggestions of how best to prevent future misuses.

Suggested Citation

Hirschauer Norbert & Mußhoff Oliver & Grüner Sven & Frey Ulrich & Theesfeld Insa & Wagner Peter, 2016. "Die Interpretation des p-Wertes – Grundsätzliche Missverständnisse," Journal of Economics and Statistics (Jahrbuecher fuer Nationaloekonomie und Statistik), De Gruyter, vol. 236(5), pages 557-575, October.

Handle: RePEc:jns:jbstat:v:236:y:2016:i:5:p:557-575
DOI: 10.1515/jbnst-2015-1030

Download full text from publisher

References listed on IDEAS

Deirdre N. McCloskey & Stephen T. Ziliak, 1996. "The Standard Error of Regressions," Journal of Economic Literature, American Economic Association, vol. 34(1), pages 97-114, March.
Armstrong, J. Scott, 2007. "Significance tests harm progress in forecasting," International Journal of Forecasting, Elsevier, vol. 23(2), pages 321-327.
- Armstrong, J. Scott, 2007. "Significance Tests Harm Progress in Forecasting," MPRA Paper 81664, University Library of Munich, Germany.
John P A Ioannidis, 2005. "Why Most Published Research Findings Are False," PLOS Medicine, Public Library of Science, vol. 2(8), pages 1-1, August.
John List & Sally Sadoff & Mathis Wagner, 2011. "So you want to run an experiment, now what? Some simple rules of thumb for optimal experimental design," Experimental Economics, Springer;Economic Science Association, vol. 14(4), pages 439-457, November.
- John A. List & Sally Sadoff & Mathis Wagner, 2009. "So you want to run an experiment, now what? Some Simple Rules of Thumb for Optimal Experimental Design," Carlo Alberto Notebooks 125, Collegio Carlo Alberto.
- John List & Sally Sadoff & Mathis Wagner, 2010. "So you want to run an experiment, now what? Some simple rules of thumb for optimal experimental design," Artefactual Field Experiments 00094, The Field Experiments Website.
- John A. List & Sally Sadoff & Mathis Wagner, 2010. "So you want to run an experiment, now what? Some Simple Rules of Thumb for Optimal Experimental Design," NBER Working Papers 15701, National Bureau of Economic Research, Inc.
- John A. List & Sally Sadoff & Mathis Wagner, 2010. "So you want to run an experiment, now what? Some Simple Rules of Thumb for Optimal Experimental Design," CeRP Working Papers 94, Center for Research on Pensions and Welfare Policies, Turin (Italy).
Sellke T. & Bayarri M. J. & Berger J. O., 2001. "Calibration of rho Values for Testing Precise Null Hypotheses," The American Statistician, American Statistical Association, vol. 55, pages 62-71, February.
Walter Krämer, 2011. "The Cult of Statistical Significance – What Economists Should and Should Not Do to Make their Data Talk," Schmollers Jahrbuch : Journal of Applied Social Science Studies / Zeitschrift für Wirtschafts- und Sozialwissenschaften, Duncker & Humblot, Berlin, vol. 131(3), pages 455-468.
- Walter Krämer, 2011. "The cult of statistical significance. What economists should and should not do to make their data talk," RatSWD Working Papers 176, German Data Forum (RatSWD).
Maren Duvendack & Richard W. Palmer-Jones & W. Robert Reed, 2014. "Replications in Economics: A Progress Report," Working Papers in Economics 14/26, University of Canterbury, Department of Economics and Finance.
repec:ejw:journl:v:12:y:2015:i:2:p:164-191 is not listed on IDEAS

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Hüttel, Silke & Hess, Sebastian, 2023. "Lessons from the p-value debate and the replication crisis for "open Q science" – the editor's perspective or: will the revolution devour its children?," DARE Discussion Papers 2302, Georg-August University of Göttingen, Department of Agricultural Economics and Rural Development (DARE).
Jens Rommel & Meike Weltin, 2021. "Is There a Cult of Statistical Significance in Agricultural Economics?," Applied Economic Perspectives and Policy, John Wiley & Sons, vol. 43(3), pages 1176-1191, September.
- Rommel, Jens & Weltin, Meike, "undated". "Is there a cult of statistical significance in Agricultural Economics?," 57th Annual Conference, Weihenstephan, Germany, September 13-15, 2017 261998, German Association of Agricultural Economists (GEWISOLA).
Heckelei, Thomas & Huettel, Silke & Odening, Martin & Rommel, Jens, "undated". "The replicability crisis and the p-value debate – what are the consequences for the agricultural and food economics community?," Discussion Papers 316369, University of Bonn, Institute for Food and Resource Economics.
Hirschauer Norbert & Grüner Sven & Mußhoff Oliver & Becker Claudia, 2019. "Twenty Steps Towards an Adequate Inferential Interpretation of p-Values in Econometrics," Journal of Economics and Statistics (Jahrbuecher fuer Nationaloekonomie und Statistik), De Gruyter, vol. 239(4), pages 703-721, August.
Alexander Herzog-Stein & Camille Logeay, 2019. "Short-Term macroeconomic evaluation of the German minimum wage with a VAR/VECM," IMK Working Paper 197-2019, IMK at the Hans Boeckler Foundation, Macroeconomic Policy Institute.
Anica Veronika Fietz & Sven Grüner, 2017. "Transparency systems: do businesses in North Rhine-Westphalia (Germany) regret the cancellation of the Smiley scheme?," Agricultural and Food Economics, Springer;Italian Society of Agricultural Economics (SIDEA), vol. 5(1), pages 1-10, December.
Aparo, Nathaline Onek & Odongo, Walter & De Steur, Hans, 2022. "Unraveling heterogeneity in farmer's adoption of mobile phone technologies: A systematic review," Technological Forecasting and Social Change, Elsevier, vol. 185(C).

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Hirschauer Norbert & Grüner Sven & Mußhoff Oliver & Becker Claudia, 2019. "Twenty Steps Towards an Adequate Inferential Interpretation of p-Values in Econometrics," Journal of Economics and Statistics (Jahrbuecher fuer Nationaloekonomie und Statistik), De Gruyter, vol. 239(4), pages 703-721, August.
Kim, Jae H. & Ji, Philip Inyeob, 2015. "Significance testing in empirical finance: A critical review and assessment," Journal of Empirical Finance, Elsevier, vol. 34(C), pages 1-14.
Jesper W. Schneider, 2015. "Null hypothesis significance tests. A mix-up of two different theories: the basis for widespread confusion and numerous misinterpretations," Scientometrics, Springer;Akadémiai Kiadó, vol. 102(1), pages 411-432, January.
Jyotirmoy Sarkar, 2018. "Will Pâ€ Value Triumph over Abuses and Attacks?," Biostatistics and Biometrics Open Access Journal, Juniper Publishers Inc., vol. 7(4), pages 66-71, July.
Hirschauer, Norbert & Grüner, Sven & Mußhoff, Oliver & Becker, Claudia & Jantsch, Antje, 2020. "Can p-values be meaningfully interpreted without random sampling?," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 14, pages 71-91.
- Hirschauer, Norbert & Gruener, Sven & Mußhoff, Oliver & Becker, Claudia & Jantsch, Antje, 2019. "Can p-values be meaningfully interpreted without random sampling?," SocArXiv yazr8, Center for Open Science.
repec:ejw:journl:v:10:y:2013:i:1:p:97-107 is not listed on IDEAS
Michaelides, Michael, 2021. "Large sample size bias in empirical finance," Finance Research Letters, Elsevier, vol. 41(C).
Black, Bernard & Hollingsworth, Alex & Nunes, Letícia & Simon, Kosali, 2022. "Simulated power analyses for observational studies: An application to the Affordable Care Act Medicaid expansion," Journal of Public Economics, Elsevier, vol. 213(C).
- Bernard Black & Alex Hollingsworth & Leticia Nunes & Kosali Simon, 2019. "Simulated Power Analyses for Observational Studies: An Application to the Affordable Care Act Medicaid Expansion," NBER Working Papers 25568, National Bureau of Economic Research, Inc.
Nicolas Vallois & Dorian Jullien, 2017. "Estimating Rationality in Economics: A History of Statistical Methods in Experimental Economics," Working Papers halshs-01651070, HAL.
Eszter Czibor & David Jimenez‐Gomez & John A. List, 2019. "The Dozen Things Experimental Economists Should Do (More of)," Southern Economic Journal, John Wiley & Sons, vol. 86(2), pages 371-432, October.
- Eszter Czibor & David Jimenez-Gomez & John List, 2019. "The Dozen Things Experimental Economists Should Do (More of)," Artefactual Field Experiments 00648, The Field Experiments Website.
- Eszter Czibor & David Jimenez-Gomez & John A. List, 2019. "The Dozen Things Experimental Economists Should Do (More of)," NBER Working Papers 25451, National Bureau of Economic Research, Inc.
Jae H. Kim & Kamran Ahmed & Philip Inyeob Ji, 2018. "Significance Testing in Accounting Research: A Critical Evaluation Based on Evidence," Abacus, Accounting Foundation, University of Sydney, vol. 54(4), pages 524-546, December.
Nicolas Vallois & Dorian Jullien, 2018. "A history of statistical methods in experimental economics," The European Journal of the History of Economic Thought, Taylor & Francis Journals, vol. 25(6), pages 1455-1492, November.
- Nicolas Vallois & Dorian Jullien, 2018. "A History of Statistical Methods in Experimental Economics," Post-Print halshs-01651070, HAL.
Jens Rommel & Meike Weltin, 2021. "Is There a Cult of Statistical Significance in Agricultural Economics?," Applied Economic Perspectives and Policy, John Wiley & Sons, vol. 43(3), pages 1176-1191, September.
- Rommel, Jens & Weltin, Meike, 2017. "Is there a cult of statistical significance in Agricultural Economics?," 57th Annual Conference, Weihenstephan, Germany, September 13-15, 2017 261998, German Association of Agricultural Economists (GEWISOLA).
Stephan B. Bruns & David I. Stern, 2019. "Lag length selection and p-hacking in Granger causality testing: prevalence and performance of meta-regression models," Empirical Economics, Springer, vol. 56(3), pages 797-830, March.
Denis Fougère & Nicolas Jacquemet, 2019. "Causal Inference and Impact Evaluation," Economie et Statistique / Economics and Statistics, Institut National de la Statistique et des Etudes Economiques (INSEE), issue 510-511-5, pages 181-200.
- Denis Fougère & Nicolas Jacquemet, 2019. "Causal Inference and Impact Evaluation," SciencePo Working papers Main hal-02866828, HAL.
- Denis Fougère & Nicolas Jacquemet, 2019. "Causal Inference and Impact Evaluation," Université Paris1 Panthéon-Sorbonne (Post-Print and Working Papers) hal-02866828, HAL.
- Denis Fougère & Nicolas Jacquemet, 2019. "Causal Inference and Impact Evaluation," Post-Print hal-02866828, HAL.
- Denis Fougère & Nicolas Jacquemet, 2019. "Causal Inference and Impact Evaluation," PSE-Ecole d'économie de Paris (Postprint) hal-02866828, HAL.
Emma von Essen & Marieke Huysentruyt & Topi Miettinen, 2019. "Exploration in Teams and the Encouragement Effect: Theory and Evidence," Economics Working Papers 2019-10, Department of Economics and Business Economics, Aarhus University.
Brian Albert Monroe, 2020. "The statistical power of individual-level risk preference estimation," Journal of the Economic Science Association, Springer;Economic Science Association, vol. 6(2), pages 168-188, December.
Kathryn N. Vasilaky & J. Michelle Brock, 2020. "Power(ful) guidelines for experimental economists," Journal of the Economic Science Association, Springer;Economic Science Association, vol. 6(2), pages 189-212, December.
Blakeley B. McShane & David Gal, 2016. "Blinding Us to the Obvious? The Effect of Statistical Training on the Evaluation of Evidence," Management Science, INFORMS, vol. 62(6), pages 1707-1718, June.
Soyer, Emre & Hogarth, Robin M., 2012. "The illusion of predictability: How regression statistics mislead experts," International Journal of Forecasting, Elsevier, vol. 28(3), pages 695-711.
Pérez, María-Eglée & Pericchi, Luis Raúl, 2014. "Changing statistical significance with the amount of information: The adaptive α significance level," Statistics & Probability Letters, Elsevier, vol. 85(C), pages 20-24.

More about this item

Keywords

; ; ; ; ; ; ; ; ; ; ; ; ; ; ;

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:jns:jbstat:v:236:y:2016:i:5:p:557-575. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://www.degruyterbrill.com .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Die Interpretation des p-Wertes – Grundsätzliche Missverständnisse

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data