IDEAS home Printed from https://ideas.repec.org/a/jns/jbstat/v236y2016i5p557-575.html
   My bibliography  Save this article

Die Interpretation des p-Wertes – Grundsätzliche Missverständnisse

Author

Listed:
  • Hirschauer Norbert
  • Grüner Sven

    (Professur für Unternehmensführung im Agribusiness, Martin-Luther-Universität Halle-Wittenberg, 06099 Halle (Saale))

  • Mußhoff Oliver

    (Arbeitsbereich Landwirtschaftliche Betriebslehre, Georg-August-Universität Göttingen, Platz der Göttinger Sieben 5, 37073 Göttingen)

  • Frey Ulrich
  • Theesfeld Insa

    (Professur für Agrar-, Umwelt- und Ernährungspolitik, Martin-Luther-Universität Halle-Wittenberg, 06099 Halle (Saale))

  • Wagner Peter

    (Professur für Landwirtschaftliche Betriebslehre, Martin-Luther-Universität Halle-Wittenberg, 06099 Halle (Saale))

Abstract

Der p-Wert wird vielfach als Goldstandard für Inferenzschlüsse angesehen. Zur Validierung statistischer Zusammenhänge hat sich die Konvention herausgebildet, möglichst geringe p-Werte einzufordern und bei Werten unterhalb gewisser Schwellen (z. B. 0,05) von statistisch signifikanten Ergebnissen zu sprechen. Häufig wird der p-Wert auch als Irrtumswahrscheinlichkeit bezeichnet. Beide Begriffe sind problematisch, da sie Missverständnissen Vorschub leisten. Hinzu kommt, dass das sog. p-hacking, d. h. die gezielte Suche nach Auswertungen, die zu statistisch signifikanten Ergebnissen führen, Verzerrungen hervorrufen und die Rate falscher Entdeckungen (false discovery rate) erhöhen kann. Fehlinterpretationen des p-Wertes und auswertungsbedingte Verzerrungen wurden über die Jahrzehnte hinweg immer wieder kritisch diskutiert. In der empirischen Forschung scheinen sie aber persistent zu sein und in den letzten Jahren wurde die p-Wert-Debatte wegen der Nicht-Reproduzierbarkeit vieler Studien zunehmend intensiv geführt. Angesichts der über die Disziplinen verstreuten und oft auf Einzelaspekte abzielenden Literatur zur p-Wert-Problematik beschreibt dieser Methodenkommentar systematisch die wichtigsten Probleme und diskutiert die entsprechenden Lösungsvorschläge.

Suggested Citation

  • Hirschauer Norbert & Grüner Sven & Mußhoff Oliver & Frey Ulrich & Theesfeld Insa & Wagner Peter, 2016. "Die Interpretation des p-Wertes – Grundsätzliche Missverständnisse," Journal of Economics and Statistics (Jahrbuecher fuer Nationaloekonomie und Statistik), De Gruyter, vol. 236(5), pages 557-575, October.
  • Handle: RePEc:jns:jbstat:v:236:y:2016:i:5:p:557-575
    DOI: 10.1515/jbnst-2015-1030
    as

    Download full text from publisher

    File URL: https://doi.org/10.1515/jbnst-2015-1030
    Download Restriction: no

    File URL: https://libkey.io/10.1515/jbnst-2015-1030?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Deirdre N. McCloskey & Stephen T. Ziliak, 1996. "The Standard Error of Regressions," Journal of Economic Literature, American Economic Association, vol. 34(1), pages 97-114, March.
    2. Armstrong, J. Scott, 2007. "Significance tests harm progress in forecasting," International Journal of Forecasting, Elsevier, vol. 23(2), pages 321-327.
    3. John P A Ioannidis, 2005. "Why Most Published Research Findings Are False," PLOS Medicine, Public Library of Science, vol. 2(8), pages 1-1, August.
    4. John List & Sally Sadoff & Mathis Wagner, 2011. "So you want to run an experiment, now what? Some simple rules of thumb for optimal experimental design," Experimental Economics, Springer;Economic Science Association, vol. 14(4), pages 439-457, November.
    5. Sellke T. & Bayarri M. J. & Berger J. O., 2001. "Calibration of rho Values for Testing Precise Null Hypotheses," The American Statistician, American Statistical Association, vol. 55, pages 62-71, February.
    6. Walter Krämer, 2011. "The Cult of Statistical Significance – What Economists Should and Should Not Do to Make their Data Talk," Schmollers Jahrbuch : Journal of Applied Social Science Studies / Zeitschrift für Wirtschafts- und Sozialwissenschaften, Duncker & Humblot, Berlin, vol. 131(3), pages 455-468.
    7. Maren Duvendack & Richard W. Palmer-Jones & W. Robert Reed, 2015. "Replications in Economics: A Progress Report," Econ Journal Watch, Econ Journal Watch, vol. 12(2), pages 164–191-1, May.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jens Rommel & Meike Weltin, 2021. "Is There a Cult of Statistical Significance in Agricultural Economics?," Applied Economic Perspectives and Policy, John Wiley & Sons, vol. 43(3), pages 1176-1191, September.
    2. Heckelei, Thomas & Huettel, Silke & Odening, Martin & Rommel, Jens, 2021. "The replicability crisis and the p-value debate – what are the consequences for the agricultural and food economics community?," Discussion Papers 316369, University of Bonn, Institute for Food and Resource Economics.
    3. Alexander Herzog-Stein & Camille Logeay, 2019. "Short-Term macroeconomic evaluation of the German minimum wage with a VAR/VECM," IMK Working Paper 197-2019, IMK at the Hans Boeckler Foundation, Macroeconomic Policy Institute.
    4. Anica Veronika Fietz & Sven Grüner, 2017. "Transparency systems: do businesses in North Rhine-Westphalia (Germany) regret the cancellation of the Smiley scheme?," Agricultural and Food Economics, Springer;Italian Society of Agricultural Economics (SIDEA), vol. 5(1), pages 1-10, December.
    5. Hüttel, Silke & Hess, Sebastian, 2023. "Lessons from the p-value debate and the replication crisis for "open Q science" – the editor's perspective or: will the revolution devour its children?," DARE Discussion Papers 2302, Georg-August University of Göttingen, Department of Agricultural Economics and Rural Development (DARE).
    6. Hirschauer Norbert & Grüner Sven & Mußhoff Oliver & Becker Claudia, 2019. "Twenty Steps Towards an Adequate Inferential Interpretation of p-Values in Econometrics," Journal of Economics and Statistics (Jahrbuecher fuer Nationaloekonomie und Statistik), De Gruyter, vol. 239(4), pages 703-721, August.
    7. Aparo, Nathaline Onek & Odongo, Walter & De Steur, Hans, 2022. "Unraveling heterogeneity in farmer's adoption of mobile phone technologies: A systematic review," Technological Forecasting and Social Change, Elsevier, vol. 185(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Hirschauer Norbert & Grüner Sven & Mußhoff Oliver & Becker Claudia, 2019. "Twenty Steps Towards an Adequate Inferential Interpretation of p-Values in Econometrics," Journal of Economics and Statistics (Jahrbuecher fuer Nationaloekonomie und Statistik), De Gruyter, vol. 239(4), pages 703-721, August.
    2. Kim, Jae H. & Ji, Philip Inyeob, 2015. "Significance testing in empirical finance: A critical review and assessment," Journal of Empirical Finance, Elsevier, vol. 34(C), pages 1-14.
    3. Jesper W. Schneider, 2015. "Null hypothesis significance tests. A mix-up of two different theories: the basis for widespread confusion and numerous misinterpretations," Scientometrics, Springer;Akadémiai Kiadó, vol. 102(1), pages 411-432, January.
    4. Jyotirmoy Sarkar, 2018. "Will P†Value Triumph over Abuses and Attacks?," Biostatistics and Biometrics Open Access Journal, Juniper Publishers Inc., vol. 7(4), pages 66-71, July.
    5. Hirschauer, Norbert & Grüner, Sven & Mußhoff, Oliver & Becker, Claudia & Jantsch, Antje, 2020. "Can p-values be meaningfully interpreted without random sampling?," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 14, pages 71-91.
    6. Stephen T. Ziliak & Deirdre N. McCloskey, 2013. "We Agree That Statistical Significance Proves Essentially Nothing: A Rejoinder to Thomas Mayer," Econ Journal Watch, Econ Journal Watch, vol. 10(1), pages 97-107, January.
    7. Michaelides, Michael, 2021. "Large sample size bias in empirical finance," Finance Research Letters, Elsevier, vol. 41(C).
    8. Black, Bernard & Hollingsworth, Alex & Nunes, Letícia & Simon, Kosali, 2022. "Simulated power analyses for observational studies: An application to the Affordable Care Act Medicaid expansion," Journal of Public Economics, Elsevier, vol. 213(C).
    9. Nicolas Vallois & Dorian Jullien, 2017. "Estimating Rationality in Economics: A History of Statistical Methods in Experimental Economics," Working Papers halshs-01651070, HAL.
    10. Eszter Czibor & David Jimenez‐Gomez & John A. List, 2019. "The Dozen Things Experimental Economists Should Do (More of)," Southern Economic Journal, John Wiley & Sons, vol. 86(2), pages 371-432, October.
    11. Jae H. Kim & Kamran Ahmed & Philip Inyeob Ji, 2018. "Significance Testing in Accounting Research: A Critical Evaluation Based on Evidence," Abacus, Accounting Foundation, University of Sydney, vol. 54(4), pages 524-546, December.
    12. Nicolas Vallois & Dorian Jullien, 2018. "A history of statistical methods in experimental economics," The European Journal of the History of Economic Thought, Taylor & Francis Journals, vol. 25(6), pages 1455-1492, November.
    13. Mueller-Langer, Frank & Fecher, Benedikt & Harhoff, Dietmar & Wagner, Gert G., 2019. "Replication studies in economics—How many and which papers are chosen for replication, and why?," Research Policy, Elsevier, vol. 48(1), pages 62-83.
    14. Jens Rommel & Meike Weltin, 2021. "Is There a Cult of Statistical Significance in Agricultural Economics?," Applied Economic Perspectives and Policy, John Wiley & Sons, vol. 43(3), pages 1176-1191, September.
    15. Stephan B. Bruns & David I. Stern, 2019. "Lag length selection and p-hacking in Granger causality testing: prevalence and performance of meta-regression models," Empirical Economics, Springer, vol. 56(3), pages 797-830, March.
    16. Denis Fougère & Nicolas Jacquemet, 2019. "Causal Inference and Impact Evaluation," Economie et Statistique / Economics and Statistics, Institut National de la Statistique et des Etudes Economiques (INSEE), issue 510-511-5, pages 181-200.
    17. Emma von Essen & Marieke Huysentruyt & Topi Miettinen, 2019. "Exploration in Teams and the Encouragement Effect: Theory and Evidence," Economics Working Papers 2019-10, Department of Economics and Business Economics, Aarhus University.
    18. Brian Albert Monroe, 2020. "The statistical power of individual-level risk preference estimation," Journal of the Economic Science Association, Springer;Economic Science Association, vol. 6(2), pages 168-188, December.
    19. Kathryn N. Vasilaky & J. Michelle Brock, 2020. "Power(ful) guidelines for experimental economists," Journal of the Economic Science Association, Springer;Economic Science Association, vol. 6(2), pages 189-212, December.
    20. Blakeley B. McShane & David Gal, 2016. "Blinding Us to the Obvious? The Effect of Statistical Training on the Evaluation of Evidence," Management Science, INFORMS, vol. 62(6), pages 1707-1718, June.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:jns:jbstat:v:236:y:2016:i:5:p:557-575. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://www.degruyter.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.