IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0296904.html
   My bibliography  Save this article

As good as it gets? A new approach to estimating possible prediction performance

Author

Listed:
  • David Anderson
  • Margret Bjarnadottir

Abstract

How much information does a dataset contain about an outcome of interest? To answer this question, estimates are generated for a given dataset, representing the minimum possible absolute prediction error for an outcome variable that any model could achieve. The estimate is produced using a constrained omniscient model that mandates only that identical observations receive identical predictions, and that observations which are very similar to each other receive predictions that are alike. It is demonstrated that the resulting prediction accuracy bounds function effectively on both simulated data and real-world datasets. This method generates bounds on predictive performance typically within 10% of the performance of the true model, and performs well across a range of simulated and real datasets. Three applications of the methodology are discussed: measuring data quality, model evaluation, and quantifying the amount of irreducible error in a prediction problem.

Suggested Citation

  • David Anderson & Margret Bjarnadottir, 2024. "As good as it gets? A new approach to estimating possible prediction performance," PLOS ONE, Public Library of Science, vol. 19(10), pages 1-18, October.
  • Handle: RePEc:plo:pone00:0296904
    DOI: 10.1371/journal.pone.0296904
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0296904
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0296904&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0296904?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Ron S. Kenett & Galit Shmueli, 2014. "On information quality," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 177(1), pages 3-38, January.
    2. Francis X. Diebold & Lutz Kilian, 2001. "Measuring predictability: theory and macroeconomic applications," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 16(6), pages 657-669.
    3. Neth, Hansjörg & Meder, Björn & Kothiyal, Amit & Gigerenzer, Gerd, 2014. "`Homo heuristicus` in the financial world: From risk management to managing uncertainty," Journal of Risk Management in Financial Institutions, Henry Stewart Publications, vol. 7(2), pages 134-144, March.
    4. Luka Jovanovic & Dejan Jovanovic & Nebojsa Bacanin & Ana Jovancai Stakic & Milos Antonijevic & Hesham Magd & Ravi Thirumalaisamy & Miodrag Zivkovic, 2022. "Multi-Step Crude Oil Price Prediction Based on LSTM Approach Tuned by Salp Swarm Algorithm with Disputation Operator," Sustainability, MDPI, vol. 14(21), pages 1-29, November.
    5. Wright, George & Goodwin, Paul, 2009. "Decision making and planning under low levels of predictability: Enhancing the scenario method," International Journal of Forecasting, Elsevier, vol. 25(4), pages 813-825, October.
    6. A. S. C. Ehrenberg & J. A. Bound, 1993. "Predictability and Prediction," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 156(2), pages 167-194, March.
    7. Donald Ballou & Richard Wang & Harold Pazer & Giri Kumar Tayi, 1998. "Modeling Information Manufacturing Systems to Determine Information Product Quality," Management Science, INFORMS, vol. 44(4), pages 462-484, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. David McMillan & Isabel Ruiz & Alan Speight, 2010. "Correlations and spillovers among three euro rates: evidence using realised variance," The European Journal of Finance, Taylor & Francis Journals, vol. 16(8), pages 753-767.
    2. Pierpaolo D’Urso & Vincenzina Vitale, 2021. "Modeling Local BES Indicators by Copula-Based Bayesian Networks," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 153(3), pages 823-847, February.
    3. Dovern, Jonas, 2006. "Predicting GDP components: do leading indicators increase predictability?," Kiel Advanced Studies Working Papers 436, Kiel Institute for the World Economy (IfW Kiel).
    4. James Derbyshire, 2020. "Answers to questions on uncertainty in geography: Old lessons and new scenario tools," Environment and Planning A, , vol. 52(4), pages 710-727, June.
    5. Nina Bov{c}kov'a & Barbora Voln'a & Mirko Dohnal, 2025. "SME Gender-Related Innovation: A Non-Numerical Trend Analysis Using Positive, Zero, and Negative Quantities," Papers 2504.08493, arXiv.org.
    6. Alessandra Luati & Tommaso Proietti & Marco Reale, 2012. "The Variance Profile," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(498), pages 607-621, June.
    7. Klenk, Nicole L. & Hickey, Gordon M., 2011. "A virtual and anonymous, deliberative and analytic participation process for planning and evaluation: The Concept Mapping Policy Delphi," International Journal of Forecasting, Elsevier, vol. 27(1), pages 152-165, January.
    8. Juha-Miikka Nurmilaakso, 2014. "Coordination costs and ICT investments: an economic analysis," Netnomics, Springer, vol. 15(2), pages 57-67, September.
    9. Sridevi Narayanan & Chee Keong Choong & Lin Sea Lau, 2020. "An investigation on the role of good governance as a mediating factor in the FDI-Growth nexus: An ASEAN Perspective," Economics Bulletin, AccessEcon, vol. 40(4), pages 2769-2779.
    10. Ralf Elbert & Lowis Seikowsky, 2017. "The influences of behavioral biases, barriers and facilitators on the willingness of forwarders’ decision makers to modal shift from unimodal road freight transport to intermodal road–rail freight tra," Journal of Business Economics, Springer, vol. 87(8), pages 1083-1123, November.
    11. Konstantin A. Kholodilin & Boriss Siliverstovs, 2009. "Do forecasters inform or reassure?," KOF Working papers 09-215, KOF Swiss Economic Institute, ETH Zurich.
    12. Paulo Esteves, 2003. "Uncertainty and Risk Analysis: na Application to the Projections for the Portuguese Economy in 2004," Economic Bulletin and Financial Stability Report Articles and Banco de Portugal Economic Studies, Banco de Portugal, Economics and Research Department.
    13. Francis X. Diebold, 1998. "The Past, Present, and Future of Macroeconomic Forecasting," Journal of Economic Perspectives, American Economic Association, vol. 12(2), pages 175-192, Spring.
    14. John W. Galbraith, 1999. "Content Horizons For Forecasts Of Economic Time Series," Departmental Working Papers 1999-01, McGill University, Department of Economics.
    15. David Hand & Niall Adams, 2000. "Defining attributes for scorecard construction in credit scoring," Journal of Applied Statistics, Taylor & Francis Journals, vol. 27(5), pages 527-540.
    16. Davidson, Ian & Tayi, Giri, 2009. "Data preparation using data quality matrices for classification mining," European Journal of Operational Research, Elsevier, vol. 197(2), pages 764-772, September.
    17. Even, Adir & Shankaranarayanan, G. & Berger, Paul D., 2010. "Managing the Quality of Marketing Data: Cost/benefit Tradeoffs and Optimal Configuration," Journal of Interactive Marketing, Elsevier, vol. 24(3), pages 209-221.
    18. Ruan, Xinfeng & Zhang, Jin E., 2018. "Risk-neutral moments in the crude oil market," Energy Economics, Elsevier, vol. 72(C), pages 583-600.
    19. Hofer Helmut & Weyerstraß Klaus & Schmidt Torsten, 2011. "Practice and Prospects of Medium-term Economic Forecasting," Journal of Economics and Statistics (Jahrbuecher fuer Nationaloekonomie und Statistik), De Gruyter, vol. 231(1), pages 153-171, February.
    20. Dovern, Jonas, 2024. "Eliciting expectation uncertainty from private households," International Journal of Forecasting, Elsevier, vol. 40(1), pages 113-123.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0296904. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.