IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0296904.html
   My bibliography  Save this article

As good as it gets? A new approach to estimating possible prediction performance

Author

Listed:
  • David Anderson
  • Margret Bjarnadottir

Abstract

How much information does a dataset contain about an outcome of interest? To answer this question, estimates are generated for a given dataset, representing the minimum possible absolute prediction error for an outcome variable that any model could achieve. The estimate is produced using a constrained omniscient model that mandates only that identical observations receive identical predictions, and that observations which are very similar to each other receive predictions that are alike. It is demonstrated that the resulting prediction accuracy bounds function effectively on both simulated data and real-world datasets. This method generates bounds on predictive performance typically within 10% of the performance of the true model, and performs well across a range of simulated and real datasets. Three applications of the methodology are discussed: measuring data quality, model evaluation, and quantifying the amount of irreducible error in a prediction problem.

Suggested Citation

  • David Anderson & Margret Bjarnadottir, 2024. "As good as it gets? A new approach to estimating possible prediction performance," PLOS ONE, Public Library of Science, vol. 19(10), pages 1-18, October.
  • Handle: RePEc:plo:pone00:0296904
    DOI: 10.1371/journal.pone.0296904
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0296904
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0296904&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0296904?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. A. S. C. Ehrenberg & J. A. Bound, 1993. "Predictability and Prediction," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 156(2), pages 167-194, March.
    2. Ron S. Kenett & Galit Shmueli, 2014. "On information quality," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 177(1), pages 3-38, January.
    3. Francis X. Diebold & Lutz Kilian, 2001. "Measuring predictability: theory and macroeconomic applications," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 16(6), pages 657-669.
    4. Neth, Hansjörg & Meder, Björn & Kothiyal, Amit & Gigerenzer, Gerd, 2014. "`Homo heuristicus` in the financial world: From risk management to managing uncertainty," Journal of Risk Management in Financial Institutions, Henry Stewart Publications, vol. 7(2), pages 134-144, March.
    5. Donald Ballou & Richard Wang & Harold Pazer & Giri Kumar Tayi, 1998. "Modeling Information Manufacturing Systems to Determine Information Product Quality," Management Science, INFORMS, vol. 44(4), pages 462-484, April.
    6. Luka Jovanovic & Dejan Jovanovic & Nebojsa Bacanin & Ana Jovancai Stakic & Milos Antonijevic & Hesham Magd & Ravi Thirumalaisamy & Miodrag Zivkovic, 2022. "Multi-Step Crude Oil Price Prediction Based on LSTM Approach Tuned by Salp Swarm Algorithm with Disputation Operator," Sustainability, MDPI, vol. 14(21), pages 1-29, November.
    7. Wright, George & Goodwin, Paul, 2009. "Decision making and planning under low levels of predictability: Enhancing the scenario method," International Journal of Forecasting, Elsevier, vol. 25(4), pages 813-825, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Berkowitz, J. & Birgean, I. & Kilian, L., 1999. "On the Finite-Sample Accuracy of Nonparametric Resampling Algorithms for Economic Time Series," Papers 99-01, Michigan - Center for Research on Economic & Social Theory.
    2. Meissner, Philip & Brands, Christian & Wulf, Torsten, 2017. "Quantifiying blind spots and weak signals in executive judgment: A structured integration of expert judgment into the scenario development process," International Journal of Forecasting, Elsevier, vol. 33(1), pages 244-253.
    3. Pierpaolo D’Urso & Vincenzina Vitale, 2020. "Bayesian Networks Model Averaging for Bes Indicators," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 151(3), pages 897-919, October.
    4. Cairns, George & Wright, George & Fairbrother, Peter, 2016. "Promoting articulated action from diverse stakeholders in response to public policy scenarios: A case analysis of the use of ‘scenario improvisation’ method," Technological Forecasting and Social Change, Elsevier, vol. 103(C), pages 97-108.
    5. Hofer Helmut & Weyerstraß Klaus & Schmidt Torsten, 2011. "Practice and Prospects of Medium-term Economic Forecasting," Journal of Economics and Statistics (Jahrbuecher fuer Nationaloekonomie und Statistik), De Gruyter, vol. 231(1), pages 153-171, February.
    6. Derbyshire, James, 2017. "Potential surprise theory as a theoretical foundation for scenario planning," Technological Forecasting and Social Change, Elsevier, vol. 124(C), pages 77-87.
    7. David McMillan & Isabel Ruiz & Alan Speight, 2010. "Correlations and spillovers among three euro rates: evidence using realised variance," The European Journal of Finance, Taylor & Francis Journals, vol. 16(8), pages 753-767.
    8. Pierpaolo D’Urso & Vincenzina Vitale, 2021. "Modeling Local BES Indicators by Copula-Based Bayesian Networks," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 153(3), pages 823-847, February.
    9. Dovern, Jonas, 2006. "Predicting GDP components: do leading indicators increase predictability?," Kiel Advanced Studies Working Papers 436, Kiel Institute for the World Economy (IfW Kiel).
    10. Federica Cugnata & Silvia Salini, 2014. "Model-based approach for importance–performance analysis," Quality & Quantity: International Journal of Methodology, Springer, vol. 48(6), pages 3053-3064, November.
    11. Kunc, Martin & O'Brien, Frances A., 2017. "Exploring the development of a methodology for scenario use: Combining scenario and resource mapping approaches," Technological Forecasting and Social Change, Elsevier, vol. 124(C), pages 150-159.
    12. James Derbyshire, 2020. "Answers to questions on uncertainty in geography: Old lessons and new scenario tools," Environment and Planning A, , vol. 52(4), pages 710-727, June.
    13. Xitong Li & Hongwei Zhu & Luo Zuo, 2021. "Reporting Technologies and Textual Readability: Evidence from the XBRL Mandate," Information Systems Research, INFORMS, vol. 32(3), pages 1025-1042, September.
    14. Nina Bov{c}kov'a & Barbora Voln'a & Mirko Dohnal, 2025. "SME Gender-Related Innovation: A Non-Numerical Trend Analysis Using Positive, Zero, and Negative Quantities," Papers 2504.08493, arXiv.org.
    15. Arbrie Jashari & Victor Tiberius & Marina Dabić, 2022. "Tracing the progress of scenario research in business and management," Futures & Foresight Science, John Wiley & Sons, vol. 4(2), June.
    16. Alessandra Luati & Tommaso Proietti & Marco Reale, 2012. "The Variance Profile," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(498), pages 607-621, June.
    17. Liu, Bai & Yang, Dazhi & Mayer, Martin János & Coimbra, Carlos F.M. & Kleissl, Jan & Kay, Merlinde & Wang, Wenting & Bright, Jamie M. & Xia, Xiang’ao & Lv, Xin & Srinivasan, Dipti & Wu, Yan & Beyer, H, 2023. "Predictability and forecast skill of solar irradiance over the contiguous United States," Renewable and Sustainable Energy Reviews, Elsevier, vol. 182(C).
    18. Klenk, Nicole L. & Hickey, Gordon M., 2011. "A virtual and anonymous, deliberative and analytic participation process for planning and evaluation: The Concept Mapping Policy Delphi," International Journal of Forecasting, Elsevier, vol. 27(1), pages 152-165, January.
    19. Timothy Cogley & Giorgio E. Primiceri & Thomas J. Sargent, 2010. "Inflation-Gap Persistence in the US," American Economic Journal: Macroeconomics, American Economic Association, vol. 2(1), pages 43-69, January.
    20. Hendry, David F. & Hubrich, Kirstin, 2006. "Forecasting economic aggregates by disaggregates," Working Paper Series 589, European Central Bank.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0296904. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.