IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2603.04275.html

Statistical Inference for Score Decompositions

Author

Listed:
  • Timo Dimitriadis
  • Marius Puke

Abstract

We introduce inference methods for score decompositions, which partition scoring functions for predictive assessment into three interpretable components: miscalibration, discrimination, and uncertainty. Our estimation and inference relies on a linear recalibration of the forecasts, which is applicable to general multi-step ahead point forecasts such as means and quantiles due to its validity for both smooth and non-smooth scoring functions. This approach ensures desirable finite-sample properties, enables asymptotic inference, and establishes a direct connection to the classical Mincer-Zarnowitz regression. The resulting inference framework facilitates tests for equal forecast calibration or discrimination, which yield three key advantages. They enhance the information content of predictive ability tests by decomposing scores, deliver higher statistical power in certain scenarios, and formally connect scoring-function-based evaluation to traditional calibration tests, such as financial backtests. Applications demonstrate the method's utility. We find that for survey inflation forecasts, discrimination abilities can differ significantly even when overall predictive ability does not. In an application to financial risk models, our tests provide deeper insights into the calibration and information content of volatility and Value-at-Risk forecasts. By disentangling forecast accuracy from backtest performance, the method exposes critical shortcomings in current banking regulation.

Suggested Citation

  • Timo Dimitriadis & Marius Puke, 2026. "Statistical Inference for Score Decompositions," Papers 2603.04275, arXiv.org.
  • Handle: RePEc:arx:papers:2603.04275
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2603.04275
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Diebold, Francis X & Gunther, Todd A & Tay, Anthony S, 1998. "Evaluating Density Forecasts with Applications to Financial Risk Management," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 39(4), pages 863-883, November.
    2. Whitney Newey & Kenneth West, 2014. "A simple, positive semi-definite, heteroscedasticity and autocorrelation consistent covariance matrix," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 33(1), pages 125-132.
    3. Natalia Nolde & Johanna F. Ziegel, 2016. "Elicitability and backtesting: Perspectives for banking regulation," Papers 1608.05498, arXiv.org, revised Feb 2017.
    4. Sebastian Bayer & Timo Dimitriadis, 2022. "Regression-Based Expected Shortfall Backtesting [Backtesting Expected Shortfall]," Journal of Financial Econometrics, Oxford University Press, vol. 20(3), pages 437-471.
    5. Jack Fosten & Daniel Gutknecht & Marc-Oliver Pohle, 2024. "Testing Quantile Forecast Optimality," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 42(4), pages 1367-1378, October.
    6. Timo Dimitriadis & Yannick Hoga, 2026. "Regressions under Adverse Conditions," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 44(1), pages 227-241, January.
    7. Jacob A. Mincer & Victor Zarnowitz, 1969. "The Evaluation of Economic Forecasts," NBER Chapters, in: Economic Forecasts and Expectations: Analysis of Forecasting Behavior and Performance, pages 3-46, National Bureau of Economic Research, Inc.
    8. Gneiting, Tilmann, 2011. "Making and Evaluating Point Forecasts," Journal of the American Statistical Association, American Statistical Association, vol. 106(494), pages 746-762.
    9. George-Marios Angeletos & Zhen Huo & Karthik A. Sastry, 2021. "Imperfect Macroeconomic Expectations: Evidence and Theory," NBER Macroeconomics Annual, University of Chicago Press, vol. 35(1), pages 1-86.
    10. Newey, Whitney K, 1994. "The Asymptotic Variance of Semiparametric Estimators," Econometrica, Econometric Society, vol. 62(6), pages 1349-1382, November.
    11. Nieto, Maria Rosa & Ruiz, Esther, 2016. "Frontiers in VaR forecasting and backtesting," International Journal of Forecasting, Elsevier, vol. 32(2), pages 475-501.
    12. Yannick Hoga & Matei Demetrescu, 2023. "Monitoring Value-at-Risk and Expected Shortfall Forecasts," Management Science, INFORMS, vol. 69(5), pages 2954-2971, May.
    13. Murphy, Allan H. & Winkler, Robert L., 1992. "Diagnostic verification of probability forecasts," International Journal of Forecasting, Elsevier, vol. 7(4), pages 435-455, March.
    14. Alexander Henzi & Johanna F Ziegel, 2022. "Valid sequential inference on probability forecast performance [A comparison of the ECMWF, MSC, and NCEP global ensemble prediction systems]," Biometrika, Biometrika Trust, vol. 109(3), pages 647-663.
    15. Ghysels, Eric & Kvedaras, Virmantas & Zemlys, Vaidotas, 2016. "Mixed Frequency Data Sampling Regression Models: The R Package midasr," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 72(i04).
    16. Patton, Andrew J. & Ziegel, Johanna F. & Chen, Rui, 2019. "Dynamic semiparametric models for expected shortfall (and Value-at-Risk)," Journal of Econometrics, Elsevier, vol. 211(2), pages 388-413.
    17. Ghysels, Eric & Santa-Clara, Pedro & Valkanov, Rossen, 2006. "Predicting volatility: getting the most out of return data sampled at different frequencies," Journal of Econometrics, Elsevier, vol. 131(1-2), pages 59-95.
    18. Duchesne, Pierre & Lafaye De Micheaux, Pierre, 2010. "Computing the distribution of quadratic forms: Further comparisons between the Liu-Tang-Zhang approximation and exact methods," Computational Statistics & Data Analysis, Elsevier, vol. 54(4), pages 858-862, April.
    19. Jia Li & Zhipeng Liao & Rogier Quaedvlieg, 2022. "Conditional Superior Predictive Ability," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 89(2), pages 843-875.
    20. Dimitriadis, Timo & Gneiting, Tilmann & Jordan, Alexander I. & Vogel, Peter, 2024. "Evaluating probabilistic classifiers: The triptych," International Journal of Forecasting, Elsevier, vol. 40(3), pages 1101-1122.
    21. Fulvio Corsi, 2009. "A Simple Approximate Long-Memory Model of Realized Volatility," Journal of Financial Econometrics, Oxford University Press, vol. 7(2), pages 174-196, Spring.
    22. Tim Bollerslev & Benjamin Hood & John Huss & Lasse Heje Pedersen, 2018. "Risk Everywhere: Modeling and Managing Volatility," The Review of Financial Studies, Society for Financial Studies, vol. 31(7), pages 2729-2773.
    23. Gaglianone, Wagner Piazza & Lima, Luiz Renato & Linton, Oliver & Smith, Daniel R., 2011. "Evaluating Value-at-Risk Models via Quantile Regression," Journal of Business & Economic Statistics, American Statistical Association, vol. 29(1), pages 150-160.
    24. Davidson, James, 1994. "Stochastic Limit Theory: An Introduction for Econometricians," OUP Catalogue, Oxford University Press, number 9780198774037.
    25. Graham Elliott & Dalia Ghanem & Fabian Krüger, 2016. "Forecasting Conditional Probabilities of Binary Outcomes under Misspecification," The Review of Economics and Statistics, MIT Press, vol. 98(4), pages 742-755, October.
    26. West, Kenneth D, 1996. "Asymptotic Inference about Predictive Ability," Econometrica, Econometric Society, vol. 64(5), pages 1067-1084, September.
    27. Roopesh Ranjan & Tilmann Gneiting, 2010. "Combining probability forecasts," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 72(1), pages 71-91, January.
    28. Alexander Henzi & Johanna F Ziegel, 2022. "Correction to: ‘Valid sequential inference on probability forecast performance’ [Valid sequential inference on probability forecast performance]," Biometrika, Biometrika Trust, vol. 109(4), pages 1181-1182.
    29. Hansen, Peter Reinhard, 2005. "A Test for Superior Predictive Ability," Journal of Business & Economic Statistics, American Statistical Association, vol. 23, pages 365-380, October.
    30. Paul H. Kupiec, 1995. "Techniques for verifying the accuracy of risk measurement models," Finance and Economics Discussion Series 95-24, Board of Governors of the Federal Reserve System (U.S.).
    31. Pedro Bordalo & Nicola Gennaioli & Yueran Ma & Andrei Shleifer, 2020. "Overreaction in Macroeconomic Expectations," American Economic Review, American Economic Association, vol. 110(9), pages 2748-2782, September.
    32. Olivier Coibion & Yuriy Gorodnichenko, 2015. "Information Rigidity and the Expectations Formation Process: A Simple Framework and New Facts," American Economic Review, American Economic Association, vol. 105(8), pages 2644-2678, August.
    33. Zeileis, Achim, 2004. "Econometric Computing with HC and HAC Covariance Matrix Estimators," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 11(i10).
    34. Andrew J. Patton, 2020. "Comparing Possibly Misspecified Forecasts," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 38(4), pages 796-809, October.
    35. Yo Joong Choe & Aaditya Ramdas, 2024. "Comparing Sequential Forecasters," Operations Research, INFORMS, vol. 72(4), pages 1368-1387, July.
    36. Faust, Jon & Wright, Jonathan H., 2013. "Forecasting Inflation," Handbook of Economic Forecasting, in: G. Elliott & C. Granger & A. Timmermann (ed.), Handbook of Economic Forecasting, edition 1, volume 2, chapter 0, pages 2-56, Elsevier.
    37. Robert F. Engle & Simone Manganelli, 2004. "CAViaR: Conditional Autoregressive Value at Risk by Regression Quantiles," Journal of Business & Economic Statistics, American Statistical Association, vol. 22, pages 367-381, October.
    38. Jeremy Berkowitz & Peter Christoffersen & Denis Pelletier, 2011. "Evaluating Value-at-Risk Models with Desk-Level Data," Management Science, INFORMS, vol. 57(12), pages 2213-2227, December.
    39. Diebold, Francis X & Mariano, Roberto S, 2002. "Comparing Predictive Accuracy," Journal of Business & Economic Statistics, American Statistical Association, vol. 20(1), pages 134-144, January.
    40. Tobias Fissler & Yannick Hoga, 2024. "Backtesting Systemic Risk Forecasts Using Multi-Objective Elicitability," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 42(2), pages 485-498, April.
    41. Clark, Todd E. & McCracken, Michael W., 2001. "Tests of equal forecast accuracy and encompassing for nested models," Journal of Econometrics, Elsevier, vol. 105(1), pages 85-110, November.
    42. Antonio F. Galvao & Jungmo Yoon, 2024. "HAC Covariance Matrix Estimation in Quantile Regression," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 119(547), pages 2305-2316, July.
    43. Tilmann Gneiting & Fadoua Balabdaoui & Adrian E. Raftery, 2007. "Probabilistic forecasts, calibration and sharpness," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 69(2), pages 243-268, April.
    44. Andrews, Donald W K, 1991. "Heteroskedasticity and Autocorrelation Consistent Covariance Matrix Estimation," Econometrica, Econometric Society, vol. 59(3), pages 817-858, May.
    45. Bollerslev, Tim, 1986. "Generalized autoregressive conditional heteroskedasticity," Journal of Econometrics, Elsevier, vol. 31(3), pages 307-327, April.
    46. Clark, Todd & McCracken, Michael, 2013. "Advances in Forecast Evaluation," Handbook of Economic Forecasting, in: G. Elliott & C. Granger & A. Timmermann (ed.), Handbook of Economic Forecasting, edition 1, volume 2, chapter 0, pages 1107-1201, Elsevier.
    47. Raffaella Giacomini & Halbert White, 2006. "Tests of Conditional Predictive Ability," Econometrica, Econometric Society, vol. 74(6), pages 1545-1578, November.
    48. Glosten, Lawrence R & Jagannathan, Ravi & Runkle, David E, 1993. "On the Relation between the Expected Value and the Volatility of the Nominal Excess Return on Stocks," Journal of Finance, American Finance Association, vol. 48(5), pages 1779-1801, December.
    49. Leland E. Farmer & Emi Nakamura & Jón Steinsson, 2024. "Learning about the Long Run," Journal of Political Economy, University of Chicago Press, vol. 132(10), pages 3334-3377.
    50. Timo Dimitriadis & Yannick Hoga, 2022. "Dynamic CoVaR Modeling and Estimation," Papers 2206.14275, arXiv.org, revised Jan 2025.
    51. Hajo Holzmann & Matthias Eulert, 2014. "The role of the information set for forecasting - with applications to risk management," Papers 1404.7653, arXiv.org.
    52. Patton, Andrew J., 2011. "Volatility forecast comparison using imperfect volatility proxies," Journal of Econometrics, Elsevier, vol. 160(1), pages 246-256, January.
    53. Yannick Hoga & Timo Dimitriadis, 2023. "On Testing Equal Conditional Predictive Ability Under Measurement Error," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 41(2), pages 364-376, April.
    54. Hansen, Bruce E., 1996. "Stochastic Equicontinuity for Unbounded Dependent Heterogeneous Arrays," Econometric Theory, Cambridge University Press, vol. 12(2), pages 347-359, June.
    55. Peter R. Hansen & Asger Lunde & James M. Nason, 2011. "The Model Confidence Set," Econometrica, Econometric Society, vol. 79(2), pages 453-497, March.
    56. Kemal Guler & Pin T. Ng & Zhijie Xiao, 2017. "Mincer–Zarnowitz quantile and expectile regressions for forecast evaluations under aysmmetric loss functions," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 36(6), pages 651-679, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Petropoulos, Fotios & Apiletti, Daniele & Assimakopoulos, Vassilios & Babai, Mohamed Zied & Barrow, Devon K. & Ben Taieb, Souhaib & Bergmeir, Christoph & Bessa, Ricardo J. & Bijak, Jakub & Boylan, Joh, 2022. "Forecasting: theory and practice," International Journal of Forecasting, Elsevier, vol. 38(3), pages 705-871.
      • Fotios Petropoulos & Daniele Apiletti & Vassilios Assimakopoulos & Mohamed Zied Babai & Devon K. Barrow & Souhaib Ben Taieb & Christoph Bergmeir & Ricardo J. Bessa & Jakub Bijak & John E. Boylan & Jet, 2020. "Forecasting: theory and practice," Papers 2012.03854, arXiv.org, revised Jan 2022.
    2. Timo Dimitriadis & iaochun Liu & Julie Schnaitmann, 2023. "Encompassing Tests for Value at Risk and Expected Shortfall Multistep Forecasts Based on Inference on the Boundary," Journal of Financial Econometrics, Oxford University Press, vol. 21(2), pages 412-444.
    3. Li, Jia & Patton, Andrew J., 2018. "Asymptotic inference about predictive accuracy using high frequency data," Journal of Econometrics, Elsevier, vol. 203(2), pages 223-240.
    4. Dimitriadis, Timo & Schnaitmann, Julie, 2021. "Forecast encompassing tests for the expected shortfall," International Journal of Forecasting, Elsevier, vol. 37(2), pages 604-621.
    5. Zhanyi Jiao & Qiuqi Wang & Yimiao Zhao, 2025. "Comparative e-backtests for general risk measures," Papers 2511.05840, arXiv.org, revised Mar 2026.
    6. Jack Fosten & Daniel Gutknecht & Marc-Oliver Pohle, 2023. "Testing Quantile Forecast Optimality," Papers 2302.02747, arXiv.org, revised Oct 2023.
    7. Erik Kole & Thijs Markwat & Anne Opschoor & Dick van Dijk, 2017. "Forecasting Value-at-Risk under Temporal and Portfolio Aggregation," Journal of Financial Econometrics, Oxford University Press, vol. 15(4), pages 649-677.
    8. A. Amendola & V. Candila, 2016. "Evaluation of volatility predictions in a VaR framework," Quantitative Finance, Taylor & Francis Journals, vol. 16(5), pages 695-709, May.
    9. Gaglianone, Wagner Piazza & Marins, Jaqueline Terra Moura, 2017. "Evaluation of exchange rate point and density forecasts: An application to Brazil," International Journal of Forecasting, Elsevier, vol. 33(3), pages 707-728.
    10. Daniel Borup & Martin Thyrsgaard, 2017. "Statistical tests for equal predictive ability across multiple forecasting methods," CREATES Research Papers 2017-19, Department of Economics and Business Economics, Aarhus University.
    11. Santos, Douglas G. & Candido, Osvaldo & Tófoli, Paula V., 2022. "Forecasting risk measures using intraday and overnight information," The North American Journal of Economics and Finance, Elsevier, vol. 60(C).
    12. Herrera, Ana María & Hu, Liang & Pastor, Daniel, 2018. "Forecasting crude oil price volatility," International Journal of Forecasting, Elsevier, vol. 34(4), pages 622-635.
    13. Sander Barendse & Erik Kole & Dick van Dijk, 2023. "Backtesting Value-at-Risk and Expected Shortfall in the Presence of Estimation Error," Journal of Financial Econometrics, Oxford University Press, vol. 21(2), pages 528-568.
    14. Fuentes, Fernanda & Herrera, Rodrigo & Clements, Adam, 2023. "Forecasting extreme financial risk: A score-driven approach," International Journal of Forecasting, Elsevier, vol. 39(2), pages 720-735.
    15. Fritzsch, Simon & Timphus, Maike & Weiß, Gregor, 2024. "Marginals versus copulas: Which account for more model risk in multivariate risk forecasting?," Journal of Banking & Finance, Elsevier, vol. 158(C).
    16. Lukas Bauer, 2025. "Evaluating financial tail risk forecasts: Testing Equal Predictive Ability," Papers 2505.23333, arXiv.org.
    17. Nieto, Maria Rosa & Ruiz, Esther, 2016. "Frontiers in VaR forecasting and backtesting," International Journal of Forecasting, Elsevier, vol. 32(2), pages 475-501.
    18. Buccheri, Giuseppe & Renò, Roberto & Vocalelli, Giorgio, 2025. "Taking advantage of biased proxies for forecast evaluation," Journal of Econometrics, Elsevier, vol. 251(C).
    19. Tobias Fissler & Yannick Hoga, 2021. "Backtesting Systemic Risk Forecasts using Multi-Objective Elicitability," Papers 2104.10673, arXiv.org, revised Feb 2022.
    20. Mauro Bernardi & Leopoldo Catania, 2016. "Comparison of Value-at-Risk models using the MCS approach," Computational Statistics, Springer, vol. 31(2), pages 579-608, June.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2603.04275. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.