Calibration tests for count data

My bibliography Save this article

Calibration tests for count data

Author

Listed:

Wei Wei
Leonhard Held

Registered:

Abstract

Calibration, the statistical consistency of forecast distributions and observations, is a central requirement for probabilistic predictions. Calibration of continuous forecasts has been widely discussed, and significance tests are commonly used to detect whether a prediction model is miscalibrated. However, calibration tests for discrete forecasts are rare, especially for distributions with unlimited support. In this paper, we propose two types of calibration tests for count data: tests based on conditional exceedance probabilities and tests based on proper scoring rules. For the latter, three scoring rules are considered: the ranked probability score, the logarithmic score and the Dawid-Sebastiani score. Simulation studies show that all the different tests have good control of the type I error rate and sufficient power under miscalibration. As an illustration, we apply the methodology to weekly data on meningoccocal disease incidence in Germany, 2001–2006. The results show that the test approach is powerful in detecting miscalibrated forecasts. Copyright Sociedad de Estadística e Investigación Operativa 2014

Suggested Citation

Wei Wei & Leonhard Held, 2014. "Calibration tests for count data," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 23(4), pages 787-805, December.

Handle: RePEc:spr:testjl:v:23:y:2014:i:4:p:787-805
DOI: 10.1007/s11749-014-0380-8

Download full text from publisher

As the access to this document is restricted, you may want to search for a different version of it.

References listed on IDEAS

Brendan P. M. McCabe & Gael M. Martin & David Harris, 2011. "Efficient probabilistic forecasts for counts," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 73(2), pages 253-272, March.
Christoffersen, Peter F, 1998. "Evaluating Interval Forecasts," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 39(4), pages 841-862, November.
Rainer Winkelmann, 2008. "Econometric Analysis of Count Data," Springer Books, Springer, edition 0, number 978-3-540-78389-3, June.
Harvey, David I & Leybourne, Stephen J & Newbold, Paul, 1998. "Tests for Forecast Encompassing," Journal of Business & Economic Statistics, American Statistical Association, vol. 16(2), pages 254-259, April.
R. Winkler & Javier Muñoz & José Cervera & José Bernardo & Gail Blattenberger & Joseph Kadane & Dennis Lindley & Allan Murphy & Robert Oliver & David Ríos-Insua, 1996. "Scoring rules and the evaluation of probabilities," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 5(1), pages 1-60, June.
Claudia Czado & Tilmann Gneiting & Leonhard Held, 2009. "Predictive Model Assessment for Count Data," Biometrics, The International Biometric Society, vol. 65(4), pages 1254-1261, December.
L. Held & K. Rufibach & F. Balabdaoui, 2010. "A Score Regression Approach to Assess Calibration of Continuous Probabilistic Predictions," Biometrics, The International Biometric Society, vol. 66(4), pages 1295-1305, December.
McCabe, B.P.M. & Martin, G.M., 2005. "Bayesian predictions of low count time series," International Journal of Forecasting, Elsevier, vol. 21(2), pages 315-330.
Diebold, Francis X & Mariano, Roberto S, 2002. "Comparing Predictive Accuracy," Journal of Business & Economic Statistics, American Statistical Association, vol. 20(1), pages 134-144, January.
- Diebold, Francis X & Mariano, Roberto S, 1995. "Comparing Predictive Accuracy," Journal of Business & Economic Statistics, American Statistical Association, vol. 13(3), pages 253-263, July.
- Francis X. Diebold & Roberto S. Mariano, 1994. "Comparing Predictive Accuracy," NBER Technical Working Papers 0169, National Bureau of Economic Research, Inc.
C. P. Farrington & N. J. Andrews & A. D. Beale & M. A. Catchpole, 1996. "A Statistical Algorithm for the Early Detection of Outbreaks of Infectious Disease," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 159(3), pages 547-563, May.
Diebold, Francis X & Gunther, Todd A & Tay, Anthony S, 1998. "Evaluating Density Forecasts with Applications to Financial Risk Management," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 39(4), pages 863-883, November.
Tilmann Gneiting & Larissa Stanberry & Eric Grimit & Leonhard Held & Nicholas Johnson, 2008. "Rejoinder on: Assessing probabilistic forecasts of multivariate quantities, with an application to ensemble predictions of surface winds," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 17(2), pages 256-264, August.
Corradi, Valentina & Swanson, Norman R., 2006. "Predictive density and conditional confidence interval accuracy tests," Journal of Econometrics, Elsevier, vol. 135(1-2), pages 187-228.
- Valentina Corradi & Norman Swanson, 2004. "Predective Density and Conditional Confidence Interval Accuracy Tests," Departmental Working Papers 200423, Rutgers University, Department of Economics.
Tilmann Gneiting & Larissa Stanberry & Eric Grimit & Leonhard Held & Nicholas Johnson, 2008. "Assessing probabilistic forecasts of multivariate quantities, with an application to ensemble predictions of surface winds," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 17(2), pages 211-235, August.
Tilmann Gneiting, 2008. "Editorial: Probabilistic forecasting," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 171(2), pages 319-321, April.
Tilmann Gneiting & Fadoua Balabdaoui & Adrian E. Raftery, 2007. "Probabilistic forecasts, calibration and sharpness," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 69(2), pages 243-268, April.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Maria Victoria Ibañez & Marina Martínez-Garcia & Amelia Simó, 2021. "A Review of Spatiotemporal Models for Count Data in R Packages. A Case Study of COVID-19 Data," Mathematics, MDPI, vol. 9(13), pages 1-23, July.
Petropoulos, Fotios & Apiletti, Daniele & Assimakopoulos, Vassilios & Babai, Mohamed Zied & Barrow, Devon K. & Ben Taieb, Souhaib & Bergmeir, Christoph & Bessa, Ricardo J. & Bijak, Jakub & Boylan, Joh, 2022. "Forecasting: theory and practice," International Journal of Forecasting, Elsevier, vol. 38(3), pages 705-871.
- Fotios Petropoulos & Daniele Apiletti & Vassilios Assimakopoulos & Mohamed Zied Babai & Devon K. Barrow & Souhaib Ben Taieb & Christoph Bergmeir & Ricardo J. Bessa & Jakub Bijak & John E. Boylan & Jet, 2020. "Forecasting: theory and practice," Papers 2012.03854, arXiv.org, revised Jan 2022.
Wei, Wei & Balabdaoui, Fadoua & Held, Leonhard, 2017. "Calibration tests for multivariate Gaussian forecasts," Journal of Multivariate Analysis, Elsevier, vol. 154(C), pages 216-233.
Bansal, Prateek & Krueger, Rico & Graham, Daniel J., 2021. "Fast Bayesian estimation of spatial count data models," Computational Statistics & Data Analysis, Elsevier, vol. 157(C).
Kolassa, Stephan, 2016. "Evaluating predictive count data distributions in retail sales forecasting," International Journal of Forecasting, Elsevier, vol. 32(3), pages 788-803.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Petropoulos, Fotios & Apiletti, Daniele & Assimakopoulos, Vassilios & Babai, Mohamed Zied & Barrow, Devon K. & Ben Taieb, Souhaib & Bergmeir, Christoph & Bessa, Ricardo J. & Bijak, Jakub & Boylan, Joh, 2022. "Forecasting: theory and practice," International Journal of Forecasting, Elsevier, vol. 38(3), pages 705-871.
- Fotios Petropoulos & Daniele Apiletti & Vassilios Assimakopoulos & Mohamed Zied Babai & Devon K. Barrow & Souhaib Ben Taieb & Christoph Bergmeir & Ricardo J. Bessa & Jakub Bijak & John E. Boylan & Jet, 2020. "Forecasting: theory and practice," Papers 2012.03854, arXiv.org, revised Jan 2022.
Braun, Julia & Sabanés Bové, Daniel & Held, Leonhard, 2014. "Choice of generalized linear mixed models using predictive crossvalidation," Computational Statistics & Data Analysis, Elsevier, vol. 75(C), pages 190-202.
Malte Knuppel & Fabian Kruger & Marc-Oliver Pohle, 2022. "Score-based calibration testing for multivariate forecast distributions," Papers 2211.16362, arXiv.org, revised Dec 2023.
- Knüppel, Malte & Krüger, Fabian & Pohle, Marc-Oliver, 2022. "Score-based calibration testing for multivariate forecast distributions," Discussion Papers 50/2022, Deutsche Bundesbank.
Ng, Jason & Forbes, Catherine S. & Martin, Gael M. & McCabe, Brendan P.M., 2013. "Non-parametric estimation of forecast distributions in non-Gaussian, non-linear state space models," International Journal of Forecasting, Elsevier, vol. 29(3), pages 411-430.
- Jason Ng & Catherine S. Forbes & Gael M. Martin & Brendan P.M. McCabe, 2011. "Non-Parametric Estimation of Forecast Distributions in Non-Gaussian, Non-linear State Space Models," Monash Econometrics and Business Statistics Working Papers 11/11, Monash University, Department of Econometrics and Business Statistics.
Gneiting, Tilmann, 2011. "Making and Evaluating Point Forecasts," Journal of the American Statistical Association, American Statistical Association, vol. 106(494), pages 746-762.
Maneesoonthorn, Worapree & Martin, Gael M. & Forbes, Catherine S. & Grose, Simone D., 2012. "Probabilistic forecasts of volatility and its risk premia," Journal of Econometrics, Elsevier, vol. 171(2), pages 217-236.
- Worapree Maneesoonthorn & Gael M. Martin & Catherine S. Forbes & Simone Grose, 2010. "Probabilistic Forecasts of Volatility and its Risk Premia," Monash Econometrics and Business Statistics Working Papers 22/10, Monash University, Department of Econometrics and Business Statistics.
Gordy, Michael B. & McNeil, Alexander J., 2020. "Spectral backtests of forecast distributions with application to risk management," Journal of Banking & Finance, Elsevier, vol. 116(C).
- Michael B. Gordy & Alexander J. McNeil, 2017. "Spectral backtests of forecast distributions with application to risk management," Papers 1708.01489, arXiv.org, revised Jul 2019.
- Michael B. Gordy & Alexander J. McNeil, 2018. "Spectral Backtests of Forecast Distributions with Application to Risk Management," Finance and Economics Discussion Series 2018-021, Board of Governors of the Federal Reserve System (U.S.).
Fabian Krüger & Sebastian Lerch & Thordis Thorarinsdottir & Tilmann Gneiting, 2021. "Predictive Inference Based on Markov Chain Monte Carlo Output," International Statistical Review, International Statistical Institute, vol. 89(2), pages 274-301, August.
Thordis L. Thorarinsdottir & Tilmann Gneiting, 2010. "Probabilistic forecasts of wind speed: ensemble model output statistics by using heteroscedastic censored regression," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 173(2), pages 371-388, April.
David Harris & Gael M. Martin & Indeewara Perera & Don S. Poskitt, 2017. "Construction and visualization of optimal confidence sets for frequentist distributional forecasts," Monash Econometrics and Business Statistics Working Papers 9/17, Monash University, Department of Econometrics and Business Statistics.
Weron, Rafał, 2014. "Electricity price forecasting: A review of the state-of-the-art with a look into the future," International Journal of Forecasting, Elsevier, vol. 30(4), pages 1030-1081.
- Rafal Weron, 2014. "Electricity price forecasting: A review of the state-of-the-art with a look into the future," HSC Research Reports HSC/14/07, Hugo Steinhaus Center, Wroclaw University of Technology.
Wei, Wei & Balabdaoui, Fadoua & Held, Leonhard, 2017. "Calibration tests for multivariate Gaussian forecasts," Journal of Multivariate Analysis, Elsevier, vol. 154(C), pages 216-233.
Claudia Czado & Tilmann Gneiting & Leonhard Held, 2009. "Predictive Model Assessment for Count Data," Biometrics, The International Biometric Society, vol. 65(4), pages 1254-1261, December.
Nowotarski, Jakub & Weron, Rafał, 2018. "Recent advances in electricity price forecasting: A review of probabilistic forecasting," Renewable and Sustainable Energy Reviews, Elsevier, vol. 81(P1), pages 1548-1568.
- Jakub Nowotarski & Rafal Weron, 2016. "Recent advances in electricity price forecasting: A review of probabilistic forecasting," HSC Research Reports HSC/16/07, Hugo Steinhaus Center, Wroclaw University of Technology.
Geweke, John & Amisano, Gianni, 2011. "Optimal prediction pools," Journal of Econometrics, Elsevier, vol. 164(1), pages 130-141, September.
- John Geweke & Gianni Amisano, 2008. "Optimal Prediction Pools," Working Paper series 22_08, Rimini Centre for Economic Analysis.
- Amisano, Gianni & Geweke, John, 2009. "Optimal Prediction Pools," Working Paper Series 1017, European Central Bank.
Cees Diks & Valentyn Panchenko & Dick van Dijk, 2008. "Partial Likelihood-Based Scoring Rules for Evaluating Density Forecasts in Tails," Tinbergen Institute Discussion Papers 08-050/4, Tinbergen Institute.
- Cees Diks & Valentyn Panchenko & Dick van Dijk, 2008. "Partial Likelihood-Based Scoring Rules for Evaluating Density Forecasts in Tails," Discussion Papers 2008-10, School of Economics, The University of New South Wales.
- Dijk, D. van & Diks, C.G.H. & Panchenko, V., 2008. "Partial Likelihood-Based Scoring Rules for Evaluating Density Forecasts in Tails," CeNDEF Working Papers 08-03, Universiteit van Amsterdam, Center for Nonlinear Dynamics in Economics and Finance.
Dick van Dijk & Philip Hans Franses & Michael P. Clements & Jeremy Smith, 2003. "On SETAR non-linearity and forecasting," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 22(5), pages 359-375.
- Clements, M.P. & Franses, Ph.H.B.F. & Smith, J., 1999. "On SETAR non- linearity and forecasting," Econometric Institute Research Papers EI 9914-/A, Erasmus University Rotterdam, Erasmus School of Economics (ESE), Econometric Institute.
Martin, Gael M. & Loaiza-Maya, Rubén & Maneesoonthorn, Worapree & Frazier, David T. & Ramírez-Hassan, Andrés, 2022. "Optimal probabilistic forecasts: When do they work?," International Journal of Forecasting, Elsevier, vol. 38(1), pages 384-406.
- Ruben Loaiza-Maya & Gael M. Martin & David T. Frazier & Worapree Maneesoonthorn & Andres Ramirez Hassan, 2020. "Optimal probabilistic forecasts: When do they work?," Monash Econometrics and Business Statistics Working Papers 33/20, Monash University, Department of Econometrics and Business Statistics.
- Gael M. Martin & Rub'en Loaiza-Maya & David T. Frazier & Worapree Maneesoonthorn & Andr'es Ram'irez Hassan, 2020. "Optimal probabilistic forecasts: When do they work?," Papers 2009.09592, arXiv.org.
Francesco Giancaterini & Alain Hecq & Claudio Morana, 2022. "Is Climate Change Time-Reversible?," Econometrics, MDPI, vol. 10(4), pages 1-18, December.
- Francesco Giancaterini & Alain Hecq & Claudio Morana, 2022. "Is climate change time reversible?," Papers 2205.07579, arXiv.org, revised Nov 2022.
- Francesco Giancaterini & Alain Hecq & Claudio Morana, 2022. "Is climate change time-reversible?," Working Papers 498, University of Milano-Bicocca, Department of Economics, revised Nov 2022.
- Francesco Giancaterini & Alain Hecq & Claudio Morana, 2022. "Is climate change time reversible?," Working Paper series 22-08, Rimini Centre for Economic Analysis, revised Dec 2022.
Rapach, David E. & Wohar, Mark E., 2006. "The out-of-sample forecasting performance of nonlinear models of real exchange rate behavior," International Journal of Forecasting, Elsevier, vol. 22(2), pages 341-361.

More about this item

Keywords

Calibration test; Count data; Predictive distribution; Proper scoring rules; 62M20 Prediction;
All these keywords.

JEL classification:

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:testjl:v:23:y:2014:i:4:p:787-805. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Calibration tests for count data

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Keywords

JEL classification:

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data