IDEAS home Printed from https://ideas.repec.org/a/bla/jorssc/v49y2000i2p247-268.html
   My bibliography  Save this article

Diagnostic checks for discrete data regression models using posterior predictive simulations

Author

Listed:
  • A. Gelman
  • Y. Goegebeur
  • F. Tuerlinckx
  • I. Van Mechelen

Abstract

Model checking with discrete data regressions can be difficult because the usual methods such as residual plots have complicated reference distributions that depend on the parameters in the model. Posterior predictive checks have been proposed as a Bayesian way to average the results of goodness‐of‐fit tests in the presence of uncertainty in estimation of the parameters. We try this approach using a variety of discrepancy variables for generalized linear models fitted to a historical data set on behavioural learning. We then discuss the general applicability of our findings in the context of a recent applied example on which we have worked. We find that the following discrepancy variables work well, in the sense of being easy to interpret and sensitive to important model failures: structured displays of the entire data set, general discrepancy variables based on plots of binned or smoothed residuals versus predictors and specific discrepancy variables created on the basis of the particular concerns arising in an application. Plots of binned residuals are especially easy to use because their predictive distributions under the model are sufficiently simple that model checks can often be made implicitly. The following discrepancy variables did not work well: scatterplots of latent residuals defined from an underlying continuous model and quantile–quantile plots of these residuals.

Suggested Citation

  • A. Gelman & Y. Goegebeur & F. Tuerlinckx & I. Van Mechelen, 2000. "Diagnostic checks for discrete data regression models using posterior predictive simulations," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 49(2), pages 247-268.
  • Handle: RePEc:bla:jorssc:v:49:y:2000:i:2:p:247-268
    DOI: 10.1111/1467-9876.00190
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/1467-9876.00190
    Download Restriction: no

    File URL: https://libkey.io/10.1111/1467-9876.00190?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. R. B. O'Hara & S. Lampila & M. Orell, 2009. "Estimation of Rates of Births, Deaths, and Immigration from Mark–Recapture Data," Biometrics, The International Biometric Society, vol. 65(1), pages 275-281, March.
    2. Eijffinger, Sylvester & Mahieu, Ronald & Raes, Louis, 2018. "Inferring hawks and doves from voting records," European Journal of Political Economy, Elsevier, vol. 51(C), pages 107-120.
    3. Robert Kapłon, 2006. "A retrospective review of categorical data analysis – theory and marketing practice," Operations Research and Decisions, Wroclaw University of Science and Technology, Faculty of Management, vol. 16(1), pages 55-72.
    4. Carolina Costa Mota Paraíba & Natalia Bochkina & Carlos Alberto Ribeiro Diniz, 2018. "Bayesian truncated beta nonlinear mixed-effects models," Journal of Applied Statistics, Taylor & Francis Journals, vol. 45(2), pages 320-346, January.
    5. Silvia Montagna & Vanessa Orani & Raffaele Argiento, 2021. "Bayesian isotonic logistic regression via constrained splines: an application to estimating the serve advantage in professional tennis," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 30(2), pages 573-604, June.
    6. Patrizia Ordine & Claudio Lupi, 2009. "Family Income and Students' Mobility," Giornale degli Economisti, GDE (Giornale degli Economisti e Annali di Economia), Bocconi University, vol. 68(1), pages 1-23, April.
    7. Clough, Brian J. & Russell, Matthew B. & Domke, Grant M. & Woodall, Christopher W. & Radtke, Philip J., 2016. "Comparing tree foliage biomass models fitted to a multispecies, felled-tree biomass dataset for the United States," Ecological Modelling, Elsevier, vol. 333(C), pages 79-91.
    8. Thomas J. Steenburgh & Andrew Ainslie & Peder Hans Engebretson, 2003. "Massively Categorical Variables: Revealing the Information in Zip Codes," Marketing Science, INFORMS, vol. 22(1), pages 40-57, August.
    9. Meulders, Michel & Boeck, Paul De & Mechelen, Iven Van, 2001. "Probability matrix decomposition models and main-effects generalized linear models for the analysis of replicated binary associations," Computational Statistics & Data Analysis, Elsevier, vol. 38(2), pages 217-233, December.
    10. repec:jss:jstsof:45:i02 is not listed on IDEAS
    11. Andrew Gelman & Ginger L. Chew & Michael Shnaidman, 2004. "Bayesian Analysis of Serial Dilution Assays," Biometrics, The International Biometric Society, vol. 60(2), pages 407-417, June.
    12. Wagner, Helga & Duller, Christine, 2012. "Bayesian model selection for logistic regression models with random intercept," Computational Statistics & Data Analysis, Elsevier, vol. 56(5), pages 1256-1274.
    13. Andrew Gelman, 2003. "A Bayesian Formulation of Exploratory Data Analysis and Goodness‐of‐fit Testing," International Statistical Review, International Statistical Institute, vol. 71(2), pages 369-382, August.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jorssc:v:49:y:2000:i:2:p:247-268. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/rssssea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.