IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v155y2021ics016794732030195x.html
   My bibliography  Save this article

A Bayesian goodness-of-fit test for regression

Author

Listed:
  • Barrientos, Andrés F.
  • Canale, Antonio

Abstract

Regression models are widely used statistical procedures, and the validation of their assumptions plays a crucial role in the data analysis process. Unfortunately, validating assumptions usually depends on the availability of tests tailored to the specific model of interest. A novel Bayesian goodness-of-fit hypothesis testing approach is presented for a broad class of regression models the response variable of which is univariate and continuous. The proposed approach relies on a suitable transformation of the response variable and a Bayesian prior induced by a predictor-dependent mixture model. Hypothesis testing is performed via Bayes factor, the asymptotic properties of which are discussed. The method is implemented by means of a Markov chain Monte Carlo algorithm, and its performance is illustrated using simulated and real data sets.

Suggested Citation

  • Barrientos, Andrés F. & Canale, Antonio, 2021. "A Bayesian goodness-of-fit test for regression," Computational Statistics & Data Analysis, Elsevier, vol. 155(C).
  • Handle: RePEc:eee:csdana:v:155:y:2021:i:c:s016794732030195x
    DOI: 10.1016/j.csda.2020.107104
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S016794732030195X
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2020.107104?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. De Iorio, Maria & Muller, Peter & Rosner, Gary L. & MacEachern, Steven N., 2004. "An ANOVA Model for Dependent Random Measures," Journal of the American Statistical Association, American Statistical Association, vol. 99, pages 205-215, January.
    2. Kim‐Anh Do & Peter Müller & Feng Tang, 2005. "A Bayesian mixture model for differential gene expression," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 54(3), pages 627-644, June.
    3. ROSS McVINISH & JUDITH ROUSSEAU & KERRIE MENGERSEN, 2009. "Bayesian Goodness of Fit Testing with Mixtures of Triangular Distributions," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 36(2), pages 337-354, June.
    4. A. Canale & A. Lijoi & B. Nipoti & I. Prünster, 2017. "On the Pitman–Yor process with spike and slab base measure," Biometrika, Biometrika Trust, vol. 104(3), pages 681-697.
    5. Chung, Yeonseung & Dunson, David B., 2009. "Nonparametric Bayes Conditional Distribution Modeling With Variable Selection," Journal of the American Statistical Association, American Statistical Association, vol. 104(488), pages 1646-1660.
    6. Fan J. & Huang L-S., 2001. "Goodness-of-Fit Tests for Parametric Regression Models," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 640-652, June.
    7. Pena, Edsel A. & Slate, Elizabeth H., 2006. "Global Validation of Linear Model Assumptions," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 341-354, March.
    8. Pati, Debdeep & Dunson, David B. & Tokdar, Surya T., 2013. "Posterior consistency in conditional distribution estimation," Journal of Multivariate Analysis, Elsevier, vol. 116(C), pages 456-472.
    9. Jarque, Carlos M. & Bera, Anil K., 1980. "Efficient tests for normality, homoscedasticity and serial independence of regression residuals," Economics Letters, Elsevier, vol. 6(3), pages 255-259.
    10. Brockwell, A.E., 2011. "Acknowledgement of priority to: “Universal residuals: A multivariate transformation. Statistics & Probability Letters 27, 2007, 1473–1478”," Statistics & Probability Letters, Elsevier, vol. 81(12), pages 1822-1822.
    11. Miller, Forrest R. & Neill, James W., 2016. "Lack of fit tests for linear regression models with many predictor variables using minimal weighted maximal matchings," Journal of Multivariate Analysis, Elsevier, vol. 150(C), pages 14-26.
    12. Norets, Andriy & Pelenis, Justinas, 2014. "Posterior Consistency In Conditional Density Estimation By Covariate Dependent Mixtures," Econometric Theory, Cambridge University Press, vol. 30(3), pages 606-646, June.
    13. Brockwell, A.E., 2007. "Universal residuals: A multivariate transformation," Statistics & Probability Letters, Elsevier, vol. 77(14), pages 1473-1478, August.
    14. David B. Dunson & Ju-Hyun Park, 2008. "Kernel stick-breaking processes," Biometrika, Biometrika Trust, vol. 95(2), pages 307-323.
    15. repec:dau:papers:123456789/369 is not listed on IDEAS
    16. Basu S. & Chib S., 2003. "Marginal Likelihood and Bayes Factors for Dirichlet Process Mixture Models," Journal of the American Statistical Association, American Statistical Association, vol. 98, pages 224-235, January.
    17. Yang, Mingan, 2012. "Bayesian variable selection for logistic mixed model with nonparametric random effects," Computational Statistics & Data Analysis, Elsevier, vol. 56(9), pages 2663-2674.
    18. Cribari-Neto, Francisco & Zeileis, Achim, 2010. "Beta Regression in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 34(i02).
    19. Teran Hidalgo, Sebastian J. & Wu, Michael C. & Engel, Stephanie M. & Kosorok, Michael R., 2018. "Goodness-of-fit test for nonparametric regression models: Smoothing spline ANOVA models as example," Computational Statistics & Data Analysis, Elsevier, vol. 122(C), pages 135-155.
    20. Michele Guindani & Peter Müller & Song Zhang, 2009. "A Bayesian discovery procedure," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 71(5), pages 905-925, November.
    21. Dunson, David B. & Herring, Amy H. & Engel, Stephanie M., 2008. "Bayesian Selection and Clustering of Polymorphisms in Functionally Related Genes," Journal of the American Statistical Association, American Statistical Association, vol. 103, pages 534-546, June.
    22. Berger J. O & Guglielmi A., 2001. "Bayesian and Conditional Frequentist Testing of a Parametric Model Versus Nonparametric Alternatives," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 174-184, March.
    23. Andrés F. Barrientos & Alejandro Jara & Fernando A. Quintana, 2017. "Fully Nonparametric Regression for Bounded Data Using Dependent Bernstein Polynomials," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(518), pages 806-825, April.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Zhou, Haiming & Huang, Xianzheng, 2022. "Bayesian beta regression for bounded responses with unknown supports," Computational Statistics & Data Analysis, Elsevier, vol. 167(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Norets, Andriy, 2015. "Bayesian regression with nonparametric heteroskedasticity," Journal of Econometrics, Elsevier, vol. 185(2), pages 409-419.
    2. Canale, Antonio & Lijoi, Antonio & Nipoti, Bernardo & Prünster, Igor, 2023. "Inner spike and slab Bayesian nonparametric models," Econometrics and Statistics, Elsevier, vol. 27(C), pages 120-135.
    3. Debdeep Pati & David Dunson, 2014. "Bayesian nonparametric regression with varying residual density," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 66(1), pages 1-31, February.
    4. Norets, Andriy & Pelenis, Justinas, 2012. "Bayesian modeling of joint and conditional distributions," Journal of Econometrics, Elsevier, vol. 168(2), pages 332-346.
    5. Pelenis, Justinas, 2014. "Bayesian regression with heteroscedastic error density and parametric mean function," Journal of Econometrics, Elsevier, vol. 178(P3), pages 624-638.
    6. Pelenis, Justinas, 2012. "Bayesian Semiparametric Regression," Economics Series 285, Institute for Advanced Studies.
    7. Pati, Debdeep & Dunson, David B. & Tokdar, Surya T., 2013. "Posterior consistency in conditional distribution estimation," Journal of Multivariate Analysis, Elsevier, vol. 116(C), pages 456-472.
    8. Laura Liu, 2018. "Density Forecasts in Panel Data Models : A Semiparametric Bayesian Perspective," Finance and Economics Discussion Series 2018-036, Board of Governors of the Federal Reserve System (U.S.).
    9. Huang, Yifan & Meng, Shengwang, 2020. "A Bayesian nonparametric model and its application in insurance loss prediction," Insurance: Mathematics and Economics, Elsevier, vol. 93(C), pages 84-94.
    10. Andrés F. Barrientos & Alejandro Jara & Fernando A. Quintana, 2017. "Fully Nonparametric Regression for Bounded Data Using Dependent Bernstein Polynomials," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(518), pages 806-825, April.
    11. Surya T. Tokdar & Ryan Martin, 2021. "Bayesian Test of Normality Versus a Dirichlet Process Mixture Alternative," Sankhya B: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 83(1), pages 66-96, May.
    12. Zhou, Haiming & Huang, Xianzheng, 2022. "Bayesian beta regression for bounded responses with unknown supports," Computational Statistics & Data Analysis, Elsevier, vol. 167(C).
    13. Igari, Ryosuke & Hoshino, Takahiro, 2018. "A Bayesian data combination approach for repeated durations under unobserved missing indicators: Application to interpurchase-timing in marketing," Computational Statistics & Data Analysis, Elsevier, vol. 126(C), pages 150-166.
    14. Stefano Favaro & Antonio Lijoi & Igor Prünster, 2012. "On the stick–breaking representation of normalized inverse Gaussian priors," DEM Working Papers Series 008, University of Pavia, Department of Economics and Management.
    15. Ryo Kato & Takahiro Hoshino, 2020. "Semiparametric Bayesian multiple imputation for regression models with missing mixed continuous–discrete covariates," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 72(3), pages 803-825, June.
    16. Luai Al-Labadi, 2021. "The two-sample problem via relative belief ratio," Computational Statistics, Springer, vol. 36(3), pages 1791-1808, September.
    17. Fuentes-García, Ruth & Mena, Ramsés H. & Walker, Stephen G., 2009. "A nonparametric dependent process for Bayesian regression," Statistics & Probability Letters, Elsevier, vol. 79(8), pages 1112-1119, April.
    18. Laura Liu, 2017. "Density Forecasts in Panel Models: A semiparametric Bayesian Perspective," PIER Working Paper Archive 17-006, Penn Institute for Economic Research, Department of Economics, University of Pennsylvania, revised 28 Apr 2017.
    19. Kelter, Riko, 2022. "Power analysis and type I and type II error rates of Bayesian nonparametric two-sample tests for location-shifts based on the Bayes factor under Cauchy priors," Computational Statistics & Data Analysis, Elsevier, vol. 165(C).
    20. Hatjispyros, Spyridon J. & Nicoleris, Theodoros & Walker, Stephen G., 2016. "Random density functions with common atoms and pairwise dependence," Computational Statistics & Data Analysis, Elsevier, vol. 101(C), pages 236-249.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:155:y:2021:i:c:s016794732030195x. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.