IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v96y2016icp120-132.html
   My bibliography  Save this article

Frequentist nonparametric goodness-of-fit tests via marginal likelihood ratios

Author

Listed:
  • Hart, Jeffrey D.
  • Choi, Taeryon
  • Yi, Seongbaek

Abstract

A nonparametric procedure for testing the goodness of fit of a parametric density is investigated. The test statistic is the ratio of two marginal likelihoods corresponding to a kernel estimate and the parametric model. The marginal likelihood for the kernel estimate is obtained by proposing a prior for the estimate’s bandwidth, and then integrating the product of this prior and a leave-one-out kernel likelihood. Properties of the kernel-based marginal likelihood depend importantly on the kernel used. In particular, a specific, somewhat heavy-tailed, kernel K0 yields better performing marginal likelihood ratios than does the popular Gaussian kernel. Monte Carlo is used to compare the power of the new test with that of the Shapiro–Wilk test, the Kolmogorov–Smirnov test, and a recently proposed goodness-of-fit test based on empirical likelihood ratios. Properties of these tests are considered when testing the fit of normal and double exponential distributions. The new test is used to establish a claim made in the astronomy literature concerning the distribution of nebulae brightnesses in the Andromeda galaxy. Generalizations to the multivariate case are also described.

Suggested Citation

  • Hart, Jeffrey D. & Choi, Taeryon & Yi, Seongbaek, 2016. "Frequentist nonparametric goodness-of-fit tests via marginal likelihood ratios," Computational Statistics & Data Analysis, Elsevier, vol. 96(C), pages 120-132.
  • Handle: RePEc:eee:csdana:v:96:y:2016:i:c:p:120-132
    DOI: 10.1016/j.csda.2015.10.013
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167947315002686
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2015.10.013?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Zhang, Xibin & King, Maxwell L. & Hyndman, Rob J., 2006. "A Bayesian approach to bandwidth selection for multivariate kernel density estimation," Computational Statistics & Data Analysis, Elsevier, vol. 50(11), pages 3009-3031, July.
    2. Vexler, Albert & Gurevich, Gregory, 2010. "Empirical likelihood ratios applied to goodness-of-fit tests based on sample entropy," Computational Statistics & Data Analysis, Elsevier, vol. 54(2), pages 531-545, February.
    3. Zhang, Xibin & Brooks, Robert D. & King, Maxwell L., 2009. "A Bayesian approach to bandwidth selection for multivariate kernel regression with an application to state-price density estimation," Journal of Econometrics, Elsevier, vol. 153(1), pages 21-32, November.
    4. Gerda Claeskens & Nils Lid Hjort, 2004. "Goodness of Fit via Non‐parametric Likelihood Ratios," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 31(4), pages 487-513, December.
    5. Miecznikowski, Jeffrey & Vexler, Albert & Shepherd, Lori, 2013. "dbEmpLikeGOF: An R Package for Nonparametric Likelihood Ratio Tests for Goodness-of-Fit and Two-Sample Comparisons Based on Sample Entropy," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 54(i03).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Song Li & Mervyn J. Silvapulle & Param Silvapulle & Xibin Zhang, 2015. "Bayesian Approaches to Nonparametric Estimation of Densities on the Unit Interval," Econometric Reviews, Taylor & Francis Journals, vol. 34(3), pages 394-412, March.
    2. Guohua Feng & Chuan Wang & Xibin Zhang, 2019. "Estimation of inefficiency in stochastic frontier models: a Bayesian kernel approach," Journal of Productivity Analysis, Springer, vol. 51(1), pages 1-19, February.
    3. Anastasios Panagiotelis & Michael S. Smith & Peter J. Danaher, 2014. "From Amazon to Apple: Modeling Online Retail Sales, Purchase Incidence, and Visit Behavior," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 32(1), pages 14-29, January.
    4. Pavia, Jose M., 2015. "Testing Goodness-of-Fit with the Kernel Density Estimator: GoFKernel," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 66(c01).
    5. Rong Zhang & Brett A. Inder & Xibin Zhang, 2012. "Parameter estimation for a discrete-response model with double rules of sample selection: A Bayesian approach," Monash Econometrics and Business Statistics Working Papers 5/12, Monash University, Department of Econometrics and Business Statistics.
    6. Xibin Zhang & Maxwell L. King & Han Lin Shang, 2011. "Bayesian estimation of bandwidths for a nonparametric regression model with a flexible error density," Monash Econometrics and Business Statistics Working Papers 10/11, Monash University, Department of Econometrics and Business Statistics.
    7. Filippone, Maurizio & Sanguinetti, Guido, 2011. "Approximate inference of the bandwidth in multivariate kernel density estimation," Computational Statistics & Data Analysis, Elsevier, vol. 55(12), pages 3104-3122, December.
    8. Xibin Zhang & Maxwell L. King & Han Lin Shang, 2016. "Bayesian Bandwidth Selection for a Nonparametric Regression Model with Mixed Types of Regressors," Econometrics, MDPI, vol. 4(2), pages 1-27, April.
    9. Bagkavos, Dimitrios & Patil, Prakash N., 2023. "Goodness-of-fit testing for normal mixture densities," Computational Statistics & Data Analysis, Elsevier, vol. 188(C).
    10. Hadi Alizadeh Noughabi & Albert Vexler, 2016. "An efficient correction to the density-based empirical likelihood ratio goodness-of-fit test for the inverse Gaussian distribution," Journal of Applied Statistics, Taylor & Francis Journals, vol. 43(16), pages 2988-3003, December.
    11. Catalina Bolance & Montserrat Guillen & David Pitt, 2014. "Non-parametric Models for Univariate Claim Severity Distributions - an approach using R," Working Papers 2014-01, Universitat de Barcelona, UB Riskcenter.
    12. Hu, Shuowen & Poskitt, D.S. & Zhang, Xibin, 2012. "Bayesian adaptive bandwidth kernel density estimation of irregular multivariate distributions," Computational Statistics & Data Analysis, Elsevier, vol. 56(3), pages 732-740.
    13. Olivier Thas, 2009. "Comments on: Goodness-of-fit tests in mixed modes," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 18(2), pages 260-264, August.
    14. Madeleine Cule & Richard Samworth & Michael Stewart, 2010. "Maximum likelihood estimation of a multi‐dimensional log‐concave density," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 72(5), pages 545-607, November.
    15. Asok K. Nanda & Shovan Chowdhury, 2021. "Shannon's Entropy and Its Generalisations Towards Statistical Inference in Last Seven Decades," International Statistical Review, International Statistical Institute, vol. 89(1), pages 167-185, April.
    16. Mukhopadhyay, Subhadeep & Ghosh, Anil K., 2011. "Bayesian multiscale smoothing in supervised and semi-supervised kernel discriminant analysis," Computational Statistics & Data Analysis, Elsevier, vol. 55(7), pages 2344-2353, July.
    17. Zhang, Rong & Inder, Brett A. & Zhang, Xibin, 2015. "Bayesian estimation of a discrete response model with double rules of sample selection," Computational Statistics & Data Analysis, Elsevier, vol. 86(C), pages 81-96.
    18. Kenneth L. Sørensen & Rune Vejlin, 2014. "Return To Experience And Initial Wage Level: Do Low Wage Workers Catch Up?," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 29(6), pages 984-1006, September.
    19. Zhang, Xibin & King, Maxwell L. & Shang, Han Lin, 2014. "A sampling algorithm for bandwidth estimation in a nonparametric regression model with a flexible error density," Computational Statistics & Data Analysis, Elsevier, vol. 78(C), pages 218-234.
    20. Ana M. Monteiro & Antonio A. F. Santos, 2020. "Conditional risk-neutral density from option prices by local polynomial kernel smoothing with no-arbitrage constraints," Review of Derivatives Research, Springer, vol. 23(1), pages 41-61, April.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:96:y:2016:i:c:p:120-132. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.