IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v53y2009i9p3344-3357.html
   My bibliography  Save this article

Non-parametric k-sample tests: Density functions vs distribution functions

Author

Listed:
  • Martínez-Camblor, Pablo
  • de Uña-Álvarez, Jacobo

Abstract

Tests for the comparison of k samples based on kernel density estimators (KDE) are introduced. The Double Minimum method as a new and useful procedure for the crucial problem of bandwidth selection is developed. The statistical power of the proposed tests, as well as the impact of the smoothing degree and the performance of the Double Minimum algorithm, are studied via Monte Carlo simulations. Finally, the results of the tests based on the KDE are compared to those of the traditional k-sample tests based on empirical distribution functions (EDF), and to other tests based on the likelihood ratio introduced in the recent literature. Two main conclusions are obtained. First, the proposed bandwidth selection method attains quasi-optimal results. Second, the simulations suggest that KDE-based tests are the most powerful when the underlying populations are different in shape, and that the L1 distance among densities leads to optimal results in the considered situations.

Suggested Citation

  • Martínez-Camblor, Pablo & de Uña-Álvarez, Jacobo, 2009. "Non-parametric k-sample tests: Density functions vs distribution functions," Computational Statistics & Data Analysis, Elsevier, vol. 53(9), pages 3344-3357, July.
  • Handle: RePEc:eee:csdana:v:53:y:2009:i:9:p:3344-3357
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167-9473(09)00045-0
    Download Restriction: Full text for ScienceDirect subscribers only.
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Alba Fernández, V. & Jiménez Gamero, M.D. & Muñoz Garcia, J., 2008. "A test for the two-sample problem based on empirical characteristic functions," Computational Statistics & Data Analysis, Elsevier, vol. 52(7), pages 3730-3748, March.
    2. Fan, Yanqin, 1998. "Goodness-Of-Fit Tests Based On Kernel Density Estimators With Fixed Smoothing Parameters," Econometric Theory, Cambridge University Press, vol. 14(5), pages 604-621, October.
    3. Duc Devroye & J. Beirlant & R. Cao & R. Fraiman & P. Hall & M. Jones & Gábor Lugosi & E. Mammen & J. Marron & C. Sánchez-Sellero & J. Uña & F. Udina & L. Devroye, 1997. "Universal smoothing factor selection in density estimation: theory and practice," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 6(2), pages 223-320, December.
    4. Fan, Yanqin, 1994. "Testing the Goodness of Fit of a Parametric Density Function by Kernel Method," Econometric Theory, Cambridge University Press, vol. 10(2), pages 316-356, June.
    5. Zhang, Jin & Wu, Yuehua, 2007. "k-Sample tests based on the likelihood ratio," Computational Statistics & Data Analysis, Elsevier, vol. 51(9), pages 4682-4691, May.
    6. Anderson, N. H. & Hall, P. & Titterington, D. M., 1994. "Two-Sample Test Statistics for Measuring Discrepancies Between Two Multivariate Probability Density Functions Using Kernel-Based Density Estimates," Journal of Multivariate Analysis, Elsevier, vol. 50(1), pages 41-54, July.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Martínez-Camblor, Pablo & Corral, Norberto, 2011. "Repeated measures analysis for functional data," Computational Statistics & Data Analysis, Elsevier, vol. 55(12), pages 3244-3256, December.
    2. Álvarez-Esteban, P.C. & del Barrio, E. & Cuesta-Albertos, J.A. & Matrán, C., 2013. "Searching for a common pooling pattern among several samples," Computational Statistics & Data Analysis, Elsevier, vol. 67(C), pages 1-14.
    3. Chen, Yuhui & Hanson, Timothy E., 2014. "Bayesian nonparametric k-sample tests for censored and uncensored data," Computational Statistics & Data Analysis, Elsevier, vol. 71(C), pages 335-346.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Pavia, Jose M., 2015. "Testing Goodness-of-Fit with the Kernel Density Estimator: GoFKernel," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 66(c01).
    2. Pablo Martínez-Camblor & Jacobo Uña-Álvarez, 2013. "Studying the bandwidth in $$k$$ -sample smooth tests," Computational Statistics, Springer, vol. 28(2), pages 875-892, April.
    3. Tenreiro, Carlos, 2009. "On the choice of the smoothing parameter for the BHEP goodness-of-fit test," Computational Statistics & Data Analysis, Elsevier, vol. 53(4), pages 1038-1053, February.
    4. Scaillet, Olivier, 2007. "Kernel-based goodness-of-fit tests for copulas with fixed smoothing parameters," Journal of Multivariate Analysis, Elsevier, vol. 98(3), pages 533-543, March.
    5. Marcelo Fernandes & Eduardo Mendes & Olivier Scaillet, 2015. "Testing for symmetry and conditional symmetry using asymmetric kernels," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 67(4), pages 649-671, August.
    6. Graciela Boente & Daniela Rodriguez & Wenceslao González Manteiga, 2014. "Goodness-of-fit Test for Directional Data," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 41(1), pages 259-275, March.
    7. Delgado, Miguel A. & Song, Xiaojun, 2018. "Nonparametric tests for conditional symmetry," Journal of Econometrics, Elsevier, vol. 206(2), pages 447-471.
    8. Jiménez Gamero, M.D. & Muñoz García, J. & Pino Mejías, R., 2005. "Testing goodness of fit for the distribution of errors in multivariate linear models," Journal of Multivariate Analysis, Elsevier, vol. 95(2), pages 301-322, August.
    9. Bagkavos, Dimitrios & Patil, Prakash N. & Wood, Andrew T.A., 2023. "Nonparametric goodness-of-fit testing for a continuous multivariate parametric model," Journal of Multivariate Analysis, Elsevier, vol. 196(C).
    10. Juan Carlos Pardo-Fernández & María Dolores Jiménez-Gamero & Anouar El Ghouch, 2015. "A Non-parametric ANOVA-type Test for Regression Curves Based on Characteristic Functions," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 42(1), pages 197-213, March.
    11. Li, Qi & Maasoumi, Esfandiar & Racine, Jeffrey S., 2009. "A nonparametric test for equality of distributions with mixed categorical and continuous data," Journal of Econometrics, Elsevier, vol. 148(2), pages 186-200, February.
    12. Pablo Martínez-Camblor, 2010. "Comparing k-independent and right censored samples based on the likelihood ratio," Computational Statistics, Springer, vol. 25(3), pages 363-374, September.
    13. Henze, N. & Klar, B. & Zhu, L. X., 2005. "Checking the adequacy of the multivariate semiparametric location shift model," Journal of Multivariate Analysis, Elsevier, vol. 93(2), pages 238-256, April.
    14. Wenceslao González-Manteiga & Rosa Crujeiras, 2013. "An updated review of Goodness-of-Fit tests for regression models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 22(3), pages 361-411, September.
    15. Alba Fernández, V. & Jiménez Gamero, M.D. & Muñoz Garcia, J., 2008. "A test for the two-sample problem based on empirical characteristic functions," Computational Statistics & Data Analysis, Elsevier, vol. 52(7), pages 3730-3748, March.
    16. Carlos Tenreiro, 2022. "On automatic kernel density estimate-based tests for goodness-of-fit," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 31(3), pages 717-748, September.
    17. Qi Li & Jeffrey Scott Racine, 2006. "Nonparametric Econometrics: Theory and Practice," Economics Books, Princeton University Press, edition 1, volume 1, number 8355.
    18. Hagmann, M. & Scaillet, O., 2007. "Local multiplicative bias correction for asymmetric kernel density estimators," Journal of Econometrics, Elsevier, vol. 141(1), pages 213-249, November.
    19. Fernandes, Marcelo & Grammig, Joachim, 2005. "Nonparametric specification tests for conditional duration models," Journal of Econometrics, Elsevier, vol. 127(1), pages 35-68, July.
    20. Jean-David Fermanian & Dominique Guégan, 2021. "Fair learning with bagging," Documents de travail du Centre d'Economie de la Sorbonne 21034, Université Panthéon-Sorbonne (Paris 1), Centre d'Economie de la Sorbonne.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:53:y:2009:i:9:p:3344-3357. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.