IDEAS home Printed from https://ideas.repec.org/a/spr/testjl/v28y2019i4d10.1007_s11749-018-00625-3.html
   My bibliography  Save this article

Testing equality of a large number of densities under mixing conditions

Author

Listed:
  • Marta Cousido-Rocha

    (University of Vigo
    University of Vigo)

  • Jacobo Uña-Álvarez

    (University of Vigo
    University of Vigo)

  • Jeffrey D. Hart

    (Texas A&M University)

Abstract

In certain settings, such as microarray data, the sampling information is formed by a large number of possibly dependent small data sets. In special applications, for example in order to perform clustering, the researcher aims to verify whether all data sets have a common distribution. For this reason we propose a formal test for the null hypothesis that all data sets come from a single distribution. The asymptotic setting is that in which the number of small data sets goes to infinity, while the sample size remains fixed. The asymptotic null distribution of the proposed test is derived under mixing conditions on the sequence of small data sets, and the power properties of our test under two reasonable fixed alternatives are investigated. A simulation study is conducted, showing that the test respects the nominal level, and that it has a power which tends to 1 when the number of data sets tends to infinity. An illustration involving microarray data is provided.

Suggested Citation

  • Marta Cousido-Rocha & Jacobo Uña-Álvarez & Jeffrey D. Hart, 2019. "Testing equality of a large number of densities under mixing conditions," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 28(4), pages 1203-1228, December.
  • Handle: RePEc:spr:testjl:v:28:y:2019:i:4:d:10.1007_s11749-018-00625-3
    DOI: 10.1007/s11749-018-00625-3
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11749-018-00625-3
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11749-018-00625-3?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Quessy, Jean-François & Éthier, François, 2012. "Cramér–von Mises and characteristic function tests for the two and k-sample problems with dependent data," Computational Statistics & Data Analysis, Elsevier, vol. 56(6), pages 2097-2111.
    2. Dehling, Herold & Wendler, Martin, 2010. "Central limit theorem and the bootstrap for U-statistics of strongly mixing data," Journal of Multivariate Analysis, Elsevier, vol. 101(1), pages 126-137, January.
    3. D. Zhan & J. D. Hart, 2014. "Testing equality of a large number of densities," Biometrika, Biometrika Trust, vol. 101(2), pages 449-464.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Cousido-Rocha, Marta & de Uña-Álvarez, Jacobo & Hart, Jeffrey D., 2019. "A two-sample test for the equality of univariate marginal distributions for high-dimensional data," Journal of Multivariate Analysis, Elsevier, vol. 174(C).
    2. M. D. Jiménez-Gamero & M. Cousido-Rocha & M. V. Alba-Fernández & F. Jiménez-Jiménez, 2022. "Testing the equality of a large number of populations," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 31(1), pages 1-21, March.
    3. Jiménez-Gamero, M. Dolores & Franco-Pereira, Alba M., 2021. "Testing the equality of a large number of means of functional data," Journal of Multivariate Analysis, Elsevier, vol. 185(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jiménez-Gamero, M.D. & Alba-Fernández, M.V. & Jodrá, P. & Barranco-Chamorro, I., 2017. "Fast tests for the two-sample problem based on the empirical characteristic function," Mathematics and Computers in Simulation (MATCOM), Elsevier, vol. 137(C), pages 390-410.
    2. Giuseppe Cavaliere & Dimitris N. Politis & Anders Rahbek & Paul Doukhan & Gabriel Lang & Anne Leucht & Michael H. Neumann, 2015. "Recent developments in bootstrap methods for dependent data," Journal of Time Series Analysis, Wiley Blackwell, vol. 36(3), pages 290-314, May.
    3. M. D. Jiménez-Gamero & M. Cousido-Rocha & M. V. Alba-Fernández & F. Jiménez-Jiménez, 2022. "Testing the equality of a large number of populations," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 31(1), pages 1-21, March.
    4. Zdeněk Hlávka & Marie Hušková & Simos G. Meintanis, 2020. "Change-point methods for multivariate time-series: paired vectorial observations," Statistical Papers, Springer, vol. 61(4), pages 1351-1383, August.
    5. Hwang, Eunju & Shin, Dong Wan, 2012. "Strong consistency of the stationary bootstrap under ψ-weak dependence," Statistics & Probability Letters, Elsevier, vol. 82(3), pages 488-495.
    6. Wyłupek, Grzegorz, 2023. "A nonparametric test for paired data," Journal of Multivariate Analysis, Elsevier, vol. 198(C).
    7. Dehling, Herold & Sharipov, Olimjon Sh. & Wendler, Martin, 2015. "Bootstrap for dependent Hilbert space-valued random variables with application to von Mises statistics," Journal of Multivariate Analysis, Elsevier, vol. 133(C), pages 200-215.
    8. Meintanis, Simos G. & Ushakov, Nikolai G., 2016. "Nonparametric probability weighted empirical characteristic function and applications," Statistics & Probability Letters, Elsevier, vol. 108(C), pages 52-61.
    9. Doukhan, Paul & Lang, Gabriel & Leucht, Anne & Neumann, Michael H., 2014. "Dependent wild bootstrap for the empirical process," Working Papers 35246, University of Mannheim, Department of Economics.
    10. Tabacu, Lucia, 2018. "Weak convergence of the linear rank statistics under strong mixing conditions," Statistics & Probability Letters, Elsevier, vol. 132(C), pages 28-34.
    11. Junwei Hu & Lihong Wang, 2023. "A weighted U-statistic based change point test for multivariate time series," Statistical Papers, Springer, vol. 64(3), pages 753-778, June.
    12. G. I. Rivas-Martínez & M. D. Jiménez-Gamero & J. L. Moreno-Rebollo, 2019. "A two-sample test for the error distribution in nonparametric regression based on the characteristic function," Statistical Papers, Springer, vol. 60(4), pages 1369-1395, August.
    13. Olimjon Sharipov & Martin Wendler, 2012. "Bootstrap for the sample mean and for -statistics of mixing and near-epoch dependent processes," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 24(2), pages 317-342.
    14. Jiménez-Gamero, M. Dolores & Franco-Pereira, Alba M., 2021. "Testing the equality of a large number of means of functional data," Journal of Multivariate Analysis, Elsevier, vol. 185(C).
    15. Ghanem, Dalia, 2017. "Testing identifying assumptions in nonseparable panel data models," Journal of Econometrics, Elsevier, vol. 197(2), pages 202-217.
    16. Cousido-Rocha, Marta & de Uña-Álvarez, Jacobo & Hart, Jeffrey D., 2019. "A two-sample test for the equality of univariate marginal distributions for high-dimensional data," Journal of Multivariate Analysis, Elsevier, vol. 174(C).
    17. Zdeněk Hlávka & Marie Hušková & Claudia Kirch & Simos G. Meintanis, 2017. "Fourier--type tests involving martingale difference processes," Econometric Reviews, Taylor & Francis Journals, vol. 36(4), pages 468-492, April.
    18. Atchadé, Yves F. & Cattaneo, Matias D., 2014. "A martingale decomposition for quadratic forms of Markov chains (with applications)," Stochastic Processes and their Applications, Elsevier, vol. 124(1), pages 646-677.
    19. Wendler, Martin, 2011. "Bahadur representation for U-quantiles of dependent data," Journal of Multivariate Analysis, Elsevier, vol. 102(6), pages 1064-1079, July.
    20. Lee, Jiyon, 2015. "A semiparametric single index model with heterogeneous impacts on an unobserved variable," Journal of Econometrics, Elsevier, vol. 184(1), pages 13-36.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:testjl:v:28:y:2019:i:4:d:10.1007_s11749-018-00625-3. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.