IDEAS home Printed from https://ideas.repec.org/p/arx/papers/1902.02920.html
   My bibliography  Save this paper

Testing the Order of Multivariate Normal Mixture Models

Author

Listed:
  • Hiroyuki Kasahara
  • Katsumi Shimotsu

Abstract

Finite mixtures of multivariate normal distributions have been widely used in empirical applications in diverse fields such as statistical genetics and statistical finance. Testing the number of components in multivariate normal mixture models is a long-standing challenge even in the most important case of testing homogeneity. This paper develops likelihood-based tests of the null hypothesis of $M_0$ components against the alternative hypothesis of $M_0 + 1$ components for a general $M_0 \geq 1$. For heteroscedastic normal mixtures, we propose an EM test and derive the asymptotic distribution of the EM test statistic. For homoscedastic normal mixtures, we derive the asymptotic distribution of the likelihood ratio test statistic. We also derive the asymptotic distribution of the likelihood ratio test statistic and EM test statistic under local alternatives and show the validity of parametric bootstrap. The simulations show that the proposed test has good finite sample size and power properties.

Suggested Citation

  • Hiroyuki Kasahara & Katsumi Shimotsu, 2019. "Testing the Order of Multivariate Normal Mixture Models," Papers 1902.02920, arXiv.org.
  • Handle: RePEc:arx:papers:1902.02920
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/1902.02920
    File Function: Latest version
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Hanfeng Chen & Jiahua Chen & John D. Kalbfleisch, 2004. "Testing for a finite mixture model with two components," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 66(1), pages 95-115, February.
    2. Jiahua Chen & Pengfei Li & Yuejiao Fu, 2012. "Inference on the Order of a Normal Mixture," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(499), pages 1096-1105, September.
    3. Lemdani, Mohamed & Pons, Odile, 1997. "Likelihood ratio tests for genetic linkage," Statistics & Probability Letters, Elsevier, vol. 33(1), pages 15-22, April.
    4. Chen, Jiahua & Tan, Xianming, 2009. "Inference for multivariate normal mixtures," Journal of Multivariate Analysis, Elsevier, vol. 100(7), pages 1367-1383, August.
    5. P. Li & J. Chen & P. Marriott, 2009. "Non-finite Fisher information and homogeneity: an EM approach," Biometrika, Biometrika Trust, vol. 96(2), pages 411-426.
    6. Hanfeng Chen & Jiahua Chen & John D. Kalbfleisch, 2001. "A modified likelihood ratio test for homogeneity in finite mixture models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 63(1), pages 19-29.
    7. Li, Pengfei & Chen, Jiahua, 2010. "Testing the Order of a Finite Mixture," Journal of the American Statistical Association, American Statistical Association, vol. 105(491), pages 1084-1092.
    8. Hong‐Tu Zhu & Heping Zhang, 2004. "Hypothesis testing in mixture regression models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 66(1), pages 3-16, February.
    9. Hiroyuki Kasahara & Katsumi Shimotsu, 2015. "Testing the Number of Components in Normal Mixture Regression Models," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(512), pages 1632-1645, December.
    10. He, Yi & Pan, Wei & Lin, Jizhen, 2006. "Cluster analysis using multivariate normal mixture models to detect differential gene expression with microarray data," Computational Statistics & Data Analysis, Elsevier, vol. 51(2), pages 641-658, November.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Paul Schrimpf & Michio Suzuki & Hiroyuki Kasahara, 2015. "Identification and Estimation of Production Function with Unobserved Heterogeneity," 2015 Meeting Papers 924, Society for Economic Dynamics.
    2. Yu Hao & Hiroyuki Kasahara, 2022. "Testing the Number of Components in Finite Mixture Normal Regression Model with Panel Data," Papers 2210.02824, arXiv.org, revised Jun 2023.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Kasahara Hiroyuki & Shimotsu Katsumi, 2012. "Testing the Number of Components in Finite Mixture Models," Global COE Hi-Stat Discussion Paper Series gd12-259, Institute of Economic Research, Hitotsubashi University.
    2. Yu Hao & Hiroyuki Kasahara, 2022. "Testing the Number of Components in Finite Mixture Normal Regression Model with Panel Data," Papers 2210.02824, arXiv.org, revised Jun 2023.
    3. Wong, Tony S.T. & Lam, Kwok Fai & Zhao, Victoria X., 2018. "Asymptotic null distribution of the modified likelihood ratio test for homogeneity in finite mixture models," Computational Statistics & Data Analysis, Elsevier, vol. 127(C), pages 248-257.
    4. Wichitchan, Supawadee & Yao, Weixin & Yang, Guangren, 2019. "Hypothesis testing for finite mixture models," Computational Statistics & Data Analysis, Elsevier, vol. 132(C), pages 180-189.
    5. Yang Ning & Yong Chen, 2015. "A Class of Pseudolikelihood Ratio Tests for Homogeneity in Exponential Tilt Mixture Models," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 42(2), pages 504-517, June.
    6. Ye, Mao & Lu, Zhao-Hua & Li, Yimei & Song, Xinyuan, 2019. "Finite mixture of varying coefficient model: Estimation and component selection," Journal of Multivariate Analysis, Elsevier, vol. 171(C), pages 452-474.
    7. Holzmann, Hajo & Schwaiger, Florian, 2016. "Testing for the number of states in hidden Markov models," Computational Statistics & Data Analysis, Elsevier, vol. 100(C), pages 318-330.
    8. Meng Li & Sijia Xiang & Weixin Yao, 2016. "Robust estimation of the number of components for mixtures of linear regression models," Computational Statistics, Springer, vol. 31(4), pages 1539-1555, December.
    9. Alexander D. Stead & Phill Wheat & William H. Greene, 2023. "On hypothesis testing in latent class and finite mixture stochastic frontier models, with application to a contaminated normal-half normal model," Journal of Productivity Analysis, Springer, vol. 60(1), pages 37-48, August.
    10. Charnigo, Richard & Fan, Qian & Bittel, Douglas & Dai, Hongying, 2013. "Testing unilateral versus bilateral normal contamination," Statistics & Probability Letters, Elsevier, vol. 83(1), pages 163-167.
    11. Juan Shen & Xuming He, 2015. "Inference for Subgroup Analysis With a Structured Logistic-Normal Mixture Model," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(509), pages 303-312, March.
    12. Chuan Hong & Yang Ning & Shuang Wang & Hao Wu & Raymond J. Carroll & Yong Chen, 2017. "PLEMT: A Novel Pseudolikelihood-Based EM Test for Homogeneity in Generalized Exponential Tilt Mixture Models," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(520), pages 1393-1404, October.
    13. Jiaying Gu & Roger Koenker & Stanislav Volgushev, 2017. "Testing for homogeneity in mixture models," CeMMAP working papers 39/17, Institute for Fiscal Studies.
    14. Bagkavos, Dimitrios & Patil, Prakash N., 2023. "Goodness-of-fit testing for normal mixture densities," Computational Statistics & Data Analysis, Elsevier, vol. 188(C).
    15. Jiaying Gu & Roger Koenker & Stanislav Volgushev, 2017. "Testing for homogeneity in mixture models," CeMMAP working papers CWP39/17, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    16. Christian Ritz, 2013. "Penalized likelihood ratio tests for repeated measurement models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 22(3), pages 534-547, September.
    17. Dannemann, Jörn & Holzmann, Hajo, 2010. "Testing for two components in a switching regression model," Computational Statistics & Data Analysis, Elsevier, vol. 54(6), pages 1592-1604, June.
    18. Zhu, Hongtu & Zhang, Heping, 2006. "Asymptotics for estimation and testing procedures under loss of identifiability," Journal of Multivariate Analysis, Elsevier, vol. 97(1), pages 19-45, January.
    19. Andrew Sweeting, 2009. "The strategic timing incentives of commercial radio stations: An empirical analysis using multiple equilibria," RAND Journal of Economics, RAND Corporation, vol. 40(4), pages 710-742, December.
    20. Dante Amengual & Xinyue Bei & Marine Carrasco & Enrique Sentana, 2022. "Score-type tests for normal mixtures," Working Papers wp2022_2213, CEMFI.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:1902.02920. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.