IDEAS home Printed from https://ideas.repec.org/a/spr/aistmt/v75y2023i1d10.1007_s10463-022-00835-5.html
   My bibliography  Save this article

Semiparametric modelling of two-component mixtures with stochastic dominance

Author

Listed:
  • Jingjing Wu

    (University of Calgary)

  • Tasnima Abedin

    (Alberta Health Services)

  • Qiang Zhao

    (Shandong Normal University)

Abstract

In this work, we studied a two-component mixture model with stochastic dominance constraint, a model arising naturally from many genetic studies. To model the stochastic dominance, we proposed a semiparametric modelling of the log of density ratio. More specifically, when the log of the ratio of two component densities is in a linear regression form, the stochastic dominance is immediately satisfied. For the resulting semiparametric mixture model, we proposed two estimators, maximum empirical likelihood estimator (MELE) and minimum Hellinger distance estimator (MHDE), and investigated their asymptotic properties such as consistency and normality. In addition, to test the validity of the proposed semiparametric model, we developed Kolmogorov–Smirnov type tests based on the two estimators. The finite-sample performance, in terms of both efficiency and robustness, of the two estimators and the tests were examined and compared via both thorough Monte Carlo simulation studies and real data analysis.

Suggested Citation

  • Jingjing Wu & Tasnima Abedin & Qiang Zhao, 2023. "Semiparametric modelling of two-component mixtures with stochastic dominance," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 75(1), pages 39-70, February.
  • Handle: RePEc:spr:aistmt:v:75:y:2023:i:1:d:10.1007_s10463-022-00835-5
    DOI: 10.1007/s10463-022-00835-5
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10463-022-00835-5
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10463-022-00835-5?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Karlis, Dimitris & Xekalaki, Evdokia, 1998. "Minimum Hellinger distance estimation for Poisson mixtures," Computational Statistics & Data Analysis, Elsevier, vol. 29(1), pages 81-103, November.
    2. Woo, Mi-Ja & Sriram, T.N., 2007. "Robust estimation of mixture complexity for count data," Computational Statistics & Data Analysis, Elsevier, vol. 51(9), pages 4379-4392, May.
    3. Woo, Mi-Ja & Sriram, T.N., 2006. "Robust Estimation of Mixture Complexity," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1475-1486, December.
    4. Biao Zhang, 2006. "A Partial Empirical Likelihood Based Score Test Under a Semiparametric Finite Mixture Model," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 58(4), pages 707-719, December.
    5. Karunamuni, Rohana J. & Wu, Jingjing, 2011. "One-step minimum Hellinger distance estimation," Computational Statistics & Data Analysis, Elsevier, vol. 55(12), pages 3148-3164, December.
    6. Wu, Jingjing & Karunamuni, Rohana & Zhang, Biao, 2010. "Minimum Hellinger distance estimation in a two-sample semiparametric model," Journal of Multivariate Analysis, Elsevier, vol. 101(5), pages 1102-1122, May.
    7. Zudi Lu & Yer Van Hui & Andy H. Lee, 2003. "Minimum Hellinger Distance Estimation for Finite Mixtures of Poisson Regression Models and Its Applications," Biometrics, The International Biometric Society, vol. 59(4), pages 1016-1026, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Tang, Qingguo & Karunamuni, Rohana J., 2013. "Minimum distance estimation in a finite mixture regression model," Journal of Multivariate Analysis, Elsevier, vol. 120(C), pages 185-204.
    2. Wu, Jingjing & Karunamuni, Rohana J., 2012. "Efficient Hellinger distance estimates for semiparametric models," Journal of Multivariate Analysis, Elsevier, vol. 107(C), pages 1-23.
    3. Jingjing Wu & Rohana J. Karunamuni, 2018. "Efficient and robust tests for semiparametric models," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 70(4), pages 761-788, August.
    4. Karunamuni, Rohana J. & Wu, Jingjing, 2011. "One-step minimum Hellinger distance estimation," Computational Statistics & Data Analysis, Elsevier, vol. 55(12), pages 3148-3164, December.
    5. Qingguo Tang & R. J. Karunamuni, 2018. "Robust variable selection for finite mixture regression models," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 70(3), pages 489-521, June.
    6. Takada, Teruko, 2009. "Simulated minimum Hellinger distance estimation of stochastic volatility models," Computational Statistics & Data Analysis, Elsevier, vol. 53(6), pages 2390-2403, April.
    7. Chee, Chew-Seng, 2017. "A mixture model-based nonparametric approach to estimating a count distribution," Computational Statistics & Data Analysis, Elsevier, vol. 109(C), pages 34-44.
    8. Umashanger, T. & Sriram, T.N., 2009. "L2E estimation of mixture complexity for count data," Computational Statistics & Data Analysis, Elsevier, vol. 53(12), pages 4243-4254, October.
    9. Woo, Mi-Ja & Sriram, T.N., 2007. "Robust estimation of mixture complexity for count data," Computational Statistics & Data Analysis, Elsevier, vol. 51(9), pages 4379-4392, May.
    10. Wu, Jingjing & Karunamuni, Rohana & Zhang, Biao, 2010. "Minimum Hellinger distance estimation in a two-sample semiparametric model," Journal of Multivariate Analysis, Elsevier, vol. 101(5), pages 1102-1122, May.
    11. Jingjing Wu & Guoqiang Chen & Zeny Feng, 2017. "An Efficient Semiparametric Approach for Marker Gene Selection and Patient Classification," Biostatistics and Biometrics Open Access Journal, Juniper Publishers Inc., vol. 1(2), pages 40-49, April.
    12. Yayuan Zhu & Jingjing Wu & Xuewen Lu, 2013. "Minimum Hellinger distance estimation for a two-sample semiparametric cure rate model with censored survival data," Computational Statistics, Springer, vol. 28(6), pages 2495-2518, December.
    13. Liming Xiang & Kelvin K. W. Yau & Yer Van Hui & Andy H. Lee, 2008. "Minimum Hellinger Distance Estimation for k-Component Poisson Mixture with Random Effects," Biometrics, The International Biometric Society, vol. 64(2), pages 508-518, June.
    14. Ye, Mao & Lu, Zhao-Hua & Li, Yimei & Song, Xinyuan, 2019. "Finite mixture of varying coefficient model: Estimation and component selection," Journal of Multivariate Analysis, Elsevier, vol. 171(C), pages 452-474.
    15. Stéphane Bonhomme & Koen Jochmans & Jean-Marc Robin, 2016. "Non-parametric estimation of finite mixtures from repeated measurements," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 78(1), pages 211-229, January.
    16. Meng Li & Sijia Xiang & Weixin Yao, 2016. "Robust estimation of the number of components for mixtures of linear regression models," Computational Statistics, Springer, vol. 31(4), pages 1539-1555, December.
    17. Kasahara, Hiroyuki & 笠原, 博幸 & Shimotsu, Katsumi & 下津, 克己, 2010. "Nonparametric Identification of Multivariate Mixtures," Discussion Papers 2010-09, Graduate School of Economics, Hitotsubashi University.
    18. Derek S. Young & Xi Chen & Dilrukshi C. Hewage & Ricardo Nilo-Poyanco, 2019. "Finite mixture-of-gamma distributions: estimation, inference, and model-based clustering," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 13(4), pages 1053-1082, December.
    19. Jamie L. Cross & Lennart Hoogerheide & Paul Labonne & Herman K. van Dijk, 2023. "Bayesian Mode Inference for Discrete Distributions in Economics and Finance," Working Papers No 11/2023, Centre for Applied Macro- and Petroleum economics (CAMP), BI Norwegian Business School.
    20. Hiroyuki Kasahara & Katsumi Shimotsu, 2014. "Non-parametric identification and estimation of the number of components in multivariate mixtures," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 76(1), pages 97-111, January.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:aistmt:v:75:y:2023:i:1:d:10.1007_s10463-022-00835-5. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.