IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v147y2020ics0167947320300311.html
   My bibliography  Save this article

On the inferential implications of decreasing weight structures in mixture models

Author

Listed:
  • De Blasi, Pierpaolo
  • Martínez, Asael Fabian
  • Mena, Ramsés H.
  • Prünster, Igor

Abstract

Bayesian estimation of nonparametric mixture models strongly relies on available representations of discrete random probability measures. In particular, the order of the mixing weights plays an important role for the identifiability of component-specific parameters which, in turn, affects the convergence properties of posterior samplers. The geometric process mixture model provides a simple alternative to models based on the Dirichlet process that effectively addresses these issues. However, the rate of decay of the mixing weights for this model may be too fast for modeling data with a large number of components. The need for different decay rates arises. Some variants of the geometric process featuring different decay behaviors, while preserving the decreasing structure, are presented and investigated. An asymptotic characterization of the number of distinct values in a sample from the corresponding mixing measure is also given, highlighting the inferential implications of different prior specifications. The analysis is completed by a simulation study in the context of density estimation. It shows that by controlling the decaying rate, the mixture model is able to capture data with a large number of components.

Suggested Citation

  • De Blasi, Pierpaolo & Martínez, Asael Fabian & Mena, Ramsés H. & Prünster, Igor, 2020. "On the inferential implications of decreasing weight structures in mixture models," Computational Statistics & Data Analysis, Elsevier, vol. 147(C).
  • Handle: RePEc:eee:csdana:v:147:y:2020:i:c:s0167947320300311
    DOI: 10.1016/j.csda.2020.106940
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167947320300311
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2020.106940?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Omiros Papaspiliopoulos & Gareth O. Roberts, 2008. "Retrospective Markov chain Monte Carlo methods for Dirichlet process hierarchical models," Biometrika, Biometrika Trust, vol. 95(1), pages 169-186.
    2. Lijoi, Antonio & Nipoti, Bernardo & Prünster, Igor, 2014. "Dependent mixture models: Clustering and borrowing information," Computational Statistics & Data Analysis, Elsevier, vol. 71(C), pages 417-433.
    3. Hatjispyros, Spyridon J. & Merkatas, Christos & Nicoleris, Theodoros & Walker, Stephen G., 2018. "Dependent mixtures of geometric weights priors," Computational Statistics & Data Analysis, Elsevier, vol. 119(C), pages 1-18.
    4. Sylvia. Richardson & Peter J. Green, 1997. "On Bayesian Analysis of Mixtures with an Unknown Number of Components (with discussion)," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 59(4), pages 731-792.
    5. Antonio Lijoi & Ramsés H. Mena & Igor Prünster, 2007. "Controlling the reinforcement in Bayesian non‐parametric mixture models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 69(4), pages 715-740, September.
    6. Fuentes-García, Ruth & Mena, Ramsés H. & Walker, Stephen G., 2009. "A nonparametric dependent process for Bayesian regression," Statistics & Probability Letters, Elsevier, vol. 79(8), pages 1112-1119, April.
    7. Sara Wade & Stephen G. Walker & Sonia Petrone, 2014. "A Predictive Study of Dirichlet Process Mixture Models for Curve Fitting," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 41(3), pages 580-605, September.
    8. Griffin, J.E. & Steel, M.F.J., 2011. "Stick-breaking autoregressive processes," Journal of Econometrics, Elsevier, vol. 162(2), pages 383-396, June.
    9. Ghosal,Subhashis & van der Vaart,Aad, 2017. "Fundamentals of Nonparametric Bayesian Inference," Cambridge Books, Cambridge University Press, number 9780521878265.
    10. Gutiérrez, Luis & Gutiérrez-Peña, Eduardo & Mena, Ramsés H., 2014. "Bayesian nonparametric classification for spectroscopy data," Computational Statistics & Data Analysis, Elsevier, vol. 78(C), pages 56-68.
    11. Bruno Scarpa & David B. Dunson, 2014. "Enriched Stick-Breaking Processes for Functional Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(506), pages 647-660, June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Hatjispyros, Spyridon J. & Merkatas, Christos & Walker, Stephen G., 2023. "Mixture models with decreasing weights," Computational Statistics & Data Analysis, Elsevier, vol. 179(C).
    2. José J. Quinlan & Fernando A. Quintana & Garritt L. Page, 2021. "On a class of repulsive mixture models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 30(2), pages 445-461, June.
    3. Pierpaolo De Blasi & Ramsés H. Mena & Igor Prünster, 2022. "Asymptotic behavior of the number of distinct values in a sample from the geometric stick-breaking process," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 74(1), pages 143-165, February.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Billio, Monica & Casarin, Roberto & Rossini, Luca, 2019. "Bayesian nonparametric sparse VAR models," Journal of Econometrics, Elsevier, vol. 212(1), pages 97-115.
    2. Pierpaolo De Blasi & Ramsés H. Mena & Igor Prünster, 2022. "Asymptotic behavior of the number of distinct values in a sample from the geometric stick-breaking process," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 74(1), pages 143-165, February.
    3. Stefano Favaro & Antonio Lijoi & Igor Prünster, 2012. "On the stick–breaking representation of normalized inverse Gaussian priors," DEM Working Papers Series 008, University of Pavia, Department of Economics and Management.
    4. Wang, Ketong & Porter, Michael D., 2018. "Optimal Bayesian clustering using non-negative matrix factorization," Computational Statistics & Data Analysis, Elsevier, vol. 128(C), pages 395-411.
    5. Im, Yunju & Tan, Aixin, 2021. "Bayesian subgroup analysis in regression using mixture models," Computational Statistics & Data Analysis, Elsevier, vol. 162(C).
    6. Weixuan Zhu & Fabrizio Leisen, 2015. "A multivariate extension of a vector of two-parameter Poisson-Dirichlet processes," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 27(1), pages 89-105, March.
    7. Federico Bassetti & Roberto Casarin & Francesco Ravazzolo, 2018. "Bayesian Nonparametric Calibration and Combination of Predictive Distributions," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 113(522), pages 675-685, April.
    8. Sylvia Frühwirth-Schnatter & Gertraud Malsiner-Walli, 2019. "From here to infinity: sparse finite versus Dirichlet process mixtures in model-based clustering," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 13(1), pages 33-64, March.
    9. Isadora Antoniano-Villalobos & Stephen G. Walker, 2016. "A Nonparametric Model for Stationary Time Series," Journal of Time Series Analysis, Wiley Blackwell, vol. 37(1), pages 126-142, January.
    10. Hatjispyros, Spyridon J. & Merkatas, Christos & Nicoleris, Theodoros & Walker, Stephen G., 2018. "Dependent mixtures of geometric weights priors," Computational Statistics & Data Analysis, Elsevier, vol. 119(C), pages 1-18.
    11. Monica Billio & Roberto Casarin & Luca Rossini, 2016. "Bayesian nonparametric sparse seemingly unrelated regression model (SUR)," Working Papers 2016:20, Department of Economics, University of Venice "Ca' Foscari".
    12. Fuentes-García, Ruth & Mena, Ramsés H. & Walker, Stephen G., 2019. "Modal posterior clustering motivated by Hopfield’s network," Computational Statistics & Data Analysis, Elsevier, vol. 137(C), pages 92-100.
    13. Walker, Stephen G., 2023. "Comparing weak and strong convergence of density functions," Statistics & Probability Letters, Elsevier, vol. 200(C).
    14. Lancelot F. James & Antonio Lijoi & Igor Prünster, 2009. "Posterior Analysis for Normalized Random Measures with Independent Increments," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 36(1), pages 76-97, March.
    15. Li, Mingyang & Meng, Hongdao & Zhang, Qingpeng, 2017. "A nonparametric Bayesian modeling approach for heterogeneous lifetime data with covariates," Reliability Engineering and System Safety, Elsevier, vol. 167(C), pages 95-104.
    16. Ryan Martin, 2021. "A Survey of Nonparametric Mixing Density Estimation via the Predictive Recursion Algorithm," Sankhya B: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 83(1), pages 97-121, May.
    17. Luis E. Nieto-Barajas & Peter Müller & Yuan Ji & Yiling Lu & Gordon B. Mills, 2012. "A Time-Series DDP for Functional Proteomics Profiles," Biometrics, The International Biometric Society, vol. 68(3), pages 859-868, September.
    18. Peter Müeller & Fernando A. Quintana & Garritt Page, 2018. "Nonparametric Bayesian inference in applications," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 27(2), pages 175-206, June.
    19. Ruth Fuentes–García & Ramsés Mena & Stephen Walker, 2010. "A Probability for Classification Based on the Dirichlet Process Mixture Model," Journal of Classification, Springer;The Classification Society, vol. 27(3), pages 389-403, November.
    20. Grazian, Clara & Villa, Cristiano & Liseo, Brunero, 2020. "On a loss-based prior for the number of components in mixture models," Statistics & Probability Letters, Elsevier, vol. 158(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:147:y:2020:i:c:s0167947320300311. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.