IDEAS home Printed from https://ideas.repec.org/a/bla/jorssb/v69y2007i4p715-740.html
   My bibliography  Save this article

Controlling the reinforcement in Bayesian non‐parametric mixture models

Author

Listed:
  • Antonio Lijoi
  • Ramsés H. Mena
  • Igor Prünster

Abstract

Summary. The paper deals with the problem of determining the number of components in a mixture model. We take a Bayesian non‐parametric approach and adopt a hierarchical model with a suitable non‐parametric prior for the latent structure. A commonly used model for such a problem is the mixture of Dirichlet process model. Here, we replace the Dirichlet process with a more general non‐parametric prior obtained from a generalized gamma process. The basic feature of this model is that it yields a partition structure for the latent variables which is of Gibbs type. This relates to the well‐known (exchangeable) product partition models. If compared with the usual mixture of Dirichlet process model the advantage of the generalization that we are examining relies on the availability of an additional parameter σ belonging to the interval (0,1): it is shown that such a parameter greatly influences the clustering behaviour of the model. A value of σ that is close to 1 generates a large number of clusters, most of which are of small size. Then, a reinforcement mechanism which is driven by σ acts on the mass allocation by penalizing clusters of small size and favouring those few groups containing a large number of elements. These features turn out to be very useful in the context of mixture modelling. Since it is difficult to specify a priori the reinforcement rate, it is reasonable to specify a prior for σ. Hence, the strength of the reinforcement mechanism is controlled by the data.

Suggested Citation

  • Antonio Lijoi & Ramsés H. Mena & Igor Prünster, 2007. "Controlling the reinforcement in Bayesian non‐parametric mixture models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 69(4), pages 715-740, September.
  • Handle: RePEc:bla:jorssb:v:69:y:2007:i:4:p:715-740
    DOI: 10.1111/j.1467-9868.2007.00609.x
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/j.1467-9868.2007.00609.x
    Download Restriction: no

    File URL: https://libkey.io/10.1111/j.1467-9868.2007.00609.x?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Lancelot F. James & Antonio Lijoi & Igor Pruenster, 2005. "Bayesian Inference via Classes of Normalized Random Measures," ICER Working Papers - Applied Mathematics Series 5-2005, ICER - International Centre for Economic Research.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Pierpaolo De Blasi & Stefano Favaro & Antonio Lijoi & Ramsés H. Mena & Igor Prünster & Mattteo Ruggiero, 2013. "Are Gibbs-type priors the most natural generalization of the Dirichlet process?," DEM Working Papers Series 054, University of Pavia, Department of Economics and Management.
    2. Raffaele Argiento & Alessandra Guglielmi & Antonio Pievatolo, 2014. "Estimation, prediction and interpretation of NGG random effects models: an application to Kevlar fibre failure times," Statistical Papers, Springer, vol. 55(3), pages 805-826, August.
    3. J. E. Griffin & M. Kolossiatis & M. F. J. Steel, 2013. "Comparing distributions by using dependent normalized random-measure mixtures," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 75(3), pages 499-529, June.
    4. Kolossiatis, M. & Griffin, J.E. & Steel, M.F.J., 2011. "Modeling overdispersion with the normalized tempered stable distribution," Computational Statistics & Data Analysis, Elsevier, vol. 55(7), pages 2288-2301, July.
    5. Fuentes-García, Ruth & Mena, Ramsés H. & Walker, Stephen G., 2019. "Modal posterior clustering motivated by Hopfield’s network," Computational Statistics & Data Analysis, Elsevier, vol. 137(C), pages 92-100.
    6. Corradin, Riccardo & Nieto-Barajas, Luis Enrique & Nipoti, Bernardo, 2022. "Optimal stratification of survival data via Bayesian nonparametric mixtures," Econometrics and Statistics, Elsevier, vol. 22(C), pages 17-38.
    7. Cesari, Oriana & Favaro, Stefano & Nipoti, Bernardo, 2014. "Posterior analysis of rare variants in Gibbs-type species sampling models," Journal of Multivariate Analysis, Elsevier, vol. 131(C), pages 79-98.
    8. Antonio Canale & Igor Prünster, 2017. "Robustifying Bayesian nonparametric mixtures for count data," Biometrics, The International Biometric Society, vol. 73(1), pages 174-184, March.
    9. Peter Müeller & Fernando A. Quintana & Garritt Page, 2018. "Nonparametric Bayesian inference in applications," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 27(2), pages 175-206, June.
    10. Lancelot F. James & Antonio Lijoi & Igor Prünster, 2009. "Posterior Analysis for Normalized Random Measures with Independent Increments," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 36(1), pages 76-97, March.
    11. De Blasi, Pierpaolo & Martínez, Asael Fabian & Mena, Ramsés H. & Prünster, Igor, 2020. "On the inferential implications of decreasing weight structures in mixture models," Computational Statistics & Data Analysis, Elsevier, vol. 147(C).
    12. Stefano Favaro & Antonio Lijoi & Igor Prünster, 2012. "A New Estimator of the Discovery Probability," Biometrics, The International Biometric Society, vol. 68(4), pages 1188-1196, December.
    13. Antonio Lijoi & Bernardo Nipoti & Igor Prünster, 2013. "Dependent mixture models: clustering and borrowing information," DEM Working Papers Series 046, University of Pavia, Department of Economics and Management.
    14. Lijoi, Antonio & Nipoti, Bernardo & Prünster, Igor, 2014. "Dependent mixture models: Clustering and borrowing information," Computational Statistics & Data Analysis, Elsevier, vol. 71(C), pages 417-433.
    15. Pierpaolo De Blasi & Ramsés H. Mena & Igor Prünster, 2022. "Asymptotic behavior of the number of distinct values in a sample from the geometric stick-breaking process," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 74(1), pages 143-165, February.
    16. Fabrizio Leisen & Antonio Lijoi, 2010. "Vectors of two-parameter Poisson-Dirichlet processes," Quaderni di Dipartimento 119, University of Pavia, Department of Economics and Quantitative Methods.
    17. Antonio Lijoi & Igor Pruenster, 2009. "Distributional Properties of means of Random Probability Measures," ICER Working Papers - Applied Mathematics Series 22-2009, ICER - International Centre for Economic Research.
    18. Julyan Arbel & Stefano Favaro, 2021. "Approximating Predictive Probabilities of Gibbs-Type Priors," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 83(1), pages 496-519, February.
    19. Zhu, Weixuan & Leisen, Fabrizio, 2013. "A multivariate extension of a vector of Poisson- Dirichlet processes," DES - Working Papers. Statistics and Econometrics. WS ws132220, Universidad Carlos III de Madrid. Departamento de Estadística.
    20. François Caron & Emily B. Fox, 2017. "Sparse graphs using exchangeable random measures," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(5), pages 1295-1366, November.
    21. Wang, Ketong & Porter, Michael D., 2018. "Optimal Bayesian clustering using non-negative matrix factorization," Computational Statistics & Data Analysis, Elsevier, vol. 128(C), pages 395-411.
    22. Subharup Guha & Rex Jung & David Dunson, 2022. "Predicting phenotypes from brain connection structure," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 71(3), pages 639-668, June.
    23. Ruth Fuentes–García & Ramsés Mena & Stephen Walker, 2010. "A Probability for Classification Based on the Dirichlet Process Mixture Model," Journal of Classification, Springer;The Classification Society, vol. 27(3), pages 389-403, November.
    24. Matteo Ruggiero & Matteo Sordello, 2018. "Clustering dynamics in a class of normalised generalised gamma dependent priors," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 70(1), pages 83-98, February.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Antonio Lijoi & Igor Pruenster & Stephen G. Walker, 2008. "Bayesian nonparametric estimators derived from conditional Gibbs structures," ICER Working Papers - Applied Mathematics Series 06-2008, ICER - International Centre for Economic Research.
    2. Cerquetti, Annalisa, 2007. "A note on Bayesian nonparametric priors derived from exponentially tilted Poisson-Kingman models," Statistics & Probability Letters, Elsevier, vol. 77(18), pages 1705-1711, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jorssb:v:69:y:2007:i:4:p:715-740. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/rssssea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.