IDEAS home Printed from https://ideas.repec.org/p/ems/eureir/15112.html
   My bibliography  Save this paper

A Bayesian approach to two-mode clustering

Author

Listed:
  • van Dijk, A.
  • van Rosmalen, J.M.
  • Paap, R.

Abstract

We develop a new Bayesian approach to estimate the parameters of a latent-class model for the joint clustering of both modes of two-mode data matrices. Posterior results are obtained using a Gibbs sampler with data augmentation. Our Bayesian approach has three advantages over existing methods. First, we are able to do statistical inference on the model parameters, which would not be possible using frequentist estimation procedures. In addition, the Bayesian approach allows us to provide statistical criteria for determining the optimal numbers of clusters. Finally, our Gibbs sampler has fewer problems with local optima in the likelihood function and empty classes than the EM algorithm used in a frequentist approach. We apply the Bayesian estimation method of the latent-class two-mode clustering model to two empirical data sets. The first data set is the Supreme Court voting data set of Doreian, Batagelj, and Ferligoj (2004). The second data set comprises the roll call votes of the United States House of Representatives in 2007. For both data sets, we show how two-mode clustering can provide useful insights.

Suggested Citation

  • van Dijk, A. & van Rosmalen, J.M. & Paap, R., 2009. "A Bayesian approach to two-mode clustering," Econometric Institute Research Papers EI 2009-06, Erasmus University Rotterdam, Erasmus School of Economics (ESE), Econometric Institute.
  • Handle: RePEc:ems:eureir:15112
    as

    Download full text from publisher

    File URL: https://repub.eur.nl/pub/15112/ei2009-06.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Geweke, John, 2007. "Interpretation and inference in mixture models: Simple MCMC works," Computational Statistics & Data Analysis, Elsevier, vol. 51(7), pages 3529-3550, April.
    2. Han C. & Carlin B. P., 2001. "Markov Chain Monte Carlo Methods for Computing Bayes Factors: A Comparative Review," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1122-1132, September.
    3. Schepers, Jan & van Mechelen, Iven & Ceulemans, Eva, 2006. "Three-mode partitioning," Computational Statistics & Data Analysis, Elsevier, vol. 51(3), pages 1623-1642, December.
    4. Joost Rosmalen & Patrick Groenen & Javier Trejos & William Castillo, 2009. "Optimization Strategies for Two-Mode Partitioning," Journal of Classification, Springer;The Classification Society, vol. 26(2), pages 155-181, August.
    5. Jon P. Nelson, 2002. ""Green" Voting And Ideology: Lcv Scores And Roll-Call Voting In The U.S. Senate, 1988-1998," The Review of Economics and Statistics, MIT Press, vol. 84(3), pages 518-529, August.
    6. Sylvia Fruhwirth-Schnatter, 2004. "Estimating marginal likelihoods for mixture and Markov switching models using bridge sampling techniques," Econometrics Journal, Royal Economic Society, vol. 7(1), pages 143-167, June.
    7. Jan Schepers & Eva Ceulemans & Iven Mechelen, 2008. "Selecting Among Multi-Mode Partitioning Models of Different Complexities: A Comparison of Four Model Selection Criteria," Journal of Classification, Springer;The Classification Society, vol. 25(1), pages 67-85, June.
    8. Fruhwirth-Schnatter S., 2001. "Markov Chain Monte Carlo Estimation of Classical and Dynamic Switching and Mixture Models," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 194-209, March.
    9. Wayne DeSarbo & Duncan Fong & John Liechty & M. Kim Saxton, 2004. "A hierarchical bayesian procedure for two-mode cluster analysis," Psychometrika, Springer;The Psychometric Society, vol. 69(4), pages 547-572, December.
    10. de Leeuw, Jan, 2006. "Principal component analysis of binary data by iterated singular value decomposition," Computational Statistics & Data Analysis, Elsevier, vol. 50(1), pages 21-39, January.
    11. Glenn Milligan & Martha Cooper, 1985. "An examination of procedures for determining the number of clusters in a data set," Psychometrika, Springer;The Psychometric Society, vol. 50(2), pages 159-179, June.
    12. Govaert, Gérard & Nadif, Mohamed, 2008. "Block clustering with Bernoulli mixture models: Comparison of different approaches," Computational Statistics & Data Analysis, Elsevier, vol. 52(6), pages 3233-3245, February.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Eleni Matechou & Ivy Liu & Daniel Fernández & Miguel Farias & Bergljot Gjelsvik, 2016. "Biclustering Models for Two-Mode Ordinal Data," Psychometrika, Springer;The Psychometric Society, vol. 81(3), pages 611-624, September.
    2. Alessandro Casa & Charles Bouveyron & Elena Erosheva & Giovanna Menardi, 2021. "Co-clustering of Time-Dependent Data via the Shape Invariant Model," Journal of Classification, Springer;The Classification Society, vol. 38(3), pages 626-649, October.
    3. Aghiles Salah & Mohamed Nadif, 2019. "Directional co-clustering," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 13(3), pages 591-620, September.
    4. Aurore Lomet & Gérard Govaert & Yves Grandvalet, 2018. "Model selection for Gaussian latent block clustering with the integrated classification likelihood," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 12(3), pages 489-508, September.
    5. Daniel Fernández & Richard Arnold & Shirley Pledger & Ivy Liu & Roy Costilla, 2019. "Finite mixture biclustering of discrete type multivariate data," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 13(1), pages 117-143, March.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. David Ardia, 2009. "Bayesian estimation of a Markov-switching threshold asymmetric GARCH model with Student-t innovations," Econometrics Journal, Royal Economic Society, vol. 12(1), pages 105-126, March.
    2. Keane, Michael & Stavrunova, Olena, 2016. "Adverse selection, moral hazard and the demand for Medigap insurance," Journal of Econometrics, Elsevier, vol. 190(1), pages 62-78.
    3. Luc Bauwens & Jean-François Carpantier & Arnaud Dufays, 2017. "Autoregressive Moving Average Infinite Hidden Markov-Switching Models," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 35(2), pages 162-182, April.
    4. Yin, Ming, 2015. "Estimating Gaussian Mixture Autoregressive model with Sequential Monte Carlo algorithm: A parallel GPU implementation," MPRA Paper 88111, University Library of Munich, Germany, revised 2018.
    5. Philippe J. Deschamps, 2008. "Comparing smooth transition and Markov switching autoregressive models of US unemployment," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 23(4), pages 435-462.
    6. Bauwens, Luc & Dufays, Arnaud & Rombouts, Jeroen V.K., 2014. "Marginal likelihood for Markov-switching and change-point GARCH models," Journal of Econometrics, Elsevier, vol. 178(P3), pages 508-522.
    7. Philippe J. Deschamps, 2008. "Comparing smooth transition and Markov switching autoregressive models of US unemployment," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 23(4), pages 435-462.
    8. J. Vera & Rodrigo Macías & Willem Heiser, 2013. "Cluster Differences Unfolding for Two-Way Two-Mode Preference Rating Data," Journal of Classification, Springer;The Classification Society, vol. 30(3), pages 370-396, October.
    9. Li, Mingliang & Mumford, Kevin J. & Tobias, Justin L., 2012. "A Bayesian analysis of payday loans and their regulation," Journal of Econometrics, Elsevier, vol. 171(2), pages 205-216.
    10. Hoogerheide, Lennart & Opschoor, Anne & van Dijk, Herman K., 2012. "A class of adaptive importance sampling weighted EM algorithms for efficient and robust posterior and predictive simulation," Journal of Econometrics, Elsevier, vol. 171(2), pages 101-120.
    11. de Pooter, M.D. & Ravazzolo, F. & Segers, R. & van Dijk, H.K., 2008. "Bayesian near-boundary analysis in basic macroeconomic time series models," Econometric Institute Research Papers EI 2008-13, Erasmus University Rotterdam, Erasmus School of Economics (ESE), Econometric Institute.
    12. Çakmaklı, Cem & Paap, Richard & van Dijk, Dick, 2013. "Measuring and predicting heterogeneous recessions," Journal of Economic Dynamics and Control, Elsevier, vol. 37(11), pages 2195-2216.
    13. Paap, Richard & Segers, Rene & van Dijk, Dick, 2009. "Do Leading Indicators Lead Peaks More Than Troughs?," Journal of Business & Economic Statistics, American Statistical Association, vol. 27(4), pages 528-543.
    14. Vlad-Cosmin Bulai & Alexandra Horobeț & Lucian Belascu, 2019. "Improving Local Governments’ Financial Sustainability by Using Open Government Data: An Application of High-Granularity Estimates of Personal Income Levels in Romania," Sustainability, MDPI, vol. 11(20), pages 1-11, October.
    15. Deschamps, Philippe J., 2012. "Bayesian estimation of generalized hyperbolic skewed student GARCH models," Computational Statistics & Data Analysis, Elsevier, vol. 56(11), pages 3035-3054.
    16. Aßmann, Christian & Boysen-Hogrefe, Jens, 2011. "A Bayesian approach to model-based clustering for binary panel probit models," Computational Statistics & Data Analysis, Elsevier, vol. 55(1), pages 261-279, January.
    17. Jan Schepers & Iven Mechelen & Eva Ceulemans, 2011. "The Real-Valued Model of Hierarchical Classes," Journal of Classification, Springer;The Classification Society, vol. 28(3), pages 363-389, October.
    18. Jan Schepers & Eva Ceulemans & Iven Mechelen, 2008. "Selecting Among Multi-Mode Partitioning Models of Different Complexities: A Comparison of Four Model Selection Criteria," Journal of Classification, Springer;The Classification Society, vol. 25(1), pages 67-85, June.
    19. Ardia, David & Baştürk, Nalan & Hoogerheide, Lennart & van Dijk, Herman K., 2012. "A comparative study of Monte Carlo methods for efficient evaluation of marginal likelihood," Computational Statistics & Data Analysis, Elsevier, vol. 56(11), pages 3398-3414.
    20. Qian, Hang, 2009. "Bayesian Portfolio Selection with Gaussian Mixture Returns," MPRA Paper 32688, University Library of Munich, Germany.

    More about this item

    Keywords

    MCMC; latent-class model; model-based clustering; two-mode data;
    All these keywords.

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ems:eureir:15112. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: RePub (email available below). General contact details of provider: https://edirc.repec.org/data/feeurnl.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.