IDEAS home Printed from https://ideas.repec.org/a/jss/jstsof/v076i09.html
   My bibliography  Save this article

blockcluster: An R Package for Model-Based Co-Clustering

Author

Listed:
  • Bhatia, Parmeet Singh
  • Iovleff, Serge
  • Govaert, Gérard

Abstract

Simultaneous clustering of rows and columns, usually designated by bi-clustering, coclustering or block clustering, is an important technique in two way data analysis. A new standard and efficient approach has been recently proposed based on the latent block model (Govaert and Nadif 2003) which takes into account the block clustering problem on both the individual and variable sets. This article presents our R package blockcluster for co-clustering of binary, contingency and continuous data based on these very models. In this document, we will give a brief review of the model-based block clustering methods, and we will show how the R package blockcluster can be used for co-clustering.

Suggested Citation

  • Bhatia, Parmeet Singh & Iovleff, Serge & Govaert, Gérard, 2017. "blockcluster: An R Package for Model-Based Co-Clustering," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 76(i09).
  • Handle: RePEc:jss:jstsof:v:076:i09
    DOI: http://hdl.handle.net/10.18637/jss.v076.i09
    as

    Download full text from publisher

    File URL: https://www.jstatsoft.org/index.php/jss/article/view/v076i09/v76i09.pdf
    Download Restriction: no

    File URL: https://www.jstatsoft.org/index.php/jss/article/downloadSuppFile/v076i09/blockcluster_4.2.3.tar.gz
    Download Restriction: no

    File URL: https://www.jstatsoft.org/index.php/jss/article/downloadSuppFile/v076i09/v76i09.R
    Download Restriction: no

    File URL: https://www.jstatsoft.org/index.php/jss/article/downloadSuppFile/v076i09/v76i09-replication.zip
    Download Restriction: no

    File URL: https://libkey.io/http://hdl.handle.net/10.18637/jss.v076.i09?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Govaert, Gérard & Nadif, Mohamed, 2008. "Block clustering with Bernoulli mixture models: Comparison of different approaches," Computational Statistics & Data Analysis, Elsevier, vol. 52(6), pages 3233-3245, February.
    2. Hathaway, Richard J., 1986. "Another interpretation of the EM algorithm for mixture distributions," Statistics & Probability Letters, Elsevier, vol. 4(2), pages 53-56, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. C. Biernacki & J. Jacques & C. Keribin, 2023. "A Survey on Model-Based Co-Clustering: High Dimension and Estimation Challenges," Journal of Classification, Springer;The Classification Society, vol. 40(2), pages 332-381, July.
    2. Blazquez-Soriano, Amparo & Ramos-Sandoval, Rosmery, 2022. "Information transfer as a tool to improve the resilience of farmers against the effects of climate change: The case of the Peruvian National Agrarian Innovation System," Agricultural Systems, Elsevier, vol. 200(C).
    3. Zaheer Ahmed & Alberto Cassese & Gerard Breukelen & Jan Schepers, 2023. "E-ReMI: Extended Maximal Interaction Two-mode Clustering," Journal of Classification, Springer;The Classification Society, vol. 40(2), pages 298-331, July.
    4. Ferraro, Maria Brigida & Giordani, Paolo & Vichi, Maurizio, 2021. "A class of two-mode clustering algorithms in a fuzzy setting," Econometrics and Statistics, Elsevier, vol. 18(C), pages 63-78.
    5. Selosse, Margot & Jacques, Julien & Biernacki, Christophe, 2020. "Model-based co-clustering for mixed type data," Computational Statistics & Data Analysis, Elsevier, vol. 144(C).
    6. Gong, Tingnan & Zhang, Weiping & Chen, Yu, 2023. "Uncovering block structures in large rectangular matrices," Journal of Multivariate Analysis, Elsevier, vol. 198(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Gérard Govaert & Mohamed Nadif, 2018. "Mutual information, phi-squared and model-based co-clustering for contingency tables," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 12(3), pages 455-488, September.
    2. Bergé, Laurent R. & Bouveyron, Charles & Corneli, Marco & Latouche, Pierre, 2019. "The latent topic block model for the co-clustering of textual interaction data," Computational Statistics & Data Analysis, Elsevier, vol. 137(C), pages 247-270.
    3. Michael Salter-Townshend & Thomas Murphy, 2014. "Mixtures of biased sentiment analysers," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 8(1), pages 85-103, March.
    4. Blazquez-Soriano, Amparo & Ramos-Sandoval, Rosmery, 2022. "Information transfer as a tool to improve the resilience of farmers against the effects of climate change: The case of the Peruvian National Agrarian Innovation System," Agricultural Systems, Elsevier, vol. 200(C).
    5. Gilles Celeux & Gilda Soromenho, 1996. "An entropy criterion for assessing the number of clusters in a mixture model," Journal of Classification, Springer;The Classification Society, vol. 13(2), pages 195-212, September.
    6. Ferraro, Maria Brigida, 2024. "Fuzzy k-Means: history and applications," Econometrics and Statistics, Elsevier, vol. 30(C), pages 110-123.
    7. Nicolas Depraetere & Martina Vandebroek, 2014. "Order selection in finite mixtures of linear regressions," Statistical Papers, Springer, vol. 55(3), pages 871-911, August.
    8. Rawya Zreik & Pierre Latouche & Charles Bouveyron, 2017. "The dynamic random subgraph model for the clustering of evolving networks," Computational Statistics, Springer, vol. 32(2), pages 501-533, June.
    9. Haedo, Christian & Mouchart, Michel, 2019. "Two-mode clustering through profiles of regions and sectors," LIDAM Discussion Papers ISBA 2019014, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    10. van Dijk, A. & van Rosmalen, J.M. & Paap, R., 2009. "A Bayesian approach to two-mode clustering," Econometric Institute Research Papers EI 2009-06, Erasmus University Rotterdam, Erasmus School of Economics (ESE), Econometric Institute.
    11. Vu, Duy & Aitkin, Murray, 2015. "Variational algorithms for biclustering models," Computational Statistics & Data Analysis, Elsevier, vol. 89(C), pages 12-24.
    12. Haedo, Christian & Mouchart, Michel, 2018. "Automatic biclustering of regions and sectors," LIDAM Discussion Papers ISBA 2018026, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    13. Di Zio, Marco & Guarnera, Ugo & Rocci, Roberto, 2007. "A mixture of mixture models for a classification problem: The unity measure error," Computational Statistics & Data Analysis, Elsevier, vol. 51(5), pages 2573-2585, February.
    14. Tatiana Makhalova & Martin Trnecka, 2021. "From-below Boolean matrix factorization algorithm based on MDL," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 15(1), pages 37-56, March.
    15. Carlo Cavicchia & Maurizio Vichi & Giorgia Zaccaria, 2022. "Gaussian mixture model with an extended ultrametric covariance structure," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 16(2), pages 399-427, June.
    16. Alessandro Casa & Charles Bouveyron & Elena Erosheva & Giovanna Menardi, 2021. "Co-clustering of Time-Dependent Data via the Shape Invariant Model," Journal of Classification, Springer;The Classification Society, vol. 38(3), pages 626-649, October.
    17. C. Biernacki & J. Jacques & C. Keribin, 2023. "A Survey on Model-Based Co-Clustering: High Dimension and Estimation Challenges," Journal of Classification, Springer;The Classification Society, vol. 40(2), pages 332-381, July.
    18. Francesca Martella & Maurizio Vichi, 2012. "Clustering microarray data using model-based double K -means," Journal of Applied Statistics, Taylor & Francis Journals, vol. 39(9), pages 1853-1869, April.
    19. Roy Costilla & Ivy Liu & Richard Arnold & Daniel Fernández, 2019. "Bayesian model-based clustering for longitudinal ordinal data," Computational Statistics, Springer, vol. 34(3), pages 1015-1038, September.
    20. Gupta, Mayetri, 2014. "An evolutionary Monte Carlo algorithm for Bayesian block clustering of data matrices," Computational Statistics & Data Analysis, Elsevier, vol. 71(C), pages 375-391.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:jss:jstsof:v:076:i09. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Christopher F. Baum (email available below). General contact details of provider: http://www.jstatsoft.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.