IDEAS home Printed from https://ideas.repec.org/a/spr/compst/v40y2025i8d10.1007_s00180-025-01625-2.html

A community detection algorithm based on spectral co-clustering and weight self-adjustment in attributed stochastic co-block models

Author

Listed:
  • Yuxin Zhang

    (University of Science and Technology of China)

  • Jie Liu

    (University of Science and Technology of China)

  • Yang Yang

    (Shanghai Jiao Tong University)

Abstract

The Degree-Corrected Stochastic co-Block Model (DC-ScBM) is widely utilized for detecting the community structure in directed networks. It can flexibly depict the topology of edges in directed graphs. However, in practice, node attributes provide an additional source of information that can be leveraged for community detection, which is not considered in the DC-ScBM. Therefore, there is a critical need to develop models and detection methods for node-attributed directed networks, especially when the goal is to discover important nodes or special community structures. We generalize the DC-ScBM using the multiplicative form to fuse edges and node attributes and describe the extent of influence of node attributes on each community. Then, a detection algorithm based on spectral co-clustering and feature weight self-adjustment (Spcc-SA) is developed. The algorithm aims to minimize normalized cut (Ncut), and iteratively detects the sending and receiving communities and the weights of node attributes, so that node attributes with stronger signals are given greater weights. Numerical studies demonstrate that the Spcc-SA algorithm outperforms existing methods across a variety of node attributes and network topologies. Especially when attribute values differ greatly and the community structure is distinct, the normalized mutual information of Spcc-SA in the sending and receiving communities can reach 0.6 and 0.8, respectively. Furthermore, We apply this algorithm to real world datasets, including the Enron email, world trade, and Weddell Sea network, demonstrating that the algorithm can effectively detect interesting community structures.

Suggested Citation

  • Yuxin Zhang & Jie Liu & Yang Yang, 2025. "A community detection algorithm based on spectral co-clustering and weight self-adjustment in attributed stochastic co-block models," Computational Statistics, Springer, vol. 40(8), pages 4247-4275, November.
  • Handle: RePEc:spr:compst:v:40:y:2025:i:8:d:10.1007_s00180-025-01625-2
    DOI: 10.1007/s00180-025-01625-2
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s00180-025-01625-2
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s00180-025-01625-2?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Nan, Dong-Yang & Yu, Wei & Liu, Xiao & Zhang, Yun-Peng & Dai, Wei-Di, 2018. "A framework of community detection based on individual labels in attribute networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 512(C), pages 523-536.
    2. M. E. J. Newman & Aaron Clauset, 2016. "Structure and inference in annotated networks," Nature Communications, Nature, vol. 7(1), pages 1-11, September.
    3. Fengqin Tang & Chunning Wang & Jinxia Su & Yuanyuan Wang, 2020. "Spectral clustering-based community detection using graph distance and node attributes," Computational Statistics, Springer, vol. 35(1), pages 69-94, March.
    4. N. Binkiewicz & J. T. Vogelstein & K. Rohe, 2017. "Covariate-assisted spectral clustering," Biometrika, Biometrika Trust, vol. 104(2), pages 361-377.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Fengqin Tang & Chunning Wang & Jinxia Su & Yuanyuan Wang, 2020. "Spectral clustering-based community detection using graph distance and node attributes," Computational Statistics, Springer, vol. 35(1), pages 69-94, March.
    2. Junhui Cai & Dan Yang & Ran Chen & Wu Zhu & Haipeng Shen & Linda Zhao, 2021. "Network regression and supervised centrality estimation," Papers 2111.12921, arXiv.org, revised Feb 2025.
    3. Lucy L. Gao & Daniela Witten & Jacob Bien, 2022. "Testing for association in multiview network data," Biometrics, The International Biometric Society, vol. 78(3), pages 1018-1030, September.
    4. Fengqin Tang & Xuejing Zhao & Cuixia Li, 2023. "Community Detection in Multilayer Networks Based on Matrix Factorization and Spectral Embedding Method," Mathematics, MDPI, vol. 11(7), pages 1-19, March.
    5. Yunpeng Zhao & Qing Pan & Chengan Du, 2019. "Logistic regression augmented community detection for network data with application in identifying autism‐related gene pathways," Biometrics, The International Biometric Society, vol. 75(1), pages 222-234, March.
    6. Heather Mathews & Alexander Volfovsky, 2023. "Community informed experimental design," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 32(4), pages 1141-1166, October.
    7. Zhou, Bin & Yan, Xiao-Yong & Xu, Xiao-Ke & Xu, Xiao-Ting & Wang, Nianxin, 2018. "Evolutionary of online social networks driven by pareto wealth distribution and bidirectional preferential attachment," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 507(C), pages 427-434.
    8. Thorben Funke & Till Becker, 2019. "Stochastic block models: A comparison of variants and inference methods," PLOS ONE, Public Library of Science, vol. 14(4), pages 1-40, April.
    9. Juan Pablo Bascur & Suzan Verberne & Nees Jan Eck & Ludo Waltman, 2025. "Which topics are best represented by science maps? An analysis of clustering effectiveness for citation and text similarity networks," Scientometrics, Springer;Akadémiai Kiadó, vol. 130(2), pages 1181-1199, February.
    10. Guo, Li & Tao, Yubo & Härdle, Wolfgang Karl, 2019. "Dynamic Network Perspective of Cryptocurrencies," IRTG 1792 Discussion Papers 2019-009, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".
    11. Jun Liu & Jiangzhou Wang & Binghui Liu, 2020. "Community Detection of Multi-Layer Attributed Networks via Penalized Alternating Factorization," Mathematics, MDPI, vol. 8(2), pages 1-20, February.
    12. Qiuping Wang & Yuan Zhang & Ting Yan, 2023. "Asymptotic theory in network models with covariates and a growing number of node parameters," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 75(2), pages 369-392, April.
    13. Shujie Ma & Liangjun Su & Yichong Zhang, 2020. "Detecting Latent Communities in Network Formation Models," Economics and Statistics Working Papers 12-2020, Singapore Management University, School of Economics.
    14. Zhou, Bin & Xu, Xiao-Ting & Liu, Jian-Guo & Xu, Xiao-Ke & Wang, Nianxin, 2019. "Information interaction model for the mobile communication networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 525(C), pages 1170-1176.
    15. Chang, Zhenhai & Yin, Xianjun & Jia, Caiyan & Wang, Xiaoyang, 2018. "Mixture models with entropy regularization for community detection in networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 496(C), pages 339-350.
    16. Louit, Sydney & Clark, Evan A. & Gelbard, Alexander H. & Vivek, Niketna & Yan, Jun & Zhang, Panpan, 2025. "CALF-SBM: A covariate-assisted latent factor stochastic block model," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 667(C).
    17. Anna Badalyan & Nicolò Ruggeri & Caterina De Bacco, 2024. "Structure and inference in hypergraphs with node attributes," Nature Communications, Nature, vol. 15(1), pages 1-11, December.
    18. Martina Contisciani & Federico Battiston & Caterina De Bacco, 2022. "Inference of hyperedges and overlapping communities in hypergraphs," Nature Communications, Nature, vol. 13(1), pages 1-10, December.
    19. Alireza Ermagun & Nazanin Tajik, 2021. "Recovery patterns and physics of the network," PLOS ONE, Public Library of Science, vol. 16(1), pages 1-20, January.
    20. Zihe Zhou & Bo Tian, 2019. "Research on Community Detection of Online Social Network Members Based on the Sparse Subspace Clustering Approach," Future Internet, MDPI, vol. 11(12), pages 1-16, December.

    More about this item

    Keywords

    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:compst:v:40:y:2025:i:8:d:10.1007_s00180-025-01625-2. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.