IDEAS home Printed from https://ideas.repec.org/a/spr/scient/v102y2015i1d10.1007_s11192-014-1377-5.html
   My bibliography  Save this article

An entropy-based social network community detecting method and its application to scientometrics

Author

Listed:
  • Yongli Li

    (Harbin Institute of Technology
    Università di Siena)

  • Guijie Zhang

    (Harbin Institute of Technology)

  • Yuqiang Feng

    (Harbin Institute of Technology)

  • Chong Wu

    (Harbin Institute of Technology)

Abstract

Community structure is one of the important properties of social networks in general and in particular the citation networks in the field of scientometrics. A majority of existing methods are not proper for detecting communities in a directed network, and thus hinders their applications in the citation networks. In this paper, we provide a novel method which not only overcomes the above mentioned disability, but also has a relative low algorithm time complexity which facilitates the application in large scale networks. We use the concept of Shannon entropy to measure a network’s information and then consider the process of detecting communities as a process of information loss. Based on this idea, we develop an optimal model to depict the process of detecting communities and further introduce the principle of dynamic programming to solve the model. A simulation test is also designed to examine the model’s accuracy in discovering the community structure and identifying the optimal community number. Finally, we apply our method in a citation network from the journal Scientometrics and then provide several insights on promising research topics through the detected communities by our method.

Suggested Citation

  • Yongli Li & Guijie Zhang & Yuqiang Feng & Chong Wu, 2015. "An entropy-based social network community detecting method and its application to scientometrics," Scientometrics, Springer;Akadémiai Kiadó, vol. 102(1), pages 1003-1017, January.
  • Handle: RePEc:spr:scient:v:102:y:2015:i:1:d:10.1007_s11192-014-1377-5
    DOI: 10.1007/s11192-014-1377-5
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11192-014-1377-5
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11192-014-1377-5?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Marc Correa & Lucinio González-Sabaté & Ignacio Serrano, 2013. "Home bias effect in the management literature," Scientometrics, Springer;Akadémiai Kiadó, vol. 95(1), pages 417-433, April.
    2. Guang Yu & Yi-Jun Li, 2010. "Identification of referencing and citation processes of scientific journals based on the citation distribution model," Scientometrics, Springer;Akadémiai Kiadó, vol. 82(2), pages 249-261, February.
    3. Gergely Palla & Imre Derényi & Illés Farkas & Tamás Vicsek, 2005. "Uncovering the overlapping community structure of complex networks in nature and society," Nature, Nature, vol. 435(7043), pages 814-818, June.
    4. Per O Seglen, 1992. "How representative is the journal impact factor?," Research Evaluation, Oxford University Press, vol. 2(3), pages 143-149, December.
    5. Sameer Kumar & Jariah Mohd. Jan, 2014. "Research collaboration networks of two OIC nations: comparative study between Turkey and Malaysia in the field of ‘Energy Fuels’, 2009–2011," Scientometrics, Springer;Akadémiai Kiadó, vol. 98(1), pages 387-414, January.
    6. Richard Bellman, 1957. "On a Dynamic Programming Approach to the Caterer Problem--I," Management Science, INFORMS, vol. 3(3), pages 270-278, April.
    7. Yan, Erjia & Ding, Ying & Milojević, Staša & Sugimoto, Cassidy R., 2012. "Topics in dynamic research communities: An exploratory study for the field of information retrieval," Journal of Informetrics, Elsevier, vol. 6(1), pages 140-153.
    8. Erjia Yan & Ying Ding & Elin K. Jacob, 2012. "Overlaying communities and topics: an analysis on publication networks," Scientometrics, Springer;Akadémiai Kiadó, vol. 90(2), pages 499-513, February.
    9. Theresa Velden & Carl Lagoze, 2013. "The extraction of community structures from publication networks to support ethnographic observations of field differences in scientific communication," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 64(12), pages 2405-2427, December.
    10. Chaomei Chen & Fidelia Ibekwe-SanJuan & Jianhua Hou, 2010. "The structure and dynamics of cocitation clusters: A multiple-perspective cocitation analysis," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 61(7), pages 1386-1409, July.
    11. He, Bing & Ding, Ying & Tang, Jie & Reguramalingam, Vignesh & Bollen, Johan, 2013. "Mining diversity subgraph in multidisciplinary scientific collaboration networks: A meso perspective," Journal of Informetrics, Elsevier, vol. 7(1), pages 117-128.
    12. Theresa Velden & Asif-ul Haque & Carl Lagoze, 2010. "A new approach to analyzing patterns of collaboration in co-authorship networks: mesoscopic analysis and interpretation," Scientometrics, Springer;Akadémiai Kiadó, vol. 85(1), pages 219-242, October.
    13. Theresa Velden & Carl Lagoze, 2013. "The extraction of community structures from publication networks to support ethnographic observations of field differences in scientific communication," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 64(12), pages 2405-2427, December.
    14. Georg Groh & Christoph Fuchs, 2011. "Multi-modal social networks for modeling scientific fields," Scientometrics, Springer;Akadémiai Kiadó, vol. 89(2), pages 569-590, November.
    15. Choong Kwai Fatt & Ephrance Abu Ujum & Kuru Ratnavelu, 2010. "The structure of collaboration in the Journal of Finance," Scientometrics, Springer;Akadémiai Kiadó, vol. 85(3), pages 849-860, December.
    16. Rodriguez, Marko A. & Pepe, Alberto, 2008. "On the relationship between the structural and socioacademic communities of a coauthorship network," Journal of Informetrics, Elsevier, vol. 2(3), pages 195-201.
    17. Francesca Pallotti & Alessandro Lomi & Daniele Mascia, 2013. "From network ties to network structures: Exponential Random Graph Models of interorganizational relations," Quality & Quantity: International Journal of Methodology, Springer, vol. 47(3), pages 1665-1685, April.
    18. T. S. Evans & R. Lambiotte & P. Panzarasa, 2011. "Community structure and patterns of scientific collaboration in Business and Management," Scientometrics, Springer;Akadémiai Kiadó, vol. 89(1), pages 381-396, October.
    19. Wolfgang Glänzel & Balázs Schlemmer & Bart Thijs, 2003. "Better late than never? On the chance to become highly cited only beyond the standard bibliometric time horizon," Scientometrics, Springer;Akadémiai Kiadó, vol. 58(3), pages 571-586, November.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Fabian Meyer-Brötz & Edgar Schiebel & Leo Brecht, 2017. "Experimental evaluation of parameter settings in calculation of hybrid similarities: effects of first- and second-order similarity, edge cutting, and weighting factors," Scientometrics, Springer;Akadémiai Kiadó, vol. 111(3), pages 1307-1325, June.
    2. Guijie Zhang & Luning Liu & Fangfang Wei, 2019. "Key nodes mining in the inventor–author knowledge diffusion network," Scientometrics, Springer;Akadémiai Kiadó, vol. 118(3), pages 721-735, March.
    3. Wang, Feifei & Jia, Chenran & Wang, Xiaohan & Liu, Junwan & Xu, Shuo & Liu, Yang & Yang, Chenyuyan, 2019. "Exploring all-author tripartite citation networks: A case study of gene editing," Journal of Informetrics, Elsevier, vol. 13(3), pages 856-873.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Liliana Arroyo Moliner & Eva Gallardo-Gallardo & Pedro Gallo de Puelles, 2017. "Understanding scientific communities: a social network approach to collaborations in Talent Management research," Scientometrics, Springer;Akadémiai Kiadó, vol. 113(3), pages 1439-1462, December.
    2. Theresa Velden & Shiyan Yan & Carl Lagoze, 2017. "Mapping the cognitive structure of astrophysics by infomap clustering of the citation network and topic affinity analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 111(2), pages 1033-1051, May.
    3. Yanto Chandra, 2018. "Mapping the evolution of entrepreneurship as a field of research (1990–2013): A scientometric analysis," PLOS ONE, Public Library of Science, vol. 13(1), pages 1-24, January.
    4. Carusi, Chiara & Bianchi, Giuseppe, 2019. "Scientific community detection via bipartite scholar/journal graph co-clustering," Journal of Informetrics, Elsevier, vol. 13(1), pages 354-386.
    5. Rodica Ioana Lung & Noémi Gaskó & Mihai Alexandru Suciu, 2018. "A hypergraph model for representing scientific output," Scientometrics, Springer;Akadémiai Kiadó, vol. 117(3), pages 1361-1379, December.
    6. Guijie Zhang & Luning Liu & Fangfang Wei, 2019. "Key nodes mining in the inventor–author knowledge diffusion network," Scientometrics, Springer;Akadémiai Kiadó, vol. 118(3), pages 721-735, March.
    7. Sameer Kumar & Bernd Markscheffel, 2016. "Bonded-communities in HantaVirus research: a research collaboration network (RCN) analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 109(1), pages 533-550, October.
    8. Theresa Velden & Kevin W. Boyack & Jochen Gläser & Rob Koopman & Andrea Scharnhorst & Shenghui Wang, 2017. "Comparison of topic extraction approaches and their results," Scientometrics, Springer;Akadémiai Kiadó, vol. 111(2), pages 1169-1221, May.
    9. Guijie Zhang & Luning Liu & Yuqiang Feng & Zhen Shao & Yongli Li, 2014. "Cext-N index: a network node centrality measure for collaborative relationship distribution," Scientometrics, Springer;Akadémiai Kiadó, vol. 101(1), pages 291-307, October.
    10. Li, Yongli & Wu, Chong & Wang, Xiaoyu & Luo, Peng, 2014. "A network-based and multi-parameter model for finding influential authors," Journal of Informetrics, Elsevier, vol. 8(3), pages 791-799.
    11. Yan, Erjia, 2014. "Research dynamics: Measuring the continuity and popularity of research topics," Journal of Informetrics, Elsevier, vol. 8(1), pages 98-110.
    12. Jianlin Zhou & An Zeng & Ying Fan & Zengru Di, 2018. "Identifying important scholars via directed scientific collaboration networks," Scientometrics, Springer;Akadémiai Kiadó, vol. 114(3), pages 1327-1343, March.
    13. Sjögårde, Peter & Ahlgren, Per, 2018. "Granularity of algorithmically constructed publication-level classifications of research publications: Identification of topics," Journal of Informetrics, Elsevier, vol. 12(1), pages 133-152.
    14. Hakyeon Lee & Pilsung Kang, 2018. "Identifying core topics in technology and innovation management studies: a topic model approach," The Journal of Technology Transfer, Springer, vol. 43(5), pages 1291-1317, October.
    15. Erjia Yan, 2014. "Topic-based Pagerank: toward a topic-level scientific evaluation," Scientometrics, Springer;Akadémiai Kiadó, vol. 100(2), pages 407-437, August.
    16. Katalin Orosz & Illés J. Farkas & Péter Pollner, 2016. "Quantifying the changing role of past publications," Scientometrics, Springer;Akadémiai Kiadó, vol. 108(2), pages 829-853, August.
    17. Ren, Fu-Xin & Shen, Hua-Wei & Cheng, Xue-Qi, 2012. "Modeling the clustering in citation networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 391(12), pages 3533-3539.
    18. Yongli Li & Chong Wu & Zizheng Wang, 2015. "An information-theoretic approach for detecting communities in networks," Quality & Quantity: International Journal of Methodology, Springer, vol. 49(4), pages 1719-1733, July.
    19. Lin Zhang & Wolfgang Glänzel, 2017. "A citation-based cross-disciplinary study on literature ageing: part II—diachronous aspects," Scientometrics, Springer;Akadémiai Kiadó, vol. 111(3), pages 1559-1572, June.
    20. Guillaume Cabanac & Gilles Hubert & Béatrice Milard, 2015. "Academic careers in Computer Science: continuance and transience of lifetime co-authorships," Scientometrics, Springer;Akadémiai Kiadó, vol. 102(1), pages 135-150, January.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:102:y:2015:i:1:d:10.1007_s11192-014-1377-5. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.