IDEAS home Printed from https://ideas.repec.org/a/eee/phsmap/v432y2015icp206-215.html
   My bibliography  Save this article

Sampling promotes community structure in social and information networks

Author

Listed:
  • Blagus, Neli
  • Šubelj, Lovro
  • Weiss, Gregor
  • Bajec, Marko

Abstract

Any network studied in the literature is inevitably just a sampled representative of its real-world analogue. Additionally, network sampling is lately often applied to large networks to allow for their faster and more efficient analysis. Nevertheless, the changes in network structure introduced by sampling are still far from understood. In this paper, we study the presence of characteristic groups of nodes in sampled social and information networks. We consider different network sampling techniques including random node and link selection, network exploration and expansion. We first observe that the structure of social networks reveals densely linked groups like communities, while the structure of information networks is better described by modules of structurally equivalent nodes. However, despite these notable differences, the structure of sampled networks exhibits stronger characterization by community-like groups than the original networks, irrespective of their type and consistently across various sampling techniques. Hence, rich community structure commonly observed in social and information networks is to some extent merely an artifact of sampling.

Suggested Citation

  • Blagus, Neli & Šubelj, Lovro & Weiss, Gregor & Bajec, Marko, 2015. "Sampling promotes community structure in social and information networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 432(C), pages 206-215.
  • Handle: RePEc:eee:phsmap:v:432:y:2015:i:c:p:206-215
    DOI: 10.1016/j.physa.2015.03.048
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0378437115003106
    Download Restriction: Full text for ScienceDirect subscribers only. Journal offers the option of making the article available online on Science direct for a fee of $3,000

    File URL: https://libkey.io/10.1016/j.physa.2015.03.048?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. L. Šubelj & M. Bajec, 2012. "Ubiquitousness of link-density and link-pattern communities in real-world networks," The European Physical Journal B: Condensed Matter and Complex Systems, Springer;EDP Sciences, vol. 85(1), pages 1-11, January.
    2. Lovro Šubelj & Slavko Žitnik & Neli Blagus & Marko Bajec, 2014. "Node Mixing And Group Structure Of Complex Software Networks," Advances in Complex Systems (ACS), World Scientific Publishing Co. Pte. Ltd., vol. 17(07n08), pages 1-26.
    3. Gergely Palla & Imre Derényi & Illés Farkas & Tamás Vicsek, 2005. "Uncovering the overlapping community structure of complex networks in nature and society," Nature, Nature, vol. 435(7043), pages 814-818, June.
    4. Blagus, Neli & Šubelj, Lovro & Bajec, Marko, 2014. "Assessing the effectiveness of real-world network simplification," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 413(C), pages 134-146.
    5. Borut Lužar & Zoran Levnajić & Janez Povh & Matjaž Perc, 2014. "Community Structure and the Evolution of Interdisciplinarity in Slovenia's Scientific Collaboration Network," PLOS ONE, Public Library of Science, vol. 9(4), pages 1-5, April.
    6. J. Reichardt & D. R. White, 2007. "Role models for complex networks," The European Physical Journal B: Condensed Matter and Complex Systems, Springer;EDP Sciences, vol. 60(2), pages 217-224, November.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Blagus, Neli & Šubelj, Lovro & Bajec, Marko, 2017. "Empirical comparison of network sampling: How to choose the most appropriate method?," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 477(C), pages 136-148.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Šubelj, Lovro & Bajec, Marko, 2014. "Group detection in complex networks: An algorithm and comparison of the state of the art," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 397(C), pages 144-156.
    2. Barigozzi, Matteo & Fagiolo, Giorgio & Mangioni, Giuseppe, 2011. "Identifying the community structure of the international-trade multi-network," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 390(11), pages 2051-2066.
    3. Xiaofeng Wang & Gongshen Liu & Jianhua Li & Jan P Nees, 2017. "Locating Structural Centers: A Density-Based Clustering Method for Community Detection," PLOS ONE, Public Library of Science, vol. 12(1), pages 1-23, January.
    4. Stefan Pinkert & Jörg Schultz & Jörg Reichardt, 2010. "Protein Interaction Networks—More Than Mere Modules," PLOS Computational Biology, Public Library of Science, vol. 6(1), pages 1-13, January.
    5. Ma, Guoshuai & Yuhua, Qian & Zhang, Yayu & Yan, Hongren & Cheng, Honghong & Hu, Zhiguo, 2022. "The recognition of kernel research team," Journal of Informetrics, Elsevier, vol. 16(4).
    6. Jorge Peña & Yannick Rochat, 2012. "Bipartite Graphs as Models of Population Structures in Evolutionary Multiplayer Games," PLOS ONE, Public Library of Science, vol. 7(9), pages 1-13, September.
    7. Shang, Jiaxing & Liu, Lianchen & Li, Xin & Xie, Feng & Wu, Cheng, 2016. "Targeted revision: A learning-based approach for incremental community detection in dynamic networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 443(C), pages 70-85.
    8. Ying Song & Zhiwen Zheng & Yunmei Shi & Bo Wang, 2023. "GLOD: The Local Greedy Expansion Method for Overlapping Community Detection in Dynamic Provenance Networks," Mathematics, MDPI, vol. 11(15), pages 1-16, July.
    9. Zhang, Zhiwei & Wang, Zhenyu, 2015. "Mining overlapping and hierarchical communities in complex networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 421(C), pages 25-33.
    10. Masa Tsuchiya & Vincent Piras & Alessandro Giuliani & Masaru Tomita & Kumar Selvarajoo, 2010. "Collective Dynamics of Specific Gene Ensembles Crucial for Neutrophil Differentiation: The Existence of Genome Vehicles Revealed," PLOS ONE, Public Library of Science, vol. 5(8), pages 1-10, August.
    11. Wu, Zhihao & Lin, Youfang & Wan, Huaiyu & Tian, Shengfeng & Hu, Keyun, 2012. "Efficient overlapping community detection in huge real-world networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 391(7), pages 2475-2490.
    12. Nie, Yanyi & Li, Wenyao & Pan, Liming & Lin, Tao & Wang, Wei, 2022. "Markovian approach to tackle competing pathogens in simplicial complex," Applied Mathematics and Computation, Elsevier, vol. 417(C).
    13. Rizman Žalik, Krista & Žalik, Borut, 2014. "A local multiresolution algorithm for detecting communities of unbalanced structures," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 407(C), pages 380-393.
    14. Yang, Xu-Hua & Lou, Shun-Li & Chen, Guang & Chen, Sheng-Yong & Huang, Wei, 2013. "Scale-free networks via attaching to random neighbors," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 392(17), pages 3531-3536.
    15. Zhang, Shihua & Wang, Rui-Sheng & Zhang, Xiang-Sun, 2007. "Identification of overlapping community structure in complex networks using fuzzy c-means clustering," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 374(1), pages 483-490.
    16. Dai, Liang & Derudder, Ben & Liu, Xingjian, 2018. "Transport network backbone extraction: A comparison of techniques," Journal of Transport Geography, Elsevier, vol. 69(C), pages 271-281.
    17. Chen, Lei & Kou, Yingxin & Li, Zhanwu & Xu, An & Wu, Cheng, 2018. "Empirical research on complex networks modeling of combat SoS based on data from real war-game, Part I: Statistical characteristics," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 490(C), pages 754-773.
    18. Giorgio Gronchi & Marco Raglianti & Fabio Giovannelli, 2021. "Network Theory and Switching Behaviors: A User Guide for Analyzing Electronic Records Databases," Future Internet, MDPI, vol. 13(9), pages 1-12, August.
    19. Amulyashree Sridhar & Sharvani GS & AH Manjunatha Reddy & Biplab Bhattacharjee & Kalyan Nagaraj, 2019. "The Eminence of Co-Expressed Ties in Schizophrenia Network Communities," Data, MDPI, vol. 4(4), pages 1-23, November.
    20. Shen Wang & Jun Wu & Yutao Zhang, 2018. "Consumer preference–enabled intelligent energy management for smart cities using game theoretic social tie," International Journal of Distributed Sensor Networks, , vol. 14(4), pages 15501477187, April.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:phsmap:v:432:y:2015:i:c:p:206-215. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/physica-a-statistical-mechpplications/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.