IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0274218.html
   My bibliography  Save this article

On network backbone extraction for modeling online collective behavior

Author

Listed:
  • Carlos Henrique Gomes Ferreira
  • Fabricio Murai
  • Ana P C Silva
  • Martino Trevisan
  • Luca Vassio
  • Idilio Drago
  • Marco Mellia
  • Jussara M Almeida

Abstract

Collective user behavior in social media applications often drives several important online and offline phenomena linked to the spread of opinions and information. Several studies have focused on the analysis of such phenomena using networks to model user interactions, represented by edges. However, only a fraction of edges contribute to the actual investigation. Even worse, the often large number of non-relevant edges may obfuscate the salient interactions, blurring the underlying structures and user communities that capture the collective behavior patterns driving the target phenomenon. To solve this issue, researchers have proposed several network backbone extraction techniques to obtain a reduced and representative version of the network that better explains the phenomenon of interest. Each technique has its specific assumptions and procedure to extract the backbone. However, the literature lacks a clear methodology to highlight such assumptions, discuss how they affect the choice of a method and offer validation strategies in scenarios where no ground truth exists. In this work, we fill this gap by proposing a principled methodology for comparing and selecting the most appropriate backbone extraction method given a phenomenon of interest. We characterize ten state-of-the-art techniques in terms of their assumptions, requirements, and other aspects that one must consider to apply them in practice. We present four steps to apply, evaluate and select the best method(s) to a given target phenomenon. We validate our approach using two case studies with different requirements: online discussions on Instagram and coordinated behavior in WhatsApp groups. We show that each method can produce very different backbones, underlying that the choice of an adequate method is of utmost importance to reveal valuable knowledge about the particular phenomenon under investigation.

Suggested Citation

  • Carlos Henrique Gomes Ferreira & Fabricio Murai & Ana P C Silva & Martino Trevisan & Luca Vassio & Idilio Drago & Marco Mellia & Jussara M Almeida, 2022. "On network backbone extraction for modeling online collective behavior," PLOS ONE, Public Library of Science, vol. 17(9), pages 1-36, September.
  • Handle: RePEc:plo:pone00:0274218
    DOI: 10.1371/journal.pone.0274218
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0274218
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0274218&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0274218?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Teruyoshi Kobayashi & Taro Takaguchi & Alain Barrat, 2019. "The structured backbone of temporal social ties," Nature Communications, Nature, vol. 10(1), pages 1-11, December.
    2. Ehm, Werner, 1991. "Binomial approximation to the Poisson binomial distribution," Statistics & Probability Letters, Elsevier, vol. 11(1), pages 7-16, January.
    3. Yi-Ting Huang & Sheng-Fang Su, 2018. "Motives for Instagram Use and Topics of Interest among Young Adults," Future Internet, MDPI, vol. 10(8), pages 1-12, August.
    4. Michele Coscia & Frank M. H. Neffke & Ricardo Hausmann, 2020. "Knowledge diffusion in the network of international business travel," Nature Human Behaviour, Nature, vol. 4(10), pages 1011-1020, October.
    5. Philipp Lorenz-Spreen & Stephan Lewandowsky & Cass R. Sunstein & Ralph Hertwig, 2020. "How behavioural sciences can promote truth, autonomy and democratic discourse online," Nature Human Behaviour, Nature, vol. 4(11), pages 1102-1109, November.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Zachary P Neal, 2024. "How strong is strong? The challenge of interpreting network edge weights," PLOS ONE, Public Library of Science, vol. 19(10), pages 1-11, October.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Pauly, Stefan & Stipanicic, Fernando, 2021. "The creation and diffusion of knowledge: Evidence from the Jet Age," CEPREMAP Working Papers (Docweb) 2112, CEPREMAP.
    2. Balland, Pierre-Alexandre & Broekel, Tom & Diodato, Dario & Giuliani, Elisa & Hausmann, Ricardo & O'Clery, Neave & Rigby, David, 2022. "Reprint of The new paradigm of economic complexity," Research Policy, Elsevier, vol. 51(8).
    3. Arun G. Chandrasekhar & Robert Townsend & Juan Pablo Xandri, 2018. "Financial Centrality and Liquidity Provision," NBER Working Papers 24406, National Bureau of Economic Research, Inc.
    4. David M. Phillippo & Sofia Dias & A. E. Ades & Mark Belger & Alan Brnabic & Alexander Schacht & Daniel Saure & Zbigniew Kadziola & Nicky J. Welton, 2020. "Multilevel network meta‐regression for population‐adjusted treatment comparisons," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 183(3), pages 1189-1210, June.
    5. Thierry Mayer & Hillel Rapoport & Camilo Umana-Dajud, 2025. "Free Trade Agreements and the movement of business people," Journal of Economic Geography, Oxford University Press, vol. 25(1), pages 93-126.
    6. Marie Ernst & Yvik Swan, 2022. "Distances Between Distributions Via Stein’s Method," Journal of Theoretical Probability, Springer, vol. 35(2), pages 949-987, June.
    7. Pilar Aparicio-Martinez & Alberto-Jesus Perea-Moreno & María Pilar Martinez-Jimenez & María Dolores Redel-Macías & Manuel Vaquero-Abellan & Claudia Pagliari, 2019. "A Bibliometric Analysis of the Health Field Regarding Social Networks and Young People," IJERPH, MDPI, vol. 16(20), pages 1-25, October.
    8. Ernest Miguelez & Andrea Morrison, 2023. "Migrant inventors as agents of technological change," The Journal of Technology Transfer, Springer, vol. 48(2), pages 669-692, April.
    9. Róbert Pethes & Levente Kovács, 2023. "An Exact and an Approximation Method to Compute the Degree Distribution of Inhomogeneous Random Graph Using Poisson Binomial Distribution," Mathematics, MDPI, vol. 11(6), pages 1-24, March.
    10. René Belderbos & Davide Castellani & Helen S. Du & Geon Ho Lee, 2024. "Internal versus external agglomeration advantages in investment location choice: The role of global cities’ international connectivity," Journal of International Business Studies, Palgrave Macmillan;Academy of International Business, vol. 55(6), pages 745-763, August.
    11. Mattia Mazzoli & Riccardo Gallotti & Filippo Privitera & Pere Colet & José J. Ramasco, 2023. "Spatial immunization to abate disease spreading in transportation hubs," Nature Communications, Nature, vol. 14(1), pages 1-10, December.
    12. Hidalgo, César A., 2023. "The policy implications of economic complexity," Research Policy, Elsevier, vol. 52(9).
    13. Shuyuan Mary Ho & Xiuwen Liu & Md Shamim Seraj & Sabrina Dickey, 2023. "Social distance “nudge:” a context aware mHealth intervention in response to COVID pandemics," Computational and Mathematical Organization Theory, Springer, vol. 29(3), pages 391-414, September.
    14. Aihua Xia & Fuxi Zhang, 2009. "Polynomial Birth–Death Distribution Approximation in the Wasserstein Distance," Journal of Theoretical Probability, Springer, vol. 22(2), pages 294-310, June.
    15. Ulrich Schetter & Dario Diodato & Eric S. M. Protzer & Frank Neffke & Ricardo Hausmann, 2024. "From Products to Capabilities: Constructing A Genotypic Product Space," Growth Lab Working Papers 230, Harvard's Growth Lab.
    16. Söderlund, Bengt, 2023. "The importance of business travel for trade: Evidence from the liberalization of the Soviet airspace," Journal of International Economics, Elsevier, vol. 145(C).
    17. Pietro Nickl & Mehdi Moussaïd & Philipp Lorenz-Spreen, 2025. "The evolution of online news headlines," Palgrave Communications, Palgrave Macmillan, vol. 12(1), pages 1-13, December.
    18. Fulian Yin & Meiqi Ji & Zhongliang Yang & Zhaoliang Wu & Xinyu Xia & Tongtong Xing & Yuwei She & Zhiwen Hu, 2022. "Exploring the determinants of global vaccination campaigns to combat COVID-19," Palgrave Communications, Palgrave Macmillan, vol. 9(1), pages 1-13, December.
    19. Helen X. H. Bao & Yuna Song, 2022. "Improving Food Security through Entomophagy: Can Behavioural Interventions Influence Consumer Preference for Edible Insects?," Sustainability, MDPI, vol. 14(7), pages 1-19, March.
    20. Song, Le & Ma, Yinghong, 2022. "Evaluating tacit knowledge diffusion with algebra matrix algorithm based social networks," Applied Mathematics and Computation, Elsevier, vol. 428(C).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0274218. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.