IDEAS home Printed from https://ideas.repec.org/a/eee/phsmap/v463y2016icp356-365.html
   My bibliography  Save this article

Flexible sampling large-scale social networks by self-adjustable random walk

Author

Listed:
  • Xu, Xiao-Ke
  • Zhu, Jonathan J.H.

Abstract

Online social networks (OSNs) have become an increasingly attractive gold mine for academic and commercial researchers. However, research on OSNs faces a number of difficult challenges. One bottleneck lies in the massive quantity and often unavailability of OSN population data. Sampling perhaps becomes the only feasible solution to the problems. How to draw samples that can represent the underlying OSNs has remained a formidable task because of a number of conceptual and methodological reasons. Especially, most of the empirically-driven studies on network sampling are confined to simulated data or sub-graph data, which are fundamentally different from real and complete-graph OSNs. In the current study, we propose a flexible sampling method, called Self-Adjustable Random Walk (SARW), and test it against with the population data of a real large-scale OSN. We evaluate the strengths of the sampling method in comparison with four prevailing methods, including uniform, breadth-first search (BFS), random walk (RW), and revised RW (i.e., MHRW) sampling. We try to mix both induced-edge and external-edge information of sampled nodes together in the same sampling process. Our results show that the SARW sampling method has been able to generate unbiased samples of OSNs with maximal precision and minimal cost. The study is helpful for the practice of OSN research by providing a highly needed sampling tools, for the methodological development of large-scale network sampling by comparative evaluations of existing sampling methods, and for the theoretical understanding of human networks by highlighting discrepancies and contradictions between existing knowledge/assumptions of large-scale real OSN data.

Suggested Citation

  • Xu, Xiao-Ke & Zhu, Jonathan J.H., 2016. "Flexible sampling large-scale social networks by self-adjustable random walk," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 463(C), pages 356-365.
  • Handle: RePEc:eee:phsmap:v:463:y:2016:i:c:p:356-365
    DOI: 10.1016/j.physa.2016.07.055
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0378437116304861
    Download Restriction: Full text for ScienceDirect subscribers only. Journal offers the option of making the article available online on Science direct for a fee of $3,000

    File URL: https://libkey.io/10.1016/j.physa.2016.07.055?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. repec:cup:cbooks:9780511771576 is not listed on IDEAS
    2. Easley,David & Kleinberg,Jon, 2010. "Networks, Crowds, and Markets," Cambridge Books, Cambridge University Press, number 9780521195331.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Fuentes, Emilio Aced & Santini, Simone, 2021. "Network navigation with non-Lèvy superdiffusive random walks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 580(C).
    2. Xu, Xiao-Ke & Wang, Xue & Xiao, Jing, 2018. "Inferring parent–child relationships by a node-remove centrality framework in online social networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 505(C), pages 222-232.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Blazquez-Soriano, Amparo & Ramos-Sandoval, Rosmery, 2022. "Information transfer as a tool to improve the resilience of farmers against the effects of climate change: The case of the Peruvian National Agrarian Innovation System," Agricultural Systems, Elsevier, vol. 200(C).
    2. Martin L. Weitzman, 2015. "A Voting Architecture for the Governance of Free-Driver Externalities, with Application to Geoengineering," Scandinavian Journal of Economics, Wiley Blackwell, vol. 117(4), pages 1049-1068, October.
    3. Wei Zhong, 2017. "Simulating influenza pandemic dynamics with public risk communication and individual responsive behavior," Computational and Mathematical Organization Theory, Springer, vol. 23(4), pages 475-495, December.
    4. Guo Weilong & Minca Andreea & Wang Li, 2016. "The topology of overlapping portfolio networks," Statistics & Risk Modeling, De Gruyter, vol. 33(3-4), pages 139-155, December.
    5. Kobayashi, Teruyoshi & Takaguchi, Taro, 2018. "Identifying relationship lending in the interbank market: A network approach," Journal of Banking & Finance, Elsevier, vol. 97(C), pages 20-36.
    6. Konstantinos Antoniadis & Kostas Zafiropoulos & Vasiliki Vrana, 2016. "A Method for Assessing the Performance of e-Government Twitter Accounts," Future Internet, MDPI, vol. 8(2), pages 1-18, April.
    7. Maness, Michael & Cirillo, Cinzia, 2016. "An indirect latent informational conformity social influence choice model: Formulation and case study," Transportation Research Part B: Methodological, Elsevier, vol. 93(PA), pages 75-101.
    8. Lomi, Alessandro & Fonti, Fabio, 2012. "Networks in markets and the propensity of companies to collaborate: An empirical test of three mechanisms," Economics Letters, Elsevier, vol. 114(2), pages 216-220.
    9. Zhang, Xuxi & Liu, Xianping & Lewis, Frank L. & Wang, Xia, 2020. "Bipartite tracking consensus of nonlinear multi-agent systems," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 545(C).
    10. Bing Han & Liyan Yang, 2013. "Social Networks, Information Acquisition, and Asset Prices," Management Science, INFORMS, vol. 59(6), pages 1444-1457, June.
    11. Dimitrios Karamanis, 2022. "Defence partnerships, military expenditure, investment, and economic growth: an analysis in PESCO countries," GreeSE – Hellenic Observatory Papers on Greece and Southeast Europe 173, Hellenic Observatory, LSE.
    12. Levent V. Orman, 2016. "Information markets over trust networks," Electronic Commerce Research, Springer, vol. 16(4), pages 529-551, December.
    13. Zhu, Yu-Xiao & Cao, Yan-Yan & Chen, Ting & Qiu, Xiao-Yan & Wang, Wei & Hou, Rui, 2018. "Crossover phenomena in growth pattern of social contagions with restricted contact," Chaos, Solitons & Fractals, Elsevier, vol. 114(C), pages 408-414.
    14. Pablo Galaso & Adrián Rodríguez Miranda & Sebastian Goinheix, 2018. "Local development, social capital and social network analysis: evidence from Uruguay," Revista de Estudios Regionales, Universidades Públicas de Andalucía, vol. 3, pages 137-163.
    15. Takahiro Ezaki & Naoki Masuda, 2017. "Reinforcement learning account of network reciprocity," PLOS ONE, Public Library of Science, vol. 12(12), pages 1-8, December.
    16. Mariann Ollar & Marzena Rostek, 2011. "Information Aggregation and Innovation in Market Design," Working Papers 11-12, NET Institute.
    17. Mr. Jorge A Chan-Lau, 2017. "Variance Decomposition Networks: Potential Pitfalls and a Simple Solution," IMF Working Papers 2017/107, International Monetary Fund.
    18. Lillo, Felipe & Valdés, Rodrigo, 2016. "Dynamics of financial markets and transaction costs: A graph-based study," Research in International Business and Finance, Elsevier, vol. 38(C), pages 455-465.
    19. Usha Sridhar & Sridhar Mandyam, 2016. "Loan Allocation and Guarantee Structure for Group Borrower Networks in Microfinance," Studies in Microeconomics, , vol. 4(2), pages 100-114, December.
    20. Arifovic, Jasmina & Eaton, B. Curtis & Walker, Graeme, 2015. "The coevolution of beliefs and networks," Journal of Economic Behavior & Organization, Elsevier, vol. 120(C), pages 46-63.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:phsmap:v:463:y:2016:i:c:p:356-365. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/physica-a-statistical-mechpplications/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.