IDEAS home Printed from https://ideas.repec.org/a/eee/phsmap/v424y2015icp254-268.html
   My bibliography  Save this article

Sampling social networks using shortest paths

Author

Listed:
  • Rezvanian, Alireza
  • Meybodi, Mohammad Reza

Abstract

In recent years, online social networks (OSN) have emerged as a platform of sharing variety of information about people, and their interests, activities, events and news from real worlds. Due to the large scale and access limitations (e.g., privacy policies) of online social network services such as Facebook and Twitter, it is difficult to access the whole public network in a limited amount of time. For this reason researchers try to study and characterize OSN by taking appropriate and reliable samples from the network. In this paper, we propose to use the concept of shortest path for sampling social networks. The proposed sampling method first finds the shortest paths between several pairs of nodes selected according to some criteria. Then the edges in these shortest paths are ranked according to the number of times that each edge has appeared in the set of found shortest paths. The sampled network is then computed as a subgraph of the social network which contains a percentage of highly ranked edges. In order to investigate the performance of the proposed sampling method, we provide a number of experiments on synthetic and real networks. Experimental results show that the proposed sampling method outperforms the existing method such as random edge sampling, random node sampling, random walk sampling and Metropolis–Hastings random walk sampling in terms of relative error (RE), normalized root mean square error (NMSE), and Kolmogorov–Smirnov (KS) test.

Suggested Citation

  • Rezvanian, Alireza & Meybodi, Mohammad Reza, 2015. "Sampling social networks using shortest paths," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 424(C), pages 254-268.
  • Handle: RePEc:eee:phsmap:v:424:y:2015:i:c:p:254-268
    DOI: 10.1016/j.physa.2015.01.030
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0378437115000321
    Download Restriction: Full text for ScienceDirect subscribers only. Journal offers the option of making the article available online on Science direct for a fee of $3,000

    File URL: https://libkey.io/10.1016/j.physa.2015.01.030?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Pablo M. Gleiser & Leon Danon, 2003. "Community Structure In Jazz," Advances in Complex Systems (ACS), World Scientific Publishing Co. Pte. Ltd., vol. 6(04), pages 565-573.
    2. Qi Gao & Xintong Ding & Feng Pan & Weixing Li, 2014. "An improved sampling method of complex network," International Journal of Modern Physics C (IJMPC), World Scientific Publishing Co. Pte. Ltd., vol. 25(05), pages 1-11.
    3. M. Goldstein & S. Morris & G. Yen, 2004. "Problems with fitting to the power-law distribution," The European Physical Journal B: Condensed Matter and Complex Systems, Springer;EDP Sciences, vol. 41(2), pages 255-258, September.
    4. Rezvanian, Alireza & Rahmati, Mohammad & Meybodi, Mohammad Reza, 2014. "Sampling from complex networks using distributed learning automata," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 396(C), pages 224-234.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Fenner, Trevor & Levene, Mark & Loizou, George, 2010. "Predicting the long tail of book sales: Unearthing the power-law exponent," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 389(12), pages 2416-2421.
    2. Zhang, Yun & Liu, Yongguo & Li, Jieting & Zhu, Jiajing & Yang, Changhong & Yang, Wen & Wen, Chuanbiao, 2020. "WOCDA: A whale optimization based community detection algorithm," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 539(C).
    3. Rafael González-Val, 2012. "A Nonparametric Estimation of the Local Zipf Exponent for all US Cities," Environment and Planning B, , vol. 39(6), pages 1119-1130, December.
    4. Rafael González-Val, 2021. "The Probability Distribution of Worldwide Forest Areas," Sustainability, MDPI, vol. 13(3), pages 1-19, January.
    5. Rafael González‐Val, 2019. "Historical urban growth in Europe (1300–1800)," Papers in Regional Science, Wiley Blackwell, vol. 98(2), pages 1115-1136, April.
    6. Marcus Berliant & Hiroki Watanabe, 2015. "Explaining the size distribution of cities: Extreme economies," Quantitative Economics, Econometric Society, vol. 6(1), pages 153-187, March.
    7. Klabunde, Anna, 2014. "Computational Economic Modeling of Migration," Ruhr Economic Papers 471, RWI - Leibniz-Institut für Wirtschaftsforschung, Ruhr-University Bochum, TU Dortmund University, University of Duisburg-Essen.
    8. Fenner, Trevor & Levene, Mark & Loizou, George, 2005. "A stochastic evolutionary model exhibiting power-law behaviour with an exponential cutoff," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 355(2), pages 641-656.
    9. Liu, X. & Murata, T., 2010. "Advanced modularity-specialized label propagation algorithm for detecting communities in networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 389(7), pages 1493-1500.
    10. Moradabadi, Behnaz & Meybodi, Mohammad Reza, 2016. "Link prediction based on temporal similarity metrics using continuous action set learning automata," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 460(C), pages 361-373.
    11. Etienne Côme & Nicolas Jouvin & Pierre Latouche & Charles Bouveyron, 2021. "Hierarchical clustering with discrete latent variable models and the integrated classification likelihood," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 15(4), pages 957-986, December.
    12. Namtirtha, Amrita & Dutta, Animesh & Dutta, Biswanath, 2018. "Identifying influential spreaders in complex networks based on kshell hybrid method," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 499(C), pages 310-324.
    13. Peltonen, Tuomas A. & Scheicher, Martin & Vuillemey, Guillaume, 2014. "The network structure of the CDS market and its determinants," Journal of Financial Stability, Elsevier, vol. 13(C), pages 118-133.
    14. Mike, Szabolcs & Farmer, J. Doyne, 2008. "An empirical behavioral model of liquidity and volatility," Journal of Economic Dynamics and Control, Elsevier, vol. 32(1), pages 200-234, January.
    15. Politi, Mauro & Scalas, Enrico, 2008. "Fitting the empirical distribution of intertrade durations," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 387(8), pages 2025-2034.
    16. Fan Jiang & Niancai Liu, 2018. "The hierarchical status of international academic awards in social sciences," Scientometrics, Springer;Akadémiai Kiadó, vol. 117(3), pages 2091-2115, December.
    17. Wang, Zhixiao & Zhao, Ya & Xi, Jingke & Du, Changjiang, 2016. "Fast ranking influential nodes in complex networks using a k-shell iteration factor," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 461(C), pages 171-181.
    18. Zareie, Ahmad & Sheikhahmadi, Amir, 2019. "EHC: Extended H-index Centrality measure for identification of users’ spreading influence in complex networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 514(C), pages 141-155.
    19. Hu, Fang & Liu, Jia & Li, Liuhuan & Liang, Jun, 2020. "Community detection in complex networks using Node2vec with spectral clustering," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 545(C).
    20. Xu, Shuang & Wang, Pei, 2017. "Identifying important nodes by adaptive LeaderRank," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 469(C), pages 654-664.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:phsmap:v:424:y:2015:i:c:p:254-268. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/physica-a-statistical-mechpplications/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.