IDEAS home Printed from https://ideas.repec.org/a/eee/phsmap/v501y2018icp248-263.html
   My bibliography  Save this article

Statistical similarity measures for link prediction in heterogeneous complex networks

Author

Listed:
  • Shakibian, Hadi
  • Charkari, Nasrollah Moghadam

Abstract

The majority of the link prediction measures in heterogeneous complex networks rely on the nodes connectivities while less attention has been paid to the importance of the nodes and paths. In this paper, we propose some new meta-path based statistical similarity measures to properly perform link prediction task. The main idea in the proposed measures is to drive some co-occurrence events in a number of co-occurrence matrices that are occurred between the visited nodes obeying a meta-path. The extracted co-occurrence matrices are analyzed in terms of the energy, inertia, local homogeneity, correlation, and information measure of correlation to determine various information theoretic measures. We evaluate the proposed measures, denoted as link energy, link inertia, link local homogeneity, link correlation, and link information measure of correlation, using a standard DBLP network data set. The results of the AUC score and Precision rate indicate the validity and accuracy of the proposed measures in comparison to the popular meta-path based similarity measures.

Suggested Citation

  • Shakibian, Hadi & Charkari, Nasrollah Moghadam, 2018. "Statistical similarity measures for link prediction in heterogeneous complex networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 501(C), pages 248-263.
  • Handle: RePEc:eee:phsmap:v:501:y:2018:i:c:p:248-263
    DOI: 10.1016/j.physa.2018.02.189
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0378437118302759
    Download Restriction: Full text for ScienceDirect subscribers only. Journal offers the option of making the article available online on Science direct for a fee of $3,000

    File URL: https://libkey.io/10.1016/j.physa.2018.02.189?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Xu, Zhongqi & Pu, Cunlai & Yang, Jian, 2016. "Link prediction based on path entropy," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 456(C), pages 294-301.
    2. Camilo Akimushkin & Diego Raphael Amancio & Osvaldo Novais Oliveira Jr., 2017. "Text Authorship Identified Using the Dynamics of Word Co-Occurrence Networks," PLOS ONE, Public Library of Science, vol. 12(1), pages 1-15, January.
    3. Shaobin Huang & Tianyang Lv & Xizhe Zhang & Yange Yang & Weimin Zheng & Chao Wen, 2014. "Identifying Node Role in Social Network Based on Multiple Indicators," PLOS ONE, Public Library of Science, vol. 9(8), pages 1-16, August.
    4. Li, Huajiao & An, Haizhong & Wang, Yue & Huang, Jiachen & Gao, Xiangyun, 2016. "Evolutionary features of academic articles co-keyword network and keywords co-occurrence network: Based on two-mode affiliation network," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 450(C), pages 657-669.
    5. Yin, Likang & Zheng, Haoyang & Bian, Tian & Deng, Yong, 2017. "An evidential link prediction method and link predictability based on Shannon entropy," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 482(C), pages 699-712.
    6. Fei Tan & Yongxiang Xia & Boyao Zhu, 2014. "Link Prediction in Complex Networks: A Mutual Information Perspective," PLOS ONE, Public Library of Science, vol. 9(9), pages 1-8, September.
    7. Zhong, Xiang & Liu, Jiajun & Gao, Yong & Wu, Lun, 2017. "Analysis of co-occurrence toponyms in web pages based on complex networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 466(C), pages 462-475.
    8. Loet Leydesdorff & Liwen Vaughan, 2006. "Co‐occurrence matrices and their applications in information science: Extending ACA to the Web environment," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 57(12), pages 1616-1628, October.
    9. Li, Jichao & Ge, Bingfeng & Yang, Kewei & Chen, Yingwu & Tan, Yuejin, 2017. "Meta-path based heterogeneous combat network link prediction," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 482(C), pages 507-523.
    10. Bryan S. Graham, 2017. "An Econometric Model of Network Formation With Degree Heterogeneity," Econometrica, Econometric Society, vol. 85, pages 1033-1063, July.
    11. Qiuju Zhou & Loet Leydesdorff, 2016. "The normalization of occurrence and Co-occurrence matrices in bibliometrics using Cosine similarities and Ochiai coefficients," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 67(11), pages 2805-2814, November.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Gu, Shuang & Li, Keping & Feng, Tao & Yan, Dongyang & Liu, Yanyan, 2022. "The prediction of potential risk path in railway traffic events," Reliability Engineering and System Safety, Elsevier, vol. 222(C).
    2. Tofighy, Sajjad & Charkari, Nasrollah Moghadam & Ghaderi, Foad, 2022. "Link prediction in multiplex networks using intralayer probabilistic distance and interlayer co-evolving factors," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 606(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Kumar, Ajay & Singh, Shashank Sheshar & Singh, Kuldeep & Biswas, Bhaskar, 2020. "Link prediction techniques, applications, and performance: A survey," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 553(C).
    2. Wu, Jiehua & Shen, Jing & Zhou, Bei & Zhang, Xiayan & Huang, Bohuai, 2019. "General link prediction with influential node identification," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 523(C), pages 996-1007.
    3. Samuel Zanferdini Oliva & Livia Oliveira-Ciabati & Denise Gazotto Dezembro & Mário Sérgio Adolfi Júnior & Maísa Carvalho Silva & Hugo Cesar Pessotti & Juliana Tarossi Pollettini, 2021. "Text structuring methods based on complex network: a systematic review," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(2), pages 1471-1493, February.
    4. Yao, Yabing & Zhang, Ruisheng & Yang, Fan & Tang, Jianxin & Yuan, Yongna & Hu, Rongjing, 2018. "Link prediction in complex networks based on the interactions among paths," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 510(C), pages 52-67.
    5. García-Lillo, Francisco & Seva-Larrosa, Pedro & Sánchez-García, Eduardo, 2023. "What is going on in entrepreneurship research? A bibliometric and SNA analysis," Journal of Business Research, Elsevier, vol. 158(C).
    6. Loet Leydesdorff & Dieter Franz Kogler & Bowen Yan, 2017. "Mapping patent classifications: portfolio and statistical analysis, and the comparison of strengths and weaknesses," Scientometrics, Springer;Akadémiai Kiadó, vol. 112(3), pages 1573-1591, September.
    7. Gallego-Losada, María-Jesús & Montero-Navarro, Antonio & García-Abajo, Elisa & Gallego-Losada, Rocío, 2023. "Digital financial inclusion. Visualizing the academic literature," Research in International Business and Finance, Elsevier, vol. 64(C).
    8. Hyejin Park & Han Woo Park, 2018. "Two-side face of knowledge building using scientometric analysis," Quality & Quantity: International Journal of Methodology, Springer, vol. 52(6), pages 2815-2836, November.
    9. Lee, Jiyon, 2018. "A spatial latent class model," Economics Letters, Elsevier, vol. 162(C), pages 62-68.
    10. Jiang, Syuan-Yi, 2022. "Transition and innovation ecosystem – investigating technologies, focal actors, and institution in eHealth innovations," Technological Forecasting and Social Change, Elsevier, vol. 175(C).
    11. van Eck, N.J.P. & Waltman, L., 2009. "How to Normalize Co-Occurrence Data? An Analysis of Some Well-Known Similarity Measures," ERIM Report Series Research in Management ERS-2009-001-LIS, Erasmus Research Institute of Management (ERIM), ERIM is the joint research institute of the Rotterdam School of Management, Erasmus University and the Erasmus School of Economics (ESE) at Erasmus University Rotterdam.
    12. Chih‐Sheng Hsieh & Lung‐Fei Lee & Vincent Boucher, 2020. "Specification and estimation of network formation and network interaction models with the exponential probability distribution," Quantitative Economics, Econometric Society, vol. 11(4), pages 1349-1390, November.
    13. Geert Ridder & Shuyang Sheng, 2020. "Two-Step Estimation of a Strategic Network Formation Model with Clustering," Papers 2001.03838, arXiv.org, revised Nov 2022.
    14. Jesper W. Schneider & Birger Larsen & Peter Ingwersen, 2009. "A comparative study of first and all-author co-citation counting, and two different matrix generation approaches applied for author co-citation analyses," Scientometrics, Springer;Akadémiai Kiadó, vol. 80(1), pages 103-130, July.
    15. Jimi Adams & Ryan Light, 2014. "Mapping Interdisciplinary Fields: Efficiencies, Gaps and Redundancies in HIV/AIDS Research," PLOS ONE, Public Library of Science, vol. 9(12), pages 1-13, December.
    16. Georg Groh & Christoph Fuchs, 2011. "Multi-modal social networks for modeling scientific fields," Scientometrics, Springer;Akadémiai Kiadó, vol. 89(2), pages 569-590, November.
    17. Wei-Feng Tung & Ting-Yu Lee, 2013. "Rank-mediated collaborative tagging recommendation service using video-tag relationship prediction," Information Systems Frontiers, Springer, vol. 15(4), pages 627-635, September.
    18. Chen, Mingli & Fernández-Val, Iván & Weidner, Martin, 2021. "Nonlinear factor models for network and panel data," Journal of Econometrics, Elsevier, vol. 220(2), pages 296-324.
    19. Hao Wang & Sanhong Deng & Xinning Su, 2016. "A study on construction and analysis of discipline knowledge structure of Chinese LIS based on CSSCI," Scientometrics, Springer;Akadémiai Kiadó, vol. 109(3), pages 1725-1759, December.
    20. Wu, Hanjun & Hong Tsui, Kan Wai & Ngo, Thanh & Lin, Yi-Hsin, 2020. "Impacts of aviation subsidies on regional wellbeing: Systematic review, meta-analysis and future research directions," Transport Policy, Elsevier, vol. 99(C), pages 215-239.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:phsmap:v:501:y:2018:i:c:p:248-263. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/physica-a-statistical-mechpplications/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.