IDEAS home Printed from https://ideas.repec.org/a/eee/infome/v15y2021i1s175115772030643x.html
   My bibliography  Save this article

Learning multi-resolution representations of research patterns in bibliographic networks

Author

Listed:
  • Lee, O-Joun
  • Jeon, Hyeon-Ju
  • Jung, Jason J.

Abstract

This study aims at representing research patterns of bibliographic entities (e.g., scholars, papers, and venues) with a fixed-length vector. Bibliographic network structures rooted in the entities are incredibly diverse, and this diversity increases in the outstanding entities. Thus, despite their significant volume, the outstanding entities obtain minimal learning opportunities, whereas low-performance entities are over-represented. This study solves the problem by representing the patterns of the entities rather than depicting individual entities in a precise manner. First, we describe structures rooted in the entities using the Weisfeiler–Lehman (WL) relabeling process. Each subgraph generated by the relabeling process provides information on the scholars, kinds of papers they published, standards of venues in which the papers were published, and types of their collaborators. We assume that a subgraph depicts the research patterns of bibliographic entities, such as the preference of a scholar in choosing either a few highly impactful papers or numerous papers of moderate impact. Then, we simplify the subgraphs according to multiple levels of detailedness. Original subgraphs represent the individuality of the entities, and simplified subgraphs represent the entities sharing the same research patterns. In addition, simplified subgraphs balance the learning opportunities of high- and low-performance entities by co-occurring with both types of entities. We embed the subgraphs using the Skip-Gram method. If the results of the embedding represent the research patterns of the entities, the obtained vectors should be able to represent various aspects of the research performance in both the short-term and long-term durations regardless of the performances of the entities. Therefore, we conducted experiments for predicting 23 performance indicators during four time periods for four performance groups (top 1%, 5%, 10%, and all entities) using only the vector representations. The proposed model outperformed the existing network embedding methods in terms of both accuracy and variance.

Suggested Citation

  • Lee, O-Joun & Jeon, Hyeon-Ju & Jung, Jason J., 2021. "Learning multi-resolution representations of research patterns in bibliographic networks," Journal of Informetrics, Elsevier, vol. 15(1).
  • Handle: RePEc:eee:infome:v:15:y:2021:i:1:s175115772030643x
    DOI: 10.1016/j.joi.2020.101126
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S175115772030643X
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.joi.2020.101126?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Gert Sabidussi, 1966. "The centrality index of a graph," Psychometrika, Springer;The Psychometric Society, vol. 31(4), pages 581-603, December.
    2. Bordons, María & Aparicio, Javier & González-Albo, Borja & Díaz-Faes, Adrián A., 2015. "The relationship between the research performance of scientists and their position in co-authorship networks in three fields," Journal of Informetrics, Elsevier, vol. 9(1), pages 135-144.
    3. Yan, Xiangbin & Zhai, Li & Fan, Weiguo, 2013. "C-index: A weighted network node centrality measure for collaboration competence," Journal of Informetrics, Elsevier, vol. 7(1), pages 223-239.
    4. Mariani, Manuel Sebastian & Medo, Matúš & Zhang, Yi-Cheng, 2016. "Identification of milestone papers through time-balanced network centrality," Journal of Informetrics, Elsevier, vol. 10(4), pages 1207-1223.
    5. Serge Galam, 2011. "Tailor based allocations for multiple authorship: a fractional gh-index," Scientometrics, Springer;Akadémiai Kiadó, vol. 89(1), pages 365-379, October.
    6. Yi Zhang & Fen Zhao & Jianguo Lu, 2019. "P2V: large-scale academic paper embedding," Scientometrics, Springer;Akadémiai Kiadó, vol. 121(1), pages 399-432, October.
    7. Sabine Loudcher & Wararat Jakawat & Edmundo Pavel Soriano Morales & Cécile Favre, 2015. "Combining OLAP and information networks for bibliographic data analysis: a survey," Scientometrics, Springer;Akadémiai Kiadó, vol. 103(2), pages 471-487, May.
    8. Alireza Abbasi & Jorn Altmann & Junseok Hwang, 2009. "Evaluating Scholars Based on their Academic Collaboration Activities: The RC-Index and CC-Index for Quantifying Collaboration Activities of Researchers and Scientific Communities," TEMEP Discussion Papers 200915, Seoul National University; Technology Management, Economics, and Policy Program (TEMEP), revised Sep 2009.
    9. Perianes-Rodríguez, Antonio & Chinchilla-Rodríguez, Zaida & Vargas-Quesada, Benjamín & Olmeda Gómez, Carlos & Moya-Anegón, Félix, 2009. "Synthetic hybrid indicators based on scientific collaboration to quantify and evaluate individual research results," Journal of Informetrics, Elsevier, vol. 3(2), pages 91-101.
    10. Anil, Akash & Singh, Sanasam Ranbir, 2020. "Effect of class imbalance in heterogeneous network embedding: An empirical study," Journal of Informetrics, Elsevier, vol. 14(2).
    11. Chao Gao & Zhen Wang & Xianghua Li & Zili Zhang & Wei Zeng, 2016. "PR-Index: Using the h-Index and PageRank for Determining True Impact," PLOS ONE, Public Library of Science, vol. 11(9), pages 1-13, September.
    12. Wu, Zhihao & Lin, Youfang & Wang, Jing & Gregory, Steve, 2016. "Link prediction with node clustering coefficient," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 452(C), pages 1-8.
    13. Leonardo Reyes-Gonzalez & Claudia N. Gonzalez-Brambila & Francisco Veloso, 2016. "Using co-authorship and citation analysis to identify research groups: a new way to assess performance," Scientometrics, Springer;Akadémiai Kiadó, vol. 108(3), pages 1171-1191, September.
    14. Abramo, Giovanni & Cicero, Tindaro & D’Angelo, Ciriaco Andrea, 2013. "Individual research performance: A proposal for comparing apples to oranges," Journal of Informetrics, Elsevier, vol. 7(2), pages 528-539.
    15. Waltman, Ludo, 2016. "A review of the literature on citation impact indicators," Journal of Informetrics, Elsevier, vol. 10(2), pages 365-391.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Alireza Abbasi & Mahdi Jalili & Abolghasem Sadeghi-Niaraki, 2018. "Influence of network-based structural and power diversity on research performance," Scientometrics, Springer;Akadémiai Kiadó, vol. 117(1), pages 579-590, October.
    2. Dunaiski, Marcel & Geldenhuys, Jaco & Visser, Willem, 2019. "Globalised vs averaged: Bias and ranking performance on the author level," Journal of Informetrics, Elsevier, vol. 13(1), pages 299-313.
    3. Dunaiski, Marcel & Geldenhuys, Jaco & Visser, Willem, 2019. "On the interplay between normalisation, bias, and performance of paper impact metrics," Journal of Informetrics, Elsevier, vol. 13(1), pages 270-290.
    4. Mutz, Rüdiger & Daniel, Hans-Dieter, 2018. "The bibliometric quotient (BQ), or how to measure a researcher’s performance capacity: A Bayesian Poisson Rasch model," Journal of Informetrics, Elsevier, vol. 12(4), pages 1282-1295.
    5. Zhai, Li & Yan, Xiangbin, 2022. "A directed collaboration network for exploring the order of scientific collaboration," Journal of Informetrics, Elsevier, vol. 16(4).
    6. Wang, Jingjing & Xu, Shuqi & Mariani, Manuel S. & Lü, Linyuan, 2021. "The local structure of citation networks uncovers expert-selected milestone papers," Journal of Informetrics, Elsevier, vol. 15(4).
    7. Yu Zhang & Min Wang & Morteza Saberi & Elizabeth Chang, 2022. "Analysing academic paper ranking algorithms using test data and benchmarks: an investigation," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(7), pages 4045-4074, July.
    8. Anna Tietze & Philip Hofmann, 2019. "The h-index and multi-author hm-index for individual researchers in condensed matter physics," Scientometrics, Springer;Akadémiai Kiadó, vol. 119(1), pages 171-185, April.
    9. Alireza Abbasi & Liaquat Hossain & Shahadat Uddin & Kim J. R. Rasmussen, 2011. "Evolutionary dynamics of scientific collaboration networks: multi-levels and cross-time analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 89(2), pages 687-710, November.
    10. Nasirian, Farzaneh & Mahdavi Pajouh, Foad & Balasundaram, Balabhaskar, 2020. "Detecting a most closeness-central clique in complex networks," European Journal of Operational Research, Elsevier, vol. 283(2), pages 461-475.
    11. Abbasi, Alireza & Altmann, Jörn & Hossain, Liaquat, 2011. "Identifying the effects of co-authorship networks on the performance of scholars: A correlation and regression analysis of performance measures and social network analysis measures," Journal of Informetrics, Elsevier, vol. 5(4), pages 594-607.
    12. Guijie Zhang & Luning Liu & Yuqiang Feng & Zhen Shao & Yongli Li, 2014. "Cext-N index: a network node centrality measure for collaborative relationship distribution," Scientometrics, Springer;Akadémiai Kiadó, vol. 101(1), pages 291-307, October.
    13. Enrico di Bella & Luca Gandullia & Sara Preti, 2021. "Analysis of scientific collaboration network of Italian Institute of Technology," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(10), pages 8517-8539, October.
    14. Xu, Shuqi & Mariani, Manuel Sebastian & Lü, Linyuan & Medo, Matúš, 2020. "Unbiased evaluation of ranking metrics reveals consistent performance in science and technology citation data," Journal of Informetrics, Elsevier, vol. 14(1).
    15. Itsuki Kageyama & Karin Kurata & Shuto Miyashita & Yeongjoo Lim & Shintaro Sengoku & Kota Kodama, 2022. "A Bibliometric Analysis of Wearable Device Research Trends 2001–2022—A Study on the Reversal of Number of Publications and Research Trends in China and the USA," IJERPH, MDPI, vol. 19(24), pages 1-19, December.
    16. Mariani, Manuel Sebastian & Medo, Matúš & Lafond, François, 2019. "Early identification of important patents: Design and validation of citation network metrics," Technological Forecasting and Social Change, Elsevier, vol. 146(C), pages 644-654.
    17. Hector G. Ceballos & Sara E. Garza & Francisco J. Cantu, 2018. "Factors influencing the formation of intra-institutional formal research groups: group prediction from collaboration, organisational, and topical networks," Scientometrics, Springer;Akadémiai Kiadó, vol. 114(1), pages 181-216, January.
    18. Jiang, Xiaorui & Zhuge, Hai, 2019. "Forward search path count as an alternative indirect citation impact indicator," Journal of Informetrics, Elsevier, vol. 13(4).
    19. Yu, Dejian & Pan, Tianxing, 2021. "Tracing the main path of interdisciplinary research considering citation preference: A case from blockchain domain," Journal of Informetrics, Elsevier, vol. 15(2).
    20. Jing Tu, 2019. "What connections lead to good scientific performance?," Scientometrics, Springer;Akadémiai Kiadó, vol. 118(2), pages 587-604, February.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:infome:v:15:y:2021:i:1:s175115772030643x. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/joi .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.