IDEAS home Printed from https://ideas.repec.org/a/spr/scient/v128y2023i3d10.1007_s11192-023-04642-4.html
   My bibliography  Save this article

An embedding approach for analyzing the evolution of research topics with a case study on computer science subdomains

Author

Listed:
  • Seyyed Reza Taher Harikandeh

    (Shahid Beheshti University)

  • Sadegh Aliakbary

    (Shahid Beheshti University)

  • Soroush Taheri

    (Shahid Beheshti University)

Abstract

The study of topic evolution aims to analyze the behavior of different research fields by utilizing various features such as the relationships between articles. In recent years, many published papers consider more than one field of study which has led to a significant increase in the number of inter-field and interdisciplinary articles. Therefore, we can analyze the similarity/dissimilarity and convergence/divergence of research fields based on topic analysis of the published papers. Our research intends to create a methodology for studying the evolution of the research fields. In this paper, we propose an embedding approach for modeling each research topics as a multidimensional vector. Using this model, we measure the topic’s distances over the years and investigate how topics evolve over time. The proposed similarity metric showed many advantages over other alternatives (such as Jaccard similarity) and it resulted in better stability and accuracy. As a case study, we applied the proposed method to subsets of computer science for experimental purposes, and the results were quite comprehensible and coherent.

Suggested Citation

  • Seyyed Reza Taher Harikandeh & Sadegh Aliakbary & Soroush Taheri, 2023. "An embedding approach for analyzing the evolution of research topics with a case study on computer science subdomains," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(3), pages 1567-1582, March.
  • Handle: RePEc:spr:scient:v:128:y:2023:i:3:d:10.1007_s11192-023-04642-4
    DOI: 10.1007/s11192-023-04642-4
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11192-023-04642-4
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11192-023-04642-4?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Peng Liu & Haoxiang Xia, 2015. "Structure and evolution of co-authorship network in an interdisciplinary research field," Scientometrics, Springer;Akadémiai Kiadó, vol. 103(1), pages 101-134, April.
    2. Jian Xu & Yi Bu & Ying Ding & Sinan Yang & Hongli Zhang & Chen Yu & Lin Sun, 2018. "Understanding the formation of interdisciplinary research from the perspective of keyword evolution: a case study on joint attention," Scientometrics, Springer;Akadémiai Kiadó, vol. 117(2), pages 973-995, November.
    3. Luciano Kay & Nils Newman & Jan Youtie & Alan L. Porter & Ismael Rafols, 2014. "Patent overlay mapping: Visualizing technological distance," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 65(12), pages 2432-2443, December.
    4. Ashkan Ebadi & Pengcheng Xi & Stéphane Tremblay & Bruce Spencer & Raman Pall & Alexander Wong, 2021. "Understanding the temporal evolution of COVID-19 research through machine learning and natural language processing," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(1), pages 725-739, January.
    5. Chen, Baitong & Tsutsui, Satoshi & Ding, Ying & Ma, Feicheng, 2017. "Understanding the topic evolution in a scientific domain: An exploratory study for the field of information retrieval," Journal of Informetrics, Elsevier, vol. 11(4), pages 1175-1189.
    6. Jung, Sukhwan & Yoon, Wan Chul, 2020. "An alternative topic model based on Common Interest Authors for topic evolution analysis," Journal of Informetrics, Elsevier, vol. 14(3).
    7. Henry Small, 1973. "Co‐citation in the scientific literature: A new measure of the relationship between two documents," Journal of the American Society for Information Science, Association for Information Science & Technology, vol. 24(4), pages 265-269, July.
    8. Masood, Muhammad Ali & Abbasi, Rabeeh Ayaz, 2021. "Using graph embedding and machine learning to identify rebels on twitter," Journal of Informetrics, Elsevier, vol. 15(1).
    9. Yi Zhang & Xiaojing Cai & Caroline V. Fry & Mengjia Wu & Caroline S. Wagner, 2021. "Topic evolution, disruption and resilience in early COVID-19 research," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(5), pages 4225-4253, May.
    10. Soroush Taheri & Sadegh Aliakbary, 2022. "Research trend prediction in computer science publications: a deep neural network approach," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(2), pages 849-869, February.
    11. Qian, Yue & Liu, Yu & Sheng, Quan Z., 2020. "Understanding hierarchical structural evolution in a scientific discipline: A case study of artificial intelligence," Journal of Informetrics, Elsevier, vol. 14(3).
    12. Min Song & Go Eun Heo & Su Yeon Kim, 2014. "Analyzing topic evolution in bioinformatics: investigation of dynamics of the field with conference data in DBLP," Scientometrics, Springer;Akadémiai Kiadó, vol. 101(1), pages 397-428, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Qian, Yue & Liu, Yu & Sheng, Quan Z., 2020. "Understanding hierarchical structural evolution in a scientific discipline: A case study of artificial intelligence," Journal of Informetrics, Elsevier, vol. 14(3).
    2. Jian Xu & Ying Ding & Yi Bu & Shuqing Deng & Chen Yu & Yimin Zou & Andrew Madden, 2019. "Interdisciplinary scholarly communication: an exploratory study for the field of joint attention," Scientometrics, Springer;Akadémiai Kiadó, vol. 119(3), pages 1597-1619, June.
    3. Qiang Gao & Xiao Huang & Ke Dong & Zhentao Liang & Jiang Wu, 2022. "Semantic-enhanced topic evolution analysis: a combination of the dynamic topic model and word2vec," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(3), pages 1543-1563, March.
    4. Wang, Xiaoguang & He, Jing & Huang, Han & Wang, Hongyu, 2022. "MatrixSim: A new method for detecting the evolution paths of research topics," Journal of Informetrics, Elsevier, vol. 16(4).
    5. Ying Huang & Wolfgang Glänzel & Lin Zhang, 2021. "Tracing the development of mapping knowledge domains," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(7), pages 6201-6224, July.
    6. Lu Huang & Xiang Chen & Yi Zhang & Changtian Wang & Xiaoli Cao & Jiarun Liu, 2022. "Identification of topic evolution: network analytics with piecewise linear representation and word embedding," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(9), pages 5353-5383, September.
    7. Zhentao Liang & Jin Mao & Kun Lu & Gang Li, 2021. "Finding citations for PubMed: a large-scale comparison between five freely available bibliographic data sources," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(12), pages 9519-9542, December.
    8. Mauricio Marrone, 2020. "Application of entity linking to identify research fronts and trends," Scientometrics, Springer;Akadémiai Kiadó, vol. 122(1), pages 357-379, January.
    9. Zhao, Yi & Liu, Lifan & Zhang, Chengzhi, 2022. "Is coronavirus-related research becoming more interdisciplinary? A perspective of co-occurrence analysis and diversity measure of scientific articles," Technological Forecasting and Social Change, Elsevier, vol. 175(C).
    10. Takano, Yasutomo & Kajikawa, Yuya, 2019. "Extracting commercialization opportunities of the Internet of Things: Measuring text similarity between papers and patents," Technological Forecasting and Social Change, Elsevier, vol. 138(C), pages 45-68.
    11. Lukun Zheng & Yuhang Jiang, 2022. "Combining dissimilarity measures for quantifying changes in research fields," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(7), pages 3751-3765, July.
    12. Jeff Alstott & Giorgio Triulzi & Bowen Yan & Jianxi Luo, 2017. "Mapping technology space by normalizing patent networks," Scientometrics, Springer;Akadémiai Kiadó, vol. 110(1), pages 443-479, January.
    13. Chaker Jebari & Enrique Herrera-Viedma & Manuel Jesus Cobo, 2021. "The use of citation context to detect the evolution of research topics: a large-scale analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(4), pages 2971-2989, April.
    14. Mingers, John & Leydesdorff, Loet, 2015. "A review of theory and practice in scientometrics," European Journal of Operational Research, Elsevier, vol. 246(1), pages 1-19.
    15. Hofmann, Peter & Keller, Robert & Urbach, Nils, 2019. "Inter-technology relationship networks: Arranging technologies through text mining," Technological Forecasting and Social Change, Elsevier, vol. 143(C), pages 202-213.
    16. Kai Hu & Kunlun Qi & Siluo Yang & Shengyu Shen & Xiaoqiang Cheng & Huayi Wu & Jie Zheng & Stephen McClure & Tianxing Yu, 2018. "Identifying the “Ghost City” of domain topics in a keyword semantic space combining citations," Scientometrics, Springer;Akadémiai Kiadó, vol. 114(3), pages 1141-1157, March.
    17. Huichen Gao & Shijuan Wang, 2022. "The Intellectual Structure of Research on Rural-to-Urban Migrants: A Bibliometric Analysis," IJERPH, MDPI, vol. 19(15), pages 1-19, August.
    18. Jung, Sukhwan & Segev, Aviv, 2022. "DAC: Descendant-aware clustering algorithm for network-based topic emergence prediction," Journal of Informetrics, Elsevier, vol. 16(3).
    19. Hengmin Zhu & Li Qian & Wang Qin & Jing Wei & Chao Shen, 2022. "Evolution analysis of online topics based on ‘word-topic’ coupling network," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(7), pages 3767-3792, July.
    20. Sjögårde, Peter & Ahlgren, Per, 2018. "Granularity of algorithmically constructed publication-level classifications of research publications: Identification of topics," Journal of Informetrics, Elsevier, vol. 12(1), pages 133-152.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:128:y:2023:i:3:d:10.1007_s11192-023-04642-4. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.