IDEAS home Printed from https://ideas.repec.org/a/spr/scient/v127y2022i9d10.1007_s11192-022-04273-1.html
   My bibliography  Save this article

Identification of topic evolution: network analytics with piecewise linear representation and word embedding

Author

Listed:
  • Lu Huang

    (Beijing Institute of Technology)

  • Xiang Chen

    (Beijing Institute of Technology)

  • Yi Zhang

    (University of Technology Sydney)

  • Changtian Wang

    (Beijing Institute of Technology)

  • Xiaoli Cao

    (Beijing Institute of Technology)

  • Jiarun Liu

    (Beijing Institute of Technology)

Abstract

Understanding the evolutionary relationships among scientific topics and learning the evolutionary process of innovations is a crucial issue for strategic decision makers in governments, firms and funding agencies when they carry out forward-looking research activities. However, traditional co-word network analysis on topic identification cannot effectively excavate semantic relationship from the context, and fixed time window method cannot scientifically reflect the evolution process of topics. This study proposes a framework of identifying topic evolutionary pathways based on network analytics: Firstly, keyword networks are constructed, in which a piecewise linear representation method is used for dividing time periods and a Word2Vec mode is used for capturing semantics from the context of titles and abstracts; Secondly, a community detection algorithm is used to identify topics in networks; Finally, evolutionary relationships between topics are represented by measuring the topic similarity between adjacent time periods, and then topic evolutionary pathways are identified and visualized. An empirical study on information science demonstrates the reliability of the methodology, with subsequent empirical validations.

Suggested Citation

  • Lu Huang & Xiang Chen & Yi Zhang & Changtian Wang & Xiaoli Cao & Jiarun Liu, 2022. "Identification of topic evolution: network analytics with piecewise linear representation and word embedding," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(9), pages 5353-5383, September.
  • Handle: RePEc:spr:scient:v:127:y:2022:i:9:d:10.1007_s11192-022-04273-1
    DOI: 10.1007/s11192-022-04273-1
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11192-022-04273-1
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11192-022-04273-1?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Ying Yang & Mingzhi Wu & Lei Cui, 2012. "Integration of three visualization methods based on co-word analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 90(2), pages 659-673, February.
    2. Gianfranco Di Vaio & Jacob Louis Weisdorf, 2010. "Ranking economic history journals: a citation-based impact-adjusted analysis," Cliometrica, Journal of Historical Economics and Econometric History, Association Française de Cliométrie (AFC), vol. 4(1), pages 1-17, January.
    3. Hanlin You & Mengjun Li & Keith W. Hipel & Jiang Jiang & Bingfeng Ge & Hante Duan, 2017. "Development trend forecasting for coherent light generator technology based on patent citation network analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 111(1), pages 297-315, April.
    4. Zhang, Yi & Lu, Jie & Liu, Feng & Liu, Qian & Porter, Alan & Chen, Hongshu & Zhang, Guangquan, 2018. "Does deep learning help topic extraction? A kernel k-means clustering method with word embedding," Journal of Informetrics, Elsevier, vol. 12(4), pages 1099-1117.
    5. Chungil Chae & Jeong-Ha Yim & Jaeeun Lee & Sung Jun Jo & Jeong Rok Oh, 2020. "The Bibliometric Keywords Network Analysis of Human Resource Management Research Trends: The Case of Human Resource Management Journals in South Korea," Sustainability, MDPI, vol. 12(14), pages 1-37, July.
    6. Richard Klavans & Kevin W. Boyack, 2017. "Which Type of Citation Analysis Generates the Most Accurate Taxonomy of Scientific and Technical Knowledge?," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 68(4), pages 984-998, April.
    7. Bo Wang & Shengbo Liu & Kun Ding & Zeyuan Liu & Jing Xu, 2014. "Identifying technological topics and institution-topic distribution probability for patent competitive intelligence analysis: a case study in LTE technology," Scientometrics, Springer;Akadémiai Kiadó, vol. 101(1), pages 685-704, October.
    8. Park, Inchae & Yoon, Byungun, 2018. "Technological opportunity discovery for technological convergence based on the prediction of technology knowledge flow in a citation network," Journal of Informetrics, Elsevier, vol. 12(4), pages 1199-1222.
    9. Hong Wu & Huifang Yi & Chang Li, 2021. "An integrated approach for detecting and quantifying the topic evolutions of patent technology: a case study on graphene field," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(8), pages 6301-6321, August.
    10. Yan Wang & Zhiyuan Liu & Maosong Sun, 2015. "Incorporating Linguistic Knowledge for Learning Distributed Word Representations," PLOS ONE, Public Library of Science, vol. 10(4), pages 1-20, April.
    11. Ding, Ying, 2011. "Community detection: Topological vs. topical," Journal of Informetrics, Elsevier, vol. 5(4), pages 498-514.
    12. Pardeep Sud & Mike Thelwall, 2014. "Evaluating altmetrics," Scientometrics, Springer;Akadémiai Kiadó, vol. 98(2), pages 1131-1143, February.
    13. Wanying Ding & Chaomei Chen, 2014. "Dynamic topic detection and tracking: A comparison of HDP, C-word, and cocitation methods," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 65(10), pages 2084-2097, October.
    14. Zao Liu, 2005. "Visualizing the intellectual structure in urban studies: A journal co-citation analysis (1992-2002)," Scientometrics, Springer;Akadémiai Kiadó, vol. 62(3), pages 385-402, March.
    15. Marie Katsurai & Shunsuke Ono, 2019. "TrendNets: mapping emerging research trends from dynamic co-word networks via sparse representation," Scientometrics, Springer;Akadémiai Kiadó, vol. 121(3), pages 1583-1598, December.
    16. Chen, Baitong & Tsutsui, Satoshi & Ding, Ying & Ma, Feicheng, 2017. "Understanding the topic evolution in a scientific domain: An exploratory study for the field of information retrieval," Journal of Informetrics, Elsevier, vol. 11(4), pages 1175-1189.
    17. Jeong, Do-Heon & Song, Min, 2014. "Time gap analysis by the topic model-based temporal technique," Journal of Informetrics, Elsevier, vol. 8(3), pages 776-790.
    18. Florian Rabitz & Alin Olteanu & Jurgita Jurkevičienė & Agnė Budžytė, 2021. "A topic network analysis of the system turn in the environmental sciences," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(3), pages 2107-2140, March.
    19. Lin Qi & Yuwei Wang & Jindong Chen & Mengjie Liao & Jian Zhang & Rosa M. Benito, 2021. "Culture under Complex Perspective: A Classification for Traditional Chinese Cultural Elements Based on NLP and Complex Networks," Complexity, Hindawi, vol. 2021, pages 1-15, April.
    20. Huang, Lu & Chen, Xiang & Ni, Xingxing & Liu, Jiarun & Cao, Xiaoli & Wang, Changtian, 2021. "Tracking the dynamics of co-word networks for emerging topic identification," Technological Forecasting and Social Change, Elsevier, vol. 170(C).
    21. Jianhua Hou & Xiucai Yang & Chaomei Chen, 2018. "Emerging trends and new developments in information science: a document co-citation analysis (2009–2016)," Scientometrics, Springer;Akadémiai Kiadó, vol. 115(2), pages 869-892, May.
    22. Chyi-Kwei Yau & Alan Porter & Nils Newman & Arho Suominen, 2014. "Clustering scientific documents with topic modeling," Scientometrics, Springer;Akadémiai Kiadó, vol. 100(3), pages 767-786, September.
    23. Fang Zhang & Shengli Wu, 2021. "Measuring academic entities’ impact by content-based citation analysis in a heterogeneous academic network," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(8), pages 7197-7222, August.
    24. Teh, Yee Whye & Jordan, Michael I. & Beal, Matthew J. & Blei, David M., 2006. "Hierarchical Dirichlet Processes," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1566-1581, December.
    25. Xu, Haiyun & Winnink, Jos & Yue, Zenghui & Liu, Ziqiang & Yuan, Guoting, 2020. "Topic-linked innovation paths in science and technology," Journal of Informetrics, Elsevier, vol. 14(2).
    26. Xiaoling Sun & Kun Ding, 2018. "Identifying and tracking scientific and technological knowledge memes from citation networks of publications and patents," Scientometrics, Springer;Akadémiai Kiadó, vol. 116(3), pages 1735-1748, September.
    27. Qikai Cheng & Jiamin Wang & Wei Lu & Yong Huang & Yi Bu, 2020. "Keyword-citation-keyword network: a new perspective of discipline knowledge structure analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 124(3), pages 1923-1943, September.
    28. Gergely Palla & Albert-László Barabási & Tamás Vicsek, 2007. "Quantifying social group evolution," Nature, Nature, vol. 446(7136), pages 664-667, April.
    29. Zhang, Yi & Wu, Mengjia & Miao, Wen & Huang, Lu & Lu, Jie, 2021. "Bi-layer network analytics: A methodology for characterizing emerging general-purpose technologies," Journal of Informetrics, Elsevier, vol. 15(4).
    30. Katherine W. McCain, 2008. "Assessing an author's influence using time series historiographic mapping: The oeuvre of conrad hal waddington (1905–1975)," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 59(4), pages 510-525, February.
    31. Jie Chen & Jialin Chen & Shu Zhao & Yanping Zhang & Jie Tang, 2020. "Exploiting word embedding for heterogeneous topic model towards patent recommendation," Scientometrics, Springer;Akadémiai Kiadó, vol. 125(3), pages 2091-2108, December.
    32. Antonio Moreno & Christian Terwiesch, 2014. "Doing Business with Strangers: Reputation in Online Service Marketplaces," Information Systems Research, INFORMS, vol. 25(4), pages 865-886, December.
    33. Kai Hu & Huayi Wu & Kunlun Qi & Jingmin Yu & Siluo Yang & Tianxing Yu & Jie Zheng & Bo Liu, 2018. "A domain keyword analysis approach extending Term Frequency-Keyword Active Index with Google Word2Vec model," Scientometrics, Springer;Akadémiai Kiadó, vol. 114(3), pages 1031-1068, March.
    34. Zhang, Yi & Porter, Alan L. & Hu, Zhengyin & Guo, Ying & Newman, Nils C., 2014. "“Term clumping” for technical intelligence: A case study on dye-sensitized solar cells," Technological Forecasting and Social Change, Elsevier, vol. 85(C), pages 26-39.
    35. Xiaoguang Wang & Qikai Cheng & Wei Lu, 2014. "Analyzing evolution of research topics with NEViewer: a new method based on dynamic co-word networks," Scientometrics, Springer;Akadémiai Kiadó, vol. 101(2), pages 1253-1271, November.
    36. Péter Érdi & Kinga Makovi & Zoltán Somogyvári & Katherine Strandburg & Jan Tobochnik & Péter Volf & László Zalányi, 2013. "Prediction of emerging technologies based on analysis of the US patent citation network," Scientometrics, Springer;Akadémiai Kiadó, vol. 95(1), pages 225-242, April.
    37. Liang-xing Su & Peng-hui Lyu & Zheng Yang & Shuai Ding & Kai-le Zhou, 2015. "Scientometric cognitive and evaluation on smart city related construction and building journals data," Scientometrics, Springer;Akadémiai Kiadó, vol. 105(1), pages 449-470, October.
    38. Zhikun Ding & Rongsheng Liu & Zongjie Li & Cheng Fan, 2020. "A Thematic Network-Based Methodology for the Research Trend Identification in Building Energy Management," Energies, MDPI, vol. 13(18), pages 1-33, September.
    39. Qian, Yue & Liu, Yu & Sheng, Quan Z., 2020. "Understanding hierarchical structural evolution in a scientific discipline: A case study of artificial intelligence," Journal of Informetrics, Elsevier, vol. 14(3).
    40. Chanwoo Jeong & Sion Jang & Eunjeong Park & Sungchul Choi, 2020. "A context-aware citation recommendation model with BERT and graph convolutional networks," Scientometrics, Springer;Akadémiai Kiadó, vol. 124(3), pages 1907-1922, September.
    41. Yi Zhang & Guangquan Zhang & Donghua Zhu & Jie Lu, 2017. "Scientific evolutionary pathways: Identifying and visualizing relationships for scientific topics," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 68(8), pages 1925-1939, August.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Wang, Xiaoguang & He, Jing & Huang, Han & Wang, Hongyu, 2022. "MatrixSim: A new method for detecting the evolution paths of research topics," Journal of Informetrics, Elsevier, vol. 16(4).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Zhang, Yi & Wu, Mengjia & Miao, Wen & Huang, Lu & Lu, Jie, 2021. "Bi-layer network analytics: A methodology for characterizing emerging general-purpose technologies," Journal of Informetrics, Elsevier, vol. 15(4).
    2. Wang, Xiaoguang & He, Jing & Huang, Han & Wang, Hongyu, 2022. "MatrixSim: A new method for detecting the evolution paths of research topics," Journal of Informetrics, Elsevier, vol. 16(4).
    3. Huang, Lu & Chen, Xiang & Ni, Xingxing & Liu, Jiarun & Cao, Xiaoli & Wang, Changtian, 2021. "Tracking the dynamics of co-word networks for emerging topic identification," Technological Forecasting and Social Change, Elsevier, vol. 170(C).
    4. Lu Huang & Xiang Chen & Yi Zhang & Yihe Zhu & Suyi Li & Xingxing Ni, 2021. "Dynamic network analytics for recommending scientific collaborators," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(11), pages 8789-8814, November.
    5. Qiang Gao & Xiao Huang & Ke Dong & Zhentao Liang & Jiang Wu, 2022. "Semantic-enhanced topic evolution analysis: a combination of the dynamic topic model and word2vec," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(3), pages 1543-1563, March.
    6. Lu Huang & Yijie Cai & Erdong Zhao & Shengting Zhang & Yue Shu & Jiao Fan, 2022. "Measuring the interdisciplinarity of Information and Library Science interactions using citation analysis and semantic analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(11), pages 6733-6761, November.
    7. Zhang, Yi & Lu, Jie & Liu, Feng & Liu, Qian & Porter, Alan & Chen, Hongshu & Zhang, Guangquan, 2018. "Does deep learning help topic extraction? A kernel k-means clustering method with word embedding," Journal of Informetrics, Elsevier, vol. 12(4), pages 1099-1117.
    8. Huailan Liu & Zhiwang Chen & Jie Tang & Yuan Zhou & Sheng Liu, 2020. "Mapping the technology evolution path: a novel model for dynamic topic detection and tracking," Scientometrics, Springer;Akadémiai Kiadó, vol. 125(3), pages 2043-2090, December.
    9. Jung, Sukhwan & Segev, Aviv, 2022. "DAC: Descendant-aware clustering algorithm for network-based topic emergence prediction," Journal of Informetrics, Elsevier, vol. 16(3).
    10. Yi Zhang & Xiaojing Cai & Caroline V. Fry & Mengjia Wu & Caroline S. Wagner, 2021. "Topic evolution, disruption and resilience in early COVID-19 research," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(5), pages 4225-4253, May.
    11. Yuan Zhou & Heng Lin & Yufei Liu & Wei Ding, 2019. "A novel method to identify emerging technologies using a semi-supervised topic clustering model: a case of 3D printing industry," Scientometrics, Springer;Akadémiai Kiadó, vol. 120(1), pages 167-185, July.
    12. Li, Xin & Xie, Qianqian & Daim, Tugrul & Huang, Lucheng, 2019. "Forecasting technology trends using text mining of the gaps between science and technology: The case of perovskite solar cell technology," Technological Forecasting and Social Change, Elsevier, vol. 146(C), pages 432-449.
    13. Liu, Zhenfeng & Feng, Jian & Uden, Lorna, 2023. "Technology opportunity analysis using hierarchical semantic networks and dual link prediction," Technovation, Elsevier, vol. 128(C).
    14. Ting Xiong & Liang Zhou & Ying Zhao & Xiaojuan Zhang, 2022. "Mining semantic information of co-word network to improve link prediction performance," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(6), pages 2981-3004, June.
    15. Qian, Yue & Liu, Yu & Sheng, Quan Z., 2020. "Understanding hierarchical structural evolution in a scientific discipline: A case study of artificial intelligence," Journal of Informetrics, Elsevier, vol. 14(3).
    16. Hengmin Zhu & Li Qian & Wang Qin & Jing Wei & Chao Shen, 2022. "Evolution analysis of online topics based on ‘word-topic’ coupling network," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(7), pages 3767-3792, July.
    17. Huang, Ying & Li, Ruinan & Zou, Fang & Jiang, Lidan & Porter, Alan L. & Zhang, Lin, 2022. "Technology life cycle analysis: From the dynamic perspective of patent citation networks," Technological Forecasting and Social Change, Elsevier, vol. 181(C).
    18. Xiaoguang Wang & Qikai Cheng & Wei Lu, 2014. "Analyzing evolution of research topics with NEViewer: a new method based on dynamic co-word networks," Scientometrics, Springer;Akadémiai Kiadó, vol. 101(2), pages 1253-1271, November.
    19. Sun, Bixuan & Kolesnikov, Sergey & Goldstein, Anna & Chan, Gabriel, 2021. "A dynamic approach for identifying technological breakthroughs with an application in solar photovoltaics," Technological Forecasting and Social Change, Elsevier, vol. 165(C).
    20. Porter, Alan L. & Chiavetta, Denise & Newman, Nils C., 2020. "Measuring tech emergence: A contest," Technological Forecasting and Social Change, Elsevier, vol. 159(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:127:y:2022:i:9:d:10.1007_s11192-022-04273-1. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.