IDEAS home Printed from https://ideas.repec.org/a/spr/scient/v120y2019i1d10.1007_s11192-019-03126-8.html
   My bibliography  Save this article

A novel method to identify emerging technologies using a semi-supervised topic clustering model: a case of 3D printing industry

Author

Listed:
  • Yuan Zhou

    (Tsinghua University)

  • Heng Lin

    (Huazhong University of Science and Technology)

  • Yufei Liu

    (Tsinghua University
    Center for Strategic Studies, Chinese Academy of Engineering)

  • Wei Ding

    (Huazhong University of Science and Technology)

Abstract

There have been recent attempts to identify emerging technologies by using topic-based analysis, but many of them have methodological deficiencies. First, analyses are unsupervised, and unsupervised methods cannot incorporate supervised knowledge that is needed to better identify technological domains. Second, those methods lack semantic interpretation, as many of them still remain at word-level analyses, we developed a novel technology-identification method that uses a semi-supervised topic clustering model (Labeled Dirichlet Multi Mixture model) to integrate technological domain knowledge. The model also generates a sentence-level semantic technological topic description through the topic description method (Various-aspects Sentence-level Description) on information extraction. We used this novel method to analyze the technology of the 3D printing industry, and successfully identified emerging technologies by differentiating new topics from the traditional topics, the results effectively demonstrated the semantic technological topic description by showing sentences. This method could be of great interest to technology forecasters and relevant policy-makers.

Suggested Citation

  • Yuan Zhou & Heng Lin & Yufei Liu & Wei Ding, 2019. "A novel method to identify emerging technologies using a semi-supervised topic clustering model: a case of 3D printing industry," Scientometrics, Springer;Akadémiai Kiadó, vol. 120(1), pages 167-185, July.
  • Handle: RePEc:spr:scient:v:120:y:2019:i:1:d:10.1007_s11192-019-03126-8
    DOI: 10.1007/s11192-019-03126-8
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11192-019-03126-8
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11192-019-03126-8?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Zhang, Lin & Liu, Xinhai & Janssens, Frizo & Liang, Liming & Glänzel, Wolfgang, 2010. "Subject clustering analysis based on ISI category classification," Journal of Informetrics, Elsevier, vol. 4(2), pages 185-193.
    2. Hanning Guo & Scott Weingart & Katy Börner, 2011. "Mixed-indicators model for identifying emerging research areas," Scientometrics, Springer;Akadémiai Kiadó, vol. 89(1), pages 421-435, October.
    3. de Rassenfosse, Gaétan & Dernis, Hélène & Guellec, Dominique & Picci, Lucio & van Pottelsberghe de la Potterie, Bruno, 2013. "The worldwide count of priority patents: A new indicator of inventive activity," Research Policy, Elsevier, vol. 42(3), pages 720-737.
    4. Bo Wang & Shengbo Liu & Kun Ding & Zeyuan Liu & Jing Xu, 2014. "Identifying technological topics and institution-topic distribution probability for patent competitive intelligence analysis: a case study in LTE technology," Scientometrics, Springer;Akadémiai Kiadó, vol. 101(1), pages 685-704, October.
    5. Small, Henry & Boyack, Kevin W. & Klavans, Richard, 2014. "Identifying emerging topics in science and technology," Research Policy, Elsevier, vol. 43(8), pages 1450-1467.
    6. Kevin W. Boyack & Richard Klavans, 2010. "Co‐citation analysis, bibliographic coupling, and direct citation: Which citation approach represents the research front most accurately?," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 61(12), pages 2389-2404, December.
    7. Ivana Roche & Dominique Besagni & Claire François & Marianne Hörlesberger & Edgar Schiebel, 2010. "Identification and characterisation of technological topics in the field of Molecular Biology," Scientometrics, Springer;Akadémiai Kiadó, vol. 82(3), pages 663-676, March.
    8. Janghyeok Yoon & Kwangsoo Kim, 2011. "Identifying rapidly evolving technological trends for R&D planning using SAO-based semantic patent networks," Scientometrics, Springer;Akadémiai Kiadó, vol. 88(1), pages 213-228, July.
    9. Edgar Schiebel & Marianne Hörlesberger & Ivana Roche & Claire François & Dominique Besagni, 2010. "An advanced diffusion model to identify emergent research issues: the case of optoelectronic devices," Scientometrics, Springer;Akadémiai Kiadó, vol. 83(3), pages 765-781, June.
    10. Yuan Zhou & Xin Li & Rasmus Lema & Frauke Urban, 2016. "Comparing the knowledge bases of wind turbine firms in Asia and Europe: Patent trajectories, networks, and globalisation," Science and Public Policy, Oxford University Press, vol. 43(4), pages 476-491.
    11. Lu, Louis Y.Y. & Liu, John S., 2016. "A novel approach to identify the major research themes and development trajectory: The case of patenting research," Technological Forecasting and Social Change, Elsevier, vol. 103(C), pages 71-82.
    12. Yuan Zhou & Meijuan Pan & Frauke Urban, 2018. "Comparing the International Knowledge Flow of China’s Wind and Solar Photovoltaic (PV) Industries: Patent Analysis and Implications for Sustainable Development," Sustainability, MDPI, vol. 10(6), pages 1-34, June.
    13. Rotolo, Daniele & Hicks, Diana & Martin, Ben R., 2015. "What is an emerging technology?," Research Policy, Elsevier, vol. 44(10), pages 1827-1843.
    14. Ta-Shun Cho & Hsin-Yu Shih, 2011. "Patent citation network analysis of core and emerging technologies in Taiwan: 1997–2008," Scientometrics, Springer;Akadémiai Kiadó, vol. 89(3), pages 795-811, December.
    15. Hyunseok Park & Janghyeok Yoon & Kwangsoo Kim, 2012. "Identifying patent infringement using SAO based semantic technological similarities," Scientometrics, Springer;Akadémiai Kiadó, vol. 90(2), pages 515-529, February.
    16. Jeong, Do-Heon & Song, Min, 2014. "Time gap analysis by the topic model-based temporal technique," Journal of Informetrics, Elsevier, vol. 8(3), pages 776-790.
    17. Breitzman, Anthony & Thomas, Patrick, 2015. "The Emerging Clusters Model: A tool for identifying emerging technologies across multiple patent systems," Research Policy, Elsevier, vol. 44(1), pages 195-205.
    18. Wolfgang Glänzel & Sarah Heeffer & Bart Thijs, 2017. "Lexical analysis of scientific publications for nano-level scientometrics," Scientometrics, Springer;Akadémiai Kiadó, vol. 111(3), pages 1897-1906, June.
    19. Venugopalan, Subhashini & Rai, Varun, 2015. "Topic based classification and pattern identification in patents," Technological Forecasting and Social Change, Elsevier, vol. 94(C), pages 236-250.
    20. Chyi-Kwei Yau & Alan Porter & Nils Newman & Arho Suominen, 2014. "Clustering scientific documents with topic modeling," Scientometrics, Springer;Akadémiai Kiadó, vol. 100(3), pages 767-786, September.
    21. Kevin W. Boyack, 2017. "Investigating the effect of global data on topic detection," Scientometrics, Springer;Akadémiai Kiadó, vol. 111(2), pages 999-1015, May.
    22. Kevin W. Boyack & Richard Klavans, 2010. "Co-citation analysis, bibliographic coupling, and direct citation: Which citation approach represents the research front most accurately?," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 61(12), pages 2389-2404, December.
    23. Woo Hyoung Lee, 2008. "How to identify emerging research fields using scientometrics: An example in the field of Information Security," Scientometrics, Springer;Akadémiai Kiadó, vol. 76(3), pages 503-525, September.
    24. Shenghui Wang & Rob Koopman, 2017. "Clustering articles based on semantic similarity," Scientometrics, Springer;Akadémiai Kiadó, vol. 111(2), pages 1017-1031, May.
    25. Jing Zhang & Xiaomin Liu & Lili Wu, 2016. "The study of subject-classification based on journal coupling and expert subject-classification system," Scientometrics, Springer;Akadémiai Kiadó, vol. 107(3), pages 1149-1170, June.
    26. Katharina Maria Hofer & Angela Elisabeth Smejkal & F. Zeynep Bilgin & Gerhard A. Wuehrer, 2010. "Conference proceedings as a matter of bibliometric studies: the Academy of International Business 2006–2008," Scientometrics, Springer;Akadémiai Kiadó, vol. 84(3), pages 845-862, September.
    27. Ding, Ying, 2011. "Scientific collaboration and endorsement: Network analysis of coauthorship and citation networks," Journal of Informetrics, Elsevier, vol. 5(1), pages 187-203.
    28. Zhang, Yi & Porter, Alan L. & Hu, Zhengyin & Guo, Ying & Newman, Nils C., 2014. "“Term clumping” for technical intelligence: A case study on dye-sensitized solar cells," Technological Forecasting and Social Change, Elsevier, vol. 85(C), pages 26-39.
    29. Yawei Wang & Frauke Urban & Yuan Zhou & Luyi Chen, 2018. "Comparing the Technology Trajectories of Solar PV and Solar Water Heaters in China: Using a Patent Lens," Sustainability, MDPI, vol. 10(11), pages 1-29, November.
    30. Xin Ying An & Qing Qiang Wu, 2011. "Co-word analysis of the trends in stem cells field based on subject heading weighting," Scientometrics, Springer;Akadémiai Kiadó, vol. 88(1), pages 133-144, July.
    31. Loet Leydesdorff & Ismael Rafols, 2009. "A global map of science based on the ISI subject categories," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 60(2), pages 348-362, February.
    32. S. Phineas Upham & Henry Small, 2010. "Emerging research fronts in science and technology: patterns of new knowledge development," Scientometrics, Springer;Akadémiai Kiadó, vol. 83(1), pages 15-38, April.
    33. Waltman, Ludo & van Eck, Nees Jan & Noyons, Ed C.M., 2010. "A unified approach to mapping and clustering of bibliometric networks," Journal of Informetrics, Elsevier, vol. 4(4), pages 629-635.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Yuan Zhou & Fang Dong & Yufei Liu & Liang Ran, 2021. "A deep learning framework to early identify emerging technologies in large-scale outlier patents: an empirical study of CNC machine tool," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(2), pages 969-994, February.
    2. Huailan Liu & Zhiwang Chen & Jie Tang & Yuan Zhou & Sheng Liu, 2020. "Mapping the technology evolution path: a novel model for dynamic topic detection and tracking," Scientometrics, Springer;Akadémiai Kiadó, vol. 125(3), pages 2043-2090, December.
    3. Dejing Kong & Jianzhong Yang & Lingfeng Li, 2020. "Early identification of technological convergence in numerical control machine tool: a deep learning approach," Scientometrics, Springer;Akadémiai Kiadó, vol. 125(3), pages 1983-2009, December.
    4. Yuan Zhou & Fang Dong & Yufei Liu & Zhaofu Li & JunFei Du & Li Zhang, 2020. "Forecasting emerging technologies using data augmentation and deep learning," Scientometrics, Springer;Akadémiai Kiadó, vol. 123(1), pages 1-29, April.
    5. Guannan Xu & Weijie Hu & Yuanyuan Qiao & Yuan Zhou, 2020. "Mapping an innovation ecosystem using network clustering and community identification: a multi-layered framework," Scientometrics, Springer;Akadémiai Kiadó, vol. 124(3), pages 2057-2081, September.
    6. Benjamin M. Knisely & Holly H. Pavliscsak, 2023. "Research proposal content extraction using natural language processing and semi-supervised clustering: A demonstration and comparative analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(5), pages 3197-3224, May.
    7. Guo Chen & Jing Chen & Yu Shao & Lu Xiao, 2023. "Automatic noise reduction of domain-specific bibliographic datasets using positive-unlabeled learning," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(2), pages 1187-1204, February.
    8. Wooseok Jang & Yongtae Park & Hyeonju Seol, 2021. "Identifying emerging technologies using expert opinions on the future: A topic modeling and fuzzy clustering approach," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(8), pages 6505-6532, August.
    9. Peichao Dai & Ruxu Sheng & Zhongzhen Miao & Zanxu Chen & Yuan Zhou, 2021. "Analysis of Spatial–Temporal Characteristics of Industrial Land Supply Scale in Relation to Industrial Structure in China," Land, MDPI, vol. 10(11), pages 1-18, November.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Shuo Xu & Liyuan Hao & Xin An & Hongshen Pang & Ting Li, 2020. "Review on emerging research topics with key-route main path analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 122(1), pages 607-624, January.
    2. Xu, Shuo & Hao, Liyuan & An, Xin & Yang, Guancan & Wang, Feifei, 2019. "Emerging research topics detection with multiple machine learning models," Journal of Informetrics, Elsevier, vol. 13(4).
    3. Rotolo, Daniele & Hicks, Diana & Martin, Ben R., 2015. "What is an emerging technology?," Research Policy, Elsevier, vol. 44(10), pages 1827-1843.
    4. Huang, Lu & Chen, Xiang & Ni, Xingxing & Liu, Jiarun & Cao, Xiaoli & Wang, Changtian, 2021. "Tracking the dynamics of co-word networks for emerging topic identification," Technological Forecasting and Social Change, Elsevier, vol. 170(C).
    5. Inchae Park & Byungun Yoon, 2018. "Identifying Promising Research Frontiers of Pattern Recognition through Bibliometric Analysis," Sustainability, MDPI, vol. 10(11), pages 1-32, November.
    6. Jochen Gläser & Wolfgang Glänzel & Andrea Scharnhorst, 2017. "Same data—different results? Towards a comparative approach to the identification of thematic structures in science," Scientometrics, Springer;Akadémiai Kiadó, vol. 111(2), pages 981-998, May.
    7. Xu, Shuo & Hao, Liyuan & Yang, Guancan & Lu, Kun & An, Xin, 2021. "A topic models based framework for detecting and forecasting emerging technologies," Technological Forecasting and Social Change, Elsevier, vol. 162(C).
    8. Zhou, Yuan & Dong, Fang & Kong, Dejing & Liu, Yufei, 2019. "Unfolding the convergence process of scientific knowledge for the early identification of emerging technologies," Technological Forecasting and Social Change, Elsevier, vol. 144(C), pages 205-220.
    9. Sjögårde, Peter & Ahlgren, Per, 2018. "Granularity of algorithmically constructed publication-level classifications of research publications: Identification of topics," Journal of Informetrics, Elsevier, vol. 12(1), pages 133-152.
    10. Samira Ranaei & Arho Suominen & Alan Porter & Stephen Carley, 2020. "Evaluating technological emergence using text analytics: two case technologies and three approaches," Scientometrics, Springer;Akadémiai Kiadó, vol. 122(1), pages 215-247, January.
    11. Porter, Alan L. & Chiavetta, Denise & Newman, Nils C., 2020. "Measuring tech emergence: A contest," Technological Forecasting and Social Change, Elsevier, vol. 159(C).
    12. Kwon, Seokbeom & Liu, Xiaoyu & Porter, Alan L. & Youtie, Jan, 2019. "Research addressing emerging technological ideas has greater scientific impact," Research Policy, Elsevier, vol. 48(9), pages 1-1.
    13. Kyebambe, Moses Ntanda & Cheng, Ge & Huang, Yunqing & He, Chunhui & Zhang, Zhenyu, 2017. "Forecasting emerging technologies: A supervised learning approach through patent analysis," Technological Forecasting and Social Change, Elsevier, vol. 125(C), pages 236-244.
    14. Yuan Zhou & Fang Dong & Yufei Liu & Liang Ran, 2021. "A deep learning framework to early identify emerging technologies in large-scale outlier patents: an empirical study of CNC machine tool," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(2), pages 969-994, February.
    15. Suominen, Arho & Peng, Haoshu & Ranaei, Samira, 2019. "Examining the dynamics of an emerging research network using the case of triboelectric nanogenerators," Technological Forecasting and Social Change, Elsevier, vol. 146(C), pages 820-830.
    16. Sohrabi, Babak & Khalilijafarabad, Ahmad, 2018. "Systematic method for finding emergence research areas as data quality," Technological Forecasting and Social Change, Elsevier, vol. 137(C), pages 280-287.
    17. Puccetti, Giovanni & Giordano, Vito & Spada, Irene & Chiarello, Filippo & Fantoni, Gualtiero, 2023. "Technology identification from patent texts: A novel named entity recognition method," Technological Forecasting and Social Change, Elsevier, vol. 186(PB).
    18. Hric, Darko & Kaski, Kimmo & Kivelä, Mikko, 2018. "Stochastic block model reveals maps of citation patterns and their evolution in time," Journal of Informetrics, Elsevier, vol. 12(3), pages 757-783.
    19. Frank Havemann & Jochen Gläser & Michael Heinz, 2017. "Memetic search for overlapping topics based on a local evaluation of link communities," Scientometrics, Springer;Akadémiai Kiadó, vol. 111(2), pages 1089-1118, May.
    20. Lee, Changyong & Kwon, Ohjin & Kim, Myeongjung & Kwon, Daeil, 2018. "Early identification of emerging technologies: A machine learning approach using multiple patent indicators," Technological Forecasting and Social Change, Elsevier, vol. 127(C), pages 291-303.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:120:y:2019:i:1:d:10.1007_s11192-019-03126-8. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.