IDEAS home Printed from https://ideas.repec.org/a/spr/scient/v111y2017i2d10.1007_s11192-017-2290-5.html
   My bibliography  Save this article

Multi-source data fusion study in scientometrics

Author

Listed:
  • Hai-Yun Xu

    (Chengdu Library of Chinese Academy of Sciences)

  • Zeng-Hui Yue

    (Jining Medical University)

  • Chao Wang

    (Chengdu Library of Chinese Academy of Sciences)

  • Kun Dong

    (Chengdu Library of Chinese Academy of Sciences)

  • Hong-Shen Pang

    (Chinese Academy of Sciences)

  • Zhengbiao Han

    (Nanjing Agricultural University)

Abstract

This paper provides an introduction to multi-source data fusion (MSDF) and comprehensively overviews the ingredients and challenges of MSDF in scientometrics. As compared to the MSDF methods in the sensor and other fields, and considering the features of scientometrics, in this paper an application model and procedure of MSDF in scientometrics are proposed. The model and procedure can be divided into three parts: data type integration, fusion of data relations, and ensemble clustering. Furthermore, the fusion of data relations can be divided into cross-integration of multi-mode data and matrix fusion of multi-relational data. To obtain a clearer and deeper analysis of the MSDF model, this paper further focuses on the application of MSDF in topic identification based on text analysis of scientific literatures. This paper also discusses the application of MSDF for the exploration of scientific literatures. Finally, the most suitable MSDF methods for different situations are discussed.

Suggested Citation

  • Hai-Yun Xu & Zeng-Hui Yue & Chao Wang & Kun Dong & Hong-Shen Pang & Zhengbiao Han, 2017. "Multi-source data fusion study in scientometrics," Scientometrics, Springer;Akadémiai Kiadó, vol. 111(2), pages 773-792, May.
  • Handle: RePEc:spr:scient:v:111:y:2017:i:2:d:10.1007_s11192-017-2290-5
    DOI: 10.1007/s11192-017-2290-5
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11192-017-2290-5
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11192-017-2290-5?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Calero-Medina, Clara & Noyons, Ed C.M., 2008. "Combining mapping and citation network analysis for a better understanding of the scientific development: The case of the absorptive capacity field," Journal of Informetrics, Elsevier, vol. 2(4), pages 272-279.
    2. Peter van den Besselaar & Gaston Heimeriks, 2006. "Mapping research topics using word-reference co-occurrences: A method and an exploratory case study," Scientometrics, Springer;Akadémiai Kiadó, vol. 68(3), pages 377-393, September.
    3. S. Ravikumar & Ashutosh Agrahari & S. N. Singh, 2015. "Mapping the intellectual structure of scientometrics: a co-word analysis of the journal Scientometrics (2005–2010)," Scientometrics, Springer;Akadémiai Kiadó, vol. 102(1), pages 929-955, January.
    4. Richard Klavans & Kevin W. Boyack, 2006. "Quantitative evaluation of large maps of science," Scientometrics, Springer;Akadémiai Kiadó, vol. 68(3), pages 475-499, September.
    5. He, Xiaofeng & Zha, Hongyuan & H.Q. Ding, Chris & D. Simon, Horst, 2002. "Web document clustering using hyperlink structures," Computational Statistics & Data Analysis, Elsevier, vol. 41(1), pages 19-45, November.
    6. Lawrence Hubert & Phipps Arabie, 1985. "Comparing partitions," Journal of Classification, Springer;The Classification Society, vol. 2(1), pages 193-218, December.
    7. Naoki Shibata & Yuya Kajikawa & Yoshiyuki Takeda & Katsumori Matsushima, 2009. "Comparative study on methods of detecting research fronts using different types of citation," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 60(3), pages 571-580, March.
    8. Ahlgren, Per & Colliander, Cristian, 2009. "Document–document similarity approaches and science mapping: Experimental comparison of five approaches," Journal of Informetrics, Elsevier, vol. 3(1), pages 49-63.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Xu, Haiyun & Yue, Zenghui & Pang, Hongshen & Elahi, Ehsan & Li, Jing & Wang, Lu, 2022. "Integrative model for discovering linked topics in science and technology," Journal of Informetrics, Elsevier, vol. 16(2).
    2. Xu, Haiyun & Winnink, Jos & Yue, Zenghui & Liu, Ziqiang & Yuan, Guoting, 2020. "Topic-linked innovation paths in science and technology," Journal of Informetrics, Elsevier, vol. 14(2).
    3. Xu, Haiyun & Winnink, Jos & Yue, Zenghui & Zhang, Huiling & Pang, Hongshen, 2021. "Multidimensional Scientometric indicators for the detection of emerging research topics," Technological Forecasting and Social Change, Elsevier, vol. 163(C).
    4. Kun Dong & Haiyun Xu & Rui Luo & Ling Wei & Shu Fang, 2018. "An integrated method for interdisciplinary topic identification and prediction: a case study on information science and library science," Scientometrics, Springer;Akadémiai Kiadó, vol. 115(2), pages 849-868, May.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yang, Siluo & Han, Ruizhen & Wolfram, Dietmar & Zhao, Yuehua, 2016. "Visualizing the intellectual structure of information science (2006–2015): Introducing author keyword coupling analysis," Journal of Informetrics, Elsevier, vol. 10(1), pages 132-150.
    2. Sjögårde, Peter & Ahlgren, Per, 2018. "Granularity of algorithmically constructed publication-level classifications of research publications: Identification of topics," Journal of Informetrics, Elsevier, vol. 12(1), pages 133-152.
    3. Yi-Ming Wei & Jin-Wei Wang & Tianqi Chen & Bi-Ying Yu & Hua Liao, 2018. "Frontiers of Low-Carbon Technologies: Results from Bibliographic Coupling with Sliding Window," CEEP-BIT Working Papers 116, Center for Energy and Environmental Policy Research (CEEP), Beijing Institute of Technology.
    4. Ludo Waltman & Nees Jan Eck, 2012. "A new methodology for constructing a publication-level classification system of science," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 63(12), pages 2378-2392, December.
    5. Michel Zitt, 2015. "Meso-level retrieval: IR-bibliometrics interplay and hybrid citation-words methods in scientific fields delineation," Scientometrics, Springer;Akadémiai Kiadó, vol. 102(3), pages 2223-2245, March.
    6. Sung Kim & Derek Hansen & Richard Helps, 2018. "Computing research in the academy: insights from theses and dissertations," Scientometrics, Springer;Akadémiai Kiadó, vol. 114(1), pages 135-158, January.
    7. Hiran H. Lathabai & Susan George & Thara Prabhakaran & Manoj Changat, 2018. "An integrated approach to path analysis for weighted citation networks," Scientometrics, Springer;Akadémiai Kiadó, vol. 117(3), pages 1871-1904, December.
    8. Mu-hsuan Huang & Chia-Pin Chang, 2015. "A comparative study on detecting research fronts in the organic light-emitting diode (OLED) field using bibliographic coupling and co-citation," Scientometrics, Springer;Akadémiai Kiadó, vol. 102(3), pages 2041-2057, March.
    9. Fabian Meyer-Brötz & Edgar Schiebel & Leo Brecht, 2017. "Experimental evaluation of parameter settings in calculation of hybrid similarities: effects of first- and second-order similarity, edge cutting, and weighting factors," Scientometrics, Springer;Akadémiai Kiadó, vol. 111(3), pages 1307-1325, June.
    10. Marianne Hörlesberger & Ivana Roche & Dominique Besagni & Thomas Scherngell & Claire François & Pascal Cuxac & Edgar Schiebel & Michel Zitt & Dirk Holste, 2013. "A concept for inferring ‘frontier research’ in grant proposals," Scientometrics, Springer;Akadémiai Kiadó, vol. 97(2), pages 129-148, November.
    11. Xinhai Liu & Wolfgang Glänzel & Bart Moor, 2012. "Optimal and hierarchical clustering of large-scale hybrid networks for scientific mapping," Scientometrics, Springer;Akadémiai Kiadó, vol. 91(2), pages 473-493, May.
    12. Shuo Xu & Junwan Liu & Dongsheng Zhai & Xin An & Zheng Wang & Hongshen Pang, 2018. "Overlapping thematic structures extraction with mixed-membership stochastic blockmodel," Scientometrics, Springer;Akadémiai Kiadó, vol. 117(1), pages 61-84, October.
    13. Alfonso Ávila-Robinson & Shintaro Sengoku, 2017. "Tracing the knowledge-building dynamics in new stem cell technologies through techno-scientific networks," Scientometrics, Springer;Akadémiai Kiadó, vol. 112(3), pages 1691-1720, September.
    14. Alicja Grześkowiak, 2016. "Assessment of Participation in Cultural Activities in Poland by Selected Multivariate Methods," European Journal of Social Sciences Education and Research Articles, Revistia Research and Publishing, vol. 3, January -.
    15. Yunpeng Zhao & Qing Pan & Chengan Du, 2019. "Logistic regression augmented community detection for network data with application in identifying autism‐related gene pathways," Biometrics, The International Biometric Society, vol. 75(1), pages 222-234, March.
    16. Wu, Han-Ming & Tien, Yin-Jing & Chen, Chun-houh, 2010. "GAP: A graphical environment for matrix visualization and cluster analysis," Computational Statistics & Data Analysis, Elsevier, vol. 54(3), pages 767-778, March.
    17. José E. Chacón, 2021. "Explicit Agreement Extremes for a 2 × 2 Table with Given Marginals," Journal of Classification, Springer;The Classification Society, vol. 38(2), pages 257-263, July.
    18. F. Marta L. Di Lascio & Andrea Menapace & Roberta Pappadà, 2024. "A spatially‐weighted AMH copula‐based dissimilarity measure for clustering variables: An application to urban thermal efficiency," Environmetrics, John Wiley & Sons, Ltd., vol. 35(1), February.
    19. Yifan Zhu & Chongzhi Di & Ying Qing Chen, 2019. "Clustering Functional Data with Application to Electronic Medication Adherence Monitoring in HIV Prevention Trials," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 11(2), pages 238-261, July.
    20. Irene Vrbik & Paul McNicholas, 2015. "Fractionally-Supervised Classification," Journal of Classification, Springer;The Classification Society, vol. 32(3), pages 359-381, October.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:111:y:2017:i:2:d:10.1007_s11192-017-2290-5. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.