IDEAS home Printed from https://ideas.repec.org/a/spr/scient/v96y2013i3d10.1007_s11192-012-0896-1.html
   My bibliography  Save this article

Do second-order similarities provide added-value in a hybrid approach?

Author

Listed:
  • Bart Thijs

    (Centre for R&D Monitoring (ECOOM) and Department of MSI, KU Leuven)

  • Edgar Schiebel

    (AIT Austrian Institute of Technology GmbH)

  • Wolfgang Glänzel

    (Centre for R&D Monitoring (ECOOM) and Department of MSI, KU Leuven
    Department of Science Policy and Scientometrics, LHAS)

Abstract

Recent studies on first- and second-order similarities have shown that the latter one outperforms the first one as input for document clustering or partitioning applications. First-order similarities based on bibliographic coupling or on lexical approaches come with specific methodological issues like sparse matrices, sensitive to spelling variances or context differences. Second-order similarities were proposed to tackle these problems and take the lexical context into account. But also a hybrid combination of both types of similarities proved an important improvement which integrates the strengths of the two approaches and diminishes their weaknesses. In this paper we extend the notion of second-order similarity by applying it in the context of the hybrid approach. We conclude that there is no added value for the clearly defined clusters but that the second-order similarity can provide an additional viewpoint for the more general clusters.

Suggested Citation

  • Bart Thijs & Edgar Schiebel & Wolfgang Glänzel, 2013. "Do second-order similarities provide added-value in a hybrid approach?," Scientometrics, Springer;Akadémiai Kiadó, vol. 96(3), pages 667-677, September.
  • Handle: RePEc:spr:scient:v:96:y:2013:i:3:d:10.1007_s11192-012-0896-1
    DOI: 10.1007/s11192-012-0896-1
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11192-012-0896-1
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11192-012-0896-1?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Edgar Schiebel, 2012. "Visualization of research fronts and knowledge bases by three-dimensional areal densities of bibliographically coupled publications and co-citations," Scientometrics, Springer;Akadémiai Kiadó, vol. 91(2), pages 557-566, May.
    2. Wolfgang Glänzel & Bart Thijs, 2012. "Using ‘core documents’ for detecting and labelling new emerging topics," Scientometrics, Springer;Akadémiai Kiadó, vol. 91(2), pages 399-416, May.
    3. Kevin W. Boyack & Richard Klavans, 2010. "Co‐citation analysis, bibliographic coupling, and direct citation: Which citation approach represents the research front most accurately?," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 61(12), pages 2389-2404, December.
    4. Wolfgang Glänzel & Frizo Janssens & Bart Thijs, 2009. "A comparative analysis of publication activity and citation impact based on the core literature in bioinformatics," Scientometrics, Springer;Akadémiai Kiadó, vol. 79(1), pages 109-129, April.
    5. Kevin W. Boyack & Richard Klavans, 2010. "Co-citation analysis, bibliographic coupling, and direct citation: Which citation approach represents the research front most accurately?," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 61(12), pages 2389-2404, December.
    6. Alexander Kopcsa & Edgar Schiebel, 1998. "Science and technology mapping: A new iteration model for representing multidimensional relationships," Journal of the American Society for Information Science, Association for Information Science & Technology, vol. 49(1), pages 7-17.
    7. Wolfgang Glänzel & Bart Thijs, 2011. "Using ‘core documents’ for the representation of clusters and topics," Scientometrics, Springer;Akadémiai Kiadó, vol. 88(1), pages 297-309, July.
    8. Ahlgren, Per & Colliander, Cristian, 2009. "Document–document similarity approaches and science mapping: Experimental comparison of five approaches," Journal of Informetrics, Elsevier, vol. 3(1), pages 49-63.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Bart Thijs & Lin Zhang & Wolfgang Glänzel, 2015. "Bibliographic coupling and hierarchical clustering for the validation and improvement of subject-classification schemes," Scientometrics, Springer;Akadémiai Kiadó, vol. 105(3), pages 1453-1467, December.
    2. Fabian Meyer-Brötz & Edgar Schiebel & Leo Brecht, 2017. "Experimental evaluation of parameter settings in calculation of hybrid similarities: effects of first- and second-order similarity, edge cutting, and weighting factors," Scientometrics, Springer;Akadémiai Kiadó, vol. 111(3), pages 1307-1325, June.
    3. Wolfgang Glänzel & Bart Thijs, 2017. "Using hybrid methods and ‘core documents’ for the representation of clusters and topics: the astronomy dataset," Scientometrics, Springer;Akadémiai Kiadó, vol. 111(2), pages 1071-1087, May.
    4. Ávila-Robinson, Alfonso & Islam, Nazrul & Sengoku, Shintaro, 2022. "Exploring the knowledge base of innovation research: Towards an emerging innovation model," Technological Forecasting and Social Change, Elsevier, vol. 182(C).
    5. Yun, Jinhyuk, 2022. "Generalization of bibliographic coupling and co-citation using the node split network," Journal of Informetrics, Elsevier, vol. 16(2).
    6. Emili Vizuete-Luciano & Oktay Güzel & José M. Merigó, 2023. "Bibliometric research of the Pay-What-You-Want Topic," Journal of Revenue and Pricing Management, Palgrave Macmillan, vol. 22(5), pages 413-426, October.
    7. Christian Weismayer & Ilona Pezenka, 2017. "Identifying emerging research fields: a longitudinal latent semantic keyword analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 113(3), pages 1757-1785, December.
    8. Nauman Majeed & Sulaiman Ainin, 2021. "Visualizing the evolution and landscape of socio-economic impact research," Quality & Quantity: International Journal of Methodology, Springer, vol. 55(2), pages 637-659, April.
    9. Yun, Jinhyuk & Ahn, Sejung & Lee, June Young, 2020. "Return to basics: Clustering of scientific literature using structural information," Journal of Informetrics, Elsevier, vol. 14(4).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Mu-Hsuan Huang & Chia-Pin Chang, 2014. "Detecting research fronts in OLED field using bibliographic coupling with sliding window," Scientometrics, Springer;Akadémiai Kiadó, vol. 98(3), pages 1721-1744, March.
    2. Fabian Meyer-Brötz & Edgar Schiebel & Leo Brecht, 2017. "Experimental evaluation of parameter settings in calculation of hybrid similarities: effects of first- and second-order similarity, edge cutting, and weighting factors," Scientometrics, Springer;Akadémiai Kiadó, vol. 111(3), pages 1307-1325, June.
    3. Rons, Nadine, 2018. "Bibliometric approximation of a scientific specialty by combining key sources, title words, authors and references," Journal of Informetrics, Elsevier, vol. 12(1), pages 113-132.
    4. Shuo Xu & Liyuan Hao & Xin An & Hongshen Pang & Ting Li, 2020. "Review on emerging research topics with key-route main path analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 122(1), pages 607-624, January.
    5. Ying Huang & Wolfgang Glänzel & Lin Zhang, 2021. "Tracing the development of mapping knowledge domains," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(7), pages 6201-6224, July.
    6. Ludo Waltman & Nees Jan Eck, 2012. "A new methodology for constructing a publication-level classification system of science," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 63(12), pages 2378-2392, December.
    7. Liu, Yunmei & Yang, Liu & Chen, Min, 2021. "A new citation concept: Triangular citation in the literature," Journal of Informetrics, Elsevier, vol. 15(2).
    8. Michel Zitt, 2015. "Meso-level retrieval: IR-bibliometrics interplay and hybrid citation-words methods in scientific fields delineation," Scientometrics, Springer;Akadémiai Kiadó, vol. 102(3), pages 2223-2245, March.
    9. Bart Thijs & Wolfgang Glänzel, 2018. "The contribution of the lexical component in hybrid clustering, the case of four decades of “Scientometrics”," Scientometrics, Springer;Akadémiai Kiadó, vol. 115(1), pages 21-33, April.
    10. Cristian Colliander & Per Ahlgren, 2012. "Experimental comparison of first and second-order similarities in a scientometric context," Scientometrics, Springer;Akadémiai Kiadó, vol. 90(2), pages 675-685, February.
    11. Wolfgang Glänzel & Bart Thijs, 2012. "Using ‘core documents’ for detecting and labelling new emerging topics," Scientometrics, Springer;Akadémiai Kiadó, vol. 91(2), pages 399-416, May.
    12. Mu-hsuan Huang & Chia-Pin Chang, 2015. "A comparative study on detecting research fronts in the organic light-emitting diode (OLED) field using bibliographic coupling and co-citation," Scientometrics, Springer;Akadémiai Kiadó, vol. 102(3), pages 2041-2057, March.
    13. Sjögårde, Peter & Ahlgren, Per, 2018. "Granularity of algorithmically constructed publication-level classifications of research publications: Identification of topics," Journal of Informetrics, Elsevier, vol. 12(1), pages 133-152.
    14. Xu, Shuo & Hao, Liyuan & An, Xin & Yang, Guancan & Wang, Feifei, 2019. "Emerging research topics detection with multiple machine learning models," Journal of Informetrics, Elsevier, vol. 13(4).
    15. Wolfgang Glänzel, 2015. "Bibliometrics-aided retrieval: where information retrieval meets scientometrics," Scientometrics, Springer;Akadémiai Kiadó, vol. 102(3), pages 2215-2222, March.
    16. Peter Mutschke & Philipp Mayr & Philipp Schaer & York Sure, 2011. "Science models as value-added services for scholarly information systems," Scientometrics, Springer;Akadémiai Kiadó, vol. 89(1), pages 349-364, October.
    17. Wolfgang Glänzel & Bart Thijs, 2017. "Using hybrid methods and ‘core documents’ for the representation of clusters and topics: the astronomy dataset," Scientometrics, Springer;Akadémiai Kiadó, vol. 111(2), pages 1071-1087, May.
    18. Gómez-Núñez, Antonio J. & Batagelj, Vladimir & Vargas-Quesada, Benjamín & Moya-Anegón, Félix & Chinchilla-Rodríguez, Zaida, 2014. "Optimizing SCImago Journal & Country Rank classification by community detection," Journal of Informetrics, Elsevier, vol. 8(2), pages 369-383.
    19. Zhang, Yi & Shang, Lining & Huang, Lu & Porter, Alan L. & Zhang, Guangquan & Lu, Jie & Zhu, Donghua, 2016. "A hybrid similarity measure method for patent portfolio analysis," Journal of Informetrics, Elsevier, vol. 10(4), pages 1108-1130.
    20. Alfonso Ávila-Robinson & Shintaro Sengoku, 2017. "Tracing the knowledge-building dynamics in new stem cell technologies through techno-scientific networks," Scientometrics, Springer;Akadémiai Kiadó, vol. 112(3), pages 1691-1720, September.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:96:y:2013:i:3:d:10.1007_s11192-012-0896-1. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.