IDEAS home Printed from https://ideas.repec.org/a/spr/scient/v105y2015i3d10.1007_s11192-015-1641-3.html
   My bibliography  Save this article

Bibliographic coupling and hierarchical clustering for the validation and improvement of subject-classification schemes

Author

Listed:
  • Bart Thijs

    (KU Leuven)

  • Lin Zhang

    (KU Leuven
    North China University of Water Conservancy and Electric Power)

  • Wolfgang Glänzel

    (KU Leuven
    Library of the Hungarian Academy of Sciences)

Abstract

An attempt is made to cluster journals from the complete Web of Science database by using bibliographic coupling similarities. Since the sparseness of the underlying similarity matrix proved inappropriate for this exercise, second-order similarities have been used. Only 0.12 % out of 8282 journals had to be removed from the classification as being singletons. The quality at three hierarchical levels with 6, 14 and 24 clusters substantiated the applicability of this method. Cluster labelling was made on the basis of the about 70 subfields of the Leuven–Budapest subject-classification scheme that also allowed the comparison with the existing two-level journal classification system developed in Leuven. The further comparison with the 22 field classification system of the Essential Science Indicators does, however, reveal larger deviations.

Suggested Citation

  • Bart Thijs & Lin Zhang & Wolfgang Glänzel, 2015. "Bibliographic coupling and hierarchical clustering for the validation and improvement of subject-classification schemes," Scientometrics, Springer;Akadémiai Kiadó, vol. 105(3), pages 1453-1467, December.
  • Handle: RePEc:spr:scient:v:105:y:2015:i:3:d:10.1007_s11192-015-1641-3
    DOI: 10.1007/s11192-015-1641-3
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11192-015-1641-3
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11192-015-1641-3?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Lin Zhang & Frizo Janssens & Liming Liang & Wolfgang Glänzel, 2010. "Journal cross-citation analysis for validation and improvement of journal-based subject classification in bibliometric research," Scientometrics, Springer;Akadémiai Kiadó, vol. 82(3), pages 687-706, March.
    2. E. Bassecoulard & M. Zitt, 1999. "Indicators in a research institute: A multi-level classification of scientific journals," Scientometrics, Springer;Akadémiai Kiadó, vol. 44(3), pages 323-345, March.
    3. Francis Narin & Mark Carpenter & Nancy C. Berlt, 1972. "Interrelationships of scientific journals," Journal of the American Society for Information Science, Association for Information Science & Technology, vol. 23(5), pages 323-331, September.
    4. Bart Thijs & Edgar Schiebel & Wolfgang Glänzel, 2013. "Do second-order similarities provide added-value in a hybrid approach?," Scientometrics, Springer;Akadémiai Kiadó, vol. 96(3), pages 667-677, September.
    5. Frizo Janssens & Wolfgang Glänzel & Bart Moor, 2008. "A hybrid mapping of information science," Scientometrics, Springer;Akadémiai Kiadó, vol. 75(3), pages 607-631, June.
    6. Wolfgang Glänzel & András Schubert, 2003. "A new classification scheme of science fields and subfields designed for scientometric evaluation purposes," Scientometrics, Springer;Akadémiai Kiadó, vol. 56(3), pages 357-367, March.
    7. Wolfgang Glänzel & Bart Thijs, 2011. "Using ‘core documents’ for the representation of clusters and topics," Scientometrics, Springer;Akadémiai Kiadó, vol. 88(1), pages 297-309, July.
    8. Ahlgren, Per & Colliander, Cristian, 2009. "Document–document similarity approaches and science mapping: Experimental comparison of five approaches," Journal of Informetrics, Elsevier, vol. 3(1), pages 49-63.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Ying Huang & Wolfgang Glänzel & Lin Zhang, 2021. "Tracing the development of mapping knowledge domains," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(7), pages 6201-6224, July.
    2. Wang, Qi & Waltman, Ludo, 2016. "Large-scale analysis of the accuracy of the journal classification systems of Web of Science and Scopus," Journal of Informetrics, Elsevier, vol. 10(2), pages 347-364.
    3. Haitham Nobanee & Fayrouz Aksam Elsaied & Nouf Alhammadi & Noora Wazir, 2023. "Bibliometric analysis and visualization of green, sustainable, and environmental insurance research," Journal of Financial Services Marketing, Palgrave Macmillan, vol. 28(4), pages 631-648, December.
    4. Gerson Pech & Catarina Delgado & Silvio Paolo Sorella, 2022. "Classifying papers into subfields using Abstracts, Titles, Keywords and KeyWords Plus through pattern detection and optimization procedures: An application in Physics," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 73(11), pages 1513-1528, November.
    5. Yu-Wei Chang, 2019. "Are articles in library and information science (LIS) journals primarily contributed to by LIS authors?," Scientometrics, Springer;Akadémiai Kiadó, vol. 121(1), pages 81-104, October.
    6. Rey-Long Liu, 2017. "A new bibliographic coupling measure with descriptive capability," Scientometrics, Springer;Akadémiai Kiadó, vol. 110(2), pages 915-935, February.
    7. Wuestman, Mignon L. & Hoekman, Jarno & Frenken, Koen, 2019. "The geography of scientific citations," Research Policy, Elsevier, vol. 48(7), pages 1771-1780.
    8. Lin Zhang & Beibei Sun & Fei Shu & Ying Huang, 2022. "Comparing paper level classifications across different methods and systems: an investigation of Nature publications," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(12), pages 7633-7651, December.
    9. Cristóbal Urbano & Jordi Ardanuy, 2020. "Cross-disciplinary collaboration versus coexistence in LIS serials: analysis of authorship affiliations in four European countries," Scientometrics, Springer;Akadémiai Kiadó, vol. 124(1), pages 575-602, July.
    10. Pech, Gerson & Delgado, Catarina, 2021. "Screening the most highly cited papers in longitudinal bibliometric studies and systematic literature reviews of a research field or journal: Widespread used metrics vs a percentile citation-based app," Journal of Informetrics, Elsevier, vol. 15(3).
    11. Sander Zwanenburg & Maryam Nakhoda & Peter Whigham, 2022. "Toward greater consistency and validity in measuring interdisciplinarity: a systematic and conceptual evaluation," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(12), pages 7769-7788, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Michel Zitt, 2015. "Meso-level retrieval: IR-bibliometrics interplay and hybrid citation-words methods in scientific fields delineation," Scientometrics, Springer;Akadémiai Kiadó, vol. 102(3), pages 2223-2245, March.
    2. Lin Zhang & Frizo Janssens & Liming Liang & Wolfgang Glänzel, 2010. "Journal cross-citation analysis for validation and improvement of journal-based subject classification in bibliometric research," Scientometrics, Springer;Akadémiai Kiadó, vol. 82(3), pages 687-706, March.
    3. Wang, Qi & Waltman, Ludo, 2016. "Large-scale analysis of the accuracy of the journal classification systems of Web of Science and Scopus," Journal of Informetrics, Elsevier, vol. 10(2), pages 347-364.
    4. Jing Zhang & Xiaomin Liu & Lili Wu, 2016. "The study of subject-classification based on journal coupling and expert subject-classification system," Scientometrics, Springer;Akadémiai Kiadó, vol. 107(3), pages 1149-1170, June.
    5. Shuo Xu & Junwan Liu & Dongsheng Zhai & Xin An & Zheng Wang & Hongshen Pang, 2018. "Overlapping thematic structures extraction with mixed-membership stochastic blockmodel," Scientometrics, Springer;Akadémiai Kiadó, vol. 117(1), pages 61-84, October.
    6. Ludo Waltman & Nees Jan Eck, 2012. "A new methodology for constructing a publication-level classification system of science," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 63(12), pages 2378-2392, December.
    7. Lin Zhang & Beibei Sun & Fei Shu & Ying Huang, 2022. "Comparing paper level classifications across different methods and systems: an investigation of Nature publications," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(12), pages 7633-7651, December.
    8. Wolfgang Glänzel & Lin Zhang, 2018. "Scientometric research assessment in the developing world: A tribute to Michael J. Moravcsik from the perspective of the twenty-first century," Scientometrics, Springer;Akadémiai Kiadó, vol. 115(3), pages 1517-1532, June.
    9. Bart Thijs & Wolfgang Glänzel, 2018. "The contribution of the lexical component in hybrid clustering, the case of four decades of “Scientometrics”," Scientometrics, Springer;Akadémiai Kiadó, vol. 115(1), pages 21-33, April.
    10. Antonio J. Gómez-Núñez & Benjamín Vargas-Quesada & Félix Moya-Anegón & Wolfgang Glänzel, 2011. "Improving SCImago Journal & Country Rank (SJR) subject classification through reference analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 89(3), pages 741-758, December.
    11. Yu-Wei Chang, 2019. "Are articles in library and information science (LIS) journals primarily contributed to by LIS authors?," Scientometrics, Springer;Akadémiai Kiadó, vol. 121(1), pages 81-104, October.
    12. Dejian Yu & Wanru Wang & Shuai Zhang & Wenyu Zhang & Rongyu Liu, 2017. "Hybrid self-optimized clustering model based on citation links and textual features to detect research topics," PLOS ONE, Public Library of Science, vol. 12(10), pages 1-21, October.
    13. C.M. Calero Medina & T.N. Leeuwen, 2012. "Seed journal citation network maps: A method based on network theory," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 63(6), pages 1226-1234, June.
    14. Yuen-Hsien Tseng & Ming-Yueh Tsay, 2013. "Journal clustering of library and information science for subfield delineation using the bibliometric analysis toolkit: CATAR," Scientometrics, Springer;Akadémiai Kiadó, vol. 95(2), pages 503-528, May.
    15. Leydesdorff, Loet & Bornmann, Lutz & Zhou, Ping, 2016. "Construction of a pragmatic base line for journal classifications and maps based on aggregated journal-journal citation relations," Journal of Informetrics, Elsevier, vol. 10(4), pages 902-918.
    16. Fabian Meyer-Brötz & Edgar Schiebel & Leo Brecht, 2017. "Experimental evaluation of parameter settings in calculation of hybrid similarities: effects of first- and second-order similarity, edge cutting, and weighting factors," Scientometrics, Springer;Akadémiai Kiadó, vol. 111(3), pages 1307-1325, June.
    17. Richard Klavans & Kevin W. Boyack, 2010. "Toward an objective, reliable and accurate method for measuring research leadership," Scientometrics, Springer;Akadémiai Kiadó, vol. 82(3), pages 539-553, March.
    18. Hakyeon Lee, 2015. "Uncovering the multidisciplinary nature of technology management: journal citation network analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 102(1), pages 51-75, January.
    19. Sjögårde, Peter & Ahlgren, Per, 2018. "Granularity of algorithmically constructed publication-level classifications of research publications: Identification of topics," Journal of Informetrics, Elsevier, vol. 12(1), pages 133-152.
    20. Erjia Yan, 2014. "Topic-based Pagerank: toward a topic-level scientific evaluation," Scientometrics, Springer;Akadémiai Kiadó, vol. 100(2), pages 407-437, August.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:105:y:2015:i:3:d:10.1007_s11192-015-1641-3. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.