IDEAS home Printed from https://ideas.repec.org/a/spr/scient/v90y2012i2d10.1007_s11192-011-0491-x.html
   My bibliography  Save this article

Experimental comparison of first and second-order similarities in a scientometric context

Author

Listed:
  • Cristian Colliander

    (Umeå University)

  • Per Ahlgren

    (Umeå University
    Stockholm University)

Abstract

The measurement of similarity between objects plays a role in several scientific areas. In this article, we deal with document–document similarity in a scientometric context. We compare experimentally, using a large dataset, first-order with second-order similarities with respect to the overall quality of partitions of the dataset, where the partitions are obtained on the basis of optimizing weighted modularity. The quality of a partition is defined in terms of textual coherence. The results show that the second-order approach consistently outperforms the first-order approach. Each difference between the two approaches in overall partition quality values is significant at the 0.01 level.

Suggested Citation

  • Cristian Colliander & Per Ahlgren, 2012. "Experimental comparison of first and second-order similarities in a scientometric context," Scientometrics, Springer;Akadémiai Kiadó, vol. 90(2), pages 675-685, February.
  • Handle: RePEc:spr:scient:v:90:y:2012:i:2:d:10.1007_s11192-011-0491-x
    DOI: 10.1007/s11192-011-0491-x
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11192-011-0491-x
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11192-011-0491-x?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Leo Egghe, 2010. "On the relation between the association strength and other similarity measures," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 61(7), pages 1502-1504, July.
    2. Leo Egghe, 2010. "Good properties of similarity measures and their complementarity," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 61(10), pages 2151-2160, October.
    3. van Eck, N.J.P. & Waltman, L., 2009. "How to Normalize Co-Occurrence Data? An Analysis of Some Well-Known Similarity Measures," ERIM Report Series Research in Management ERS-2009-001-LIS, Erasmus Research Institute of Management (ERIM), ERIM is the joint research institute of the Rotterdam School of Management, Erasmus University and the Erasmus School of Economics (ESE) at Erasmus University Rotterdam.
    4. Per Ahlgren & Bo Jarneving & Ronald Rousseau, 2003. "Requirements for a cocitation similarity measure, with special reference to Pearson's correlation coefficient," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 54(6), pages 550-560, April.
    5. Kevin W. Boyack & Richard Klavans, 2010. "Co‐citation analysis, bibliographic coupling, and direct citation: Which citation approach represents the research front most accurately?," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 61(12), pages 2389-2404, December.
    6. Patrick Glenisson & Wolfgang Glänzel & Olle Persson, 2005. "Combining full-text analysis and bibliometric indicators. A pilot study," Scientometrics, Springer;Akadémiai Kiadó, vol. 63(1), pages 163-180, March.
    7. Leo Egghe & Loet Leydesdorff, 2009. "The relation between Pearson's correlation coefficient r and Salton's cosine measure," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 60(5), pages 1027-1036, May.
    8. Timothy Cribbin, 2011. "Discovering latent topical structure by second‐order similarity analysis," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 62(6), pages 1188-1207, June.
    9. Kevin W. Boyack & Richard Klavans & Katy Börner, 2005. "Mapping the backbone of science," Scientometrics, Springer;Akadémiai Kiadó, vol. 64(3), pages 351-374, August.
    10. Leo Egghe, 2009. "New relations between similarity measures for vectors based on vector norms," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 60(2), pages 232-239, February.
    11. Kevin W. Boyack & Richard Klavans, 2010. "Co-citation analysis, bibliographic coupling, and direct citation: Which citation approach represents the research front most accurately?," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 61(12), pages 2389-2404, December.
    12. Per Ahlgren & Bo Jarneving, 2008. "Bibliographic coupling, common abstract stems and clustering: A comparison of two document-document similarity approaches in the context of science mapping," Scientometrics, Springer;Akadémiai Kiadó, vol. 76(2), pages 273-290, August.
    13. Peters, H. P. F. & van Raan, A. F. J., 1993. "Co-word-based science maps of chemical engineering. Part I: Representations by direct multidimensional scaling," Research Policy, Elsevier, vol. 22(1), pages 23-45, February.
    14. Nees Jan van Eck & Ludo Waltman, 2009. "How to normalize cooccurrence data? An analysis of some well‐known similarity measures," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 60(8), pages 1635-1651, August.
    15. Timothy Cribbin, 2011. "Discovering latent topical structure by second-order similarity analysis," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 62(6), pages 1188-1207, June.
    16. Ahlgren, Per & Colliander, Cristian, 2009. "Document–document similarity approaches and science mapping: Experimental comparison of five approaches," Journal of Informetrics, Elsevier, vol. 3(1), pages 49-63.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Fabian Meyer-Brötz & Edgar Schiebel & Leo Brecht, 2017. "Experimental evaluation of parameter settings in calculation of hybrid similarities: effects of first- and second-order similarity, edge cutting, and weighting factors," Scientometrics, Springer;Akadémiai Kiadó, vol. 111(3), pages 1307-1325, June.
    2. Ludo Waltman & Nees Jan Eck, 2012. "A new methodology for constructing a publication-level classification system of science," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 63(12), pages 2378-2392, December.
    3. Cristian Colliander & Per Ahlgren, 2019. "Comparison of publication-level approaches to ex-post citation normalization," Scientometrics, Springer;Akadémiai Kiadó, vol. 120(1), pages 283-300, July.
    4. Yun, Jinhyuk, 2022. "Generalization of bibliographic coupling and co-citation using the node split network," Journal of Informetrics, Elsevier, vol. 16(2).
    5. Dejian Yu & Wanru Wang & Shuai Zhang & Wenyu Zhang & Rongyu Liu, 2017. "Hybrid self-optimized clustering model based on citation links and textual features to detect research topics," PLOS ONE, Public Library of Science, vol. 12(10), pages 1-21, October.
    6. Sergey Shashnov & Maxim Kotsemir, 2018. "Research landscape of the BRICS countries: current trends in research output, thematic structures of publications, and the relative influence of partners," Scientometrics, Springer;Akadémiai Kiadó, vol. 117(2), pages 1115-1155, November.
    7. Yang, Hyeonchae & Jung, Woo-Sung, 2016. "Structural efficiency to manipulate public research institution networks," Technological Forecasting and Social Change, Elsevier, vol. 110(C), pages 21-32.
    8. Guadalupe Palacios-Núñez & Gabriel Vélez-Cuartas & Juan D. Botero, 2018. "Developmental tendencies in the academic field of intellectual property through the identification of invisible colleges," Scientometrics, Springer;Akadémiai Kiadó, vol. 115(3), pages 1561-1574, June.
    9. Yun, Jinhyuk & Ahn, Sejung & Lee, June Young, 2020. "Return to basics: Clustering of scientific literature using structural information," Journal of Informetrics, Elsevier, vol. 14(4).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ying Huang & Wolfgang Glänzel & Lin Zhang, 2021. "Tracing the development of mapping knowledge domains," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(7), pages 6201-6224, July.
    2. Yang, Siluo & Han, Ruizhen & Wolfram, Dietmar & Zhao, Yuehua, 2016. "Visualizing the intellectual structure of information science (2006–2015): Introducing author keyword coupling analysis," Journal of Informetrics, Elsevier, vol. 10(1), pages 132-150.
    3. Wolfram, Dietmar & Zhao, Yuehua, 2014. "A comparison of journal similarity across six disciplines using citing discipline analysis," Journal of Informetrics, Elsevier, vol. 8(4), pages 840-853.
    4. Kraker, Peter & Schlögl, Christian & Jack, Kris & Lindstaedt, Stefanie, 2015. "Visualization of co-readership patterns from an online reference management system," Journal of Informetrics, Elsevier, vol. 9(1), pages 169-182.
    5. Nees Jan Eck & Ludo Waltman, 2010. "Software survey: VOSviewer, a computer program for bibliometric mapping," Scientometrics, Springer;Akadémiai Kiadó, vol. 84(2), pages 523-538, August.
    6. Nassiri, Isar & Masoudi-Nejad, Ali & Jalili, Mahdi & Moeini, Ali, 2013. "Normalized Similarity Index: An adjusted index to prioritize article citations," Journal of Informetrics, Elsevier, vol. 7(1), pages 91-98.
    7. Rodolfo Modrigais Strauss Nunes & Susana Carla Farias Pereira, 2022. "Intellectual structure and trends in the humanitarian operations field," Annals of Operations Research, Springer, vol. 319(1), pages 1099-1157, December.
    8. Serhat Burmaoglu & Ozcan Saritas, 2019. "An evolutionary analysis of the innovation policy domain: Is there a paradigm shift?," Scientometrics, Springer;Akadémiai Kiadó, vol. 118(3), pages 823-847, March.
    9. Ludo Waltman & Nees Jan Eck, 2012. "A new methodology for constructing a publication-level classification system of science," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 63(12), pages 2378-2392, December.
    10. Jun-Ping Qiu & Ke Dong & Hou-Qiang Yu, 2014. "Comparative study on structure and correlation among author co-occurrence networks in bibliometrics," Scientometrics, Springer;Akadémiai Kiadó, vol. 101(2), pages 1345-1360, November.
    11. Adrián Kovács & Bart Looy & Bruno Cassiman, 2015. "Exploring the scope of open innovation: a bibliometric review of a decade of research," Scientometrics, Springer;Akadémiai Kiadó, vol. 104(3), pages 951-983, September.
    12. Inchae Park & Keeeun Lee & Byungun Yoon, 2015. "Exploring Promising Research Frontiers Based on Knowledge Maps in the Solar Cell Technology Field," Sustainability, MDPI, vol. 7(10), pages 1-30, October.
    13. María de la Cruz del Río-Rama & Claudia Patricia Maldonado-Erazo & José Álvarez-García & Amador Durán-Sánchez, 2020. "Cultural and Natural Resources in Tourism Island: Bibliometric Mapping," Sustainability, MDPI, vol. 12(2), pages 1-26, January.
    14. Claudia Patricia Maldonado-Erazo & José Álvarez-García & María de la Cruz del Río-Rama & Amador Durán-Sánchez, 2021. "Scientific Mapping on the Impact of Climate Change on Cultural and Natural Heritage: A Systematic Scientometric Analysis," Land, MDPI, vol. 10(1), pages 1-19, January.
    15. Francisco García-Lillo & Enrique Claver-Cortés & Bartolomé Marco-Lajara & Mercedes Úbeda-García, 2017. "Mapping the Intellectual Structure of Research on ‘Born Global’ Firms and INVs: A Citation/Co-citation Analysis," Management International Review, Springer, vol. 57(4), pages 631-652, August.
    16. Wang, Xiaoli & Daim, Tugrul & Huang, Lucheng & Li, Zhiqiang & Shaikh, Ruqia & Kassi, Diby Francois, 2022. "Monitoring the development trend and competition status of high technologies using patent analysis and bibliographic coupling: The case of electronic design automation technology," Technology in Society, Elsevier, vol. 71(C).
    17. Shubham Singhania & Jagvinder Singh & Deepti Aggrawal, 2023. "Gender diversity on board and corporate sustainability: a quantitative review based on bibliometric mapping," International Journal of System Assurance Engineering and Management, Springer;The Society for Reliability, Engineering Quality and Operations Management (SREQOM),India, and Division of Operation and Maintenance, Lulea University of Technology, Sweden, vol. 14(1), pages 267-286, February.
    18. Sara Sassetti & Giacomo Marzi & Vincenzo Cavaliere & Cristiano Ciappei, 2018. "Entrepreneurial cognition and socially situated approach: a systematic and bibliometric analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 116(3), pages 1675-1718, September.
    19. Sjögårde, Peter & Ahlgren, Per, 2018. "Granularity of algorithmically constructed publication-level classifications of research publications: Identification of topics," Journal of Informetrics, Elsevier, vol. 12(1), pages 133-152.
    20. Manuel Castriotta & Maria Chiara Guardo, 2016. "Disentangling the automotive technology structure: a patent co-citation analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 107(2), pages 819-837, May.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:90:y:2012:i:2:d:10.1007_s11192-011-0491-x. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.