IDEAS home Printed from https://ideas.repec.org/a/bla/jamest/v38y1987i6p420-442.html
   My bibliography  Save this article

Pictures of relevance: A geometric analysis of similarity measures

Author

Listed:
  • William P. Jones
  • George W. Furnas

Abstract

We want computer systems that can help us assess the similarity or relevance of existing objects (e.g., documents, functions, commands, etc.) to a statement of our current needs (e.g., the query). Towards this end, a variety of similarity measures have been proposed. However, the relationship between a measure's formula and its performance is not always obvious. A geometric analysis is advanced and its utility demonstrated through its application to six conventional information retrieval similarity measures and a seventh spreading activation measure. All seven similarity measures work with a representational scheme wherein a query and the database objects are represented as vectors of term weights. A geometric analysis characterizes each similarity measure by the nature of its iso‐similarity contours in an n‐space containing query and object vectors. This analysis reveals important differences among the similarity measures and suggests conditions in which these differences will affect retrieval performance. The cosine coefficient, for example, is shown to be insensitive to between‐document differences in the magnitude of term weights while the inner product measure is sometimes overly affected by such differences. The context‐sensitive spreading activation measure may overcome both of these limitations and deserves further study. The geometric analysis is intended to complement, and perhaps to guide, the empirical analysis of similarity measures. © 1987 John Wiley & Sons, Inc.

Suggested Citation

  • William P. Jones & George W. Furnas, 1987. "Pictures of relevance: A geometric analysis of similarity measures," Journal of the American Society for Information Science, Association for Information Science & Technology, vol. 38(6), pages 420-442, November.
  • Handle: RePEc:bla:jamest:v:38:y:1987:i:6:p:420-442
    DOI: 10.1002/(SICI)1097-4571(198711)38:63.0.CO;2-S
    as

    Download full text from publisher

    File URL: https://doi.org/10.1002/(SICI)1097-4571(198711)38:63.0.CO;2-S
    Download Restriction: no

    File URL: https://libkey.io/10.1002/(SICI)1097-4571(198711)38:63.0.CO;2-S?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Loet Leydesdorff, 2007. "Mapping interdisciplinarity at the interfaces between the Science Citation Index and the Social Science Citation Index," Scientometrics, Springer;Akadémiai Kiadó, vol. 71(3), pages 391-405, June.
    2. Serhat Burmaoglu & Ozcan Saritas, 2019. "An evolutionary analysis of the innovation policy domain: Is there a paradigm shift?," Scientometrics, Springer;Akadémiai Kiadó, vol. 118(3), pages 823-847, March.
    3. Takayuki Hayashi, 2003. "Bibliometric analysis on additionality of Japanese R&D programmes," Scientometrics, Springer;Akadémiai Kiadó, vol. 56(3), pages 301-316, March.
    4. Joon Hyung Cho & Jungpyo Lee & So Young Sohn, 2021. "Predicting future technological convergence patterns based on machine learning using link prediction," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(7), pages 5413-5429, July.
    5. H. Simon & N. Sick, 2016. "Technological distance measures: new perspectives on nearby and far away," Scientometrics, Springer;Akadémiai Kiadó, vol. 107(3), pages 1299-1320, June.
    6. Agnieszka Stojanowska & Justyna Rybak & Marta Bożym & Tomasz Olszowski & Jan Stefan Bihałowicz, 2020. "Spider Webs and Lichens as Bioindicators of Heavy Metals: A Comparison Study in the Vicinity of a Copper Smelter (Poland)," Sustainability, MDPI, vol. 12(19), pages 1-13, September.
    7. Czinkota, Thomas, 2012. "Zeitpunktsignale zum aktiven Portfoliomanagement [Time-Point-Signals for Active Portfolio Management]," MPRA Paper 39565, University Library of Munich, Germany.
    8. Wilfred Dolfsma & Loet Leydesdorff, 2010. "The citation field of evolutionary economics," Journal of Evolutionary Economics, Springer, vol. 20(5), pages 645-664, October.
    9. Loet Leydesdorff & Ping Zhou, 2007. "Nanotechnology as a field of science: Its delineation in terms of journals and patents," Scientometrics, Springer;Akadémiai Kiadó, vol. 70(3), pages 693-713, March.
    10. Aloys Prinz, 2019. "The microeconomics of mobile payments," Netnomics, Springer, vol. 20(2), pages 129-151, December.
    11. Robert Braam & Peter Besselaar, 2014. "Indicators for the dynamics of research organizations: a biomedical case study," Scientometrics, Springer;Akadémiai Kiadó, vol. 99(3), pages 949-971, June.
    12. Ping Zhou & Loet Leydesdorff, 2007. "The citation impacts and citation environments of Chinese journals in mathematics," Scientometrics, Springer;Akadémiai Kiadó, vol. 72(2), pages 185-200, August.
    13. Leydesdorff, Loet & Wagner, Caroline S. & Bornmann, Lutz, 2019. "Interdisciplinarity as diversity in citation patterns among journals: Rao-Stirling diversity, relative variety, and the Gini coefficient," Journal of Informetrics, Elsevier, vol. 13(1), pages 255-269.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jamest:v:38:y:1987:i:6:p:420-442. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.asis.org .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.