IDEAS home Printed from https://ideas.repec.org/a/spr/scient/v120y2019i1d10.1007_s11192-019-03125-9.html
   My bibliography  Save this article

Discovering related scientific literature beyond semantic similarity: a new co-citation approach

Author

Listed:
  • Oscar Rodriguez-Prieto

    (Universidad de Oviedo)

  • Lourdes Araujo

    (Universidad Nacional de Educación a Distancia (UNED)
    Escuela Nacional de Sanidad)

  • Juan Martinez-Romo

    (Universidad Nacional de Educación a Distancia (UNED)
    Escuela Nacional de Sanidad)

Abstract

We propose a new approach to recommend scientific literature, a domain in which the efficient organization and search of information is crucial. The proposed system relies on the hypothesis that two scientific articles are semantically related if they are co-cited more frequently than they would be by pure chance. This relationship can be quantified by the probability of co-citation, obtained from a null model that statistically defines what we consider pure chance. Looking for article pairs that minimize this probability, the system is able to recommend a ranking of articles in response to a given article. This system is included in the co-occurrence paradigm of the field. More specifically, it is based on co-cites so it can produce recommendations more focused on relatedness than on similarity. Evaluation has been performed on the ACL Anthology collection and on the DBLP dataset, and a new corpus has been compiled to evaluate the capacity of the proposal to find relationships beyond similarity. Results show that the system is able to provide, not only articles similar to the submitted one, but also articles presenting other kind of relations, thus providing diversity, i.e. connections to new topics.

Suggested Citation

  • Oscar Rodriguez-Prieto & Lourdes Araujo & Juan Martinez-Romo, 2019. "Discovering related scientific literature beyond semantic similarity: a new co-citation approach," Scientometrics, Springer;Akadémiai Kiadó, vol. 120(1), pages 105-127, July.
  • Handle: RePEc:spr:scient:v:120:y:2019:i:1:d:10.1007_s11192-019-03125-9
    DOI: 10.1007/s11192-019-03125-9
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11192-019-03125-9
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11192-019-03125-9?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Kim, Ha Jin & Jeong, Yoo Kyung & Song, Min, 2016. "Content- and proximity-based author co-citation analysis using citation sentences," Journal of Informetrics, Elsevier, vol. 10(4), pages 954-966.
    2. Ying Ding & Erjia Yan & Arthur Frazho & James Caverlee, 2009. "PageRank for ranking authors in co‐citation networks," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 60(11), pages 2229-2243, November.
    3. Henry Small, 1973. "Co‐citation in the scientific literature: A new measure of the relationship between two documents," Journal of the American Society for Information Science, Association for Information Science & Technology, vol. 24(4), pages 265-269, July.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Kamal Sanguri & Atanu Bhuyan & Sabyasachi Patra, 2020. "A semantic similarity adjusted document co-citation analysis: a case of tourism supply chain," Scientometrics, Springer;Akadémiai Kiadó, vol. 125(1), pages 233-269, October.
    2. Tianshuang Qiu & Chuanming Yu & Yunci Zhong & Lu An & Gang Li, 2021. "A scientific citation recommendation model integrating network and text representations," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(11), pages 9199-9221, November.
    3. Zafar Ali & Irfan Ullah & Amin Khan & Asim Ullah Jan & Khan Muhammad, 2021. "An overview and evaluation of citation recommendation models," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(5), pages 4083-4119, May.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Neelam Kaushal & Rahul Pratap Singh Kaurav & Brijesh Sivathanu & Neeraj Kaushik, 2023. "Artificial intelligence and HRM: identifying future research Agenda using systematic literature review and bibliometric analysis," Management Review Quarterly, Springer, vol. 73(2), pages 455-493, June.
    2. Paúl Carrión-Mero & Néstor Montalván-Burbano & Fernando Morante-Carballo & Adolfo Quesada-Román & Boris Apolo-Masache, 2021. "Worldwide Research Trends in Landslide Science," IJERPH, MDPI, vol. 18(18), pages 1-24, September.
    3. Satish Kumar & Riya Sureka & Sisira Colombage, 2020. "Capital structure of SMEs: a systematic literature review and bibliometric analysis," Management Review Quarterly, Springer, vol. 70(4), pages 535-565, November.
    4. Prathap, Gangan & Ujum, Ephrance Abu & Kumar, Sameer & Ratnavelu, Kuru, 2021. "Scoring the resourcefulness of researchers using bibliographic coupling patterns," Journal of Informetrics, Elsevier, vol. 15(3).
    5. Moshe Blidstein & Maayan Zhitomirsky-Geffet, 2022. "Towards a new generic framework for citation network generation and analysis in the humanities," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(7), pages 4275-4297, July.
    6. Yadav, Pratyush & Pervin, Nargis, 2022. "Towards efficient navigation in digital libraries: Leveraging popularity, semantics and communities to recommend scholarly articles," Journal of Informetrics, Elsevier, vol. 16(4).
    7. Shubham Sharma & Usha Lenka, 2022. "On the shoulders of giants: uncovering key themes of organizational unlearning research in mainstream management journals," Review of Managerial Science, Springer, vol. 16(6), pages 1599-1695, August.
    8. Kent Baker, H. & Pandey, Nitesh & Kumar, Satish & Haldar, Arunima, 2020. "A bibliometric analysis of board diversity: Current status, development, and future research directions," Journal of Business Research, Elsevier, vol. 108(C), pages 232-246.
    9. Edison Jair Duque Oliva & Pedro Duque, 2022. "Tendencias emergentes en la literatura sobre el compromiso del cliente: un análisis bibliométrico," Estudios Gerenciales, Universidad Icesi, vol. 38(162), pages 120-132, March.
    10. Yong Huang & Yi Bu & Ying Ding & Wei Lu, 2018. "Number versus structure: towards citing cascades," Scientometrics, Springer;Akadémiai Kiadó, vol. 117(3), pages 2177-2193, December.
    11. Kamal Sanguri & Atanu Bhuyan & Sabyasachi Patra, 2020. "A semantic similarity adjusted document co-citation analysis: a case of tourism supply chain," Scientometrics, Springer;Akadémiai Kiadó, vol. 125(1), pages 233-269, October.
    12. Guan-Can Yang & Gang Li & Chun-Ya Li & Yun-Hua Zhao & Jing Zhang & Tong Liu & Dar-Zen Chen & Mu-Hsuan Huang, 2015. "Using the comprehensive patent citation network (CPC) to evaluate patent value," Scientometrics, Springer;Akadémiai Kiadó, vol. 105(3), pages 1319-1346, December.
    13. Gaviria-Marin, Magaly & Merigó, José M. & Baier-Fuentes, Hugo, 2019. "Knowledge management: A global examination based on bibliometric analysis," Technological Forecasting and Social Change, Elsevier, vol. 140(C), pages 194-220.
    14. Filippo Corsini & Rafael Laurenti & Franziska Meinherz & Francesco Paolo Appio & Luca Mora, 2019. "The Advent of Practice Theories in Research on Sustainable Consumption: Past, Current and Future Directions of the Field," Sustainability, MDPI, vol. 11(2), pages 1-19, January.
    15. Pamela E. Sandstrom, 2001. "Scholarly communication as a socioecological system," Scientometrics, Springer;Akadémiai Kiadó, vol. 51(3), pages 573-605, July.
    16. Serhat Burmaoglu & Ozcan Saritas, 2019. "An evolutionary analysis of the innovation policy domain: Is there a paradigm shift?," Scientometrics, Springer;Akadémiai Kiadó, vol. 118(3), pages 823-847, March.
    17. Andreas Bjurström & Merritt Polk, 2011. "Climate change and interdisciplinarity: a co-citation analysis of IPCC Third Assessment Report," Scientometrics, Springer;Akadémiai Kiadó, vol. 87(3), pages 525-550, June.
    18. Rey-Long Liu, 2017. "A new bibliographic coupling measure with descriptive capability," Scientometrics, Springer;Akadémiai Kiadó, vol. 110(2), pages 915-935, February.
    19. Giovanni Matteo & Pierfrancesco Nardi & Stefano Grego & Caterina Guidi, 2018. "Bibliometric analysis of Climate Change Vulnerability Assessment research," Environment Systems and Decisions, Springer, vol. 38(4), pages 508-516, December.
    20. Livio Cricelli & Michele Grimaldi & Silvia Vermicelli, 2022. "Crowdsourcing and open innovation: a systematic literature review, an integrated framework and a research agenda," Review of Managerial Science, Springer, vol. 16(5), pages 1269-1310, July.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:120:y:2019:i:1:d:10.1007_s11192-019-03125-9. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.