IDEAS home Printed from https://ideas.repec.org/a/spr/scient/v127y2022i5d10.1007_s11192-022-04339-0.html
   My bibliography  Save this article

SentCite: a sentence-level citation recommender based on the salient similarity among multiple segments

Author

Listed:
  • Hei-Chia Wang

    (National Cheng Kung University
    National Cheng Kung University)

  • Jen-Wei Cheng

    (National Cheng Kung University)

  • Che-Tsung Yang

    (National Cheng Kung University
    National Cheng Kung University)

Abstract

Efficiently making adequate citations is becoming more challenging due to the rapidly increasing volume of publications. In practice, citing the appropriate references is a time-consuming and skill-required task. Accordingly, many studies have tried to help by providing citation-oriented support. In this field, citation recommendation is a significant research area because it addresses the problems of required profound skills and information overload. In this paper, we propose a sentence-level citation recommender, SentCite, that can identify the sentences that need links to references and can recommend citations. SentCite employs the convolutional recurrent neural network to extract the citing sentences and recommends citations based on the salient similarity between the sentences among the abstract, full text, and in-link context of the target papers. Unlike some other research in the big data domain, the recommended quality papers in this application are very limited. We proposed undersampling inlink context awareness to avoid overfitting problems. SentCite can recommend the most appropriate papers for the given sentences and outperforms other context-based methods in terms of improvement in mean reciprocal rank (MRR) 31.8%, mean average precision (MAP) 30.1%, and normalized discounted cumulative gain (NDCG) 33.8%.

Suggested Citation

  • Hei-Chia Wang & Jen-Wei Cheng & Che-Tsung Yang, 2022. "SentCite: a sentence-level citation recommender based on the salient similarity among multiple segments," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(5), pages 2521-2546, May.
  • Handle: RePEc:spr:scient:v:127:y:2022:i:5:d:10.1007_s11192-022-04339-0
    DOI: 10.1007/s11192-022-04339-0
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11192-022-04339-0
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11192-022-04339-0?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Raja Habib & Muhammad Tanvir Afzal, 2019. "Sections-based bibliographic coupling for research paper recommendation," Scientometrics, Springer;Akadémiai Kiadó, vol. 119(2), pages 643-656, May.
    2. Metin Doslu & Haluk O. Bingol, 2016. "Context sensitive article ranking with citation context analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 108(2), pages 653-671, August.
    3. Guo Zhang & Ying Ding & Staša Milojević, 2013. "Citation content analysis (CCA): A framework for syntactic and semantic analysis of citation content," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 64(7), pages 1490-1503, July.
    4. Xu, Shuqi & Mariani, Manuel Sebastian & Lü, Linyuan & Medo, Matúš, 2020. "Unbiased evaluation of ranking metrics reveals consistent performance in science and technology citation data," Journal of Informetrics, Elsevier, vol. 14(1).
    5. Marc Bertin & Iana Atanassova & Cassidy R. Sugimoto & Vincent Lariviere, 2016. "The linguistic patterns and rhetorical structure of citation context: an approach using n-grams," Scientometrics, Springer;Akadémiai Kiadó, vol. 109(3), pages 1417-1434, December.
    6. Shutian Ma & Heng Zhang & Chengzhi Zhang & Xiaozhong Liu, 2021. "Chronological citation recommendation with time preference," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(4), pages 2991-3010, April.
    7. Natsuo Onodera & Fuyuki Yoshikane, 2015. "Factors affecting citation rates of research articles," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 66(4), pages 739-764, April.
    8. Guo Zhang & Ying Ding & Staša Milojević, 2013. "Citation content analysis (CCA): A framework for syntactic and semantic analysis of citation content," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 64(7), pages 1490-1503, July.
    9. Shutian Ma & Chengzhi Zhang & Xiaozhong Liu, 2020. "A review of citation recommendation: from textual content to enriched context," Scientometrics, Springer;Akadémiai Kiadó, vol. 122(3), pages 1445-1472, March.
    10. Chanwoo Jeong & Sion Jang & Eunjeong Park & Sungchul Choi, 2020. "A context-aware citation recommendation model with BERT and graph convolutional networks," Scientometrics, Springer;Akadémiai Kiadó, vol. 124(3), pages 1907-1922, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Shicheng Tan & Tao Zhang & Shu Zhao & Yanping Zhang, 2023. "Self-supervised scientific document recommendation based on contrastive learning," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(9), pages 5027-5049, September.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Sehrish Iqbal & Saeed-Ul Hassan & Naif Radi Aljohani & Salem Alelyani & Raheel Nawaz & Lutz Bornmann, 2021. "A decade of in-text citation analysis based on natural language processing and machine learning techniques: an overview of empirical studies," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(8), pages 6551-6599, August.
    2. Iman Tahamtan & Lutz Bornmann, 2019. "What do citation counts measure? An updated review of studies on citations in scientific documents published between 2006 and 2018," Scientometrics, Springer;Akadémiai Kiadó, vol. 121(3), pages 1635-1684, December.
    3. Lutz Bornmann & Robin Haunschild & Sven E. Hug, 2018. "Visualizing the context of citations referencing papers published by Eugene Garfield: a new type of keyword co-occurrence analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 114(2), pages 427-437, February.
    4. Tahamtan, Iman & Bornmann, Lutz, 2018. "Core elements in the process of citing publications: Conceptual overview of the literature," Journal of Informetrics, Elsevier, vol. 12(1), pages 203-216.
    5. Yang, Jinqing & Liu, Zhifeng, 2022. "The effect of citation behaviour on knowledge diffusion and intellectual structure," Journal of Informetrics, Elsevier, vol. 16(1).
    6. Ruhao Zhang & Junpeng Yuan, 2022. "Enhanced author bibliographic coupling analysis using semantic and syntactic citation information," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(12), pages 7681-7706, December.
    7. Chaker Jebari & Enrique Herrera-Viedma & Manuel Jesus Cobo, 2021. "The use of citation context to detect the evolution of research topics: a large-scale analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(4), pages 2971-2989, April.
    8. Liu, Xiaojuan & Wang, Chenlin & Chen, Dar-Zen & Huang, Mu-Hsuan, 2022. "Exploring perception of retraction based on mentioned status in post-retraction citations," Journal of Informetrics, Elsevier, vol. 16(3).
    9. Adilson Vital & Diego R. Amancio, 2022. "A comparative analysis of local similarity metrics and machine learning approaches: application to link prediction in author citation networks," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(10), pages 6011-6028, October.
    10. Hamid R. Jamali & Majid Nabavi & Saeid Asadi, 2018. "How video articles are cited, the case of JoVE: Journal of Visualized Experiments," Scientometrics, Springer;Akadémiai Kiadó, vol. 117(3), pages 1821-1839, December.
    11. Witting Antje, 2015. "Measuring the Use of Knowledge in Policy Development," Central European Journal of Public Policy, Sciendo, vol. 9(2), pages 54-62, December.
    12. Błoński Krzysztof, 2023. "Analysis of Citations and Co-Citations of the Term ‘Word of Mouth’ Based on Publications in the Field of Social Sciences," Marketing of Scientific and Research Organizations, Sciendo, vol. 28(2), pages 111-133, June.
    13. Kai Nishikawa, 2023. "How and why are citations between disciplines made? A citation context analysis focusing on natural sciences and social sciences and humanities," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(5), pages 2975-2997, May.
    14. Wang, Shiyun & Mao, Jin & Lu, Kun & Cao, Yujie & Li, Gang, 2021. "Understanding interdisciplinary knowledge integration through citance analysis: A case study on eHealth," Journal of Informetrics, Elsevier, vol. 15(4).
    15. Ali Daud & Min Song & Malik Khizar Hayat & Tehmina Amjad & Rabeeh Ayaz Abbasi & Hassan Dawood & Anwar Ghani, 2020. "Finding rising stars in bibliometric networks," Scientometrics, Springer;Akadémiai Kiadó, vol. 124(1), pages 633-661, July.
    16. Frederique Bordignon, 2022. "Critical citations in knowledge construction and citation analysis: from paradox to definition," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(2), pages 959-972, February.
    17. Bowen Ma & Chengzhi Zhang & Yuzhuo Wang & Sanhong Deng, 2022. "Enhancing identification of structure function of academic articles using contextual information," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(2), pages 885-925, February.
    18. Chen, Lixin, 2017. "Do patent citations indicate knowledge linkage? The evidence from text similarities between patents and their citations," Journal of Informetrics, Elsevier, vol. 11(1), pages 63-79.
    19. Marta Kuc-Czarnecka & Magdalena Olczyk, 2020. "How ethics combine with big data: a bibliometric analysis," Palgrave Communications, Palgrave Macmillan, vol. 7(1), pages 1-9, December.
    20. Yadav, Pratyush & Pervin, Nargis, 2022. "Towards efficient navigation in digital libraries: Leveraging popularity, semantics and communities to recommend scholarly articles," Journal of Informetrics, Elsevier, vol. 16(4).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:127:y:2022:i:5:d:10.1007_s11192-022-04339-0. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.