IDEAS home Printed from https://ideas.repec.org/a/spr/scient/v126y2021i10d10.1007_s11192-021-04117-4.html
   My bibliography  Save this article

Leveraging full-text article exploration for citation analysis

Author

Listed:
  • Moreno La Quatra

    (Politecnico di Torino Corso Duca degli Abruzzi, 24)

  • Luca Cagliero

    (Politecnico di Torino Corso Duca degli Abruzzi, 24)

  • Elena Baralis

    (Politecnico di Torino Corso Duca degli Abruzzi, 24)

Abstract

Scientific articles often include in-text citations quoting from external sources. When the cited source is an article, the citation context can be analyzed by exploring the article full-text. To quickly access the key information, researchers are often interested in identifying the sections of the cited article that are most pertinent to the text surrounding the citation in the citing article. This paper first performs a data-driven analysis of the correlation between the textual content of the sections of the cited article and the text snippet where the citation is placed. The results of the correlation analysis show that the title and abstract of the cited article are likely to include content highly similar to the citing snippet. However, the subsequent sections of the paper often include cited text snippets as well. Hence, there is a need to understand the extent to which an exploration of the full-text of the cited article would be beneficial to gain insights into the citing snippet, considering also the fact that the full-text access could be restricted. To this end, we then propose a classification approach to automatically predicting whether the cited snippets in the full-text of the paper contain a significant amount of new content beyond abstract and title. The proposed approach could support researchers in leveraging full-text article exploration for citation analysis. The experiments conducted on real scientific articles show promising results: the classifier has a 90% chance to correctly distinguish between the full-text exploration and only title and abstract cases.

Suggested Citation

  • Moreno La Quatra & Luca Cagliero & Elena Baralis, 2021. "Leveraging full-text article exploration for citation analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(10), pages 8275-8293, October.
  • Handle: RePEc:spr:scient:v:126:y:2021:i:10:d:10.1007_s11192-021-04117-4
    DOI: 10.1007/s11192-021-04117-4
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11192-021-04117-4
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11192-021-04117-4?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Shutian Ma & Jin Xu & Chengzhi Zhang, 2018. "Automatic identification of cited text spans: a multi-classifier approach over imbalanced dataset," Scientometrics, Springer;Akadémiai Kiadó, vol. 116(2), pages 1303-1330, August.
    2. Moreno La Quatra & Luca Cagliero & Elena Baralis, 2020. "Exploiting pivot words to classify and summarize discourse facets of scientific papers," Scientometrics, Springer;Akadémiai Kiadó, vol. 125(3), pages 3139-3157, December.
    3. Tarek Saier & Michael Färber, 2020. "unarXive: a large scholarly data set with publications’ full-text, annotated in-text citations, and links to metadata," Scientometrics, Springer;Akadémiai Kiadó, vol. 125(3), pages 3085-3108, December.
    4. Chanwoo Jeong & Sion Jang & Eunjeong Park & Sungchul Choi, 2020. "A context-aware citation recommendation model with BERT and graph convolutional networks," Scientometrics, Springer;Akadémiai Kiadó, vol. 124(3), pages 1907-1922, September.
    5. Chrysoula Zerva & Minh-Quoc Nghiem & Nhung T. H. Nguyen & Sophia Ananiadou, 2020. "Cited text span identification for scientific summarisation using pre-trained encoders," Scientometrics, Springer;Akadémiai Kiadó, vol. 125(3), pages 3109-3137, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Chanathip Pornprasit & Xin Liu & Pattararat Kiattipadungkul & Natthawut Kertkeidkachorn & Kyoung-Sook Kim & Thanapon Noraset & Saeed-Ul Hassan & Suppawong Tuarob, 2022. "Enhancing citation recommendation using citation network embedding," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(1), pages 233-264, January.
    2. Lu Huang & Xiang Chen & Yi Zhang & Changtian Wang & Xiaoli Cao & Jiarun Liu, 2022. "Identification of topic evolution: network analytics with piecewise linear representation and word embedding," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(9), pages 5353-5383, September.
    3. Naif Radi Aljohani & Ayman Fayoumi & Saeed-Ul Hassan, 2021. "An in-text citation classification predictive model for a scholarly search system," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(7), pages 5509-5529, July.
    4. Antonina Dattolo & Marco Corbatto, 2022. "Assisting researchers in bibliographic tasks: A new usable, real‐time tool for analyzing bibliographies," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 73(6), pages 757-776, June.
    5. Diego Kozlowski & Jennifer Dusdal & Jun Pang & Andreas Zilian, 2021. "Semantic and relational spaces in science of science: deep learning models for article vectorisation," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(7), pages 5881-5910, July.
    6. Khalid Haruna & Maizatul Akmar Ismail & Atika Qazi & Habeebah Adamu Kakudi & Mohammed Hassan & Sanah Abdullahi Muaz & Haruna Chiroma, 2020. "Research paper recommender system based on public contextual metadata," Scientometrics, Springer;Akadémiai Kiadó, vol. 125(1), pages 101-114, October.
    7. Choi, Seokkyu & Lee, Hyeonju & Park, Eunjeong & Choi, Sungchul, 2022. "Deep learning for patent landscaping using transformer and graph embedding," Technological Forecasting and Social Change, Elsevier, vol. 175(C).
    8. Moreno La Quatra & Luca Cagliero & Elena Baralis, 2020. "Exploiting pivot words to classify and summarize discourse facets of scientific papers," Scientometrics, Springer;Akadémiai Kiadó, vol. 125(3), pages 3139-3157, December.
    9. Pancheng Wang & Shasha Li & Haifang Zhou & Jintao Tang & Ting Wang, 2019. "Cited text spans identification with an improved balanced ensemble model," Scientometrics, Springer;Akadémiai Kiadó, vol. 120(3), pages 1111-1145, September.
    10. Jialiang Lin & Yao Yu & Jiaxin Song & Xiaodong Shi, 2022. "Detecting and analyzing missing citations to published scientific entities," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(5), pages 2395-2412, May.
    11. Hei-Chia Wang & Jen-Wei Cheng & Che-Tsung Yang, 2022. "SentCite: a sentence-level citation recommender based on the salient similarity among multiple segments," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(5), pages 2521-2546, May.
    12. Yonghe Lu & Meilu Yuan & Jiaxin Liu & Minghong Chen, 2023. "Research on semantic representation and citation recommendation of scientific papers with multiple semantics fusion," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(2), pages 1367-1393, February.
    13. Wang, Shiyun & Mao, Jin & Lu, Kun & Cao, Yujie & Li, Gang, 2021. "Understanding interdisciplinary knowledge integration through citance analysis: A case study on eHealth," Journal of Informetrics, Elsevier, vol. 15(4).
    14. Tianshuang Qiu & Chuanming Yu & Yunci Zhong & Lu An & Gang Li, 2021. "A scientific citation recommendation model integrating network and text representations," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(11), pages 9199-9221, November.
    15. Iqra Safder & Saeed-Ul Hassan, 2019. "Bibliometric-enhanced information retrieval: a novel deep feature engineering approach for algorithm searching from full-text publications," Scientometrics, Springer;Akadémiai Kiadó, vol. 119(1), pages 257-277, April.
    16. Jaewoong Choi & Jiho Lee & Janghyeok Yoon & Sion Jang & Jaeyoung Kim & Sungchul Choi, 2022. "A two-stage deep learning-based system for patent citation recommendation," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(11), pages 6615-6636, November.
    17. Sehrish Iqbal & Saeed-Ul Hassan & Naif Radi Aljohani & Salem Alelyani & Raheel Nawaz & Lutz Bornmann, 2021. "A decade of in-text citation analysis based on natural language processing and machine learning techniques: an overview of empirical studies," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(8), pages 6551-6599, August.
    18. Guillaume Cabanac & Ingo Frommholz & Philipp Mayr, 2018. "Bibliometric-enhanced information retrieval: preface," Scientometrics, Springer;Akadémiai Kiadó, vol. 116(2), pages 1225-1227, August.
    19. Chaker Jebari & Enrique Herrera-Viedma & Manuel Jesus Cobo, 2023. "Context-aware citation recommendation of scientific papers: comparative study, gaps and trends," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(8), pages 4243-4268, August.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:126:y:2021:i:10:d:10.1007_s11192-021-04117-4. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.