IDEAS home Printed from https://ideas.repec.org/a/spr/scient/v128y2023i12d10.1007_s11192-023-04858-4.html
   My bibliography  Save this article

Multi-task learning model for citation intent classification in scientific publications

Author

Listed:
  • Ruihua Qi

    (Dalian University of Foreign Languages
    Dalian University of Foreign Languages)

  • Jia Wei

    (Dalian University of Foreign Languages)

  • Zhen Shao

    (Dalian University of Foreign Languages)

  • Zhengguang Li

    (Dalian University of Foreign Languages)

  • Heng Chen

    (Dalian University of Foreign Languages)

  • Yunhao Sun

    (Dalian University of Foreign Languages)

  • Shaohua Li

    (Dalian University of Foreign Languages)

Abstract

Citations play a significant role in the evaluation of scientific literature and researchers. Citation intent analysis is essential for academic literature understanding. Meanwhile, it is useful for enriching semantic information representation for the citation intent classification task because of the rapid growth of publicly accessible full-text literature. However, some useful information that is readily available in citation context and facilitates citation intent analysis has not been fully explored. Furthermore, some deep learning models may not be able to learn relevant features effectively due to insufficient training samples of citation intent analysis tasks. Multi-task learning aims to exploit useful information between multiple tasks to help improve learning performance and exhibits promising results on many natural language processing tasks. In this paper, we propose a joint semantic representation model, which consists of pretrained language models and heterogeneous features of citation intent texts. Considering the correlation between citation intents, citation section and citation worthiness classification tasks, we build a multi-task citation classification framework with soft parameter sharing constraint and construct independent models for multiple tasks to improve the performance of citation intent classification. The experimental results demonstrate that the heterogeneous features and the multi-task framework with soft parameter sharing constraint proposed in this paper enhance the overall citation intent classification performance.

Suggested Citation

  • Ruihua Qi & Jia Wei & Zhen Shao & Zhengguang Li & Heng Chen & Yunhao Sun & Shaohua Li, 2023. "Multi-task learning model for citation intent classification in scientific publications," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(12), pages 6335-6355, December.
  • Handle: RePEc:spr:scient:v:128:y:2023:i:12:d:10.1007_s11192-023-04858-4
    DOI: 10.1007/s11192-023-04858-4
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11192-023-04858-4
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11192-023-04858-4?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Xiaodan Zhu & Peter Turney & Daniel Lemire & André Vellino, 2015. "Measuring academic influence: Not all citations are equal," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 66(2), pages 408-427, February.
    2. Yang Zhang & Rongying Zhao & Yufei Wang & Haihua Chen & Adnan Mahmood & Munazza Zaib & Wei Emma Zhang & Quan Z. Sheng, 2022. "Correction to: Towards employing native information in citation function classification," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(11), pages 6579-6579, November.
    3. Faiza Qayyum & Muhammad Tanvir Afzal, 2019. "Identification of important citations by exploiting research articles’ metadata and cue-terms from content," Scientometrics, Springer;Akadémiai Kiadó, vol. 118(1), pages 21-43, January.
    4. Saeed-Ul Hassan & Mubashir Imran & Sehrish Iqbal & Naif Radi Aljohani & Raheel Nawaz, 2018. "Deep context of citations using machine-learning models in scholarly full-text articles," Scientometrics, Springer;Akadémiai Kiadó, vol. 117(3), pages 1645-1662, December.
    5. Yang Zhang & Rongying Zhao & Yufei Wang & Haihua Chen & Adnan Mahmood & Munazza Zaib & Wei Emma Zhang & Quan Z. Sheng, 2022. "Towards employing native information in citation function classification," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(11), pages 6557-6577, November.
    6. Xiaorui Jiang & Jingqiang Chen, 2023. "Contextualised segment-wise citation function classification," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(9), pages 5117-5158, September.
    7. Dongqing Lyu & Xuanmin Ruan & Juan Xie & Ying Cheng, 2021. "The classification of citing motivations: a meta-synthesis," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(4), pages 3243-3264, April.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Krittin Chatrinan & Thanapon Noraset & Suppawong Tuarob, 2025. "GAN-CITE: leveraging semi-supervised generative adversarial networks for citation function classification with limited data," Scientometrics, Springer;Akadémiai Kiadó, vol. 130(2), pages 679-703, February.
    2. Xiaorui Jiang, 2025. "Ensembling approaches to citation function classification and important citation screening," Scientometrics, Springer;Akadémiai Kiadó, vol. 130(3), pages 1371-1419, March.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Xiaorui Jiang & Jingqiang Chen, 2023. "Contextualised segment-wise citation function classification," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(9), pages 5117-5158, September.
    2. Xiaorui Jiang, 2025. "Ensembling approaches to citation function classification and important citation screening," Scientometrics, Springer;Akadémiai Kiadó, vol. 130(3), pages 1371-1419, March.
    3. Faiza Qayyum & Harun Jamil & Naeem Iqbal & DoHyeun Kim & Muhammad Tanvir Afzal, 2022. "Toward potential hybrid features evaluation using MLP-ANN binary classification model to tackle meaningful citations," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(11), pages 6471-6499, November.
    4. Naif Radi Aljohani & Ayman Fayoumi & Saeed-Ul Hassan, 2021. "An in-text citation classification predictive model for a scholarly search system," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(7), pages 5509-5529, July.
    5. Xin An & Xin Sun & Shuo Xu, 2022. "Important citations identification with semi-supervised classification model," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(11), pages 6533-6555, November.
    6. Krittin Chatrinan & Thanapon Noraset & Suppawong Tuarob, 2025. "GAN-CITE: leveraging semi-supervised generative adversarial networks for citation function classification with limited data," Scientometrics, Springer;Akadémiai Kiadó, vol. 130(2), pages 679-703, February.
    7. Indra Budi & Yaniasih Yaniasih, 2023. "Understanding the meanings of citations using sentiment, role, and citation function classifications," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(1), pages 735-759, January.
    8. Yu, Dejian & Yan, Zhaoping, 2023. "Main path analysis considering citation structure and content: Case studies in different domains," Journal of Informetrics, Elsevier, vol. 17(1).
    9. Wang, Yuzhuo & Xiang, Yi & Zhang, Chengzhi, 2024. "Exploring motivations for algorithm mention in the domain of natural language processing: A deep learning approach," Journal of Informetrics, Elsevier, vol. 18(4).
    10. Mingyang Wang & Jiaqi Zhang & Shijia Jiao & Xiangrong Zhang & Na Zhu & Guangsheng Chen, 2020. "Important citation identification by exploiting the syntactic and contextual information of citations," Scientometrics, Springer;Akadémiai Kiadó, vol. 125(3), pages 2109-2129, December.
    11. Li, Xin & Tang, Xuli & Lu, Wei, 2024. "Investigating clinical links in edge-labeled citation networks of biomedical research: A translational science perspective," Journal of Informetrics, Elsevier, vol. 18(3).
    12. Yi Zhang & Chengzhi Zhang & Philipp Mayr & Arho Suominen, 2022. "An editorial of “AI + informetrics”: multi-disciplinary interactions in the era of big data," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(11), pages 6503-6507, November.
    13. Setio Basuki & Masatoshi Tsuchiya, 2022. "SDCF: semi-automatically structured dataset of citation functions," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(8), pages 4569-4608, August.
    14. Iqra Safder & Momin Ali & Naif Radi Aljohani & Raheel Nawaz & Saeed‐Ul Hassan, 2023. "Neural machine translation for in‐text citation classification," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 74(10), pages 1229-1240, October.
    15. Sehrish Iqbal & Saeed-Ul Hassan & Naif Radi Aljohani & Salem Alelyani & Raheel Nawaz & Lutz Bornmann, 2021. "A decade of in-text citation analysis based on natural language processing and machine learning techniques: an overview of empirical studies," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(8), pages 6551-6599, August.
    16. Percia David, Dimitri & Maréchal, Loïc & Lacube, William & Gillard, Sébastien & Tsesmelis, Michael & Maillart, Thomas & Mermoud, Alain, 2023. "Measuring security development in information technologies: A scientometric framework using arXiv e-prints," Technological Forecasting and Social Change, Elsevier, vol. 188(C).
    17. Marc Bertin & Iana Atanassova, 2024. "Linguistic perspectives in deciphering citation function classification," Scientometrics, Springer;Akadémiai Kiadó, vol. 129(10), pages 6301-6313, October.
    18. Yi Bu & Binglu Wang & Win-bin Huang & Shangkun Che & Yong Huang, 2018. "Using the appearance of citations in full text on author co-citation analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 116(1), pages 275-289, July.
    19. Bowen Song & Chunjuan Luan & Danni Liang, 2023. "Identification of emerging technology topics (ETTs) using BERT-based model and sematic analysis: a perspective of multiple-field characteristics of patented inventions (MFCOPIs)," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(11), pages 5883-5904, November.
    20. Wan Siti Nur Aiza & Liyana Shuib & Norisma Idris & Nur Baiti Afini Normadhi, 2024. "Features, techniques and evaluation in predicting articles’ citations: a review from years 2010–2023," Scientometrics, Springer;Akadémiai Kiadó, vol. 129(1), pages 1-29, January.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:128:y:2023:i:12:d:10.1007_s11192-023-04858-4. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.