IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0228928.html
   My bibliography  Save this article

Examining influential factors for acknowledgements classification using supervised learning

Author

Listed:
  • Min Song
  • Keun Young Kang
  • Tatsawan Timakum
  • Xinyuan Zhang

Abstract

Acknowledgements have been examined as important elements in measuring the contributions to and intellectual debts of a scientific publication. Unlike previous studies that were limited in the scope of analysis and manual examination. The present study aimed to conduct the automatic classification of acknowledgements on a large scale of data. To this end, we first created a training dataset for acknowledgements classification by sampling the acknowledgements sections from the entire PubMed Central database. Second, we adopted various supervised learning algorithms to examine which algorithm performed best in what condition. In addition, we observed the factors affecting classification performance. We investigated the effects of the following three main aspects: classification algorithms, categories, and text representations. The CNN+Doc2Vec algorithm achieved the highest performance of 93.58% accuracy in the original dataset and 87.93% in the converted dataset. The experimental results indicated that the characteristics of categories and sentence patterns influenced the performance of classification. Most of the classifiers performed better on the categories of financial, peer interactive communication, and technical support compared to other classes.

Suggested Citation

  • Min Song & Keun Young Kang & Tatsawan Timakum & Xinyuan Zhang, 2020. "Examining influential factors for acknowledgements classification using supervised learning," PLOS ONE, Public Library of Science, vol. 15(2), pages 1-21, February.
  • Handle: RePEc:plo:pone00:0228928
    DOI: 10.1371/journal.pone.0228928
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0228928
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0228928&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0228928?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Blaise Cronin & Debora Shaw & Kathryn La Barre, 2003. "A cast of thousands: Coauthorship and subauthorship collaboration in the 20th century as manifested in the scholarly journal literature of psychology and philosophy," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 54(9), pages 855-871, July.
    2. Cristian Mejia & Yuya Kajikawa, 2018. "Using acknowledgement data to characterize funding organizations by the types of research sponsored: the case of robotics research," Scientometrics, Springer;Akadémiai Kiadó, vol. 114(3), pages 883-904, March.
    3. Blaise Cronin & Debora Shaw & Kathryn La Barre, 2004. "Visible, less visible, and invisible work: Patterns of collaboration in 20th century chemistry," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 55(2), pages 160-168, January.
    4. Costas, Rodrigo & van Leeuwen, Thed N. & van Raan, Anthony F.J., 2013. "Effects of the durability of scientific literature at the group level: Case study of chemistry research groups in the Netherlands," Research Policy, Elsevier, vol. 42(4), pages 886-894.
    5. Adèle Paul-Hus & Nadine Desrochers & Rodrigo Costas, 2016. "Characterization, description, and considerations for the use of funding acknowledgement data in Web of Science," Scientometrics, Springer;Akadémiai Kiadó, vol. 108(1), pages 167-182, July.
    6. Henry Small, 2011. "Interpreting maps of science using citation context sentiments: a preliminary investigation," Scientometrics, Springer;Akadémiai Kiadó, vol. 87(2), pages 373-388, May.
    7. Small, Henry, 2018. "Characterizing highly cited method and non-method papers using citation contexts: The role of uncertainty," Journal of Informetrics, Elsevier, vol. 12(2), pages 461-480.
    8. Abrishami, Ali & Aliakbary, Sadegh, 2019. "Predicting citation counts based on deep neural network learning techniques," Journal of Informetrics, Elsevier, vol. 13(2), pages 485-499.
    9. Ying Ding & Guo Zhang & Tamy Chambers & Min Song & Xiaolong Wang & Chengxiang Zhai, 2014. "Content-based citation analysis: The next generation of citation analysis," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 65(9), pages 1820-1833, September.
    10. Yubing Nie & Yifan Zhu & Qika Lin & Sifan Zhang & Pengfei Shi & Zhendong Niu, 2019. "Academic rising star prediction via scholar’s evaluation model and machine learning techniques," Scientometrics, Springer;Akadémiai Kiadó, vol. 120(2), pages 461-476, August.
    11. Ali Gazni & Cassidy R. Sugimoto & Fereshteh Didegah, 2012. "Mapping world scientific collaboration: Authors, institutions, and countries," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 63(2), pages 323-335, February.
    12. Paul-Hus, Adèle & Mongeon, Philippe & Sainte-Marie, Maxime & Larivière, Vincent, 2017. "The sum of it all: Revealing collaboration patterns by combining authorship and acknowledgements," Journal of Informetrics, Elsevier, vol. 11(1), pages 80-87.
    13. Ali Gazni & Cassidy R. Sugimoto & Fereshteh Didegah, 2012. "Mapping world scientific collaboration: Authors, institutions, and countries," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 63(2), pages 323-335, February.
    14. Jeong, Yoo Kyung & Song, Min & Ding, Ying, 2014. "Content-based author co-citation analysis," Journal of Informetrics, Elsevier, vol. 8(1), pages 197-211.
    15. Mehmet Ali Abdulhayoglu & Bart Thijs, 2017. "Use of ResearchGate and Google CSE for author name disambiguation," Scientometrics, Springer;Akadémiai Kiadó, vol. 111(3), pages 1965-1985, June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Wen Lou & Jiangen He & Lingxin Zhang & Zhijie Zhu & Yongjun Zhu, 2023. "Support behind the scenes: the relationship between acknowledgement, coauthor, and citation in Nobel articles," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(10), pages 5767-5790, October.
    2. Pengfei Jia & Weixi Xie & Guangyao Zhang & Xianwen Wang, 2023. "Do reviewers get their deserved acknowledgments from the authors of manuscripts?," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(10), pages 5687-5703, October.
    3. Nina Smirnova & Philipp Mayr, 2023. "A comprehensive analysis of acknowledgement texts in Web of Science: a case study on four scientific domains," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(1), pages 709-734, January.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Katherine W. McCain, 2018. "Beyond Garfield’s Citation Index: an assessment of some issues in building a personal name Acknowledgments Index," Scientometrics, Springer;Akadémiai Kiadó, vol. 114(2), pages 605-631, February.
    2. Alberto Baccini & Eugenio Petrovich, 2022. "Normative versus strategic accounts of acknowledgment data: The case of the top-five journals of economics," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(1), pages 603-635, January.
    3. Lili Miao & Vincent Larivi`ere & Feifei Wang & Yong-Yeol Ahn & Cassidy R. Sugimoto, 2023. "Cooperation and interdependence in global science funding," Papers 2308.08630, arXiv.org, revised Feb 2024.
    4. Pengfei Jia & Weixi Xie & Guangyao Zhang & Xianwen Wang, 2023. "Do reviewers get their deserved acknowledgments from the authors of manuscripts?," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(10), pages 5687-5703, October.
    5. Nadine Desrochers & Adèle Paul‐Hus & Jen Pecoskie, 2017. "Five decades of gratitude: A meta‐synthesis of acknowledgments research," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 68(12), pages 2821-2833, December.
    6. Wen Lou & Jiangen He & Lingxin Zhang & Zhijie Zhu & Yongjun Zhu, 2023. "Support behind the scenes: the relationship between acknowledgement, coauthor, and citation in Nobel articles," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(10), pages 5767-5790, October.
    7. Weishu Liu & Li Tang & Guangyuan Hu, 2020. "Funding information in Web of Science: an updated overview," Scientometrics, Springer;Akadémiai Kiadó, vol. 122(3), pages 1509-1524, March.
    8. Wanjun Xia & Tianrui Li & Chongshou Li, 2023. "A review of scientific impact prediction: tasks, features and methods," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(1), pages 543-585, January.
    9. Chen, Kaihua & Zhang, Yi & Fu, Xiaolan, 2019. "International research collaboration: An emerging domain of innovation studies?," Research Policy, Elsevier, vol. 48(1), pages 149-168.
    10. Shengzhi Huang & Jiajia Qian & Yong Huang & Wei Lu & Yi Bu & Jinqing Yang & Qikai Cheng, 2022. "Disclosing the relationship between citation structure and future impact of a publication," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 73(7), pages 1025-1042, July.
    11. Waltman, Ludo, 2016. "A review of the literature on citation impact indicators," Journal of Informetrics, Elsevier, vol. 10(2), pages 365-391.
    12. Jongwuk Ahn & Dong-hyun Oh & Jeong-Dong Lee, 2014. "The scientific impact and partner selection in collaborative research at Korean universities," Scientometrics, Springer;Akadémiai Kiadó, vol. 100(1), pages 173-188, July.
    13. Thelwall, Mike & Sud, Pardeep, 2014. "No citation advantage for monograph-based collaborations?," Journal of Informetrics, Elsevier, vol. 8(1), pages 276-283.
    14. Wang, Shiyun & Mao, Jin & Lu, Kun & Cao, Yujie & Li, Gang, 2021. "Understanding interdisciplinary knowledge integration through citance analysis: A case study on eHealth," Journal of Informetrics, Elsevier, vol. 15(4).
    15. Sehrish Iqbal & Saeed-Ul Hassan & Naif Radi Aljohani & Salem Alelyani & Raheel Nawaz & Lutz Bornmann, 2021. "A decade of in-text citation analysis based on natural language processing and machine learning techniques: an overview of empirical studies," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(8), pages 6551-6599, August.
    16. Maryam Yaghtin & Hajar Sotudeh & Mahdieh Mirzabeigi & Seyed Mostafa Fakhrahmad & Mehdi Mohammadi, 2019. "In quest of new document relations: evaluating co-opinion relations between co-citations and its impact on Information retrieval effectiveness," Scientometrics, Springer;Akadémiai Kiadó, vol. 119(2), pages 987-1008, May.
    17. Adèle Paul-Hus & Adrián A Díaz-Faes & Maxime Sainte-Marie & Nadine Desrochers & Rodrigo Costas & Vincent Larivière, 2017. "Beyond funding: Acknowledgement patterns in biomedical, natural and social sciences," PLOS ONE, Public Library of Science, vol. 12(10), pages 1-14, October.
    18. Abramo, Giovanni & D'Angelo, Ciriaco Andrea & Di Costa, Flavia, 2019. "Diversification versus specialization in scientific research: Which strategy pays off?," Technovation, Elsevier, vol. 82, pages 51-57.
    19. Cathelijn J F Waaijer & Benoît Macaluso & Cassidy R Sugimoto & Vincent Larivière, 2016. "Stability and Longevity in the Publication Careers of U.S. Doctorate Recipients," PLOS ONE, Public Library of Science, vol. 11(4), pages 1-15, April.
    20. Constance Poitras & Vincent Larivière, 2023. "Research mobility to the United States: a bibliometric analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(4), pages 2601-2614, April.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0228928. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.