IDEAS home Printed from https://ideas.repec.org/a/spr/scient/v97y2013i3d10.1007_s11192-013-0983-y.html
   My bibliography  Save this article

Computer models for identifying instrumental citations in the biomedical literature

Author

Listed:
  • Lawrence D. Fu

    (New York University Medical Center)

  • Yindalon Aphinyanaphongs

    (New York University Medical Center)

  • Constantin F. Aliferis

    (New York University Medical Center)

Abstract

The most popular method for evaluating the quality of a scientific publication is citation count. This metric assumes that a citation is a positive indicator of the quality of the cited work. This assumption is not always true since citations serve many purposes. As a result, citation count is an indirect and imprecise measure of impact. If instrumental citations could be reliably distinguished from non-instrumental ones, this would readily improve the performance of existing citation-based metrics by excluding the non-instrumental citations. A citation was operationally defined as instrumental if either of the following was true: the hypothesis of the citing work was motivated by the cited work, or the citing work could not have been executed without the cited work. This work investigated the feasibility of developing computer models for automatically classifying citations as instrumental or non-instrumental. Instrumental citations were manually labeled, and machine learning models were trained on a combination of content and bibliometric features. The experimental results indicate that models based on content and bibliometric features are able to automatically classify instrumental citations with high predictivity (AUC = 0.86). Additional experiments using independent hold out data and prospective validation show that the models are generalizeable and can handle unseen cases. This work demonstrates that it is feasible to train computer models to automatically identify instrumental citations.

Suggested Citation

  • Lawrence D. Fu & Yindalon Aphinyanaphongs & Constantin F. Aliferis, 2013. "Computer models for identifying instrumental citations in the biomedical literature," Scientometrics, Springer;Akadémiai Kiadó, vol. 97(3), pages 871-882, December.
  • Handle: RePEc:spr:scient:v:97:y:2013:i:3:d:10.1007_s11192-013-0983-y
    DOI: 10.1007/s11192-013-0983-y
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11192-013-0983-y
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11192-013-0983-y?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. T. J. Phelan, 1999. "A compendium of issues for citation analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 45(1), pages 117-136, May.
    2. Lawrence D. Fu & Constantin F. Aliferis, 2010. "Using content-based and bibliometric features for machine learning models to predict citation counts in the biomedical literature," Scientometrics, Springer;Akadémiai Kiadó, vol. 85(1), pages 257-270, October.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Tian Yu & Guang Yu & Peng-Yu Li & Liang Wang, 2014. "Citation impact prediction for scientific papers using stepwise regression analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 101(2), pages 1233-1252, November.
    2. Ashkan Ebadi & Andrea Schiffauerova, 2016. "iSEER: an intelligent automatic computer system for scientific evaluation of researchers," Scientometrics, Springer;Akadémiai Kiadó, vol. 107(2), pages 477-498, May.
    3. Federica Bologna & Angelo Iorio & Silvio Peroni & Francesco Poggi, 2023. "Do open citations give insights on the qualitative peer-review evaluation in research assessments? An analysis of the Italian National Scientific Qualification," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(1), pages 19-53, January.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ashkan Ebadi & Andrea Schiffauerova, 2016. "iSEER: an intelligent automatic computer system for scientific evaluation of researchers," Scientometrics, Springer;Akadémiai Kiadó, vol. 107(2), pages 477-498, May.
    2. Lawrence D. Fu & Constantin F. Aliferis, 2010. "Using content-based and bibliometric features for machine learning models to predict citation counts in the biomedical literature," Scientometrics, Springer;Akadémiai Kiadó, vol. 85(1), pages 257-270, October.
    3. Lina Xu & Steven Dellaportas & Zhiqiang Yang & Jin Wang, 2023. "More on the relationship between interdisciplinary accounting research and citation impact," Accounting and Finance, Accounting and Finance Association of Australia and New Zealand, vol. 63(4), pages 4779-4803, December.
    4. Alexander N. Larcombe & Sasha C. Voss, 2011. "Self-citation: comparison between Radiology, European Radiology and Radiology for 1997–1998," Scientometrics, Springer;Akadémiai Kiadó, vol. 87(2), pages 347-356, May.
    5. Weimao Ke, 2013. "A fitness model for scholarly impact analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 94(3), pages 981-998, March.
    6. Corey J A Bradshaw & Justin M Chalker & Stefani A Crabtree & Bart A Eijkelkamp & John A Long & Justine R Smith & Kate Trinajstic & Vera Weisbecker, 2021. "A fairer way to compare researchers at any career stage and in any discipline using open-access citation data," PLOS ONE, Public Library of Science, vol. 16(9), pages 1-15, September.
    7. Tian Yu & Guang Yu & Peng-Yu Li & Liang Wang, 2014. "Citation impact prediction for scientific papers using stepwise regression analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 101(2), pages 1233-1252, November.
    8. Mladen M. Koljatic & Mónica R. Silva, 2001. "The international publication productivity of Latin American countries in the economics and business administration fields," Scientometrics, Springer;Akadémiai Kiadó, vol. 51(2), pages 381-394, June.
    9. Wen-Yau Cathy Lin & Mu-Hsuan Huang, 2012. "The relationship between co-authorship, currency of references and author self-citations," Scientometrics, Springer;Akadémiai Kiadó, vol. 90(2), pages 343-360, February.
    10. Basma Albanna & Julia Handl & Richard Heeks, 2021. "Publication outperformance among global South researchers: An analysis of individual-level and publication-level predictors of positive deviance," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(10), pages 8375-8431, October.
    11. Zaggl, Michael A., 2017. "Manipulation of explicit reputation in innovation and knowledge exchange communities: The example of referencing in science," Research Policy, Elsevier, vol. 46(5), pages 970-983.
    12. Li Tang & John P. Walsh, 2010. "Bibliometric fingerprints: name disambiguation based on approximate structure equivalence of cognitive maps," Scientometrics, Springer;Akadémiai Kiadó, vol. 84(3), pages 763-784, September.
    13. Christina H. Drew & Kristianna G. Pettibone & Fallis Owen Finch & Douglas Giles & Paul Jordan, 2016. "Automated Research Impact Assessment: a new bibliometrics approach," Scientometrics, Springer;Akadémiai Kiadó, vol. 106(3), pages 987-1005, March.
    14. van den Besselaar, Peter & Heyman, Ulf & Sandström, Ulf, 2017. "Perverse effects of output-based research funding? Butler’s Australian case revisited," Journal of Informetrics, Elsevier, vol. 11(3), pages 905-918.
    15. Lina Xu & Steven Dellaportas & Jin Wang, 2022. "A study of interdisciplinary accounting research: analysing the diversity of cited references," Accounting and Finance, Accounting and Finance Association of Australia and New Zealand, vol. 62(2), pages 2131-2162, June.
    16. Mingyang Wang & Guang Yu & Shuang An & Daren Yu, 2012. "Discovery of factors influencing citation impact based on a soft fuzzy rough set model," Scientometrics, Springer;Akadémiai Kiadó, vol. 93(3), pages 635-644, December.
    17. Georgina Guilera & Juana Gómez-Benito & M. Hidalgo, 2010. "Citation analysis in research on differential item functioning," Quality & Quantity: International Journal of Methodology, Springer, vol. 44(6), pages 1249-1255, October.
    18. Hu, Ya-Han & Tai, Chun-Tien & Liu, Kang Ernest & Cai, Cheng-Fang, 2020. "Identification of highly-cited papers using topic-model-based and bibliometric features: the consideration of keyword popularity," Journal of Informetrics, Elsevier, vol. 14(1).
    19. Wanjun Xia & Tianrui Li & Chongshou Li, 2023. "A review of scientific impact prediction: tasks, features and methods," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(1), pages 543-585, January.
    20. Liyue Chen & Jielan Ding & Vincent Larivière, 2022. "Measuring the citation context of national self‐references," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 73(5), pages 671-686, May.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:97:y:2013:i:3:d:10.1007_s11192-013-0983-y. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.