IDEAS home Printed from https://ideas.repec.org/a/spr/scient/v85y2010i1d10.1007_s11192-010-0160-5.html
   My bibliography  Save this article

Using content-based and bibliometric features for machine learning models to predict citation counts in the biomedical literature

Author

Listed:
  • Lawrence D. Fu

    (New York University Medical Center)

  • Constantin F. Aliferis

    (New York University Medical Center)

Abstract

The most popular method for judging the impact of biomedical articles is citation count which is the number of citations received. The most significant limitation of citation count is that it cannot evaluate articles at the time of publication since citations accumulate over time. This work presents computer models that accurately predict citation counts of biomedical publications within a deep horizon of 10 years using only predictive information available at publication time. Our experiments show that it is indeed feasible to accurately predict future citation counts with a mixture of content-based and bibliometric features using machine learning methods. The models pave the way for practical prediction of the long-term impact of publication, and their statistical analysis provides greater insight into citation behavior.

Suggested Citation

  • Lawrence D. Fu & Constantin F. Aliferis, 2010. "Using content-based and bibliometric features for machine learning models to predict citation counts in the biomedical literature," Scientometrics, Springer;Akadémiai Kiadó, vol. 85(1), pages 257-270, October.
  • Handle: RePEc:spr:scient:v:85:y:2010:i:1:d:10.1007_s11192-010-0160-5
    DOI: 10.1007/s11192-010-0160-5
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11192-010-0160-5
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11192-010-0160-5?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. T. J. Phelan, 1999. "A compendium of issues for citation analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 45(1), pages 117-136, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Lina Xu & Steven Dellaportas & Zhiqiang Yang & Jin Wang, 2023. "More on the relationship between interdisciplinary accounting research and citation impact," Accounting and Finance, Accounting and Finance Association of Australia and New Zealand, vol. 63(4), pages 4779-4803, December.
    2. Alexander N. Larcombe & Sasha C. Voss, 2011. "Self-citation: comparison between Radiology, European Radiology and Radiology for 1997–1998," Scientometrics, Springer;Akadémiai Kiadó, vol. 87(2), pages 347-356, May.
    3. Corey J A Bradshaw & Justin M Chalker & Stefani A Crabtree & Bart A Eijkelkamp & John A Long & Justine R Smith & Kate Trinajstic & Vera Weisbecker, 2021. "A fairer way to compare researchers at any career stage and in any discipline using open-access citation data," PLOS ONE, Public Library of Science, vol. 16(9), pages 1-15, September.
    4. Mladen M. Koljatic & Mónica R. Silva, 2001. "The international publication productivity of Latin American countries in the economics and business administration fields," Scientometrics, Springer;Akadémiai Kiadó, vol. 51(2), pages 381-394, June.
    5. Wen-Yau Cathy Lin & Mu-Hsuan Huang, 2012. "The relationship between co-authorship, currency of references and author self-citations," Scientometrics, Springer;Akadémiai Kiadó, vol. 90(2), pages 343-360, February.
    6. Zaggl, Michael A., 2017. "Manipulation of explicit reputation in innovation and knowledge exchange communities: The example of referencing in science," Research Policy, Elsevier, vol. 46(5), pages 970-983.
    7. Li Tang & John P. Walsh, 2010. "Bibliometric fingerprints: name disambiguation based on approximate structure equivalence of cognitive maps," Scientometrics, Springer;Akadémiai Kiadó, vol. 84(3), pages 763-784, September.
    8. Christina H. Drew & Kristianna G. Pettibone & Fallis Owen Finch & Douglas Giles & Paul Jordan, 2016. "Automated Research Impact Assessment: a new bibliometrics approach," Scientometrics, Springer;Akadémiai Kiadó, vol. 106(3), pages 987-1005, March.
    9. van den Besselaar, Peter & Heyman, Ulf & Sandström, Ulf, 2017. "Perverse effects of output-based research funding? Butler’s Australian case revisited," Journal of Informetrics, Elsevier, vol. 11(3), pages 905-918.
    10. Lina Xu & Steven Dellaportas & Jin Wang, 2022. "A study of interdisciplinary accounting research: analysing the diversity of cited references," Accounting and Finance, Accounting and Finance Association of Australia and New Zealand, vol. 62(2), pages 2131-2162, June.
    11. Georgina Guilera & Juana Gómez-Benito & M. Hidalgo, 2010. "Citation analysis in research on differential item functioning," Quality & Quantity: International Journal of Methodology, Springer, vol. 44(6), pages 1249-1255, October.
    12. Liyue Chen & Jielan Ding & Vincent Larivière, 2022. "Measuring the citation context of national self‐references," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 73(5), pages 671-686, May.
    13. Na Liu & Philip Shapira & Xiaoxu Yue, 2021. "Tracking developments in artificial intelligence research: constructing and applying a new search strategy," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(4), pages 3153-3192, April.
    14. Pablo D’Este & Puay Tang & Surya Mahdi & Andy Neely & Mabel Sánchez-Barrioluengo, 2013. "The pursuit of academic excellence and business engagement: is it irreconcilable?," Scientometrics, Springer;Akadémiai Kiadó, vol. 95(2), pages 481-502, May.
    15. Guan Jiancheng & Wang Junxia, 2004. "Evaluation and interpretation of knowledge production efficiency," Scientometrics, Springer;Akadémiai Kiadó, vol. 59(1), pages 131-155, January.
    16. Shaun Goldfinch & Tony Dale & Karl DeRouen, 2003. "Science from the periphery: Collaboration, networks and 'Periphery Effects' in the citation of New Zealand Crown Research Institutes articles, 1995-2000," Scientometrics, Springer;Akadémiai Kiadó, vol. 57(3), pages 321-337, July.
    17. Teodora Diana Corsatea, 2010. "Measuring science: Spatial investigation of academic opportunities in Belgium," Papers in Regional Science, Wiley Blackwell, vol. 89(2), pages 373-387, June.
    18. Carlos R. Azzoni, 2010. "Making a Great Difference: The Influence of Professor Werner Baer on the Economic Literature in Brazil, and on Brazil," Palgrave Macmillan Books, in: Hadi Salehi Esfahani & Giovanni Facchini & Geoffrey J. D. Hewings (ed.), Economic Development in Latin America, chapter 19, pages 295-305, Palgrave Macmillan.
    19. James H. Fowler & Dag W. Aksnes, 2007. "Does self-citation pay?," Scientometrics, Springer;Akadémiai Kiadó, vol. 72(3), pages 427-437, September.
    20. Ritu Arora, 2023. "Intellectual Structure of Parenting Style Research: A Bibliometric Analysis," SAGE Open, , vol. 13(2), pages 21582440231, April.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:85:y:2010:i:1:d:10.1007_s11192-010-0160-5. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.