IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0311493.html
   My bibliography  Save this article

An analysis of the effects of sharing research data, code, and preprints on citations

Author

Listed:
  • Giovanni Colavizza
  • Lauren Cadwallader
  • Marcel LaFlamme
  • Grégory Dozot
  • Stéphane Lecorney
  • Daniel Rappo
  • Iain Hrynaszkiewicz

Abstract

Calls to make scientific research more open have gained traction with a range of societal stakeholders. Open Science practices include but are not limited to the early sharing of results via preprints and openly sharing outputs such as data and code to make research more reproducible and extensible. Existing evidence shows that adopting Open Science practices has effects in several domains. In this study, we investigate whether adopting one or more Open Science practices leads to significantly higher citations for an associated publication, which is one form of academic impact. We use a novel dataset known as Open Science Indicators, produced by PLOS and DataSeer, which includes all PLOS publications from 2018 to 2023 as well as a comparison group sampled from the PMC Open Access Subset. In total, we analyze circa 122’000 publications. We calculate publication and author-level citation indicators and use a broad set of control variables to isolate the effect of Open Science Indicators on received citations. We show that Open Science practices are adopted to different degrees across scientific disciplines. We find that the early release of a publication as a preprint correlates with a significant positive citation advantage of about 20.2% (±.7) on average. We also find that sharing data in an online repository correlates with a smaller yet still positive citation advantage of 4.3% (±.8) on average. However, we do not find a significant citation advantage for sharing code. Further research is needed on additional or alternative measures of impact beyond citations. Our results are likely to be of interest to researchers, as well as publishers, research funders, and policymakers.

Suggested Citation

  • Giovanni Colavizza & Lauren Cadwallader & Marcel LaFlamme & Grégory Dozot & Stéphane Lecorney & Daniel Rappo & Iain Hrynaszkiewicz, 2024. "An analysis of the effects of sharing research data, code, and preprints on citations," PLOS ONE, Public Library of Science, vol. 19(10), pages 1-19, October.
  • Handle: RePEc:plo:pone00:0311493
    DOI: 10.1371/journal.pone.0311493
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0311493
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0311493&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0311493?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Stylianos Serghiou & Despina G Contopoulos-Ioannidis & Kevin W Boyack & Nico Riedel & Joshua D Wallach & John P A Ioannidis, 2021. "Assessment of transparency indicators across the biomedical literature: How open is open?," PLOS Biology, Public Library of Science, vol. 19(3), pages 1-26, March.
    2. Andreas Strotmann & Dangzhi Zhao, 2012. "Author name disambiguation: What difference does it make in author-based citation analysis?," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 63(9), pages 1820-1833, September.
    3. Wang, Jian & Veugelers, Reinhilde & Stephan, Paula, 2017. "Bias against novelty in science: A cautionary tale for users of bibliometric indicators," Research Policy, Elsevier, vol. 46(8), pages 1416-1436.
    4. Chun-Kai Huang & Cameron Neylon & Lucy Montgomery & Richard Hosking & James P. Diprose & Rebecca N. Handcock & Katie Wilson, 2024. "Open access research outputs receive more diverse citations," Scientometrics, Springer;Akadémiai Kiadó, vol. 129(2), pages 825-845, February.
    5. Dag W. Aksnes & Liv Langfeldt & Paul Wouters, 2019. "Citations, Citation Indicators, and Research Quality: An Overview of Basic Concepts and Theories," SAGE Open, , vol. 9(1), pages 21582440198, February.
    6. Heather A Piwowar & Roger S Day & Douglas B Fridsma, 2007. "Sharing Detailed Research Data Is Associated with Increased Citation Rate," PLOS ONE, Public Library of Science, vol. 2(3), pages 1-5, March.
    7. Andreas Strotmann & Dangzhi Zhao, 2012. "Author name disambiguation: What difference does it make in author‐based citation analysis?," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 63(9), pages 1820-1833, September.
    8. Michael J. Fell, 2019. "The Economic Impacts of Open Science: A Rapid Evidence Assessment," Publications, MDPI, vol. 7(3), pages 1-30, July.
    9. Kristijan Armeni & Loek Brinkman & Rickard Carlsson & Anita Eerland & Rianne Fijten & Robin Fondberg & Vera E Heininga & Stephan Heunis & Wei Qi Koh & Maurits Masselink & Niall Moran & Andrew Ó Baoill, 2021. "Towards wide-scale adoption of open science practices: The role of open science communities," Science and Public Policy, Oxford University Press, vol. 48(5), pages 605-611.
    10. Ludo Waltman & Nees Jan van Eck, 2012. "The inconsistency of the h-index," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 63(2), pages 406-415, February.
    11. Ludo Waltman & Nees Jan van Eck, 2012. "The inconsistency of the h‐index," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 63(2), pages 406-415, February.
    12. Yulin Yu & Daniel M. Romero, 2024. "Does the Use of Unusual Combinations of Datasets Contribute to Greater Scientific Impact?," Papers 2402.05024, arXiv.org, revised Sep 2024.
    13. Jinseok Kim & Jana Diesner, 2016. "Distortive effects of initial-based name disambiguation on measurements of large-scale coauthorship networks," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 67(6), pages 1446-1461, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Brito, Ricardo & Navarro, Alonso Rodríguez, 2021. "The inconsistency of h-index: A mathematical analysis," Journal of Informetrics, Elsevier, vol. 15(1).
    2. Dangzhi Zhao & Andreas Strotmann, 2020. "Telescopic and panoramic views of library and information science research 2011–2018: a comparison of four weighting schemes for author co-citation analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 124(1), pages 255-270, July.
    3. Jinseok Kim & Jinmo Kim & Jason Owen-Smith, 2019. "Generating automatically labeled data for author name disambiguation: an iterative clustering method," Scientometrics, Springer;Akadémiai Kiadó, vol. 118(1), pages 253-280, January.
    4. Lutz Bornmann & Werner Marx, 2014. "How to evaluate individual researchers working in the natural and life sciences meaningfully? A proposal of methods based on percentiles of citations," Scientometrics, Springer;Akadémiai Kiadó, vol. 98(1), pages 487-509, January.
    5. Jinseok Kim, 2019. "A fast and integrative algorithm for clustering performance evaluation in author name disambiguation," Scientometrics, Springer;Akadémiai Kiadó, vol. 120(2), pages 661-681, August.
    6. repec:plo:pone00:0230416 is not listed on IDEAS
    7. Jinseok Kim & Jenna Kim & Jason Owen‐Smith, 2021. "Ethnicity‐based name partitioning for author name disambiguation using supervised machine learning," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 72(8), pages 979-994, August.
    8. Jinseok Kim, 2018. "Evaluating author name disambiguation for digital libraries: a case of DBLP," Scientometrics, Springer;Akadémiai Kiadó, vol. 116(3), pages 1867-1886, September.
    9. repec:plo:pone00:0189137 is not listed on IDEAS
    10. Mike Thelwall, 2020. "Mid-career field switches reduce gender disparities in academic publishing," Scientometrics, Springer;Akadémiai Kiadó, vol. 123(3), pages 1365-1383, June.
    11. Chengliang Wang & Xiaojiao Chen & Teng Yu & Yidan Liu & Yuhui Jing, 2024. "Education reform and change driven by digital technology: a bibliometric study from a global perspective," Palgrave Communications, Palgrave Macmillan, vol. 11(1), pages 1-17, December.
    12. Pantea Kamrani & Isabelle Dorsch & Wolfgang G. Stock, 2021. "Do researchers know what the h-index is? And how do they estimate its importance?," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(7), pages 5489-5508, July.
    13. Zoltán Krajcsák, 2021. "Researcher Performance in Scopus Articles ( RPSA ) as a New Scientometric Model of Scientific Output: Tested in Business Area of V4 Countries," Publications, MDPI, vol. 9(4), pages 1-23, October.
    14. Kim, Jinseok & Diesner, Jana, 2015. "The effect of data pre-processing on understanding the evolution of collaboration networks," Journal of Informetrics, Elsevier, vol. 9(1), pages 226-236.
    15. Mi Zhou & Biyu Bian & Weiming Zhu & Li Huang, 2021. "A Half Century of Research on Childhood and Adolescent Depression: Science Mapping the Literature, 1970 to 2019," IJERPH, MDPI, vol. 18(18), pages 1-20, September.
    16. Marcin Kozak & Lutz Bornmann, 2012. "A New Family of Cumulative Indexes for Measuring Scientific Performance," PLOS ONE, Public Library of Science, vol. 7(10), pages 1-4, October.
    17. Ciriaco Andrea D’Angelo & Nees Jan Eck, 2020. "Collecting large-scale publication data at the level of individual researchers: a practical proposal for author name disambiguation," Scientometrics, Springer;Akadémiai Kiadó, vol. 123(2), pages 883-907, May.
    18. Corey J A Bradshaw & Justin M Chalker & Stefani A Crabtree & Bart A Eijkelkamp & John A Long & Justine R Smith & Kate Trinajstic & Vera Weisbecker, 2021. "A fairer way to compare researchers at any career stage and in any discipline using open-access citation data," PLOS ONE, Public Library of Science, vol. 16(9), pages 1-15, September.
    19. Boris Forthmann & Philipp Doebler & Rüdiger Mutz, 2024. "Why summing up bibliometric indicators does not justify a composite indicator," Scientometrics, Springer;Akadémiai Kiadó, vol. 129(12), pages 7475-7499, December.
    20. Liu, Meijun & Hu, Xiao, 2021. "Will collaborators make scientists move? A Generalized Propensity Score analysis," Journal of Informetrics, Elsevier, vol. 15(1).
    21. Eugenio Petrovich, 2022. "Bibliometrics in Press. Representations and uses of bibliometric indicators in the Italian daily newspapers," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(5), pages 2195-2233, May.
    22. Yang, Alex Jie & Wu, Linwei & Zhang, Qi & Wang, Hao & Deng, Sanhong, 2023. "The k-step h-index in citation networks at the paper, author, and institution levels," Journal of Informetrics, Elsevier, vol. 17(4).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0311493. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.