IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0229578.html
   My bibliography  Save this article

Is useful research data usually shared? An investigation of genome-wide association study summary statistics

Author

Listed:
  • Mike Thelwall
  • Marcus Munafò
  • Amalia Mas-Bleda
  • Emma Stuart
  • Meiko Makita
  • Verena Weigert
  • Chris Keene
  • Nushrat Khan
  • Katie Drax
  • Kayvan Kousha

Abstract

Primary data collected during a research study is often shared and may be reused for new studies. To assess the extent of data sharing in favourable circumstances and whether data sharing checks can be automated, this article investigates summary statistics from primary human genome-wide association studies (GWAS). This type of data is highly suitable for sharing because it is a standard research output, is straightforward to use in future studies (e.g., for secondary analysis), and may be already stored in a standard format for internal sharing within multi-site research projects. Manual checks of 1799 articles from 2010 and 2017 matching a simple PubMed query for molecular epidemiology GWAS were used to identify 314 primary human GWAS papers. Of these, only 13% reported the location of a complete set of GWAS summary data, increasing from 3% in 2010 to 23% in 2017. Whilst information about whether data was shared was typically located clearly within a data availability statement, the exact nature of the shared data was usually unspecified. Thus, data sharing is the exception even in suitable research fields with relatively strong data sharing norms. Moreover, the lack of clear data descriptions within data sharing statements greatly complicates the task of automatically characterising shared data sets.

Suggested Citation

  • Mike Thelwall & Marcus Munafò & Amalia Mas-Bleda & Emma Stuart & Meiko Makita & Verena Weigert & Chris Keene & Nushrat Khan & Katie Drax & Kayvan Kousha, 2020. "Is useful research data usually shared? An investigation of genome-wide association study summary statistics," PLOS ONE, Public Library of Science, vol. 15(2), pages 1-11, February.
  • Handle: RePEc:plo:pone00:0229578
    DOI: 10.1371/journal.pone.0229578
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0229578
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0229578&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0229578?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Dominique G Roche & Loeske E. B Kruuk, 2015. "Public Data Archiving in Ecology and Evolution: How Well are We Doing?," Working Papers id:7811, eSocialSciences.
    2. Isabella Peters & Peter Kraker & Elisabeth Lex & Christian Gumpenberger & Juan Gorraiz, 2016. "Research data explored: an extended analysis of citations and altmetrics," Scientometrics, Springer;Akadémiai Kiadó, vol. 107(2), pages 723-744, May.
    3. Mengnan Zhao & Erjia Yan & Kai Li, 2018. "Data set mentions and citations: A content analysis of full†text publications," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 69(1), pages 32-46, January.
    4. Hyoungjoo Park & Sukjin You & Dietmar Wolfram, 2018. "Informal data citation for data sharing and reuse is more common than formal data citation in biomedical fields," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 69(11), pages 1346-1354, November.
    5. Benedikt Fecher & Sascha Friesike & Marcel Hebing, 2015. "What Drives Academic Data Sharing?," PLOS ONE, Public Library of Science, vol. 10(2), pages 1-25, February.
    6. Ryan P Womack, 2015. "Research Data in Core Journals in Biology, Chemistry, Mathematics, and Physics," PLOS ONE, Public Library of Science, vol. 10(12), pages 1-22, December.
    7. Dominique G Roche & Loeske E B Kruuk & Robert Lanfear & Sandra A Binning, 2015. "Public Data Archiving in Ecology and Evolution: How Well Are We Doing?," PLOS Biology, Public Library of Science, vol. 13(11), pages 1-12, November.
    8. Thelwall, Mike, 2019. "The rhetorical structure of science? A multidisciplinary analysis of article headings," Journal of Informetrics, Elsevier, vol. 13(2), pages 555-563.
    9. Nicolas Robinson-García & Evaristo Jiménez-Contreras & Daniel Torres-Salinas, 2016. "Analyzing data citation practices using the data citation index," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 67(12), pages 2964-2975, December.
    10. Heather A Piwowar & Roger S Day & Douglas B Fridsma, 2007. "Sharing Detailed Research Data Is Associated with Increased Citation Rate," PLOS ONE, Public Library of Science, vol. 2(3), pages 1-5, March.
    11. Carol Tenopir & Elizabeth D Dalton & Suzie Allard & Mike Frame & Ivanka Pjesivac & Ben Birch & Danielle Pollock & Kristina Dorsett, 2015. "Changes in Data Sharing and Data Reuse Practices and Perceptions among Scientists Worldwide," PLOS ONE, Public Library of Science, vol. 10(8), pages 1-24, August.
    12. Bryn Nelson, 2009. "Data sharing: Empty archives," Nature, Nature, vol. 461(7261), pages 160-163, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Barbara McGillivray & Paola Marongiu & Nilo Pedrazzini & Marton Ribary & Mandy Wigdorowitz & Eleonora Zordan, 2022. "Deep Impact: A Study on the Impact of Data Papers and Datasets in the Humanities and Social Sciences," Publications, MDPI, vol. 10(4), pages 1-40, October.
    2. Keiko Kurata & Mamiko Matsubayashi & Shinji Mine, 2017. "Identifying the Complex Position of Research Data and Data Sharing Among Researchers in Natural Science," SAGE Open, , vol. 7(3), pages 21582440177, July.
    3. Stefan Reichmann & Thomas Klebel & Ilire Hasani‐Mavriqi & Tony Ross‐Hellauer, 2021. "Between administration and research: Understanding data management practices in an institutional context," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 72(11), pages 1415-1431, November.
    4. Sixto-Costoya Andrea & Robinson-Garcia Nicolas & Leeuwen Thed & Costas Rodrigo, 2021. "Exploring the relevance of ORCID as a source of study of data sharing activities at the individual-level: a methodological discussion," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(8), pages 7149-7165, August.
    5. Zeng, Tong & Wu, Longfeng & Bratt, Sarah & Acuna, Daniel E., 2020. "Assigning credit to scientific datasets using article citation networks," Journal of Informetrics, Elsevier, vol. 14(2).
    6. Hyoungjoo Park & Dietmar Wolfram, 2017. "An examination of research data sharing and re-use: implications for data citation practice," Scientometrics, Springer;Akadémiai Kiadó, vol. 111(1), pages 443-461, April.
    7. Brian Jackson, 2021. "Open Data Policies among Library and Information Science Journals," Publications, MDPI, vol. 9(2), pages 1-12, June.
    8. Josip Strcic & Antonia Civljak & Terezija Glozinic & Rafael Leite Pacheco & Tonci Brkovic & Livia Puljak, 2022. "Open data and data sharing in articles about COVID-19 published in preprint servers medRxiv and bioRxiv," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(5), pages 2791-2802, May.
    9. Malika Ihle & Isabel S. Winney & Anna Krystalli & Michael Croucher, 2017. "Striving for transparent and credible research: practical guidelines for behavioral ecologists," Behavioral Ecology, International Society for Behavioral Ecology, vol. 28(2), pages 348-354.
    10. Kai Li & Jason Rollins & Erjia Yan, 2018. "Web of Science use in published research and review papers 1997–2017: a selective, dynamic, cross-domain, content-based analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 115(1), pages 1-20, April.
    11. Bettina Suhr & Johanna Dungl & Alexander Stocker, 2020. "Search, reuse and sharing of research data in materials science and engineering—A qualitative interview study," PLOS ONE, Public Library of Science, vol. 15(9), pages 1-26, September.
    12. Renata Gonçalves Curty & Kevin Crowston & Alison Specht & Bruce W Grant & Elizabeth D Dalton, 2017. "Attitudes and norms affecting scientists’ data reuse," PLOS ONE, Public Library of Science, vol. 12(12), pages 1-22, December.
    13. Eirini Delikoura & Dimitrios Kouis, 2021. "Open Research Data and Open Peer Review: Perceptions of a Medical and Health Sciences Community in Greece," Publications, MDPI, vol. 9(2), pages 1-19, March.
    14. Joshua D. Carrell & Edward Hammill & Thomas C. Edwards, 2022. "Balancing Rare Species Conservation with Extractive Industries," Land, MDPI, vol. 11(11), pages 1-16, November.
    15. Shibayama, Sotaro & Lawson, Cornelia, 2021. "The use of rewards in the sharing of research resources," Research Policy, Elsevier, vol. 50(7).
    16. Joshua Borycz & Robert Olendorf & Alison Specht & Bruce Grant & Kevin Crowston & Carol Tenopir & Suzie Allard & Natalie M. Rice & Rachael Hu & Robert J. Sandusky, 2023. "Perceived benefits of open data are improving but scientists still lack resources, skills, and rewards," Palgrave Communications, Palgrave Macmillan, vol. 10(1), pages 1-12, December.
    17. Thu-Mai Christian & Amanda Gooch & Todd Vision & Elizabeth Hull, 2020. "Journal data policies: Exploring how the understanding of editors and authors corresponds to the policies themselves," PLOS ONE, Public Library of Science, vol. 15(3), pages 1-15, March.
    18. Benedikt Fecher & Sascha Friesike & Marcel Hebing, 2015. "What Drives Academic Data Sharing?," PLOS ONE, Public Library of Science, vol. 10(2), pages 1-25, February.
    19. Hannah Fraser & Tim Parker & Shinichi Nakagawa & Ashley Barnett & Fiona Fidler, 2018. "Questionable research practices in ecology and evolution," PLOS ONE, Public Library of Science, vol. 13(7), pages 1-16, July.
    20. Lloyd W. Morrison & Craig C. Young, 2016. "Standardization and Quality Control in Data Collection and Assessment of Threatened Plant Species," Data, MDPI, vol. 1(3), pages 1-11, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0229578. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.