IDEAS home Printed from https://ideas.repec.org/a/spr/scient/v127y2022i12d10.1007_s11192-022-04559-4.html
   My bibliography  Save this article

Measuring the impact of health research data in terms of data citations by scientific publications

Author

Listed:
  • Yongmei Bai

    (Peking University Health Science Center, Peking University
    Peking University)

  • Jian Du

    (Peking University)

Abstract

Health is a representative domain data-driven research since health research data are growingly generated at a massive scale. There is an intuitive logic that the degree to which disease burden and the number of data resources align. In order to figure out disease-specific data sharing and reuse level, we took the number of data records and their citations in the scientific literature in the Data Citation Index platform as approximate indicators. The results indicated that only a small percentage (7.5%) of health data records had received documented citations by scientific publications. We find the level of data sharing and reuse varies across diseases. Our study suggested that the more socioeconomic burden and the more research funding, the more likely scientific data for diseases will be produced and made available. But such a correlation could not be observed for the activity of data reuse. Secondary reuse of scientific data is a complex behavior.

Suggested Citation

  • Yongmei Bai & Jian Du, 2022. "Measuring the impact of health research data in terms of data citations by scientific publications," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(12), pages 6881-6893, December.
  • Handle: RePEc:spr:scient:v:127:y:2022:i:12:d:10.1007_s11192-022-04559-4
    DOI: 10.1007/s11192-022-04559-4
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11192-022-04559-4
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11192-022-04559-4?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Zhang, Lin & ZHAO, Wenjing & Liu, Jianhua & Sivertsen, Gunnar & HUANG, Ying, 2020. "Do national funding organizations properly address the diseases with the highest burden? - Observations from China and the UK," SocArXiv ckpf8, Center for Open Science.
    2. Panagopoulos, George & Tsatsaronis, George & Varlamis, Iraklis, 2017. "Detecting rising stars in dynamic collaborative networks," Journal of Informetrics, Elsevier, vol. 11(1), pages 198-222.
    3. Hyoungjoo Park & Sukjin You & Dietmar Wolfram, 2018. "Informal data citation for data sharing and reuse is more common than formal data citation in biomedical fields," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 69(11), pages 1346-1354, November.
    4. Barend Mons, 2020. "Invest 5% of research funds in ensuring data are reusable," Nature, Nature, vol. 578(7796), pages 491-491, February.
    5. Lin Zhang & Wenjing Zhao & Jianhua Liu & Gunnar Sivertsen & Ying Huang, 2020. "Do national funding organizations properly address the diseases with the highest burden?: Observations from China and the UK," Scientometrics, Springer;Akadémiai Kiadó, vol. 125(2), pages 1733-1761, November.
    6. Hyoungjoo Park & Dietmar Wolfram, 2017. "An examination of research data sharing and re-use: implications for data citation practice," Scientometrics, Springer;Akadémiai Kiadó, vol. 111(1), pages 443-461, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Lutz Bornmann & Robin Haunschild & Vanash M Patel, 2020. "Are papers addressing certain diseases perceived where these diseases are prevalent? The proposal to use Twitter data as social-spatial sensors," PLOS ONE, Public Library of Science, vol. 15(11), pages 1-22, November.
    2. Wenting Yang & Jiantong Zhang & Ruolin Ma, 2020. "The Prediction of Infectious Diseases: A Bibliometric Analysis," IJERPH, MDPI, vol. 17(17), pages 1-19, August.
    3. Li, Kai & Chen, Pei-Ying & Yan, Erjia, 2019. "Challenges of measuring software impact through citations: An examination of the lme4 R package," Journal of Informetrics, Elsevier, vol. 13(1), pages 449-461.
    4. Bai, Xiaomei & Zhang, Fuli & Lee, Ivan, 2019. "Predicting the citations of scholarly paper," Journal of Informetrics, Elsevier, vol. 13(1), pages 407-418.
    5. Yubing Nie & Yifan Zhu & Qika Lin & Sifan Zhang & Pengfei Shi & Zhendong Niu, 2019. "Academic rising star prediction via scholar’s evaluation model and machine learning techniques," Scientometrics, Springer;Akadémiai Kiadó, vol. 120(2), pages 461-476, August.
    6. Chung, Jaemin & Ko, Namuk & Kim, Hyeonsu & Yoon, Janghyeok, 2021. "Inventor profile mining approach for prospective human resource scouting," Journal of Informetrics, Elsevier, vol. 15(1).
    7. Wanjun Xia & Tianrui Li & Chongshou Li, 2023. "A review of scientific impact prediction: tasks, features and methods," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(1), pages 543-585, January.
    8. Barbara McGillivray & Paola Marongiu & Nilo Pedrazzini & Marton Ribary & Mandy Wigdorowitz & Eleonora Zordan, 2022. "Deep Impact: A Study on the Impact of Data Papers and Datasets in the Humanities and Social Sciences," Publications, MDPI, vol. 10(4), pages 1-40, October.
    9. David A. Pendlebury, 2019. "Charting a path between the simple and the false and the complex and unusable: Review of Henk F. Moed, Applied Evaluative Informetrics [in the series Qualitative and Quantitative Analysis of Scientifi," Scientometrics, Springer;Akadémiai Kiadó, vol. 119(1), pages 549-560, April.
    10. Mike Thelwall & Marcus Munafò & Amalia Mas-Bleda & Emma Stuart & Meiko Makita & Verena Weigert & Chris Keene & Nushrat Khan & Katie Drax & Kayvan Kousha, 2020. "Is useful research data usually shared? An investigation of genome-wide association study summary statistics," PLOS ONE, Public Library of Science, vol. 15(2), pages 1-11, February.
    11. Lin Zhu & Donghua Zhu & Xuefeng Wang & Scott W. Cunningham & Zhinan Wang, 2019. "An integrated solution for detecting rising technology stars in co-inventor networks," Scientometrics, Springer;Akadémiai Kiadó, vol. 121(1), pages 137-172, October.
    12. Aftab Nawaz & MSI Malik, 2022. "Rising stars prediction in reviewer network," Electronic Commerce Research, Springer, vol. 22(1), pages 53-75, March.
    13. Libby Hemphill & Amy Pienta & Sara Lafia & Dharma Akmon & David A. Bleckley, 2022. "How do properties of data, their curation, and their funding relate to reuse?," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 73(10), pages 1432-1444, October.
    14. Lin Zhu & Junjie Zhang & Scott W. Cunningham, 2022. "Domain expertise extraction for finding rising stars," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(9), pages 5475-5495, September.
    15. Ali Daud & Min Song & Malik Khizar Hayat & Tehmina Amjad & Rabeeh Ayaz Abbasi & Hassan Dawood & Anwar Ghani, 2020. "Finding rising stars in bibliometric networks," Scientometrics, Springer;Akadémiai Kiadó, vol. 124(1), pages 633-661, July.
    16. Nushrat Khan & Mike Thelwall & Kayvan Kousha, 2021. "Measuring the impact of biodiversity datasets: data reuse, citations and altmetrics," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(4), pages 3621-3639, April.
    17. Matthias Kuppler, 2022. "Predicting the future impact of Computer Science researchers: Is there a gender bias?," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(11), pages 6695-6732, November.
    18. Zeng, Tong & Wu, Longfeng & Bratt, Sarah & Acuna, Daniel E., 2020. "Assigning credit to scientific datasets using article citation networks," Journal of Informetrics, Elsevier, vol. 14(2).
    19. Allahbakhsh, Mohammad & Amintoosi, Haleh & Behkamal, Behshid & Beheshti, Amin & Bertino, Elisa, 2021. "SCiMet: Stable, sCalable and reliable Metric-based framework for quality assessment in collaborative content generation systems," Journal of Informetrics, Elsevier, vol. 15(2).
    20. Irina Gerasimov & Binita KC & Armin Mehrabian & James Acker & Michael P. McGuire, 2024. "Comparison of datasets citation coverage in Google Scholar, Web of Science, Scopus, Crossref, and DataCite," Scientometrics, Springer;Akadémiai Kiadó, vol. 129(7), pages 3681-3704, July.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:127:y:2022:i:12:d:10.1007_s11192-022-04559-4. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.