IDEAS home Printed from https://ideas.repec.org/a/plo/pdig00/0000101.html
   My bibliography  Save this article

Privacy-aware multi-institutional time-to-event studies

Author

Listed:
  • Julian Späth
  • Julian Matschinske
  • Frederick K Kamanu
  • Sabina A Murphy
  • Olga Zolotareva
  • Mohammad Bakhtiari
  • Elliott M Antman
  • Joseph Loscalzo
  • Alissa Brauneck
  • Louisa Schmalhorst
  • Gabriele Buchholtz
  • Jan Baumbach

Abstract

Clinical time-to-event studies are dependent on large sample sizes, often not available at a single institution. However, this is countered by the fact that, particularly in the medical field, individual institutions are often legally unable to share their data, as medical data is subject to strong privacy protection due to its particular sensitivity. But the collection, and especially aggregation into centralized datasets, is also fraught with substantial legal risks and often outright unlawful. Existing solutions using federated learning have already demonstrated considerable potential as an alternative for central data collection. Unfortunately, current approaches are incomplete or not easily applicable in clinical studies owing to the complexity of federated infrastructures. This work presents privacy-aware and federated implementations of the most used time-to-event algorithms (survival curve, cumulative hazard rate, log-rank test, and Cox proportional hazards model) in clinical trials, based on a hybrid approach of federated learning, additive secret sharing, and differential privacy. On several benchmark datasets, we show that all algorithms produce highly similar, or in some cases, even identical results compared to traditional centralized time-to-event algorithms. Furthermore, we were able to reproduce the results of a previous clinical time-to-event study in various federated scenarios. All algorithms are accessible through the intuitive web-app Partea (https://partea.zbh.uni-hamburg.de), offering a graphical user interface for clinicians and non-computational researchers without programming knowledge. Partea removes the high infrastructural hurdles derived from existing federated learning approaches and removes the complexity of execution. Therefore, it is an easy-to-use alternative to central data collection, reducing bureaucratic efforts but also the legal risks associated with the processing of personal data to a minimum.Author summary: Collecting data centrally from different sites in the clinical time-to-event analysis is still challenging due to the high bureaucratic effort and strict data protection laws such as the GDPR. However, huge datasets are needed to extract valuable insights from the data by applying statistical and machine learning approaches. Current approaches are still incomplete: they often do not address privacy issues in any depth, have inaccessible user interfaces, do not cover multiple algorithms, or are not open-source. In contrast, the approach we present in this work is an open-source tool for privacy-aware time-to-event analysis (Partea) that can be intuitively used by clinicians, statisticians, and other researchers. It allows the users to run state-of-the-art privacy-aware time-to-event analysis on data distributed between multiple sites through an easy-to-use interface and solves technical and legal issues for the underlying technologies.

Suggested Citation

  • Julian Späth & Julian Matschinske & Frederick K Kamanu & Sabina A Murphy & Olga Zolotareva & Mohammad Bakhtiari & Elliott M Antman & Joseph Loscalzo & Alissa Brauneck & Louisa Schmalhorst & Gabriele B, 2022. "Privacy-aware multi-institutional time-to-event studies," PLOS Digital Health, Public Library of Science, vol. 1(9), pages 1-16, September.
  • Handle: RePEc:plo:pdig00:0000101
    DOI: 10.1371/journal.pdig.0000101
    as

    Download full text from publisher

    File URL: https://journals.plos.org/digitalhealth/article?id=10.1371/journal.pdig.0000101
    Download Restriction: no

    File URL: https://journals.plos.org/digitalhealth/article/file?id=10.1371/journal.pdig.0000101&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pdig.0000101?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Yinghui Wei & Patrick Royston, 2017. "Reconstructing time-to-event data from published Kaplan–Meier curves," Stata Journal, StataCorp LLC, vol. 17(4), pages 786-802, December.
    2. David Froelicher & Juan R. Troncoso-Pastoriza & Jean Louis Raisaro & Michel A. Cuendet & Joao Sa Sousa & Hyunghoon Cho & Bonnie Berger & Jacques Fellay & Jean-Pierre Hubaux, 2021. "Truly privacy-preserving federated analytics for precision medicine with multiparty homomorphic encryption," Nature Communications, Nature, vol. 12(1), pages 1-10, December.
    3. David Froelicher & Juan R. Troncoso-Pastoriza & Jean Louis Raisaro & Michel A. Cuendet & Joao Sa Sousa & Hyunghoon Cho & Bonnie Berger & Jacques Fellay & Jean-Pierre Hubaux, 2021. "Author Correction: Truly privacy-preserving federated analytics for precision medicine with multiparty homomorphic encryption," Nature Communications, Nature, vol. 12(1), pages 1-1, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Xavier Escriba-Montagut & Yannick Marcon & Augusto Anguita-Ruiz & Demetris Avraam & Jose Urquiza & Andrei S Morgan & Rebecca C Wilson & Paul Burton & Juan R Gonzalez, 2024. "Federated privacy-protected meta- and mega-omics data analysis in multi-center studies with a fully open-source analytic platform," PLOS Computational Biology, Public Library of Science, vol. 20(12), pages 1-22, December.
    2. Helin Yang & Kwok-Yan Lam & Liang Xiao & Zehui Xiong & Hao Hu & Dusit Niyato & H. Vincent Poor, 2022. "Lead federated neuromorphic learning for wireless edge artificial intelligence," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    3. Miran Kim & Xiaoqian Jiang & Kristin Lauter & Elkhan Ismayilzada & Shayan Shams, 2022. "Secure human action recognition by encrypted neural network inference," Nature Communications, Nature, vol. 13(1), pages 1-13, December.
    4. Chongliang Luo & Md. Nazmul Islam & Natalie E. Sheils & John Buresh & Jenna Reps & Martijn J. Schuemie & Patrick B. Ryan & Mackenzie Edmondson & Rui Duan & Jiayi Tong & Arielle Marks-Anglin & Jiang Bi, 2022. "DLMM as a lossless one-shot algorithm for collaborative multi-site distributed linear mixed models," Nature Communications, Nature, vol. 13(1), pages 1-10, December.
    5. Tao Qi & Fangzhao Wu & Chuhan Wu & Liang He & Yongfeng Huang & Xing Xie, 2023. "Differentially private knowledge transfer for federated learning," Nature Communications, Nature, vol. 14(1), pages 1-9, December.
    6. Edin Basic & Mathias Kappel & Arpit Misra & Leopold Sellner & Boris A. Ratsch & Dennis A. Ostwald, 2020. "Budget impact analysis of the use of oral and intravenous therapy regimens for the treatment of relapsed or refractory multiple myeloma in Germany," The European Journal of Health Economics, Springer;Deutsche Gesellschaft für Gesundheitsökonomie (DGGÖ), vol. 21(9), pages 1351-1361, December.
    7. So-Young Ha & Dong-Won Kang & Hye-In Jung & Eui-Kyung Lee & Mi-Hai Park, 2022. "Value-Based Pricing and Budget Impact Analysis for Multi-Indication Drugs: A Case Study of Immunotherapies," IJERPH, MDPI, vol. 19(7), pages 1-10, March.
    8. Lorna Wheaton & Anastasios Papanikos & Anne Thomas & Sylwia Bujkiewicz, 2023. "Using Bayesian Evidence Synthesis Methods to Incorporate Real-World Evidence in Surrogate Endpoint Evaluation," Medical Decision Making, , vol. 43(5), pages 539-552, July.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pdig00:0000101. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: digitalhealth (email available below). General contact details of provider: https://journals.plos.org/digitalhealth .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.