IDEAS home Printed from https://ideas.repec.org/a/spr/stmapp/v34y2025i4d10.1007_s10260-025-00800-5.html
   My bibliography  Save this article

A flexible parametric approach to synthetic patients generation using health data

Author

Listed:
  • Marta Cipriani

    (Sapienza University of Rome
    Roma Tre University)

  • Lorenzo Di Rocco

    (Sapienza University of Rome)

  • Maria Puopolo

    (Department of Neuroscience, Istituto Superiore di Sanità)

  • Marco Alfò

    (Sapienza University of Rome)

Abstract

Enhancing reproducibility and data accessibility is essential to scientific research. However, ensuring data privacy while achieving these goals is challenging, especially in the medical field, where sensitive data are often commonplace. One possible solution is to use synthetic data that mimic real-world datasets. This approach may help to streamline therapy evaluation and enable quicker access to innovative treatments. We propose using a method based on sequential conditional regressions, such as in a fully conditional specification (FCS) approach, along with flexible parametric survival models to accurately replicate covariate patterns and survival times. To make our approach available to a wide audience of users, we have developed user-friendly functions in R and Python to implement it. We also provide an example application to registry data on patients affected by Creutzfeld–Jacob disease. The results show the potentialities of the proposed method in mirroring observed multivariate distributions and survival outcomes.

Suggested Citation

  • Marta Cipriani & Lorenzo Di Rocco & Maria Puopolo & Marco Alfò, 2025. "A flexible parametric approach to synthetic patients generation using health data," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 34(4), pages 639-662, September.
  • Handle: RePEc:spr:stmapp:v:34:y:2025:i:4:d:10.1007_s10260-025-00800-5
    DOI: 10.1007/s10260-025-00800-5
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10260-025-00800-5
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10260-025-00800-5?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    More about this item

    Keywords

    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:stmapp:v:34:y:2025:i:4:d:10.1007_s10260-025-00800-5. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.