IDEAS home Printed from https://ideas.repec.org/a/sae/medema/v43y2023i3p275-287.html
   My bibliography  Save this article

Mimicking Clinical Trials Using Real-World Data: A Novel Method and Applications

Author

Listed:
  • Wei-Jhih Wang

    (The Comparative Health Outcomes, Policy, and Economics (CHOICE) Institute, University of Washington, Seattle, WA, USA)

  • Aasthaa Bansal

    (The Comparative Health Outcomes, Policy, and Economics (CHOICE) Institute, University of Washington, Seattle, WA, USA)

  • Caroline Savage Bennette

    (The Comparative Health Outcomes, Policy, and Economics (CHOICE) Institute, University of Washington, Seattle, WA, USA)

  • Anirban Basu

    (The Comparative Health Outcomes, Policy, and Economics (CHOICE) Institute, University of Washington, Seattle, WA, USA
    Department of Health Services, University of Washington, Seattle, WA, USA
    Department of Economics, University of Washington, Seattle, WA, USA)

Abstract

Introduction Simulating individual-level trial data when only summary data are available is often useful for meta-analysis, forming external control arms and calibrating trial results to real-world data (RWD). The joint distribution of baseline characteristics in a trial is usually simulated by combining its summary data with RWD’s correlations. However, RWD correlations may not be a perfect proxy for the trial. A misspecified correlation structure could bias any analysis in which the outcomes generating models are nonlinear or include effect modifiers. Methods We developed an iterative algorithm using copula and resampling, which was based on the estimated propensity score for the likelihood of enrollment in a trial given participants’ characteristics. Validation was performed using Monte Carlo simulations under different scenarios in which the marginal and joint distributions of covariates differ between trial samples and RWD. Two applications were illustrated using an actual trial and the Surveillance, Epidemiology, and End Results–Medicare data. We calculated the standardized mean difference (SMD) to assess the generalizability of the trial and explored the feasibility of generating an external control by applying a parametric Weibull model trained in RWD to predict survival in the simulated trial cohort. Results Across all scenarios, approximated correlations derived from the algorithm were closer to the true correlations than the RWD’s correlations. The algorithm also successfully reproduced the joint distribution of characteristics for the actual trial. A similar SMD was observed using simulated data and individual-level trial data. The 95% confidence intervals were overlapped between adjusted survival estimates from the simulated trial and actual trial Kaplan-Meier estimates. Conclusions The algorithm could be a feasible way to simulate individual-level data when only summary data are available. Further research is needed to validate our approach with larger sample sizes. Highlights The correlation structure is crucial to building the joint distribution of patient characteristics, and a misspecified correlation structure could potentially influence predicted outcomes. An iterative algorithm was developed to approximate a trial’s correlation structure using published summary trial data and real-world data. The algorithm could be a feasible way to simulate individual-level trial data when only trial summary data are available.

Suggested Citation

  • Wei-Jhih Wang & Aasthaa Bansal & Caroline Savage Bennette & Anirban Basu, 2023. "Mimicking Clinical Trials Using Real-World Data: A Novel Method and Applications," Medical Decision Making, , vol. 43(3), pages 275-287, April.
  • Handle: RePEc:sae:medema:v:43:y:2023:i:3:p:275-287
    DOI: 10.1177/0272989X221141381
    as

    Download full text from publisher

    File URL: https://journals.sagepub.com/doi/10.1177/0272989X221141381
    Download Restriction: no

    File URL: https://libkey.io/10.1177/0272989X221141381?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Alexander J. McNeil & Rüdiger Frey & Paul Embrechts, 2015. "Quantitative Risk Management: Concepts, Techniques and Tools Revised edition," Economics Books, Princeton University Press, edition 2, number 10496.
    2. Basu, A & Polsky, D & Manning, W G, 2008. "Use of propensity scores in non-linear response models: The case for health care expenditures," Health, Econometrics and Data Group (HEDG) Working Papers 08/11, HEDG, c/o Department of Economics, University of York.
    3. Casey Quinn, 2007. "The health-economic applications of copulas: methods in applied econometric research," Health, Econometrics and Data Group (HEDG) Working Papers 07/22, HEDG, c/o Department of Economics, University of York.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Noemi Kreif & Richard Grieve & Rosalba Radice & Zia Sadique & Roland Ramsahai & Jasjeet S. Sekhon, 2012. "Methods for Estimating Subgroup Effects in Cost-Effectiveness Analyses That Use Observational Data," Medical Decision Making, , vol. 32(6), pages 750-763, November.
    2. Abduraimova, Kumushoy, 2022. "Contagion and tail risk in complex financial networks," Journal of Banking & Finance, Elsevier, vol. 143(C).
    3. Masahiko Egami & Rusudan Kevkhishvili, 2020. "Time reversal and last passage time of diffusions with applications to credit risk management," Finance and Stochastics, Springer, vol. 24(3), pages 795-825, July.
    4. Avanzi, Benjamin & Taylor, Greg & Wong, Bernard & Yang, Xinda, 2021. "On the modelling of multivariate counts with Cox processes and dependent shot noise intensities," Insurance: Mathematics and Economics, Elsevier, vol. 99(C), pages 9-24.
    5. Pfeifer Dietmar & Mändle Andreas & Ragulina Olena, 2017. "New copulas based on general partitions-of-unity and their applications to risk management (part II)," Dependence Modeling, De Gruyter, vol. 5(1), pages 246-255, October.
    6. Makam, Vaishno Devi & Millossovich, Pietro & Tsanakas, Andreas, 2021. "Sensitivity analysis with χ2-divergences," Insurance: Mathematics and Economics, Elsevier, vol. 100(C), pages 372-383.
    7. Noémi Kreif & Richard Grieve & M. Zia Sadique, 2013. "Statistical Methods For Cost‐Effectiveness Analyses That Use Observational Data: A Critical Appraisal Tool And Review Of Current Practice," Health Economics, John Wiley & Sons, Ltd., vol. 22(4), pages 486-500, April.
    8. Nevrla, Matěj, 2020. "Systemic risk in European financial and energy sectors: Dynamic factor copula approach," Economic Systems, Elsevier, vol. 44(4).
    9. H. Kaibuchi & Y. Kawasaki & G. Stupfler, 2022. "GARCH-UGH: a bias-reduced approach for dynamic extreme Value-at-Risk estimation in financial time series," Quantitative Finance, Taylor & Francis Journals, vol. 22(7), pages 1277-1294, July.
    10. Battulga Gankhuu, 2022. "Merton's Default Risk Model for Private Company," Papers 2208.01974, arXiv.org.
    11. Stephan Schlüter & Fabian Menz & Milena Kojić & Petar Mitić & Aida Hanić, 2022. "A Novel Approach to Generate Hourly Photovoltaic Power Scenarios," Sustainability, MDPI, vol. 14(8), pages 1-16, April.
    12. Borowska, Agnieszka & Hoogerheide, Lennart & Koopman, Siem Jan & van Dijk, Herman K., 2020. "Partially censored posterior for robust and efficient risk evaluation," Journal of Econometrics, Elsevier, vol. 217(2), pages 335-355.
    13. Dietmar Pfeifer & Olena Ragulina, 2018. "Generating VaR Scenarios under Solvency II with Product Beta Distributions," Risks, MDPI, vol. 6(4), pages 1-15, October.
    14. E. Ramos-P'erez & P. J. Alonso-Gonz'alez & J. J. N'u~nez-Vel'azquez, 2020. "Forecasting volatility with a stacked model based on a hybridized Artificial Neural Network," Papers 2006.16383, arXiv.org, revised Aug 2020.
    15. Paolo Gorgi & Siem Jan Koopman, 2020. "Beta observation-driven models with exogenous regressors: a joint analysis of realized correlation and leverage effects," Tinbergen Institute Discussion Papers 20-004/III, Tinbergen Institute.
    16. Dimitris Bertsimas & Agni Orfanoudaki, 2021. "Algorithmic Insurance," Papers 2106.00839, arXiv.org, revised Dec 2022.
    17. Jasjeet Singh Sekhon & Richard D. Grieve, 2012. "A matching method for improving covariate balance in cost‐effectiveness analyses," Health Economics, John Wiley & Sons, Ltd., vol. 21(6), pages 695-714, June.
    18. Xuehai Zhang, 2019. "Value at Risk and Expected Shortfall under General Semi-parametric GARCH models," Working Papers CIE 123, Paderborn University, CIE Center for International Economics.
    19. Chandra Bhat & Ipek Sener, 2009. "A copula-based closed-form binary logit choice model for accommodating spatial correlation across observational units," Journal of Geographical Systems, Springer, vol. 11(3), pages 243-272, September.
    20. Paolo Vanini & Sebastiano Rossi & Ermin Zvizdic & Thomas Domenig, 2023. "Online payment fraud: from anomaly detection to risk management," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 9(1), pages 1-25, December.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:sae:medema:v:43:y:2023:i:3:p:275-287. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: SAGE Publications (email available below). General contact details of provider: .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.