IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v174y2022ics0167947322001086.html
   My bibliography  Save this article

GPU accelerated estimation of a shared random effect joint model for dynamic prediction

Author

Listed:
  • Wang, Shikun
  • Li, Zhao
  • Lan, Lan
  • Zhao, Jieyi
  • Zheng, W. Jim
  • Li, Liang

Abstract

In longitudinal cohort studies, it is often of interest to predict the risk of a terminal clinical event using longitudinal predictor data among subjects at risk by the time of the prediction. The at-risk population changes over time; so does the association between predictors and the outcome, as well as the accumulating longitudinal predictor history. The dynamic nature of this prediction problem has received increasing interest in the literature, but computation often poses a challenge. The widely used joint model of longitudinal and survival data often comes with intensive computation and excessive model fitting time, due to numerical optimization and the analytically intractable high-dimensional integral in the likelihood function. This problem is exacerbated when the model is fit to a large dataset or the model involves multiple longitudinal predictors with nonlinear trajectories. This challenge can be addressed from an algorithmic perspective, by a novel two-stage estimation procedure, and from a computing perspective, by Graphics Processing Unit (GPU) programming. The latter is implemented through PyTorch, an emerging deep learning framework. The numerical studies demonstrate that the proposed algorithm and software can substantially speed up the estimation of the joint model, particularly with large datasets. The numerical studies also concluded that accounting for nonlinearity in longitudinal predictor trajectories can improve the prediction accuracy in comparison to joint modeling that ignore nonlinearity.

Suggested Citation

  • Wang, Shikun & Li, Zhao & Lan, Lan & Zhao, Jieyi & Zheng, W. Jim & Li, Liang, 2022. "GPU accelerated estimation of a shared random effect joint model for dynamic prediction," Computational Statistics & Data Analysis, Elsevier, vol. 174(C).
  • Handle: RePEc:eee:csdana:v:174:y:2022:i:c:s0167947322001086
    DOI: 10.1016/j.csda.2022.107528
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167947322001086
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2022.107528?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Dimitris Rizopoulos, 2011. "Dynamic Predictions and Prospective Accuracy in Joint Models for Longitudinal and Time-to-Event Data," Biometrics, The International Biometric Society, vol. 67(3), pages 819-829, September.
    2. Jeremy M. G. Taylor & Yongseok Park & Donna P. Ankerst & Cecile Proust-Lima & Scott Williams & Larry Kestin & Kyoungwha Bae & Tom Pickles & Howard Sandler, 2013. "Real-Time Individual Predictions of Prostate Cancer Recurrence Using Joint Models," Biometrics, The International Biometric Society, vol. 69(1), pages 206-213, March.
    3. Mogensen, Ulla B. & Ishwaran, Hemant & Gerds, Thomas A., 2012. "Evaluating Random Forests for Survival Analysis Using Prediction Error Curves," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 50(i11).
    4. Marlena Maziarz & Patrick Heagerty & Tianxi Cai & Yingye Zheng, 2017. "On longitudinal prediction with time-to-event outcome: Comparison of modeling options," Biometrics, The International Biometric Society, vol. 73(1), pages 83-93, March.
    5. Fushing Hsieh & Yi-Kuan Tseng & Jane-Ling Wang, 2006. "Joint Modeling of Survival and Longitudinal Data: Likelihood Approach Revisited," Biometrics, The International Biometric Society, vol. 62(4), pages 1037-1043, December.
    6. Lei Liu & Xuelin Huang & John O'Quigley, 2008. "Analysis of Longitudinal Data in the Presence of Informative Observational Times and a Dependent Terminal Event, with Application to Medical Cost Data," Biometrics, The International Biometric Society, vol. 64(3), pages 950-958, September.
    7. Liang Li & Sheng Luo & Bo Hu & Tom Greene, 2017. "Dynamic Prediction of Renal Failure Using Longitudinal Biomarkers in a Cohort Study of Chronic Kidney Disease," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 9(2), pages 357-378, December.
    8. Charles R. Harris & K. Jarrod Millman & Stéfan J. Walt & Ralf Gommers & Pauli Virtanen & David Cournapeau & Eric Wieser & Julian Taylor & Sebastian Berg & Nathaniel J. Smith & Robert Kern & Matti Picu, 2020. "Array programming with NumPy," Nature, Nature, vol. 585(7825), pages 357-362, September.
    9. Ruppert,David & Wand,M. P. & Carroll,R. J., 2003. "Semiparametric Regression," Cambridge Books, Cambridge University Press, number 9780521785167.
    10. Rizopoulos, Dimitris, 2016. "The R Package JMbayes for Fitting Joint Models for Longitudinal and Time-to-Event Data Using MCMC," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 72(i07).
    11. Michael J. Crowther & Keith R. Abrams & Paul C. Lambert, 2013. "Joint modeling of longitudinal and survival data," Stata Journal, StataCorp LP, vol. 13(1), pages 165-184, March.
    12. Yi-Kuan Tseng & Fushing Hsieh & Jane-Ling Wang, 2005. "Joint modelling of accelerated failure time and longitudinal data," Biometrika, Biometrika Trust, vol. 92(3), pages 587-603, September.
    13. Ruppert,David & Wand,M. P. & Carroll,R. J., 2003. "Semiparametric Regression," Cambridge Books, Cambridge University Press, number 9780521780506.
    14. Rizopoulos, Dimitris, 2010. "JM: An R Package for the Joint Modelling of Longitudinal and Time-to-Event Data," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 35(i09).
    15. Yingye Zheng & Patrick J. Heagerty, 2005. "Partly Conditional Survival Models for Longitudinal Data," Biometrics, The International Biometric Society, vol. 61(2), pages 379-391, June.
    16. Inyoung Kim & Noah D. Cohen & Raymond J. Carroll, 2003. "Semiparametric Regression Splines in Matched Case-Control Studies," Biometrics, The International Biometric Society, vol. 59(4), pages 1158-1169, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Kamaryn T. Tanner & Linda D. Sharples & Rhian M. Daniel & Ruth H. Keogh, 2021. "Dynamic survival prediction combining landmarking with a machine learning ensemble: Methodology and empirical comparison," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 184(1), pages 3-30, January.
    2. Liang Li & Sheng Luo & Bo Hu & Tom Greene, 2017. "Dynamic Prediction of Renal Failure Using Longitudinal Biomarkers in a Cohort Study of Chronic Kidney Disease," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 9(2), pages 357-378, December.
    3. Lisa M. McCrink & Adele H. Marshall & Karen J. Cairns, 2013. "Advances in Joint Modelling: A Review of Recent Developments with Application to the Survival of End Stage Renal Disease Patients," International Statistical Review, International Statistical Institute, vol. 81(2), pages 249-269, August.
    4. Otto-Sobotka, Fabian & Salvati, Nicola & Ranalli, Maria Giovanna & Kneib, Thomas, 2019. "Adaptive semiparametric M-quantile regression," Econometrics and Statistics, Elsevier, vol. 11(C), pages 116-129.
    5. Timothy K.M. Beatty & Erling Røed Larsen, 2005. "Using Engel curves to estimate bias in the Canadian CPI as a cost of living index," Canadian Journal of Economics/Revue canadienne d'économique, John Wiley & Sons, vol. 38(2), pages 482-499, May.
    6. Arthur Charpentier & Emmanuel Flachaire & Antoine Ly, 2017. "Econom\'etrie et Machine Learning," Papers 1708.06992, arXiv.org, revised Mar 2018.
    7. Hyunju Son & Youyi Fong, 2021. "Fast grid search and bootstrap‐based inference for continuous two‐phase polynomial regression models," Environmetrics, John Wiley & Sons, Ltd., vol. 32(3), May.
    8. Michael Wegener & Göran Kauermann, 2017. "Forecasting in nonlinear univariate time series using penalized splines," Statistical Papers, Springer, vol. 58(3), pages 557-576, September.
    9. Dlugosz, Stephan & Mammen, Enno & Wilke, Ralf A., 2017. "Generalized partially linear regression with misclassified data and an application to labour market transitions," Computational Statistics & Data Analysis, Elsevier, vol. 110(C), pages 145-159.
    10. Bernhard Baumgartner & Daniel Guhl & Thomas Kneib & Winfried J. Steiner, 2018. "Flexible estimation of time-varying effects for frequently purchased retail goods: a modeling approach based on household panel data," OR Spectrum: Quantitative Approaches in Management, Springer;Gesellschaft für Operations Research e.V., vol. 40(4), pages 837-873, October.
    11. Zi Ye & Giles Hooker & Stephen P. Ellner, 2021. "Generalized Single Index Models and Jensen Effects on Reproduction and Survival," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 26(3), pages 492-512, September.
    12. Ferraccioli, Federico & Sangalli, Laura M. & Finos, Livio, 2022. "Some first inferential tools for spatial regression with differential regularization," Journal of Multivariate Analysis, Elsevier, vol. 189(C).
    13. Alexander Dokumentov & Rob J. Hyndman, 2022. "STR: Seasonal-Trend Decomposition Using Regression," INFORMS Joural on Data Science, INFORMS, vol. 1(1), pages 50-62, April.
    14. Kalogridis, Ioannis & Van Aelst, Stefan, 2023. "Robust penalized estimators for functional linear regression," Journal of Multivariate Analysis, Elsevier, vol. 194(C).
    15. Krisztin, Tamás, 2018. "Semi-parametric spatial autoregressive models in freight generation modeling," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 114(C), pages 121-143.
    16. Lauren N. Berry & Nathaniel E. Helwig, 2021. "Cross-Validation, Information Theory, or Maximum Likelihood? A Comparison of Tuning Methods for Penalized Splines," Stats, MDPI, vol. 4(3), pages 1-24, September.
    17. Nagler Thomas & Czado Claudia & Schellhase Christian, 2017. "Nonparametric estimation of simplified vine copula models: comparison of methods," Dependence Modeling, De Gruyter, vol. 5(1), pages 99-120, January.
    18. Yukun Zhang & Haocheng Li & Sarah Kozey Keadle & Charles E. Matthews & Raymond J. Carroll, 2019. "A Review of Statistical Analyses on Physical Activity Data Collected from Accelerometers," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 11(2), pages 465-476, July.
    19. Li, Kan & Luo, Sheng, 2019. "Bayesian functional joint models for multivariate longitudinal and time-to-event data," Computational Statistics & Data Analysis, Elsevier, vol. 129(C), pages 14-29.
    20. Wei Huang & Oliver Linton & Zheng Zhang, 2021. "A Unified Framework for Specification Tests of Continuous Treatment Effect Models," Papers 2102.08063, arXiv.org, revised Sep 2021.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:174:y:2022:i:c:s0167947322001086. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.