IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v54y2010i12p3080-3094.html
   My bibliography  Save this article

A cross-validation deletion-substitution-addition model selection algorithm: Application to marginal structural models

Author

Listed:
  • Haight, Thaddeus J.
  • Wang, Yue
  • van der Laan, Mark J.
  • Tager, Ira B.

Abstract

The cross-validation deletion-substitution-addition (cvDSA) algorithm is based on data-adaptive estimation methodology to select and estimate marginal structural models (MSMs) for point treatment studies as well as models for conditional means where the outcome is continuous or binary. The algorithm builds and selects models based on user-defined criteria for model selection, and utilizes a loss function-based estimation procedure to distinguish between different model fits. In addition, the algorithm selects models based on cross-validation methodology to avoid "over-fitting" data. The cvDSA routine is an R software package available for download. An alternative R-package (DSA) based on the same principles as the cvDSA routine (i.e., cross-validation, loss function), but one that is faster and with additional refinements for selection and estimation of conditional means, is also available for download. Analyses of real and simulated data were conducted to demonstrate the use of these algorithms, and to compare MSMs where the causal effects were assumed (i.e., investigator-defined), with MSMs selected by the cvDSA. The package was used also to select models for the nuisance parameter (treatment) model to estimate the MSM parameters with inverse-probability of treatment weight (IPTW) estimation. Other estimation procedures (i.e., G-computation and double robust IPTW) are available also with the package.

Suggested Citation

  • Haight, Thaddeus J. & Wang, Yue & van der Laan, Mark J. & Tager, Ira B., 2010. "A cross-validation deletion-substitution-addition model selection algorithm: Application to marginal structural models," Computational Statistics & Data Analysis, Elsevier, vol. 54(12), pages 3080-3094, December.
  • Handle: RePEc:eee:csdana:v:54:y:2010:i:12:p:3080-3094
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167-9473(10)00051-4
    Download Restriction: Full text for ScienceDirect subscribers only.
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Yue Wang & Mark van der Laan, 2004. "Data Adaptive Estimation of the Treatment Specific Mean," U.C. Berkeley Division of Biostatistics Working Paper Series 1159, Berkeley Electronic Press.
    2. Weihua Cao & Anastasios A. Tsiatis & Marie Davidian, 2009. "Improving efficiency and robustness of the doubly robust estimator for a population mean with incomplete data," Biometrika, Biometrika Trust, vol. 96(3), pages 723-734.
    3. van der Laan Mark J. & Rubin Daniel, 2006. "Targeted Maximum Likelihood Learning," The International Journal of Biostatistics, De Gruyter, vol. 2(1), pages 1-40, December.
    4. Brookhart, M. Alan & van der Laan, Mark J., 2006. "A semiparametric model selection criterion with applications to the marginal structural model," Computational Statistics & Data Analysis, Elsevier, vol. 50(2), pages 475-498, January.
    5. Mark van der Laan & Sandrine Dudoit & Aad van der Vaart, 2004. "The Cross-Validated Adaptive Epsilon-Net Estimator," U.C. Berkeley Division of Biostatistics Working Paper Series 1141, Berkeley Electronic Press.
    6. Sandra Sinisi & Mark van der Laan, 2004. "Loss-Based Cross-Validated Deletion/Substitution/Addition Algorithms in Estimation," U.C. Berkeley Division of Biostatistics Working Paper Series 1142, Berkeley Electronic Press.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Iván Díaz & Elizabeth Colantuoni & Daniel F. Hanley & Michael Rosenblum, 2019. "Improved precision in the analysis of randomized trials with survival outcomes, without assuming proportional hazards," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 25(3), pages 439-468, July.
    2. Zhiwei Zhang & Zhen Chen & James F. Troendle & Jun Zhang, 2012. "Causal Inference on Quantiles with an Obstetric Application," Biometrics, The International Biometric Society, vol. 68(3), pages 697-706, September.
    3. Chaudhuri, Saraswata & Renault, Eric, 2023. "Efficient estimation of regression models with user-specified parametric model for heteroskedasticty," The Warwick Economics Research Paper Series (TWERPS) 1473, University of Warwick, Department of Economics.
    4. Jie Zhou & Zhiwei Zhang & Zhaohai Li & Jun Zhang, 2015. "Coarsened Propensity Scores and Hybrid Estimators for Missing Data and Causal Inference," International Statistical Review, International Statistical Institute, vol. 83(3), pages 449-471, December.
    5. Sun Hao & Ertefaie Ashkan & Lu Xin & Johnson Brent A., 2020. "Improved Doubly Robust Estimation in Marginal Mean Models for Dynamic Regimes," Journal of Causal Inference, De Gruyter, vol. 8(1), pages 300-314, January.
    6. Jianxuan Liu & Yanyuan Ma & Lan Wang, 2018. "An alternative robust estimator of average treatment effect in causal inference," Biometrics, The International Biometric Society, vol. 74(3), pages 910-923, September.
    7. Porter Kristin E. & Gruber Susan & van der Laan Mark J. & Sekhon Jasjeet S., 2011. "The Relative Performance of Targeted Maximum Likelihood Estimators," The International Journal of Biostatistics, De Gruyter, vol. 7(1), pages 1-34, August.
    8. Gruber Susan & van der Laan Mark J., 2012. "Targeted Minimum Loss Based Estimator that Outperforms a given Estimator," The International Journal of Biostatistics, De Gruyter, vol. 8(1), pages 1-22, May.
    9. Guo, Xu & Fang, Yun & Zhu, Xuehu & Xu, Wangli & Zhu, Lixing, 2018. "Semiparametric double robust and efficient estimation for mean functionals with response missing at random," Computational Statistics & Data Analysis, Elsevier, vol. 128(C), pages 325-339.
    10. Susan Athey & Guido W. Imbens & Stefan Wager, 2018. "Approximate residual balancing: debiased inference of average treatment effects in high dimensions," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 80(4), pages 597-623, September.
    11. S Ariane Christie & Amanda S Conroy & Rachael A Callcut & Alan E Hubbard & Mitchell J Cohen, 2019. "Dynamic multi-outcome prediction after injury: Applying adaptive machine learning for precision medicine in trauma," PLOS ONE, Public Library of Science, vol. 14(4), pages 1-13, April.
    12. Waverly Wei & Maya Petersen & Mark J van der Laan & Zeyu Zheng & Chong Wu & Jingshen Wang, 2023. "Efficient targeted learning of heterogeneous treatment effects for multiple subgroups," Biometrics, The International Biometric Society, vol. 79(3), pages 1934-1946, September.
    13. Michael Rosenblum & Nicholas P. Jewell & Mark van der Laan & Stephen Shiboski & Ariane van der Straten & Nancy Padian, 2009. "Analysing direct effects in randomized trials with secondary interventions: an application to human immunodeficiency virus prevention trials," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 172(2), pages 443-465, April.
    14. Luo, Yu & Graham, Daniel J. & McCoy, Emma J., 2023. "Semiparametric Bayesian doubly robust causal estimation," LSE Research Online Documents on Economics 117944, London School of Economics and Political Science, LSE Library.
    15. Victor Chernozhukov & Whitney K. Newey & Victor Quintas-Martinez & Vasilis Syrgkanis, 2021. "Automatic Debiased Machine Learning via Riesz Regression," Papers 2104.14737, arXiv.org, revised Mar 2024.
    16. Paul Frédéric Blanche & Anders Holt & Thomas Scheike, 2023. "On logistic regression with right censored data, with or without competing risks, and its use for estimating treatment effects," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 29(2), pages 441-482, April.
    17. Li Liang & Greene Tom, 2013. "A Weighting Analogue to Pair Matching in Propensity Score Analysis," The International Journal of Biostatistics, De Gruyter, vol. 9(2), pages 215-234, July.
    18. Yiyi Huo & Yingying Fan & Fang Han, 2023. "On the adaptation of causal forests to manifold data," Papers 2311.16486, arXiv.org, revised Dec 2023.
    19. Brian J. Reich & Shu Yang & Yawen Guan & Andrew B. Giffin & Matthew J. Miller & Ana Rappold, 2021. "A Review of Spatial Causal Inference Methods for Environmental and Epidemiological Applications," International Statistical Review, International Statistical Institute, vol. 89(3), pages 605-634, December.
    20. Michael Lechner, 2023. "Causal Machine Learning and its use for public policy," Swiss Journal of Economics and Statistics, Springer;Swiss Society of Economics and Statistics, vol. 159(1), pages 1-15, December.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:54:y:2010:i:12:p:3080-3094. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.