IDEAS home Printed from https://ideas.repec.org/a/bla/biomet/v74y2018i1p8-17.html
   My bibliography  Save this article

Covariate selection with group lasso and doubly robust estimation of causal effects

Author

Listed:
  • Brandon Koch
  • David M. Vock
  • Julian Wolfson

Abstract

The efficiency of doubly robust estimators of the average causal effect (ACE) of a treatment can be improved by including in the treatment and outcome models only those covariates which are related to both treatment and outcome (i.e., confounders) or related only to the outcome. However, it is often challenging to identify such covariates among the large number that may be measured in a given study. In this article, we propose GLiDeR (Group Lasso and Doubly Robust Estimation), a novel variable selection technique for identifying confounders and predictors of outcome using an adaptive group lasso approach that simultaneously performs coefficient selection, regularization, and estimation across the treatment and outcome models. The selected variables and corresponding coefficient estimates are used in a standard doubly robust ACE estimator. We provide asymptotic results showing that, for a broad class of data generating mechanisms, GLiDeR yields a consistent estimator of the ACE when either the outcome or treatment model is correctly specified. A comprehensive simulation study shows that GLiDeR is more efficient than doubly robust methods using standard variable selection techniques and has substantial computational advantages over a recently proposed doubly robust Bayesian model averaging method. We illustrate our method by estimating the causal treatment effect of bilateral versus single†lung transplant on forced expiratory volume in one year after transplant using an observational registry.

Suggested Citation

  • Brandon Koch & David M. Vock & Julian Wolfson, 2018. "Covariate selection with group lasso and doubly robust estimation of causal effects," Biometrics, The International Biometric Society, vol. 74(1), pages 8-17, March.
  • Handle: RePEc:bla:biomet:v:74:y:2018:i:1:p:8-17
    DOI: 10.1111/biom.12736
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/biom.12736
    Download Restriction: no

    File URL: https://libkey.io/10.1111/biom.12736?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. van der Laan Mark J. & Gruber Susan, 2010. "Collaborative Double Robust Targeted Maximum Likelihood Estimation," The International Journal of Biostatistics, De Gruyter, vol. 6(1), pages 1-71, May.
    2. Chi Wang & Giovanni Parmigiani & Francesca Dominici, 2012. "Bayesian Effect Estimation Accounting for Adjustment Uncertainty," Biometrics, The International Biometric Society, vol. 68(3), pages 661-671, September.
    3. Leeb, Hannes & Potscher, Benedikt M., 2008. "Sparse estimators and the oracle property, or the return of Hodges' estimator," Journal of Econometrics, Elsevier, vol. 142(1), pages 201-211, January.
    4. Tyler J. VanderWeele & Ilya Shpitser, 2011. "A New Criterion for Confounder Selection," Biometrics, The International Biometric Society, vol. 67(4), pages 1406-1413, December.
    5. Chi Wang & Giovanni Parmigiani & Francesca Dominici, 2012. "Rejoinder: Bayesian Effect Estimation Accounting for Adjustment Uncertainty," Biometrics, The International Biometric Society, vol. 68(3), pages 680-686, September.
    6. Xavier De Luna & Ingeborg Waernbaum & Thomas S. Richardson, 2011. "Covariate selection for the nonparametric estimation of an average treatment effect," Biometrika, Biometrika Trust, vol. 98(4), pages 861-875.
    7. Ming Yuan & Yi Lin, 2006. "Model selection and estimation in regression with grouped variables," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 68(1), pages 49-67, February.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Masataka Taguri, 2022. "Discussion of “Akaike Memorial Lecture 2020: Some of the challenges of statistical applications”," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 74(4), pages 643-647, August.
    2. Roberto Esposti, 2022. "The Coevolution of Policy Support and Farmers' Behaviour. An investigation on Italian agriculture over the 2008-2019 period," Working Papers 464, Universita' Politecnica delle Marche (I), Dipartimento di Scienze Economiche e Sociali.
    3. Okyere, Charles Yaw & Kornher, Lukas, 2022. "Carbon Farming Training and Welfare: Evidence from Northern Ghana," Discussion Papers 324738, University of Bonn, Center for Development Research (ZEF).
    4. Okyere, Charles Yaw & Kornher, Lukas, 2023. "Carbon farming training and welfare: Evidence from Northern Ghana," Land Use Policy, Elsevier, vol. 134(C).
    5. David Cheng & Abhishek Chakrabortty & Ashwin N. Ananthakrishnan & Tianxi Cai, 2020. "Estimating average treatment effects with a double‐index propensity score," Biometrics, The International Biometric Society, vol. 76(3), pages 767-777, September.
    6. Wonder Agbenyo & Yuansheng Jiang & Xinxin Jia & Jingyi Wang & Gideon Ntim-Amo & Rahman Dunya & Anthony Siaw & Isaac Asare & Martinson Ankrah Twumasi, 2022. "Does the Adoption of Climate-Smart Agricultural Practices Impact Farmers’ Income? Evidence from Ghana," IJERPH, MDPI, vol. 19(7), pages 1-25, March.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Agboola, Oluwagbenga David & Yu, Han, 2023. "Neighborhood-based cross fitting approach to treatment effects with high-dimensional data," Computational Statistics & Data Analysis, Elsevier, vol. 186(C).
    2. Joseph Antonelli & Matthew Cefalu & Nathan Palmer & Denis Agniel, 2018. "Doubly robust matching estimators for high dimensional confounding adjustment," Biometrics, The International Biometric Society, vol. 74(4), pages 1171-1179, December.
    3. Susan M. Shortreed & Ashkan Ertefaie, 2017. "Outcome‐adaptive lasso: Variable selection for causal inference," Biometrics, The International Biometric Society, vol. 73(4), pages 1111-1122, December.
    4. Matthew Cefalu & Francesca Dominici & Nils Arvold & Giovanni Parmigiani, 2017. "Model averaged double robust estimation," Biometrics, The International Biometric Society, vol. 73(2), pages 410-421, June.
    5. Xun Lu, 2015. "A Covariate Selection Criterion for Estimation of Treatment Effects," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 33(4), pages 506-522, October.
    6. Lefebvre, Geneviève & Atherton, Juli & Talbot, Denis, 2014. "The effect of the prior distribution in the Bayesian Adjustment for Confounding algorithm," Computational Statistics & Data Analysis, Elsevier, vol. 70(C), pages 227-240.
    7. Thomas S. Richardson & James M. Robins & Linbo Wang, 2018. "Discussion of “Data†driven confounder selection via Markov and Bayesian networks†by Häggström," Biometrics, The International Biometric Society, vol. 74(2), pages 403-406, June.
    8. Persson, Emma & Häggström, Jenny & Waernbaum, Ingeborg & de Luna, Xavier, 2017. "Data-driven algorithms for dimension reduction in causal inference," Computational Statistics & Data Analysis, Elsevier, vol. 105(C), pages 280-292.
    9. Tingting Zhou & Michael R. Elliott & Roderick J. A. Little, 2021. "Robust Causal Estimation from Observational Studies Using Penalized Spline of Propensity Score for Treatment Comparison," Stats, MDPI, vol. 4(2), pages 1-21, June.
    10. Ander Wilson & Brian J. Reich, 2014. "Confounder selection via penalized credible regions," Biometrics, The International Biometric Society, vol. 70(4), pages 852-861, December.
    11. David Cheng & Abhishek Chakrabortty & Ashwin N. Ananthakrishnan & Tianxi Cai, 2020. "Estimating average treatment effects with a double‐index propensity score," Biometrics, The International Biometric Society, vol. 76(3), pages 767-777, September.
    12. M.J. Daniels & C. Wang & B.H. Marcus, 2014. "Fully Bayesian inference under ignorable missingness in the presence of auxiliary covariates," Biometrics, The International Biometric Society, vol. 70(1), pages 62-72, March.
    13. Antonelli Joseph & Cefalu Matthew, 2020. "Averaging causal estimators in high dimensions," Journal of Causal Inference, De Gruyter, vol. 8(1), pages 92-107, January.
    14. Paola Berchialla & Veronica Sciannameo & Sara Urru & Corrado Lanera & Danila Azzolina & Dario Gregori & Ileana Baldi, 2021. "Adjustment for Baseline Covariates to Increase Efficiency in RCTs with Binary Endpoint: A Comparison of Bayesian and Frequentist Approaches," IJERPH, MDPI, vol. 18(15), pages 1-9, July.
    15. Ricardo P. Masini & Marcelo C. Medeiros & Eduardo F. Mendes, 2023. "Machine learning advances for time series forecasting," Journal of Economic Surveys, Wiley Blackwell, vol. 37(1), pages 76-111, February.
    16. Adam A. Szpiro & Lianne Sheppard & Sara D. Adar & Joel D. Kaufman, 2014. "Estimating acute air pollution health effects from cohort study data," Biometrics, The International Biometric Society, vol. 70(1), pages 164-174, March.
    17. Edward H. Kennedy & Sivaraman Balakrishnan, 2018. "Discussion of “Data†driven confounder selection via Markov and Bayesian networks†by Jenny Häggström," Biometrics, The International Biometric Society, vol. 74(2), pages 399-402, June.
    18. Yongnam Kim, 2019. "The Causal Structure of Suppressor Variables," Journal of Educational and Behavioral Statistics, , vol. 44(4), pages 367-389, August.
    19. Jennifer F. Bobb & Maricela F. Cruz & Stephen J. Mooney & Adam Drewnowski & David Arterburn & Andrea J. Cook, 2022. "Accounting for spatial confounding in epidemiological studies with individual‐level exposures: An exposure‐penalized spline approach," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 185(3), pages 1271-1293, July.
    20. Chanmin Kim & Mauricio Tec & Corwin Zigler, 2023. "Bayesian nonparametric adjustment of confounding," Biometrics, The International Biometric Society, vol. 79(4), pages 3252-3265, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:biomet:v:74:y:2018:i:1:p:8-17. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.blackwellpublishing.com/journal.asp?ref=0006-341X .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.