IDEAS home Printed from https://ideas.repec.org/a/bla/biomet/v73y2017i4p1111-1122.html
   My bibliography  Save this article

Outcome‐adaptive lasso: Variable selection for causal inference

Author

Listed:
  • Susan M. Shortreed
  • Ashkan Ertefaie

Abstract

Methodological advancements, including propensity score methods, have resulted in improved unbiased estimation of treatment effects from observational data. Traditionally, a “throw in the kitchen sink” approach has been used to select covariates for inclusion into the propensity score, but recent work shows including unnecessary covariates can impact both the bias and statistical efficiency of propensity score estimators. In particular, the inclusion of covariates that impact exposure but not the outcome, can inflate standard errors without improving bias, while the inclusion of covariates associated with the outcome but unrelated to exposure can improve precision. We propose the outcome‐adaptive lasso for selecting appropriate covariates for inclusion in propensity score models to account for confounding bias and maintaining statistical efficiency. This proposed approach can perform variable selection in the presence of a large number of spurious covariates, that is, covariates unrelated to outcome or exposure. We present theoretical and simulation results indicating that the outcome‐adaptive lasso selects the propensity score model that includes all true confounders and predictors of outcome, while excluding other covariates. We illustrate covariate selection using the outcome‐adaptive lasso, including comparison to alternative approaches, using simulated data and in a survey of patients using opioid therapy to manage chronic pain.

Suggested Citation

  • Susan M. Shortreed & Ashkan Ertefaie, 2017. "Outcome‐adaptive lasso: Variable selection for causal inference," Biometrics, The International Biometric Society, vol. 73(4), pages 1111-1122, December.
  • Handle: RePEc:bla:biomet:v:73:y:2017:i:4:p:1111-1122
    DOI: 10.1111/biom.12679
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/biom.12679
    Download Restriction: no

    File URL: https://libkey.io/10.1111/biom.12679?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Zou, Hui, 2006. "The Adaptive Lasso and Its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1418-1429, December.
    2. Andrea Rotnitzky & Lingling Li & Xiaochun Li, 2010. "A note on overadjustment in inverse probability weighted estimation," Biometrika, Biometrika Trust, vol. 97(4), pages 997-1001.
    3. Corwin M. Zigler & Krista Watts & Robert W. Yeh & Yun Wang & Brent A. Coull & Francesca Dominici, 2013. "Model Feedback in Bayesian Propensity Score Estimation," Biometrics, The International Biometric Society, vol. 69(1), pages 263-273, March.
    4. Bradley Efron, 2014. "Estimation and Accuracy After Model Selection," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(507), pages 991-1007, September.
    5. Chi Wang & Francesca Dominici & Giovanni Parmigiani & Corwin Matthew Zigler, 2015. "Accounting for uncertainty in confounder and effect modifier selection when estimating average causal effects in generalized linear models," Biometrics, The International Biometric Society, vol. 71(3), pages 654-665, September.
    6. van der Laan Mark J. & Gruber Susan, 2010. "Collaborative Double Robust Targeted Maximum Likelihood Estimation," The International Journal of Biostatistics, De Gruyter, vol. 6(1), pages 1-71, May.
    7. Chi Wang & Giovanni Parmigiani & Francesca Dominici, 2012. "Bayesian Effect Estimation Accounting for Adjustment Uncertainty," Biometrics, The International Biometric Society, vol. 68(3), pages 661-671, September.
    8. Leeb, Hannes & Potscher, Benedikt M., 2008. "Sparse estimators and the oracle property, or the return of Hodges' estimator," Journal of Econometrics, Elsevier, vol. 142(1), pages 201-211, January.
    9. Chi Wang & Giovanni Parmigiani & Francesca Dominici, 2012. "Rejoinder: Bayesian Effect Estimation Accounting for Adjustment Uncertainty," Biometrics, The International Biometric Society, vol. 68(3), pages 680-686, September.
    10. Leeb, Hannes & Pötscher, Benedikt M., 2005. "Model Selection And Inference: Facts And Fiction," Econometric Theory, Cambridge University Press, vol. 21(1), pages 21-59, February.
    11. Xavier De Luna & Ingeborg Waernbaum & Thomas S. Richardson, 2011. "Covariate selection for the nonparametric estimation of an average treatment effect," Biometrika, Biometrika Trust, vol. 98(4), pages 861-875.
    12. Ander Wilson & Brian J. Reich, 2014. "Confounder selection via penalized credible regions," Biometrics, The International Biometric Society, vol. 70(4), pages 852-861, December.
    13. Wei Lin & Rui Feng & Hongzhe Li, 2015. "Regularization Methods for High-Dimensional Instrumental Variables Regression With an Application to Genetical Genomics," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(509), pages 270-288, March.
    14. Fan J. & Li R., 2001. "Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1348-1360, December.
    15. Corwin Matthew Zigler & Francesca Dominici, 2014. "Uncertainty in Propensity Score Estimation: Bayesian Methods for Variable Selection and Model-Averaged Causal Effects," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(505), pages 95-107, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Thomas S. Richardson & James M. Robins & Linbo Wang, 2018. "Discussion of “Data†driven confounder selection via Markov and Bayesian networks†by Häggström," Biometrics, The International Biometric Society, vol. 74(2), pages 403-406, June.
    2. Rui Chen & Guanhua Chen & Menggang Yu, 2023. "Entropy balancing for causal generalization with target sample summary information," Biometrics, The International Biometric Society, vol. 79(4), pages 3179-3190, December.
    3. Chanmin Kim & Mauricio Tec & Corwin Zigler, 2023. "Bayesian nonparametric adjustment of confounding," Biometrics, The International Biometric Society, vol. 79(4), pages 3252-3265, December.
    4. Samarth Gupta, 2023. "Model-Selection Inference for Causal Impact of Clusters and Collaboration on MSMEs in India," Journal of Quantitative Economics, Springer;The Indian Econometric Society (TIES), vol. 21(3), pages 641-662, September.
    5. Yongnam Kim, 2019. "The Causal Structure of Suppressor Variables," Journal of Educational and Behavioral Statistics, , vol. 44(4), pages 367-389, August.
    6. Uehleke, Reinhard & Petrick, Martin & Hüttel, Silke, 2022. "Evaluations of agri-environmental schemes based on observational farm data: The importance of covariate selection," Land Use Policy, Elsevier, vol. 114(C).
    7. Tingting Zhou & Michael R. Elliott & Roderick J. A. Little, 2021. "Robust Causal Estimation from Observational Studies Using Penalized Spline of Propensity Score for Treatment Comparison," Stats, MDPI, vol. 4(2), pages 1-21, June.
    8. Xu Qin & Jonah Deutsch & Guanglei Hong, 2021. "Unpacking Complex Mediation Mechanisms And Their Heterogeneity Between Sites In A Job Corps Evaluation," Journal of Policy Analysis and Management, John Wiley & Sons, Ltd., vol. 40(1), pages 158-190, January.
    9. Leonard Henckel & Emilija Perković & Marloes H. Maathuis, 2022. "Graphical criteria for efficient total effect estimation via adjustment in causal linear models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 84(2), pages 579-599, April.
    10. Cai, Xizhen & Zhu, Yeying & Huang, Yuan & Ghosh, Debashis, 2022. "High-dimensional causal mediation analysis based on partial linear structural equation models," Computational Statistics & Data Analysis, Elsevier, vol. 174(C).
    11. Sean Yiu & Li Su, 2022. "Joint calibrated estimation of inverse probability of treatment and censoring weights for marginal structural models," Biometrics, The International Biometric Society, vol. 78(1), pages 115-127, March.
    12. David Cheng & Abhishek Chakrabortty & Ashwin N. Ananthakrishnan & Tianxi Cai, 2020. "Estimating average treatment effects with a double‐index propensity score," Biometrics, The International Biometric Society, vol. 76(3), pages 767-777, September.
    13. Joseph Antonelli & Georgia Papadogeorgou & Francesca Dominici, 2022. "Causal inference in high dimensions: A marriage between Bayesian modeling and good frequentist properties," Biometrics, The International Biometric Society, vol. 78(1), pages 100-114, March.
    14. Agboola, Oluwagbenga David & Yu, Han, 2023. "Neighborhood-based cross fitting approach to treatment effects with high-dimensional data," Computational Statistics & Data Analysis, Elsevier, vol. 186(C).
    15. Antonelli Joseph & Cefalu Matthew, 2020. "Averaging causal estimators in high dimensions," Journal of Causal Inference, De Gruyter, vol. 8(1), pages 92-107, January.
    16. Ertefaie Ashkan & Asgharian Masoud & Stephens David A., 2018. "Variable Selection in Causal Inference using a Simultaneous Penalization Method," Journal of Causal Inference, De Gruyter, vol. 6(1), pages 1-16, March.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Agboola, Oluwagbenga David & Yu, Han, 2023. "Neighborhood-based cross fitting approach to treatment effects with high-dimensional data," Computational Statistics & Data Analysis, Elsevier, vol. 186(C).
    2. Joseph Antonelli & Matthew Cefalu & Nathan Palmer & Denis Agniel, 2018. "Doubly robust matching estimators for high dimensional confounding adjustment," Biometrics, The International Biometric Society, vol. 74(4), pages 1171-1179, December.
    3. Matthew Cefalu & Francesca Dominici & Nils Arvold & Giovanni Parmigiani, 2017. "Model averaged double robust estimation," Biometrics, The International Biometric Society, vol. 73(2), pages 410-421, June.
    4. Ertefaie Ashkan & Asgharian Masoud & Stephens David A., 2018. "Variable Selection in Causal Inference using a Simultaneous Penalization Method," Journal of Causal Inference, De Gruyter, vol. 6(1), pages 1-16, March.
    5. Dingke Tang & Dehan Kong & Wenliang Pan & Linbo Wang, 2023. "Ultra‐high dimensional variable selection for doubly robust causal inference," Biometrics, The International Biometric Society, vol. 79(2), pages 903-914, June.
    6. Chanmin Kim & Mauricio Tec & Corwin Zigler, 2023. "Bayesian nonparametric adjustment of confounding," Biometrics, The International Biometric Society, vol. 79(4), pages 3252-3265, December.
    7. Ander Wilson & Corwin M. Zigler & Chirag J. Patel & Francesca Dominici, 2018. "Model‐averaged confounder adjustment for estimating multivariate exposure effects with linear regression," Biometrics, The International Biometric Society, vol. 74(3), pages 1034-1044, September.
    8. Brandon Koch & David M. Vock & Julian Wolfson, 2018. "Covariate selection with group lasso and doubly robust estimation of causal effects," Biometrics, The International Biometric Society, vol. 74(1), pages 8-17, March.
    9. Tingting Zhou & Michael R. Elliott & Roderick J. A. Little, 2021. "Robust Causal Estimation from Observational Studies Using Penalized Spline of Propensity Score for Treatment Comparison," Stats, MDPI, vol. 4(2), pages 1-21, June.
    10. Anders Bredahl Kock, 2012. "On the Oracle Property of the Adaptive Lasso in Stationary and Nonstationary Autoregressions," CREATES Research Papers 2012-05, Department of Economics and Business Economics, Aarhus University.
    11. Ander Wilson & Brian J. Reich, 2014. "Confounder selection via penalized credible regions," Biometrics, The International Biometric Society, vol. 70(4), pages 852-861, December.
    12. Xun Lu, 2015. "A Covariate Selection Criterion for Estimation of Treatment Effects," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 33(4), pages 506-522, October.
    13. Antonelli Joseph & Cefalu Matthew, 2020. "Averaging causal estimators in high dimensions," Journal of Causal Inference, De Gruyter, vol. 8(1), pages 92-107, January.
    14. Liao, Zhipeng & Phillips, Peter C. B., 2015. "Automated Estimation Of Vector Error Correction Models," Econometric Theory, Cambridge University Press, vol. 31(3), pages 581-646, June.
    15. Ricardo P. Masini & Marcelo C. Medeiros & Eduardo F. Mendes, 2023. "Machine learning advances for time series forecasting," Journal of Economic Surveys, Wiley Blackwell, vol. 37(1), pages 76-111, February.
    16. Pötscher, Benedikt M. & Leeb, Hannes, 2009. "On the distribution of penalized maximum likelihood estimators: The LASSO, SCAD, and thresholding," Journal of Multivariate Analysis, Elsevier, vol. 100(9), pages 2065-2082, October.
    17. Lu, Xun & Su, Liangjun, 2016. "Shrinkage estimation of dynamic panel data models with interactive fixed effects," Journal of Econometrics, Elsevier, vol. 190(1), pages 148-175.
    18. Xianyi Wu & Xian Zhou, 2019. "On Hodges’ superefficiency and merits of oracle property in model selection," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 71(5), pages 1093-1119, October.
    19. Farrell, Max H., 2015. "Robust inference on average treatment effects with possibly more covariates than observations," Journal of Econometrics, Elsevier, vol. 189(1), pages 1-23.
    20. Pötscher, Benedikt M., 2007. "Confidence Sets Based on Sparse Estimators Are Necessarily Large," MPRA Paper 5677, University Library of Munich, Germany.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:biomet:v:73:y:2017:i:4:p:1111-1122. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.blackwellpublishing.com/journal.asp?ref=0006-341X .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.