IDEAS home Printed from https://ideas.repec.org/a/spr/stabio/v16y2024i1d10.1007_s12561-023-09378-6.html
   My bibliography  Save this article

Doubly Robust Semiparametric Estimation for Multi-group Causal Comparisons

Author

Listed:
  • Anqi Yin

    (Georgetown University)

  • Ao Yuan

    (Georgetown University)

  • Ming T. Tan

    (Georgetown University)

Abstract

In biomedical research, especially in epidemiological studies, we often need to compare nonrandomized groups, as randomization is only sometimes feasible. Even in randomized clinical trials, the actual treatment assignments may not be completely random, or the covariates may be imbalanced between treatment and control groups. It is known that naïve estimates of treatment effects are biased if treatments are not randomized or covariates not balanced. Various causal inference methods have been proposed to debias or correct the bias. The doubly robust (DR) method is a more recent development that aims to ensure the robustness of the causal estimates to model assumptions. However, the DR method may still be significantly biased even when the model is only mildly misspecified. Motivated by the need to increase the robustness of multi-group causal estimates, e.g., treatments in multiarm clinical trials, and the effects of multiple levels of smoking intensity on health outcomes in epidemiological studies, this article proposes a semiparametric doubly robust estimator for multigroup causal effects, in which both the propensity score and the outcome models are specified semiparametrically with link functions estimated by shape-restricted maximum likelihood. The approach is shown to be highly robust against model assumptions, enhancing the double robustness. After deriving the asymptotic properties of the proposed DR estimators, we perform simulations to demonstrate the finite sample properties of the proposed estimators and compare them with parametric and naïve estimators. We then apply the method to analyzing the effect of smoking intensity on health outcomes in the National Epidemiology Follow-up Study.

Suggested Citation

  • Anqi Yin & Ao Yuan & Ming T. Tan, 2024. "Doubly Robust Semiparametric Estimation for Multi-group Causal Comparisons," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 16(1), pages 45-68, April.
  • Handle: RePEc:spr:stabio:v:16:y:2024:i:1:d:10.1007_s12561-023-09378-6
    DOI: 10.1007/s12561-023-09378-6
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s12561-023-09378-6
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s12561-023-09378-6?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Cattaneo, Matias D., 2010. "Efficient semiparametric estimation of multi-valued treatment effects under ignorability," Journal of Econometrics, Elsevier, vol. 155(2), pages 138-154, April.
    2. van der Laan Mark J., 2006. "Statistical Inference for Variable Importance," The International Journal of Biostatistics, De Gruyter, vol. 2(1), pages 1-33, February.
    3. Yang Ning & Grace Yi & Nancy Reid, 2018. "A Class of Weighted Estimating Equations for Semiparametric Transformation Models with Missing Covariates," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 45(1), pages 87-109, March.
    4. Donglin Zeng & D. Y. Lin, 2006. "Efficient estimation of semiparametric transformation models for counting processes," Biometrika, Biometrika Trust, vol. 93(3), pages 627-640, September.
    5. D. R. Cox, 2004. "A note on pseudolikelihood constructed from marginal densities," Biometrika, Biometrika Trust, vol. 91(3), pages 729-737, September.
    6. Michael Lechner, 2002. "Program Heterogeneity And Propensity Score Matching: An Application To The Evaluation Of Active Labor Market Policies," The Review of Economics and Statistics, MIT Press, vol. 84(2), pages 205-220, May.
    7. Yu, Tao & Li, Pengfei & Chen, Baojiang & Yuan, Ao & Qin, Jing, 2023. "Maximum pairwise-rank-likelihood-based inference for the semiparametric transformation model," Journal of Econometrics, Elsevier, vol. 235(2), pages 454-469.
    8. Kosuke Imai & David A. van Dyk, 2004. "Causal Inference With General Treatment Regimes: Generalizing the Propensity Score," Journal of the American Statistical Association, American Statistical Association, vol. 99, pages 854-866, January.
    9. Xiaofei Bai & Anastasios A. Tsiatis & Sean M. O'Brien, 2013. "Doubly-Robust Estimators of Treatment-Specific Survival Distributions in Observational Studies with Stratified Sampling," Biometrics, The International Biometric Society, vol. 69(4), pages 830-839, December.
    10. Ao Yuan & Anqi Yin & Ming T. Tan, 2021. "Enhanced Doubly Robust Procedure for Causal Inference," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 13(3), pages 454-478, December.
    11. Shu Yang & Guido W. Imbens & Zhanglin Cui & Douglas E. Faries & Zbigniew Kadziola, 2016. "Propensity score matching and subclassification in observational studies with multi‐level treatments," Biometrics, The International Biometric Society, vol. 72(4), pages 1055-1065, December.
    12. Zetterqvist, Johan & Waernbaum, Ingeborg, 2020. "Semi-parametric estimation of multi-valued treatment effects for the treated:estimating equations and sandwich estimators," Working Paper Series 2020:4, IFAU - Institute for Evaluation of Labour Market and Education Policy.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Susan Athey & Guido W. Imbens, 2017. "The State of Applied Econometrics: Causality and Policy Evaluation," Journal of Economic Perspectives, American Economic Association, vol. 31(2), pages 3-32, Spring.
    2. Zetterqvist, Johan & Waernbaum, Ingeborg, 2020. "Semi-parametric estimation of multi-valued treatment effects for the treated:estimating equations and sandwich estimators," Working Paper Series 2020:4, IFAU - Institute for Evaluation of Labour Market and Education Policy.
    3. Carlos A. Flores & Oscar A. Mitnik, 2009. "Evaluating Nonexperimental Estimators for Multiple Treatments: Evidence from Experimental Data," Working Papers 2010-10, University of Miami, Department of Economics.
    4. Carlos A. Flores & Oscar A. Mitnik, 2013. "Comparing Treatments across Labor Markets: An Assessment of Nonexperimental Multiple-Treatment Strategies," The Review of Economics and Statistics, MIT Press, vol. 95(5), pages 1691-1707, December.
    5. Ying-Ying Lee, 2015. "Efficient propensity score regression estimators of multi-valued treatment effects for the treated," Economics Series Working Papers 738, University of Oxford, Department of Economics.
    6. Lee, Ying-Ying, 2018. "Efficient propensity score regression estimators of multivalued treatment effects for the treated," Journal of Econometrics, Elsevier, vol. 204(2), pages 207-222.
    7. Essilfie, Felix Larry, 2018. "Varietal seed technology and household income of maize farmers: An application of the doubly robust model," Technology in Society, Elsevier, vol. 55(C), pages 85-91.
    8. Caloffi, Annalisa & Freo, Marzia & Ghinoi, Stefano & Mariani, Marco & Rossi, Federica, 2022. "Assessing the effects of a deliberate policy mix: The case of technology and innovation advisory services and innovation vouchers," Research Policy, Elsevier, vol. 51(6).
    9. Michael C. Knaus, 2021. "A double machine learning approach to estimate the effects of musical practice on student’s skills," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 184(1), pages 282-300, January.
    10. Wei Huang & Oliver Linton & Zheng Zhang, 2022. "A Unified Framework for Specification Tests of Continuous Treatment Effect Models," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 40(4), pages 1817-1830, October.
    11. W K Newey & S Stouli, 2022. "Heterogeneous coefficients, control variables and identification of multiple treatment effects [Multivalued treatments and decomposition analysis: An application to the WIA program]," Biometrika, Biometrika Trust, vol. 109(3), pages 865-872.
    12. Chunrong Ai & Oliver Linton & Kaiji Motegi & Zheng Zhang, 2021. "A unified framework for efficient estimation of general treatment models," Quantitative Economics, Econometric Society, vol. 12(3), pages 779-816, July.
    13. Ida D'Attoma & Silvia Pacei, 2018. "Evaluating the Effects of Product Innovation on the Performance of European Firms by Using the Generalised Propensity Score," German Economic Review, Verein für Socialpolitik, vol. 19(1), pages 94-112, February.
    14. Sokbae Lee & Bernard Salanié, 2018. "Identifying Effects of Multivalued Treatments," Econometrica, Econometric Society, vol. 86(6), pages 1939-1963, November.
    15. Difang Huang & Jiti Gao & Tatsushi Oka, 2022. "Semiparametric Single-Index Estimation for Average Treatment Effects," Papers 2206.08503, arXiv.org, revised Jan 2025.
    16. Phillip Heiler & Michael C. Knaus, 2021. "Effect or Treatment Heterogeneity? Policy Evaluation with Aggregated and Disaggregated Treatments," Papers 2110.01427, arXiv.org, revised Aug 2023.
    17. Dahye Kim & Byeong-il Ahn, 2020. "Eating Out and Consumers’ Health: Evidence on Obesity and Balanced Nutrition Intakes," IJERPH, MDPI, vol. 17(2), pages 1-17, January.
    18. Farrell, Max H., 2015. "Robust inference on average treatment effects with possibly more covariates than observations," Journal of Econometrics, Elsevier, vol. 189(1), pages 1-23.
    19. Lee, Seong-ho & Ma, Yanyuan & de Luna, Xavier, 2025. "Covariate balancing for causal inference on categorical and continuous treatments," Econometrics and Statistics, Elsevier, vol. 33(C), pages 304-329.
    20. Hajime Seya & Takahiro Yoshida, 2017. "Propensity score matching for multiple treatment levels: A CODA-based contribution," Papers 1710.08558, arXiv.org.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:stabio:v:16:y:2024:i:1:d:10.1007_s12561-023-09378-6. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.