IDEAS home Printed from https://ideas.repec.org/a/wly/navres/v64y2017i4p323-344.html
   My bibliography  Save this article

The role of covariate balance in observational studies

Author

Listed:
  • Jason J. Sauppe
  • Sheldon H. Jacobson

Abstract

Observational data are prevalent in many fields of research, and it is desirable to use this data to make causal inferences. Because this data is nonrandom, additional assumptions are needed in order to construct unbiased estimators for causal effects. The standard assumption is strong ignorability, though it is often impossible to achieve the level of covariate balance that it requires. As such, researchers often settle for lesser balance levels within their datasets. However, these balance levels are generally insufficient to guarantee an unbiased estimate of the treatment effect without further assumptions. This article presents several extensions to the strong ignorability assumption that address this issue. Under these additional assumptions, specific levels of covariate balance are both necessary and sufficient for the treatment effect estimate to be unbiased. There is a trade‐off, however: as balance decreases, stronger assumptions are required to guarantee estimator unbiasedness. These results unify parametric and nonparametric adjustment methods for causal inference and are actualized by the Balance Optimization Subset Selection framework, which identifies the best level of balance that can be achieved within a dataset. © 2017 Wiley Periodicals, Inc. Naval Research Logistics 64: 323–344, 2017

Suggested Citation

  • Jason J. Sauppe & Sheldon H. Jacobson, 2017. "The role of covariate balance in observational studies," Naval Research Logistics (NRL), John Wiley & Sons, vol. 64(4), pages 323-344, June.
  • Handle: RePEc:wly:navres:v:64:y:2017:i:4:p:323-344
    DOI: 10.1002/nav.21751
    as

    Download full text from publisher

    File URL: https://doi.org/10.1002/nav.21751
    Download Restriction: no

    File URL: https://libkey.io/10.1002/nav.21751?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. LaLonde, Robert J, 1986. "Evaluating the Econometric Evaluations of Training Programs with Experimental Data," American Economic Review, American Economic Association, vol. 76(4), pages 604-620, September.
    2. A. Smith, Jeffrey & E. Todd, Petra, 2005. "Does matching overcome LaLonde's critique of nonexperimental estimators?," Journal of Econometrics, Elsevier, vol. 125(1-2), pages 305-353.
    3. Hainmueller, Jens, 2012. "Entropy Balancing for Causal Effects: A Multivariate Reweighting Method to Produce Balanced Samples in Observational Studies," Political Analysis, Cambridge University Press, vol. 20(1), pages 25-46, January.
    4. Alexander G. Nikolaev & Sheldon H. Jacobson & Wendy K. Tam Cho & Jason J. Sauppe & Edward C. Sewell, 2013. "Balance Optimization Subset Selection (BOSS): An Alternative Approach for Causal Inference with Observational Data," Operations Research, INFORMS, vol. 61(2), pages 398-412, April.
    5. Justel, Ana & Peña, Daniel & Zamar, Rubén, 1997. "A multivariate Kolmogorov-Smirnov test of goodness of fit," Statistics & Probability Letters, Elsevier, vol. 35(3), pages 251-259, October.
    6. Petra E. Todd & Jeffrey A. Smith, 2001. "Reconciling Conflicting Evidence on the Performance of Propensity-Score Matching Methods," American Economic Review, American Economic Association, vol. 91(2), pages 112-118, May.
    7. Dan Yang & Dylan S. Small & Jeffrey H. Silber & Paul R. Rosenbaum, 2012. "Optimal Matching with Minimal Deviation from Fine Balance in a Study of Obesity and Surgical Outcomes," Biometrics, The International Biometric Society, vol. 68(2), pages 628-636, June.
    8. Kosuke Imai & Marc Ratkovic, 2014. "Covariate balancing propensity score," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 76(1), pages 243-263, January.
    9. Iacus, Stefano M. & King, Gary & Porro, Giuseppe, 2011. "Multivariate Matching Methods That Are Monotonic Imbalance Bounding," Journal of the American Statistical Association, American Statistical Association, vol. 106(493), pages 345-361.
    10. Dehejia, Rajeev, 2005. "Practical propensity score matching: a reply to Smith and Todd," Journal of Econometrics, Elsevier, vol. 125(1-2), pages 355-364.
    11. Samuel D. Pimentel & Rachel R. Kelz & Jeffrey H. Silber & Paul R. Rosenbaum, 2015. "Large, Sparse Optimal Matching With Refined Covariate Balance in an Observational Study of the Health Outcomes Produced by New Surgeons," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(510), pages 515-527, June.
    12. Alexis Diamond & Jasjeet S. Sekhon, 2013. "Genetic Matching for Estimating Causal Effects: A General Multivariate Matching Method for Achieving Balance in Observational Studies," The Review of Economics and Statistics, MIT Press, vol. 95(3), pages 932-945, July.
    13. Dimitris Bertsimas & Mac Johnson & Nathan Kallus, 2015. "The Power of Optimization Over Randomization in Designing Experiments Involving Small Samples," Operations Research, INFORMS, vol. 63(4), pages 868-876, August.
    14. Guido W. Imbens, 2004. "Nonparametric Estimation of Average Treatment Effects Under Exogeneity: A Review," The Review of Economics and Statistics, MIT Press, vol. 86(1), pages 4-29, February.
    15. Wendy K. Tam Cho & Jason J. Sauppe & Alexander G. Nikolaev & Sheldon H. Jacobson & Edward C. Sewell, 2013. "An optimization approach for making causal inferences," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 67(2), pages 211-226, May.
    16. Heller, Ruth & Rosenbaum, Paul R. & Small, Dylan S., 2009. "Split Samples and Design Sensitivity in Observational Studies," Journal of the American Statistical Association, American Statistical Association, vol. 104(487), pages 1090-1101.
    17. José R. Zubizarreta, 2015. "Stable Weights that Balance Covariates for Estimation With Incomplete Outcome Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(511), pages 910-922, September.
    18. Kosuke Imai & Gary King & Elizabeth A. Stuart, 2008. "Misunderstandings between experimentalists and observationalists about causal inference," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 171(2), pages 481-502, April.
    19. Iacus, Stefano M. & King, Gary & Porro, Giuseppe, 2012. "Causal Inference without Balance Checking: Coarsened Exact Matching," Political Analysis, Cambridge University Press, vol. 20(1), pages 1-24, January.
    20. Jason J. Sauppe & Sheldon H. Jacobson & Edward C. Sewell, 2014. "Complexity and Approximation Results for the Balance Optimization Subset Selection Model for Causal Inference in Observational Studies," INFORMS Journal on Computing, INFORMS, vol. 26(3), pages 547-566, August.
    21. Ho, Daniel E. & Imai, Kosuke & King, Gary & Stuart, Elizabeth A., 2007. "Matching as Nonparametric Preprocessing for Reducing Model Dependence in Parametric Causal Inference," Political Analysis, Cambridge University Press, vol. 15(3), pages 199-236, July.
    22. King, Gary & Zeng, Langche, 2006. "The Dangers of Extreme Counterfactuals," Political Analysis, Cambridge University Press, vol. 14(2), pages 131-159, April.
    23. Rosenbaum, Paul R. & Ross, Richard N. & Silber, Jeffrey H., 2007. "Minimum Distance Matched Sampling With Fine Balance in an Observational Study of Treatment for Ovarian Cancer," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 75-83, March.
    24. Alberto Abadie & Guido W. Imbens, 2006. "Large Sample Properties of Matching Estimators for Average Treatment Effects," Econometrica, Econometric Society, vol. 74(1), pages 235-267, January.
    25. José R. Zubizarreta, 2012. "Using Mixed Integer Programming for Matching in an Observational Study of Kidney Failure After Surgery," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(500), pages 1360-1371, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Md Saiful Islam & Md Sarowar Morshed & Md. Noor-E-Alam, 2022. "A Computational Framework for Solving Nonlinear Binary Optimization Problems in Robust Causal Inference," INFORMS Journal on Computing, INFORMS, vol. 34(6), pages 3023-3041, November.
    2. Rui Luo & Lijia Sun & Yin Kuang & Ping Deng & Mengna Lu, 2022. "Research on the Graphical Model Structure Characteristic of Strong Exogeneity Based on Twin Network Method and Its Application in Causal Inference," Mathematics, MDPI, vol. 10(6), pages 1-13, March.
    3. Hochbaum, Dorit S. & Rao, Xu & Sauppe, Jason, 2022. "Network flow methods for the minimum covariate imbalance problem," European Journal of Operational Research, Elsevier, vol. 300(3), pages 827-836.
    4. Hee Youn Kwon & Jason J. Sauppe & Sheldon H. Jacobson, 2019. "Treatment Effect Decomposition and Bootstrap Hypothesis Testing in Observational Studies," Annals of Data Science, Springer, vol. 6(3), pages 491-511, September.
    5. Calic, Goran & Shevchenko, Anton, 2020. "How signal intensity of behavioral orientations affects crowdfunding performance: The role of entrepreneurial orientation in crowdfunding business ventures," Journal of Business Research, Elsevier, vol. 115(C), pages 204-220.
    6. Roberto Esposti, 2022. "Non-Monetary Motivations Of Agroenvironmental Policies Adoption. A Causal Forest Approach," Working Papers 459, Universita' Politecnica delle Marche (I), Dipartimento di Scienze Economiche e Sociali.
    7. Edoardo Baldoni & Roberto Esposti, 2023. "Estimating The Impact Of Policies Under Spatial Interference. The Case Of Cap Support To Organic Farming," Working Papers 475, Universita' Politecnica delle Marche (I), Dipartimento di Scienze Economiche e Sociali.
    8. Anton Shevchenko, 2021. "Do financial penalties for environmental violations facilitate improvements in corporate environmental performance? An empirical investigation," Business Strategy and the Environment, Wiley Blackwell, vol. 30(4), pages 1723-1734, May.
    9. Libo Sun & Guodong Lyu & Yugang Yu & Chung‐Piaw Teo, 2020. "Fulfillment by Amazon versus fulfillment by seller: An interpretable risk‐adjusted fulfillment model," Naval Research Logistics (NRL), John Wiley & Sons, vol. 67(8), pages 627-645, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Cousineau, Martin & Verter, Vedat & Murphy, Susan A. & Pineau, Joelle, 2023. "Estimating causal effects with optimization-based methods: A review and empirical comparison," European Journal of Operational Research, Elsevier, vol. 304(2), pages 367-380.
    2. Jason J. Sauppe & Sheldon H. Jacobson & Edward C. Sewell, 2014. "Complexity and Approximation Results for the Balance Optimization Subset Selection Model for Causal Inference in Observational Studies," INFORMS Journal on Computing, INFORMS, vol. 26(3), pages 547-566, August.
    3. Martin Cousineau & Vedat Verter & Susan A. Murphy & Joelle Pineau, 2022. "Estimating causal effects with optimization-based methods: A review and empirical comparison," Papers 2203.00097, arXiv.org.
    4. Adeola Oyenubi & Martin Wittenberg, 2021. "Does the choice of balance-measure matter under genetic matching?," Empirical Economics, Springer, vol. 61(1), pages 489-502, July.
    5. Guido W. Imbens & Jeffrey M. Wooldridge, 2009. "Recent Developments in the Econometrics of Program Evaluation," Journal of Economic Literature, American Economic Association, vol. 47(1), pages 5-86, March.
    6. María de los Angeles Resa & José R. Zubizarreta, 2020. "Direct and stable weight adjustment in non‐experimental studies with multivalued treatments: analysis of the effect of an earthquake on post‐traumatic stress," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 183(4), pages 1387-1410, October.
    7. Marco Morucci & Md. Noor-E-Alam & Cynthia Rudin, 2022. "A Robust Approach to Quantifying Uncertainty in Matching Problems of Causal Inference," INFORMS Joural on Data Science, INFORMS, vol. 1(2), pages 156-171, October.
    8. Susan Athey & Guido W. Imbens, 2017. "The State of Applied Econometrics: Causality and Policy Evaluation," Journal of Economic Perspectives, American Economic Association, vol. 31(2), pages 3-32, Spring.
    9. Huber, Martin, 2019. "An introduction to flexible methods for policy evaluation," FSES Working Papers 504, Faculty of Economics and Social Sciences, University of Freiburg/Fribourg Switzerland.
    10. Steven Lehrer & Gregory Kordas, 2013. "Matching using semiparametric propensity scores," Empirical Economics, Springer, vol. 44(1), pages 13-45, February.
    11. Yuri Ostrovsky & Garnett Picot, 2021. "Innovation in immigrant-owned firms," Small Business Economics, Springer, vol. 57(4), pages 1857-1874, December.
    12. Md Saiful Islam & Md Sarowar Morshed & Md. Noor-E-Alam, 2022. "A Computational Framework for Solving Nonlinear Binary Optimization Problems in Robust Causal Inference," INFORMS Journal on Computing, INFORMS, vol. 34(6), pages 3023-3041, November.
    13. Seonho Shin, 2022. "Evaluating the Effect of the Matching Grant Program for Refugees: An Observational Study Using Matching, Weighting, and the Mantel-Haenszel Test," Journal of Labor Research, Springer, vol. 43(1), pages 103-133, March.
    14. Md Saiful Islam & Md Sarowar Morshed & Gary J Young & Md Noor-E-Alam, 2019. "Robust policy evaluation from large-scale observational studies," PLOS ONE, Public Library of Science, vol. 14(10), pages 1-19, October.
    15. Huber, Martin & Lechner, Michael & Wunsch, Conny, 2013. "The performance of estimators based on the propensity score," Journal of Econometrics, Elsevier, vol. 175(1), pages 1-21.
    16. Huber, Martin & Lechner, Michael & Wunsch, Conny, 2010. "How to Control for Many Covariates? Reliable Estimators Based on the Propensity Score," IZA Discussion Papers 5268, Institute of Labor Economics (IZA).
    17. Susan Athey & Guido W. Imbens & Stefan Wager, 2018. "Approximate residual balancing: debiased inference of average treatment effects in high dimensions," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 80(4), pages 597-623, September.
    18. Tymon Słoczyński, 2015. "The Oaxaca–Blinder Unexplained Component as a Treatment Effects Estimator," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 77(4), pages 588-604, August.
    19. David McKenzie & John Gibson & Steven Stillman, 2010. "How Important Is Selection? Experimental vs. Non-Experimental Measures of the Income Gains from Migration," Journal of the European Economic Association, MIT Press, vol. 8(4), pages 913-945, June.
    20. McKenzie, David & Gibson, John & Stillman, Steven, 2006. "How important is selection ? Experimental versus non-experimental measures of the income gains from migration," Policy Research Working Paper Series 3906, The World Bank.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wly:navres:v:64:y:2017:i:4:p:323-344. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://doi.org/10.1002/(ISSN)1520-6750 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.