IDEAS home Printed from https://ideas.repec.org/a/inm/orijds/v1y2022i2p156-171.html
   My bibliography  Save this article

A Robust Approach to Quantifying Uncertainty in Matching Problems of Causal Inference

Author

Listed:
  • Marco Morucci

    (Center for Data Science, New York University, New York 10012)

  • Md. Noor-E-Alam

    (Department of Mechanical and Industrial Engineering, Northeastern University, Boston, Massachusetts 02115)

  • Cynthia Rudin

    (Department of Computer Science, Duke University, Durham, North Carolina 27708)

Abstract

Unquantified sources of uncertainty in observational causal analyses can break the integrity of the results. One would never want another analyst to repeat a calculation with the same data set, using a seemingly identical procedure, only to find a different conclusion. However, as we show in this work, there is a typical source of uncertainty that is essentially never considered in observational causal studies: the choice of match assignment for matched groups—that is, which unit is matched to which other unit before a hypothesis test is conducted. The choice of match assignment is anything but innocuous and can have a surprisingly large influence on the causal conclusions. Given that a vast number of causal inference studies test hypotheses on treatment effects after treatment cases are matched with similar control cases, we should find a way to quantify how much this extra source of uncertainty impacts results. What we would really like to be able to report is that no matter which match assignment is made, as long as the match is sufficiently good, then the hypothesis test results are still informative. In this paper, we provide methodology based on discrete optimization to create robust tests that explicitly account for this possibility. We formulate robust tests for binary and continuous data based on common test statistics as integer linear programs solvable with common methodologies. We study the finite-sample behavior of our test statistic in the discrete-data case. We apply our methods to simulated and real-world data sets and show that they can produce useful results in practical applied settings.

Suggested Citation

  • Marco Morucci & Md. Noor-E-Alam & Cynthia Rudin, 2022. "A Robust Approach to Quantifying Uncertainty in Matching Problems of Causal Inference," INFORMS Joural on Data Science, INFORMS, vol. 1(2), pages 156-171, October.
  • Handle: RePEc:inm:orijds:v:1:y:2022:i:2:p:156-171
    DOI: 10.1287/ijds.2022.0020
    as

    Download full text from publisher

    File URL: http://dx.doi.org/10.1287/ijds.2022.0020
    Download Restriction: no

    File URL: https://libkey.io/10.1287/ijds.2022.0020?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Paul R. Rosenbaum, 2010. "Evidence factors in observational studies," Biometrika, Biometrika Trust, vol. 97(2), pages 333-345.
    2. Iacus, Stefano M. & King, Gary & Porro, Giuseppe, 2011. "Multivariate Matching Methods That Are Monotonic Imbalance Bounding," Journal of the American Statistical Association, American Statistical Association, vol. 106(493), pages 345-361.
    3. Abadie, Alberto & Imbens, Guido W., 2011. "Bias-Corrected Matching Estimators for Average Treatment Effects," Journal of Business & Economic Statistics, American Statistical Association, vol. 29(1), pages 1-11.
    4. Alexis Diamond & Jasjeet S. Sekhon, 2013. "Genetic Matching for Estimating Causal Effects: A General Multivariate Matching Method for Achieving Balance in Observational Studies," The Review of Economics and Statistics, MIT Press, vol. 95(3), pages 932-945, July.
    5. Kosuke Imai & Gary King & Elizabeth A. Stuart, 2008. "Misunderstandings between experimentalists and observationalists about causal inference," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 171(2), pages 481-502, April.
    6. Iacus, Stefano M. & King, Gary & Porro, Giuseppe, 2012. "Causal Inference without Balance Checking: Coarsened Exact Matching," Political Analysis, Cambridge University Press, vol. 20(1), pages 1-24, January.
    7. Imbens,Guido W. & Rubin,Donald B., 2015. "Causal Inference for Statistics, Social, and Biomedical Sciences," Cambridge Books, Cambridge University Press, number 9780521885881.
    8. Alberto Abadie & Guido W. Imbens, 2006. "Large Sample Properties of Matching Estimators for Average Treatment Effects," Econometrica, Econometric Society, vol. 74(1), pages 235-267, January.
    9. Beau Coker & Cynthia Rudin & Gary King, 2021. "A Theory of Statistical Inference for Ensuring the Robustness of Scientific Results," Management Science, INFORMS, vol. 67(10), pages 6174-6197, October.
    10. José R. Zubizarreta, 2012. "Using Mixed Integer Programming for Matching in an Observational Study of Kidney Failure After Surgery," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(500), pages 1360-1371, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jason J. Sauppe & Sheldon H. Jacobson, 2017. "The role of covariate balance in observational studies," Naval Research Logistics (NRL), John Wiley & Sons, vol. 64(4), pages 323-344, June.
    2. Jason J. Sauppe & Sheldon H. Jacobson & Edward C. Sewell, 2014. "Complexity and Approximation Results for the Balance Optimization Subset Selection Model for Causal Inference in Observational Studies," INFORMS Journal on Computing, INFORMS, vol. 26(3), pages 547-566, August.
    3. Jan Stede, 2019. "Do Energy Efficiency Networks Save Energy? Evidence from German Plant-Level Data," Discussion Papers of DIW Berlin 1813, DIW Berlin, German Institute for Economic Research.
    4. Takasaki, Yoshito, 2020. "Impacts of disability on poverty: Quasi-experimental evidence from landmine amputees in Cambodia," Journal of Economic Behavior & Organization, Elsevier, vol. 180(C), pages 85-107.
    5. Yuri Ostrovsky & Garnett Picot, 2021. "Innovation in immigrant-owned firms," Small Business Economics, Springer, vol. 57(4), pages 1857-1874, December.
    6. Md Saiful Islam & Md Sarowar Morshed & Md. Noor-E-Alam, 2022. "A Computational Framework for Solving Nonlinear Binary Optimization Problems in Robust Causal Inference," INFORMS Journal on Computing, INFORMS, vol. 34(6), pages 3023-3041, November.
    7. Harsh Parikh & Cynthia Rudin & Alexander Volfovsky, 2018. "MALTS: Matching After Learning to Stretch," Papers 1811.07415, arXiv.org, revised Jun 2023.
    8. Md Saiful Islam & Md Sarowar Morshed & Gary J Young & Md Noor-E-Alam, 2019. "Robust policy evaluation from large-scale observational studies," PLOS ONE, Public Library of Science, vol. 14(10), pages 1-19, October.
    9. Yoshito Takasaki, 2019. "Disability and Poverty: Landmine Amputees in Cambodia," CIRJE F-Series CIRJE-F-1118, CIRJE, Faculty of Economics, University of Tokyo.
    10. Schleich, Joachim & Fleiter, Tobias, 2019. "Effectiveness of energy audits in small business organizations," Resource and Energy Economics, Elsevier, vol. 56(C), pages 59-70.
    11. Jeon, Sung-Hee & Pohl, R. Vincent, 2019. "Medical innovation, education, and labor market outcomes of cancer patients," Journal of Health Economics, Elsevier, vol. 68(C).
    12. Tymon Słoczyński, 2015. "The Oaxaca–Blinder Unexplained Component as a Treatment Effects Estimator," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 77(4), pages 588-604, August.
    13. De Cesari, Amedeo & Gonenc, Halit & Ozkan, Neslihan, 2016. "The effects of corporate acquisitions on CEO compensation and CEO turnover of family firms," Journal of Corporate Finance, Elsevier, vol. 38(C), pages 294-317.
    14. Richard Friberg & Mark Sanctuary, 2017. "The Effect of Retail Distribution on Sales of Alcoholic Beverages," Marketing Science, INFORMS, vol. 36(4), pages 626-641, July.
    15. Robert J. Johnston & Klaus Moeltner, 2019. "Special Flood Hazard Effects on Coastal and Interior Home Values: One Size Does Not Fit All," Environmental & Resource Economics, Springer;European Association of Environmental and Resource Economists, vol. 74(1), pages 181-210, September.
    16. Difang Huang & Jiti Gao & Tatsushi Oka, 2022. "Semiparametric Single-Index Estimation for Average Treatment Effects," Papers 2206.08503, arXiv.org, revised Oct 2022.
    17. Foliano, Francesca & Green, Francis & Sartarelli, Marcello, 2019. "Away from home, better at school. The case of a British boarding school," Economics of Education Review, Elsevier, vol. 73(C).
    18. Fernandez, Linda & Cutter, Bowman & Sharma, Ritu & Scott, Tom, 2018. "Land preservation policy effect or neighborhood dynamics: A repeat sales hedonic matching approach," Journal of Environmental Economics and Management, Elsevier, vol. 88(C), pages 311-326.
    19. María de los Angeles Resa & José R. Zubizarreta, 2020. "Direct and stable weight adjustment in non‐experimental studies with multivalued treatments: analysis of the effect of an earthquake on post‐traumatic stress," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 183(4), pages 1387-1410, October.
    20. Julia Muschallik & Kerstin Pull, 2016. "Mentoring in higher education: does it enhance mentees’ research productivity?," Education Economics, Taylor & Francis Journals, vol. 24(2), pages 210-223, April.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:orijds:v:1:y:2022:i:2:p:156-171. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.