This file is part of IDEAS, which uses RePEc data


[ Papers | Articles | Software | Books | Chapters | Authors | Institutions | JEL Classification | NEP reports | Search | New papers by email | Author registration | Rankings | Volunteers | FAQ | Blog | Help! ]

Matching for Causal Inference Without Balance Checking

Author info | Abstract | Publisher info | Download info | Related research | Statistics
Author Info
Stefano Iacus (Department of Economics, Business and Statistics, University of Milan, IT)
Gary King (Institute for Quantitative Social Science, Harvard University)
Giuseppe Porro (Department of Economics and Statistics, University of Trieste)

Additional information is available for the following registered author(s):

Abstract

We address a major discrepancy in matching methods for causal inference in observational data. Since these data are typically plentiful, the goal of matching is to reduce bias and only secondarily to keep variance low. However, most matching methods seem designed for the opposite problem, guaranteeing sample size ex ante but limiting bias by controlling for covariates through reductions in the imbalance between treated and control groups only ex post and only sometimes. (The resulting practical difficulty may explain why many published applications do not check whether imbalance was reduced and so may not even be decreasing bias.) We introduce a new class of ``Monotonic Imbalance Bounding'' (MIB) matching methods that enables one to choose a fixed level of maximum imbalance, or to reduce maximum imbalance for one variable without changing it for the others. We then discuss a specific MIB method called ``Coarsened Exact Matching'' (CEM) which, unlike most existing approaches, also explicitly bounds through ex ante user choice both the degree of model dependence and the causal effect estimation error, eliminates the need for a separate procedure to restrict data to common support, meets the congruence principle, is approximately invariant to measurement error, works well with modern methods of imputation for missing data, is computationally efficient even with massive data sets, and is easy to understand and use. This method can improve causal inferences in a wide range of applications, and may be preferred for simplicity of use even when it is possible to design superior methods for particular problems. We also make available open source software which implements all our suggestions.

Download Info
To download:

If you experience problems downloading a file, check if you have the proper application to view it first. Information about this may be contained in the File-Format links below. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.

File URL: http://services.bepress.com/unimi/statistics/art36
File Format: application/pdf
File Function:
Download Restriction: no

Publisher Info
Paper provided by Universitá degli Studi di Milano in its series UNIMI - Research Papers in Economics, Business, and Statistics with number 1073.

Download reference. The following formats are available: HTML (with abstract), plain text (with abstract), BibTeX, RIS (EndNote, RefMan, ProCite), ReDIF
Length:
Date of creation: 29 Jun 2008
Date of revision:
Handle: RePEc:bep:unimip:1073

Note: oai:cdlib1:unimi-1073
Contact details of provider:
Postal: Via Conservatorio 7 - 20122 Milano
Phone: +39 02 50321522
Fax: +39 02 50321505
Web page: http://services.bepress.com/unimi
More information through EDIRC

For technical questions regarding this item, or to correct its listing, contact: (Christopher F. Baum).

Related research
Keywords: causal inferences; matching; treatment effect estimation;

This paper has been announced in the following NEP Reports:

Statistics
Access and download statistics

Did you know? Authors can create their own profile with links to their works on the RePEc Author Service.

This page was last updated on 2009-12-17.


This information is provided to you by IDEAS at the Department of Economics, College of Liberal Arts and Sciences, University of Connecticut using RePEc data on a server sponsored by the Society for Economic Dynamics.