IDEAS home Printed from
MyIDEAS: Log in (now much improved!) to save this paper

When to Control for Covariates? Panel-Asymptotic Results for Estimates of Treatment Effects

  • Joshua D. Angrist
  • Jinyong Hahn

The problem of how to control for covariates is endemic in evaluation research. Covariate-matching provides an appealing control strategy, but with continuous or high-dimensional covariate vectors, exact matching may be impossible or involve small cells. Matching observations that have the same propensity score produces unbiased estimates of causal effects whenever covariate-matching does, and also has an attractive dimension-reducing property. On the other hand, conventional asymptotic arguments show that covariate-matching is (asymptotically) more efficient that propensity score-matching. This is because the usual asymptotic sequence has cell sizes growing to infinity, with no benefit from reducing the number of cells. Here, we approximate the large sample behavior of difference matching estimators using a panel-style asymptotic sequence with fixed cell sizes and the number of cells increasing to infinity. Exact calculations in simple examples and Monte Carlo evidence suggests this generates a substantially improved approximation to actual finite-sample distributions. Under this sequence, propensity-score-matching is most likely to dominate exact matching when cell sizes are small, the explanatory power of the covariates conditional on the propensity score is low, and/or the probability of treatment is close to zero or one. Finally, we introduce a random-effects type combination estimator that provides finite-sample efficiency gains over both covariate-matching and propensity-score-matching.

If you experience problems downloading a file, check if you have the proper application to view it first. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.

File URL:
Download Restriction: no

Paper provided by National Bureau of Economic Research, Inc in its series NBER Technical Working Papers with number 0241.

in new window

Date of creation: May 1999
Date of revision:
Handle: RePEc:nbr:nberte:0241
Contact details of provider: Postal:
National Bureau of Economic Research, 1050 Massachusetts Avenue Cambridge, MA 02138, U.S.A.

Phone: 617-868-3900
Web page:

More information through EDIRC

References listed on IDEAS
Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:

as in new window
  1. Joshua D. Angrist, 1998. "Estimating the Labor Market Impact of Voluntary Military Service Using Social Security Data on Military Applicants," Econometrica, Econometric Society, vol. 66(2), pages 249-288, March.
  2. Douglas Staiger & James H. Stock, 1997. "Instrumental Variables Regression with Weak Instruments," Econometrica, Econometric Society, vol. 65(3), pages 557-586, May.
  3. Rajeev H. Dehejia & Sadek Wahba, 2002. "Propensity Score-Matching Methods For Nonexperimental Causal Studies," The Review of Economics and Statistics, MIT Press, vol. 84(1), pages 151-161, February.
  4. David Card & Daniel Sullivan, 1987. "Measuring the Effect of Subsidized Training Programs on Movements In andOut of Employment," NBER Working Papers 2173, National Bureau of Economic Research, Inc.
  5. Hausman, Jerry A. & Taylor, William E., 1981. "Panel data and unobservable individual effects," Journal of Econometrics, Elsevier, vol. 16(1), pages 155-155, May.
  6. Orley Ashenfelter & David Card, 1984. "Using the Longitudinal Structure of Earnings to Estimate the Effect of Training Programs," NBER Working Papers 1489, National Bureau of Economic Research, Inc.
  7. Deaton, A. & Paxson, C., 1997. "Economies of Scale, Household Size, and the Demand for Food," Papers 178, Princeton, Woodrow Wilson School - Development Studies.
  8. Chamberlain, Gary & Griliches, Zvi, 1975. "Unobservables with a Variance-Components Structure: Ability, Schooling, and the Economic Success of Brothers," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 16(2), pages 422-49, June.
  9. Swamy, P A V B, 1970. "Efficient Inference in a Random Coefficient Regression Model," Econometrica, Econometric Society, vol. 38(2), pages 311-23, March.
  10. Joshua D. Angrist & Alan B. Krueger, 1995. "Split Sample Instrumental Variables," NBER Technical Working Papers 0150, National Bureau of Economic Research, Inc.
  11. James J. Heckman & Hidehiko Ichimura & Petra Todd, 1998. "Matching As An Econometric Evaluation Estimator," Review of Economic Studies, Oxford University Press, vol. 65(2), pages 261-294.
  12. Chamberlain, Gary, 1987. "Asymptotic efficiency in estimation with conditional moment restrictions," Journal of Econometrics, Elsevier, vol. 34(3), pages 305-334, March.
  13. Gary Chamberlain & Guido W. Imbens, 1996. "Hierarchical Bayes Models with Many Instrumental Variables," NBER Technical Working Papers 0204, National Bureau of Economic Research, Inc.
  14. James J. Heckman & Hidehiko Ichimura & Petra E. Todd, 1997. "Matching As An Econometric Evaluation Estimator: Evidence from Evaluating a Job Training Programme," Review of Economic Studies, Oxford University Press, vol. 64(4), pages 605-654.
  15. Mundlak, Yair, 1978. "On the Pooling of Time Series and Cross Section Data," Econometrica, Econometric Society, vol. 46(1), pages 69-85, January.
  16. Maddala, G S, 1971. "The Use of Variance Components Models in Pooling Cross Section and Time Series Data," Econometrica, Econometric Society, vol. 39(2), pages 341-58, March.
  17. Bekker, Paul A, 1994. "Alternative Approximations to the Distributions of Instrumental Variable Estimators," Econometrica, Econometric Society, vol. 62(3), pages 657-81, May.
  18. Chamberlain, Gary, 1984. "Panel data," Handbook of Econometrics, in: Z. Griliches† & M. D. Intriligator (ed.), Handbook of Econometrics, edition 1, volume 2, chapter 22, pages 1247-1318 Elsevier.
  19. Angrist, Joshua D & Krueger, Alan B, 1995. "Split-Sample Instrumental Variables Estimates of the Return to Schooling," Journal of Business & Economic Statistics, American Statistical Association, vol. 13(2), pages 225-35, April.
  20. Chamberlain, Gary, 1992. "Efficiency Bounds for Semiparametric Regression," Econometrica, Econometric Society, vol. 60(3), pages 567-96, May.
Full references (including those not matched with items on IDEAS)

This item is not listed on Wikipedia, on a reading list or among the top items on IDEAS.

When requesting a correction, please mention this item's handle: RePEc:nbr:nberte:0241. See general information about how to correct material in RePEc.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ()

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If references are entirely missing, you can add them using this form.

If the full references list an item that is present in RePEc, but the system did not link to it, you can help with this form.

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your profile, as there may be some citations waiting for confirmation.

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

This information is provided to you by IDEAS at the Research Division of the Federal Reserve Bank of St. Louis using RePEc data.