IDEAS home Printed from https://ideas.repec.org/p/qub/charms/1801.html
   My bibliography  Save this paper

Inference with difference-in-differences with a small number of groups: a review, simulation study and empirical application using SHARE data

Author

Listed:
  • Slawa Rokicki
  • Jessica Cohen
  • Gunther Fink
  • Joshua Salomon
  • Mary Beth Landrum

Abstract

Difference-in-differences (DID) estimation has become increasingly popular as an approach to evaluate the effect of a group-level policy on individual-level outcomes. Several statistical methodologies have been proposed to correct for the within-group correlation of model errors resulting from the clustering of data. Little is known about how well these corrections perform with the often small number of groups observed in health research using longitudinal data. First, we review the most commonly used modelling solutions in DID estimation for panel data, including generalized estimating equations (GEE), permutation tests, clustered standard errors (CSE), wild cluster bootstrapping, and aggregation. Second, we compare the empirical coverage rates and power of these methods using a Monte Carlo simulation study in scenarios in which we vary the degree of error correlation, the group size balance, and the proportion of treated groups. Third, we provide an empirical example using the Survey of Health, Ageing and Retirement in Europe (SHARE). When the number of groups is small, CSE are systematically biased downwards in scenarios when data are unbalanced or when there is a low proportion of treated groups. This can result in over-rejection of the null even when data are composed of up to 50 groups. Aggregation, permutation tests, bias-adjusted GEE and wild cluster bootstrap produce coverage rates close to the nominal rate for almost all scenarios, though GEE may suffer from low power. In DID estimation with a small number of groups, analysis using aggregation, permutation tests, wild cluster bootstrap, or bias-adjusted GEE is recommended.

Suggested Citation

  • Slawa Rokicki & Jessica Cohen & Gunther Fink & Joshua Salomon & Mary Beth Landrum, 2018. "Inference with difference-in-differences with a small number of groups: a review, simulation study and empirical application using SHARE data," CHaRMS Working Papers 18-01, Centre for HeAlth Research at the Management School (CHaRMS).
  • Handle: RePEc:qub:charms:1801
    as

    Download full text from publisher

    File URL: ftp://ftp.qub.ac.uk/pub/users/repec/qub/charms/MS_WPS_CHARMS_18_01.pdf
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Jessica Cohen & Pascaline Dupas, 2008. "Free Distribution or Cost-Sharing? Evidence from a Malaria Prevention Experiment," NBER Working Papers 14406, National Bureau of Economic Research, Inc.
    2. Guido W. Imbens & Michal Kolesár, 2016. "Robust Standard Errors in Small Samples: Some Practical Advice," The Review of Economics and Statistics, MIT Press, vol. 98(4), pages 701-712, October.
    3. A. Colin Cameron & Jonah B. Gelbach & Douglas L. Miller, 2008. "Bootstrap-Based Improvements for Inference with Clustered Errors," The Review of Economics and Statistics, MIT Press, vol. 90(3), pages 414-427, August.
    4. Avendano, Mauricio & Berkman, Lisa F. & Brugiavini, Agar & Pasini, Giacomo, 2015. "The long-run effect of maternity leave benefits on mental health: Evidence from European countries," Social Science & Medicine, Elsevier, vol. 132(C), pages 45-53.
    5. Joan Costa‐Font & Martin Karlsson & Henning Øien, 2016. "Careful in the Crisis? Determinants of Older People's Informal Care Receipt in Crisis‐Struck European Countries," Health Economics, John Wiley & Sons, Ltd., vol. 25(S2), pages 25-42, November.
    6. Brewer Mike & Crossley Thomas F. & Joyce Robert, 2018. "Inference with Difference-in-Differences Revisited," Journal of Econometric Methods, De Gruyter, vol. 7(1), pages 1-16, January.
    7. Lloyd A. Mancl & Timothy A. DeRouen, 2001. "A Covariance Estimator for GEE with Improved Small‐Sample Properties," Biometrics, The International Biometric Society, vol. 57(1), pages 126-134, March.
    8. Andrew V. Carter & Kevin T. Schnepel & Douglas G. Steigerwald, 2017. "Asymptotic Behavior of a t -Test Robust to Cluster Heterogeneity," The Review of Economics and Statistics, MIT Press, vol. 99(4), pages 698-709, July.
    9. A. Colin Cameron & Douglas L. Miller, 2010. "Robust Inference with Clustered Data," Working Papers 318, University of California, Davis, Department of Economics.
    10. A. Colin Cameron & Douglas L. Miller, 2010. "Robust Inference with Clustered Data," Working Papers 107, University of California, Davis, Department of Economics.
    11. Stephen G. Donald & Kevin Lang, 2007. "Inference with Difference-in-Differences and Other Panel Data," The Review of Economics and Statistics, MIT Press, vol. 89(2), pages 221-233, May.
    12. repec:wly:hlthec:v:25:y:2016:i::p:25-42 is not listed on IDEAS
    13. Michael P. Fay & Barry I. Graubard, 2001. "Small-Sample Adjustments for Wald-Type Tests Using Sandwich Estimators," Biometrics, The International Biometric Society, vol. 57(4), pages 1198-1206, December.
    14. A. Colin Cameron & Douglas L. Miller, 2015. "A Practitioner’s Guide to Cluster-Robust Inference," Journal of Human Resources, University of Wisconsin Press, vol. 50(2), pages 317-372.
    15. Marianne Bertrand & Esther Duflo & Sendhil Mullainathan, 2004. "How Much Should We Trust Differences-In-Differences Estimates?," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 119(1), pages 249-275.
    16. White, Halbert, 1980. "A Heteroskedasticity-Consistent Covariance Matrix Estimator and a Direct Test for Heteroskedasticity," Econometrica, Econometric Society, vol. 48(4), pages 817-838, May.
    17. Mark E. McGovern & Kobus Herbst & Frank Tanser & Tinofa Mutevedzi & David Canning & Dickman Gareta & Deenan Pillay & Till Bärnighausen, 2016. "Do Gifts Increase Consent to Home-based HIV Testing? A Difference-in-Differences Study in Rural KwaZulu-Natal, South Africa," CHaRMS Working Papers 16-05, Centre for HeAlth Research at the Management School (CHaRMS).
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. McGovern, Mark E. & Rokicki, Slawa & Reichman, Nancy E., 2022. "Maternal depression and economic well-being: A quasi-experimental approach," Social Science & Medicine, Elsevier, vol. 305(C).
    2. Rokicki, Slawa, 2021. "Impact of family law reform on adolescent reproductive health in Ethiopia: A quasi-experimental study," World Development, Elsevier, vol. 144(C).
    3. Gehrsitz, Markus & Saffer, Henry & Grossman, Michael, 2021. "The effect of changes in alcohol tax differentials on alcohol consumption," Journal of Public Economics, Elsevier, vol. 204(C).
    4. Walsh, Brendan & Wren, Maev-Ann & Smith, Samantha & Lyons, Seán & Eighan, James & Morgenroth, Edgar, 2019. "An analysis of the effects on Irish hospital care of the supply of care inside and outside the hospital," Research Series, Economic and Social Research Institute (ESRI), number RS91, June.
    5. Igor Francetic & Günther Fink & Fabrizio Tediosi, 2021. "Impact of social accountability monitoring on health facility performance: Evidence from Tanzania," Health Economics, John Wiley & Sons, Ltd., vol. 30(4), pages 766-785, April.
    6. Walsh, Brendan & Nolan, Anne & Brick, Aoife & Keegan, Conor, 2019. "Did the expansion of free GP care impact demand for Emergency Department attendances? A difference-in-differences analysis," Social Science & Medicine, Elsevier, vol. 222(C), pages 101-111.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Matthew D. Webb, 2023. "Reworking wild bootstrap‐based inference for clustered errors," Canadian Journal of Economics/Revue canadienne d'économique, John Wiley & Sons, vol. 56(3), pages 839-858, August.
    2. Hansen, Bruce E. & Lee, Seojeong, 2019. "Asymptotic theory for clustered samples," Journal of Econometrics, Elsevier, vol. 210(2), pages 268-290.
    3. James G. MacKinnon & Matthew D. Webb, 2020. "When and How to Deal with Clustered Errors in Regression Models," Working Paper 1421, Economics Department, Queen's University.
    4. MacKinnon, James G. & Nielsen, Morten Ørregaard & Webb, Matthew D., 2023. "Cluster-robust inference: A guide to empirical practice," Journal of Econometrics, Elsevier, vol. 232(2), pages 272-299.
    5. Bruno Ferman & Cristine Pinto, 2019. "Inference in Differences-in-Differences with Few Treated Groups and Heteroskedasticity," The Review of Economics and Statistics, MIT Press, vol. 101(3), pages 452-467, July.
    6. James G. MacKinnon, 2019. "How cluster‐robust inference is changing applied econometrics," Canadian Journal of Economics/Revue canadienne d'économique, John Wiley & Sons, vol. 52(3), pages 851-881, August.
    7. MacKinnon, James G. & Nielsen, Morten Ørregaard & Webb, Matthew D., 2023. "Testing for the appropriate level of clustering in linear regression models," Journal of Econometrics, Elsevier, vol. 235(2), pages 2027-2056.
    8. Entorf, Horst & Sattarova, Liliya, 2016. "The Analysis of Prison-Prisoner Data Using Cluster-Sample Econometrics: Prison Conditions and Prisoners' Assessments of the Future," IZA Discussion Papers 10209, Institute of Labor Economics (IZA).
    9. Jeffrey D. Michler & Anna Josephson, 2022. "Recent developments in inference: practicalities for applied economics," Chapters, in: A Modern Guide to Food Economics, chapter 11, pages 235-268, Edward Elgar Publishing.
    10. repec:fgv:eesptd:411 is not listed on IDEAS
    11. Andreas Hagemann, 2019. "Permutation inference with a finite number of heterogeneous clusters," Papers 1907.01049, arXiv.org, revised Feb 2023.
    12. Hagemann, Andreas, 2019. "Placebo inference on treatment effects when the number of clusters is small," Journal of Econometrics, Elsevier, vol. 213(1), pages 190-209.
    13. James G. MacKinnon & Morten Ørregaard Nielsen & Matthew D. Webb, 2023. "Fast and reliable jackknife and bootstrap methods for cluster‐robust inference," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 38(5), pages 671-694, August.
    14. A. Colin Cameron & Jonah B. Gelbach & Douglas L. Miller, 2008. "Bootstrap-Based Improvements for Inference with Clustered Errors," The Review of Economics and Statistics, MIT Press, vol. 90(3), pages 414-427, August.
    15. James G. MacKinnon & Matthew D. Webb, 2017. "Wild Bootstrap Inference for Wildly Different Cluster Sizes," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 32(2), pages 233-254, March.
    16. Brewer, Mike & Crossley, Thomas F. & Zilio, Federico, 2019. "What Do We Really Know about the Employment Effects of the UK's National Minimum Wage?," IZA Discussion Papers 12369, Institute of Labor Economics (IZA).
    17. Antoine A. Djogbenou & James G. MacKinnon & Morten Ø. Nielsen, 2017. "Validity Of Wild Bootstrap Inference With Clustered Errors," Working Paper 1383, Economics Department, Queen's University.
    18. Tom Boot & Gianmaria Niccodemi & Tom Wansbeek, 2023. "Unbiased estimation of the OLS covariance matrix when the errors are clustered," Empirical Economics, Springer, vol. 64(6), pages 2511-2533, June.
    19. Tobias Pfaff & Johannes Hirata, 2013. "Testing the Easterlin Hypothesis with Panel Data: The Dynamic Relationship between Life Satisfaction and Economic Growth in Germany and the UK," SOEPpapers on Multidisciplinary Panel Data Research 554, DIW Berlin, The German Socio-Economic Panel (SOEP).
    20. Pfaff, Tobias & Hirata, Johannes, 2013. "Testing the Easterlin hypothesis with panel data: The dynamic relationship between life satisfaction and economic growth in Germany and in the UK," CIW Discussion Papers 4/2013, University of Münster, Center for Interdisciplinary Economics (CIW).
    21. James G. MacKinnon & Matthew D. Webb, 2017. "Pitfalls When Estimating Treatment Effects Using Clustered Data," Working Paper 1387, Economics Department, Queen's University.

    More about this item

    Keywords

    Difference-in-differences; Clustered standard errors; Inference; Monte Carlo simulation; GEE;
    All these keywords.

    JEL classification:

    • C18 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Methodolical Issues: General
    • C52 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Model Evaluation, Validation, and Selection
    • I10 - Health, Education, and Welfare - - Health - - - General

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:qub:charms:1801. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Mark McGovern (email available below). General contact details of provider: https://edirc.repec.org/data/dequbuk.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.