IDEAS home Printed from https://ideas.repec.org/a/uwp/jhriss/v50y2015i2p317-372.html
   My bibliography  Save this article

A Practitioner’s Guide to Cluster-Robust Inference

Author

Listed:
  • A. Colin Cameron
  • Douglas L. Miller

Abstract

We consider statistical inference for regression when data are grouped into clusters, with regression model errors independent across clusters but correlated within clusters. Examples include data on individuals with clustering on village or region or other category such as industry, and state-year differences-in-differences studies with clustering on state. In such settings, default standard errors can greatly overstate estimator precision. Instead, if the number of clusters is large, statistical inference after OLS should be based on cluster-robust standard errors. We outline the basic method as well as many complications that can arise in practice. These include cluster-specific fixed effects, few clusters, multiway clustering, and estimators other than OLS.

Suggested Citation

  • A. Colin Cameron & Douglas L. Miller, 2015. "A Practitioner’s Guide to Cluster-Robust Inference," Journal of Human Resources, University of Wisconsin Press, vol. 50(2), pages 317-372.
  • Handle: RePEc:uwp:jhriss:v:50:y:2015:i:2:p:317-372
    as

    Download full text from publisher

    File URL: http://jhr.uwpress.org/cgi/reprint/50/2/317
    Download Restriction: A subscripton is required to access pdf files. Pay per article is available.
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Keith Finlay & Leandro M. Magnusson, 2009. "Implementing weak-instrument robust tests for a general class of instrumental-variables models," Stata Journal, StataCorp LP, vol. 9(3), pages 398-421, September.
    2. James H. Stock & Mark W. Watson, 2008. "Heteroskedasticity-Robust Standard Errors for Fixed Effects Panel Data Regression," Econometrica, Econometric Society, vol. 76(1), pages 155-174, January.
    3. Guido W. Imbens & Michal Kolesár, 2016. "Robust Standard Errors in Small Samples: Some Practical Advice," The Review of Economics and Statistics, MIT Press, vol. 98(4), pages 701-712, October.
    4. Gary Solon & Steven J. Haider & Jeffrey M. Wooldridge, 2015. "What Are We Weighting For?," Journal of Human Resources, University of Wisconsin Press, vol. 50(2), pages 301-316.
    5. Lara Shore-Sheppard, 1996. "The Precision of Instrumental Variables Estimates With Grouped Data," Working Papers 753, Princeton University, Department of Economics, Industrial Relations Section..
    6. Kleibergen, Frank & Paap, Richard, 2006. "Generalized reduced rank tests using the singular value decomposition," Journal of Econometrics, Elsevier, vol. 133(1), pages 97-126, July.
    7. Timothy G. Conley & Christopher R. Taber, 2011. "Inference with "Difference in Differences" with a Small Number of Policy Changes," The Review of Economics and Statistics, MIT Press, vol. 93(1), pages 113-125, February.
    8. White, Halbert, 1980. "A Heteroskedasticity-Consistent Covariance Matrix Estimator and a Direct Test for Heteroskedasticity," Econometrica, Econometric Society, vol. 48(4), pages 817-838, May.
    9. Fitzsimons, Emla & Malde, Bansi & Mesnard, Alice & Vera-Hernández, Marcos, 2012. "Household Responses to Information on Child Nutrition: Experimental Evidence from Malawi," CEPR Discussion Papers 8915, C.E.P.R. Discussion Papers.
    10. Joshua D. Angrist & Victor Lavy, 2002. "The Effect of High School Matriculation Awards: Evidence from Randomized Trials," NBER Working Papers 9389, National Bureau of Economic Research, Inc.
    11. Chernozhukov, Victor & Hansen, Christian, 2008. "The reduced form: A simple approach to inference with weak instruments," Economics Letters, Elsevier, vol. 100(1), pages 68-71, July.
    12. Matthew D. Webb, 2014. "Reworking Wild Bootstrap Based Inference For Clustered Errors," Working Paper 1315, Economics Department, Queen's University.
    13. Kline Patrick & Santos Andres, 2012. "A Score Based Approach to Wild Bootstrap Inference," Journal of Econometric Methods, De Gruyter, vol. 1(1), pages 23-41, August.
    14. Andrews,Donald W. K. & Stock,James H. (ed.), 2005. "Identification and Inference for Econometric Models," Cambridge Books, Cambridge University Press, number 9780521844413, December.
    15. Blundell,Richard & Newey,Whitney K. & Persson,Torsten (ed.), 2007. "Advances in Economics and Econometrics," Cambridge Books, Cambridge University Press, number 9780521871532, December.
    16. Jeffrey M Wooldridge, 2010. "Econometric Analysis of Cross Section and Panel Data," MIT Press Books, The MIT Press, edition 2, volume 1, number 0262232588, December.
    17. Newey, Whitney & West, Kenneth, 2014. "A simple, positive semi-definite, heteroscedasticity and autocorrelation consistent covariance matrix," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 33(1), pages 125-132.
    18. Thomas Barrios & Rebecca Diamond & Guido W. Imbens & Michal Kolesár, 2012. "Clustering, Spatial Correlations, and Randomization Inference," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(498), pages 578-591, June.
    19. Davis, Peter, 2002. "Estimating multi-way error components models with unbalanced data structures," Journal of Econometrics, Elsevier, vol. 106(1), pages 67-95, January.
    20. MacKinnon, James G. & White, Halbert, 1985. "Some heteroskedasticity-consistent covariance matrix estimators with improved finite sample properties," Journal of Econometrics, Elsevier, vol. 29(3), pages 305-325, September.
    21. Jeffrey M. Wooldridge, 2003. "Cluster-Sample Methods in Applied Econometrics," American Economic Review, American Economic Association, vol. 93(2), pages 133-138, May.
    22. Hausman, Jerry & Kuersteiner, Guido, 2008. "Difference in difference meets generalized least squares: Higher order properties of hypotheses tests," Journal of Econometrics, Elsevier, vol. 144(2), pages 371-391, June.
    23. Andrew V. Carter & Kevin T. Schnepel & Douglas G. Steigerwald, 2017. "Asymptotic Behavior of a t -Test Robust to Cluster Heterogeneity," The Review of Economics and Statistics, MIT Press, vol. 99(4), pages 698-709, July.
    24. Bester, C. Alan & Conley, Timothy G. & Hansen, Christian B., 2011. "Inference with dependent data using cluster covariance estimators," Journal of Econometrics, Elsevier, vol. 165(2), pages 137-151.
    25. John C. Driscoll & Aart C. Kraay, 1998. "Consistent Covariance Matrix Estimation With Spatially Dependent Panel Data," The Review of Economics and Statistics, MIT Press, vol. 80(4), pages 549-560, November.
    26. Joshua D. Angrist & Jörn-Steffen Pischke, 2009. "Mostly Harmless Econometrics: An Empiricist's Companion," Economics Books, Princeton University Press, edition 1, number 8769.
    27. Bhattacharya, Debopam, 2005. "Asymptotic inference from multi-stage samples," Journal of Econometrics, Elsevier, vol. 126(1), pages 145-171, May.
    28. Blundell,Richard & Newey,Whitney & Persson,Torsten (ed.), 2007. "Advances in Economics and Econometrics," Cambridge Books, Cambridge University Press, number 9780521871549, December.
    29. Marianne Bertrand & Esther Duflo & Sendhil Mullainathan, 2004. "How Much Should We Trust Differences-In-Differences Estimates?," The Quarterly Journal of Economics, Oxford University Press, vol. 119(1), pages 249-275.
    30. Blundell,Richard & Newey,Whitney & Persson,Torsten (ed.), 2007. "Advances in Economics and Econometrics," Cambridge Books, Cambridge University Press, number 9780521692106, December.
    31. James G. MacKinnon & Matthew D. Webb, 2017. "Wild Bootstrap Inference for Wildly Different Cluster Sizes," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 32(2), pages 233-254, March.
    32. Greenwald, Bruce C., 1983. "A general analysis of bias in the estimated standard errors of least squares coefficients," Journal of Econometrics, Elsevier, vol. 22(3), pages 323-338, August.
    33. Abadie, Alberto & Diamond, Alexis & Hainmueller, Jens, 2010. "Synthetic Control Methods for Comparative Case Studies: Estimating the Effect of California’s Tobacco Control Program," Journal of the American Statistical Association, American Statistical Association, vol. 105(490), pages 493-505.
    34. Moulton, Brent R, 1990. "An Illustration of a Pitfall in Estimating the Effects of Aggregate Variables on Micro Unit," The Review of Economics and Statistics, MIT Press, vol. 72(2), pages 334-338, May.
    35. Blundell,Richard & Newey,Whitney K. & Persson,Torsten (ed.), 2007. "Advances in Economics and Econometrics," Cambridge Books, Cambridge University Press, number 9780521692090, December.
    36. Kloek, T, 1981. "OLS Estimation in a Model Where a Microvariable Is Explained by Aggregates and Contemporaneous Disturbances Are Equicorrelated," Econometrica, Econometric Society, vol. 49(1), pages 205-207, January.
    37. Karla Hemming & Jen Marsh, 2013. "A menu-driven facility for sample-size calculations in cluster randomized controlled trials," Stata Journal, StataCorp LP, vol. 13(1), pages 114-135, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. A. Colin Cameron & Douglas L. Miller, 2010. "Robust Inference with Clustered Data," Working Papers 106, University of California, Davis, Department of Economics.
    2. A. Colin Cameron & Douglas L. Miller, 2010. "Robust Inference with Clustered Data," Working Papers 318, University of California, Davis, Department of Economics.
    3. Hansen, Bruce E. & Lee, Seojeong, 2019. "Asymptotic theory for clustered samples," Journal of Econometrics, Elsevier, vol. 210(2), pages 268-290.
    4. James G. MacKinnon & Matthew D. Webb, 2020. "When and How to Deal with Clustered Errors in Regression Models," Working Paper 1421, Economics Department, Queen's University.
    5. P. Dorian Owen, 2017. "Evaluating Ingenious Instruments for Fundamental Determinants of Long-Run Economic Growth and Development," Econometrics, MDPI, vol. 5(3), pages 1-33, September.
    6. James G. MacKinnon, 2019. "How cluster‐robust inference is changing applied econometrics," Canadian Journal of Economics/Revue canadienne d'économique, John Wiley & Sons, vol. 52(3), pages 851-881, August.
    7. Pakel, Cavit, 2019. "Bias reduction in nonlinear and dynamic panels in the presence of cross-section dependence," Journal of Econometrics, Elsevier, vol. 213(2), pages 459-492.
    8. MacKinnon, James G. & Nielsen, Morten Ørregaard & Webb, Matthew D., 2023. "Cluster-robust inference: A guide to empirical practice," Journal of Econometrics, Elsevier, vol. 232(2), pages 272-299.
    9. Brewer Mike & Crossley Thomas F. & Joyce Robert, 2018. "Inference with Difference-in-Differences Revisited," Journal of Econometric Methods, De Gruyter, vol. 7(1), pages 1-16, January.
    10. repec:fgv:eesptd:411 is not listed on IDEAS
    11. Matthew D. Webb, 2014. "Reworking Wild Bootstrap Based Inference For Clustered Errors," Working Paper 1315, Economics Department, Queen's University.
    12. David Roodman & James G. MacKinnon & Morten Ørregaard Nielsen & Matthew D. Webb, 2019. "Fast and wild: Bootstrap inference in Stata using boottest," Stata Journal, StataCorp LP, vol. 19(1), pages 4-60, March.
    13. Guido W. Imbens & Jeffrey M. Wooldridge, 2009. "Recent Developments in the Econometrics of Program Evaluation," Journal of Economic Literature, American Economic Association, vol. 47(1), pages 5-86, March.
    14. A. Colin Cameron & Jonah B. Gelbach & Douglas L. Miller, 2008. "Bootstrap-Based Improvements for Inference with Clustered Errors," The Review of Economics and Statistics, MIT Press, vol. 90(3), pages 414-427, August.
    15. James G. MacKinnon & Matthew D. Webb, 2017. "Wild Bootstrap Inference for Wildly Different Cluster Sizes," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 32(2), pages 233-254, March.
    16. Bruno Ferman & Cristine Pinto, 2019. "Inference in Differences-in-Differences with Few Treated Groups and Heteroskedasticity," The Review of Economics and Statistics, MIT Press, vol. 101(3), pages 452-467, July.
    17. A. Colin Cameron & Jonah B. Gelbach & Douglas L. Miller, 2008. "Bootstrap-Based Improvements for Inference with Clustered Errors," The Review of Economics and Statistics, MIT Press, vol. 90(3), pages 414-427, August.
    18. James G. MacKinnon & Matthew D. Webb, 2017. "Pitfalls When Estimating Treatment Effects Using Clustered Data," Working Paper 1387, Economics Department, Queen's University.
    19. James G. MacKinnon & Morten Ørregaard Nielsen & Matthew D. Webb, 2022. "Testing for the appropriate level of clustering in linear regression models," Working Paper 1428, Economics Department, Queen's University.
    20. Andreas Hagemann, 2019. "Permutation inference with a finite number of heterogeneous clusters," Papers 1907.01049, arXiv.org, revised Feb 2023.
    21. James G. MacKinnon & Morten Ørregaard Nielsen & Matthew D. Webb, 2022. "Fast and Reliable Jackknife and Bootstrap Methods for Cluster-Robust Inference," Working Paper 1485, Economics Department, Queen's University.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:uwp:jhriss:v:50:y:2015:i:2:p:317-372. See general information about how to correct material in RePEc.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: . General contact details of provider: http://jhr.uwpress.org/ .

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (email available below). General contact details of provider: http://jhr.uwpress.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service hosted by the Research Division of the Federal Reserve Bank of St. Louis . RePEc uses bibliographic data supplied by the respective publishers.