Clustering, Spatial Correlations and Randomization Inference

Clustering, Spatial Correlations and Randomization Inference

Author

Listed:

Thomas Barrios
Rebecca Diamond
Guido W. Imbens
Michal Kolesár

Abstract

It is standard practice in empirical work to allow for clustering in the error covariance matrix if the explanatory variables of interest vary at a more aggregate level than the units of observation. Often, however, the structure of the error covariance matrix is more complex, with correlations varying in magnitude within clusters, and not vanishing between clusters. Here we explore the implications of such correlations for the actual and estimated precision of least squares estimators. We show that with equal sized clusters, if the covariate of interest is randomly assigned at the cluster level, only accounting for non-zero covariances at the cluster level, and ignoring correlations between clusters, leads to valid standard errors and confidence intervals. However, in many cases this may not suffice. For example, state policies exhibit substantial spatial correlations. As a result, ignoring spatial correlations in outcomes beyond that accounted for by the clustering at the state level, may well bias standard errors. We illustrate our findings using the 5% public use census data. Based on these results we recommend researchers assess the extent of spatial correlations in explanatory variables beyond state level clustering, and if such correlations are present, take into account spatial correlations beyond the clustering correlations typically accounted for.

Suggested Citation

Thomas Barrios & Rebecca Diamond & Guido W. Imbens & Michal Kolesár, 2010. "Clustering, Spatial Correlations and Randomization Inference," NBER Working Papers 15760, National Bureau of Economic Research, Inc.

Handle: RePEc:nbr:nberwo:15760
Note: LS

Download full text from publisher

Other versions of this item:

Thomas Barrios & Rebecca Diamond & Guido W. Imbens & Michal Kolesár, 2012. "Clustering, Spatial Correlations, and Randomization Inference," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(498), pages 578-591, June.

References listed on IDEAS

Sandra E. Black, 1999. "Do Better Schools Matter? Parental Valuation of Elementary Education," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 114(2), pages 577-599.
- Sandra E. Black, 1997. "Do better schools matter? Parental valuation of elementary education," Research Paper 9729, Federal Reserve Bank of New York.
David S. Lee & Thomas Lemieux, 2009. "Regression Discontinuity Designs In Economics," Working Papers 1118, Princeton University, Department of Economics, Industrial Relations Section..
David S. Lee & Thomas Lemieux, 2010. "Regression Discontinuity Designs in Economics," Journal of Economic Literature, American Economic Association, vol. 48(2), pages 281-355, June.
- David S. Lee & Thomas Lemieux, 2009. "Regression Discontinuity Designs in Economics," Working Papers 1118, Princeton University, Department of Economics, Industrial Relations Section..
- David S. Lee & Thomas Lemieux, 2009. "Regression Discontinuity Designs in Economics," NBER Working Papers 14723, National Bureau of Economic Research, Inc.
Stephen G. Donald & Kevin Lang, 2007. "Inference with Difference-in-Differences and Other Panel Data," The Review of Economics and Statistics, MIT Press, vol. 89(2), pages 221-233, May.
Joshua D. Angrist & Jörn-Steffen Pischke, 2009. "Mostly Harmless Econometrics: An Empiricist's Companion," Economics Books, Princeton University Press, edition 1, number 8769, December.
Greenwald, Bruce C., 1983. "A general analysis of bias in the estimated standard errors of least squares coefficients," Journal of Econometrics, Elsevier, vol. 22(3), pages 323-338, August.
Small, Dylan S. & Ten Have, Thomas R. & Rosenbaum, Paul R., 2008. "Randomization Inference in a GroupRandomized Trial of Treatments for Depression: Covariate Adjustment, Noncompliance, and Quantile Effects," Journal of the American Statistical Association, American Statistical Association, vol. 103, pages 271-279, March.
Moulton, Brent R., 1986. "Random group effects and the precision of regression estimates," Journal of Econometrics, Elsevier, vol. 32(3), pages 385-397, August.
Imbens, Guido W. & Lemieux, Thomas, 2008. "Regression discontinuity designs: A guide to practice," Journal of Econometrics, Elsevier, vol. 142(2), pages 615-635, February.
- Guido Imbens & Thomas Lemieux, 2007. "Regression Discontinuity Designs: A Guide to Practice," NBER Working Papers 13039, National Bureau of Economic Research, Inc.
- Guido Imbens & Thomas Lemieux, 2007. "Regression Discontinuity Designs: A Guide to Practice," NBER Technical Working Papers 0337, National Bureau of Economic Research, Inc.
Kloek, T, 1981. "OLS Estimation in a Model Where a Microvariable Is Explained by Aggregates and Contemporaneous Disturbances Are Equicorrelated," Econometrica, Econometric Society, vol. 49(1), pages 205-207, January.
Bester, C. Alan & Conley, Timothy G. & Hansen, Christian B., 2011. "Inference with dependent data using cluster covariance estimators," Journal of Econometrics, Elsevier, vol. 165(2), pages 137-151.
Conley, T. G., 1999. "GMM estimation with cross sectional dependence," Journal of Econometrics, Elsevier, vol. 92(1), pages 1-45, September.
Moulton, Brent R, 1990. "An Illustration of a Pitfall in Estimating the Effects of Aggregate Variables on Micro Unit," The Review of Economics and Statistics, MIT Press, vol. 72(2), pages 334-338, May.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

A. Colin Cameron & Douglas L. Miller, 2010. "Robust Inference with Clustered Data," Working Papers 106, University of California, Davis, Department of Economics.
- Colin Cameron, 2011. "Robust inference with clustered data," Mexican Stata Users' Group Meetings 2011 07, Stata Users Group.
- A. Colin Cameron & Douglas L. Miller, 2010. "Robust Inference with Clustered Data," Working Papers 107, University of California, Davis, Department of Economics.
A. Colin Cameron & Douglas L. Miller, 2010. "Robust Inference with Clustered Data," Working Papers 318, University of California, Davis, Department of Economics.
- Colin Cameron, 2011. "Robust inference with clustered data," Mexican Stata Users' Group Meetings 2011 07, Stata Users Group.
- A. Colin Cameron & Douglas L. Miller, 2010. "Robust Inference with Clustered Data," Working Papers 316, University of California, Davis, Department of Economics.
Baum-Snow, Nathaniel & Ferreira, Fernando, 2015. "Causal Inference in Urban and Regional Economics," Handbook of Regional and Urban Economics, in: Gilles Duranton & J. V. Henderson & William C. Strange (ed.), Handbook of Regional and Urban Economics, edition 1, volume 5, chapter 0, pages 3-68, Elsevier.
- Nathaniel Baum-Snow & Fernando Ferreira, 2014. "Causal Inference in Urban and Regional Economics," NBER Working Papers 20535, National Bureau of Economic Research, Inc.
James G. MacKinnon & Matthew D. Webb, 2020. "When and How to Deal with Clustered Errors in Regression Models," Working Paper 1421, Economics Department, Queen's University.
Anil Kumar, 2018. "Do Restrictions on Home Equity Extraction Contribute to Lower Mortgage Defaults? Evidence from a Policy Discontinuity at the Texas Border," American Economic Journal: Economic Policy, American Economic Association, vol. 10(1), pages 268-297, February.
- Anil Kumar, 2014. "Do restrictions on home equity extraction contribute to lower mortgage defaults? evidence from a policy discontinuity at the Texas border," Working Papers 1410, Federal Reserve Bank of Dallas.
Alberto Abadie & Susan Athey & Guido W Imbens & Jeffrey M Wooldridge, 2023. "When Should You Adjust Standard Errors for Clustering?," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 138(1), pages 1-35.
- Alberto Abadie & Susan Athey & Guido Imbens & Jeffrey Wooldridge, 2017. "When Should You Adjust Standard Errors for Clustering?," Papers 1710.02926, arXiv.org, revised Sep 2022.
- Alberto Abadie & Susan Athey & Guido W. Imbens & Jeffrey Wooldridge, 2017. "When Should You Adjust Standard Errors for Clustering?," NBER Working Papers 24003, National Bureau of Economic Research, Inc.
- Abadie, Alberto & Athey, Susan & Imbens, Guido W. & Wooldridge, Jeffrey, 2017. "When Should You Adjust Standard Errors for Clustering?," Research Papers repec:ecl:stabus:3596, Stanford University, Graduate School of Business.
Hansen, Bruce E. & Lee, Seojeong, 2019. "Asymptotic theory for clustered samples," Journal of Econometrics, Elsevier, vol. 210(2), pages 268-290.
- Bruce E. Hansen & Seojeong Jay Lee, 2017. "Asymptotic Theory for Clustered Samples," Discussion Papers 2017-18, School of Economics, The University of New South Wales.
- Bruce E. Hansen & Seojeong Lee, 2019. "Asymptotic Theory for Clustered Samples," Papers 1902.01497, arXiv.org.
Vikström, Johan, 2009. "Cluster sample inference using sensitivity analysis: the case with few groups," Working Paper Series 2009:15, IFAU - Institute for Evaluation of Labour Market and Education Policy.
James G. MacKinnon & Matthew D. Webb, 2017. "Wild Bootstrap Inference for Wildly Different Cluster Sizes," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 32(2), pages 233-254, March.
- MacKinnon, James G. & Webb, Matthew D., 2015. "Wild Bootstrap Inference for Wildly Different Cluster Sizes," Queen's Economics Department Working Papers 274639, Queen's University - Department of Economics.
- James G. MacKinnon & Matthew D. Webb, 2015. "Wild Bootstrap Inference For Wildly Different Cluster Sizes," Working Paper 1314, Economics Department, Queen's University.
Adrien Montalbo, 2019. "Education and economic development. The influence of primary schooling on municipalities in nineteenth-century France," Working Papers halshs-02286126, HAL.
A. Colin Cameron & Douglas L. Miller, 2015. "A Practitionerâ€™s Guide to Cluster-Robust Inference," Journal of Human Resources, University of Wisconsin Press, vol. 50(2), pages 317-372.
Thomas K. Bauer & Tanja Kasten & Lars-H. R. Siemers, 2017. "Business Taxation and Wages: Redistribution and Asymmetric Effects," MAGKS Papers on Economics 201732, Philipps-Universität Marburg, Faculty of Business Administration and Economics, Department of Economics (Volkswirtschaftliche Abteilung).
- Thomas K. Bauer & Tanja Kasten & Lars-H. R. Siemers, 2017. "Business Taxation and Wages: Redistribution and Asymmetric Effects," Volkswirtschaftliche Diskussionsbeiträge 182-17, Universität Siegen, Fakultät Wirtschaftswissenschaften, Wirtschaftsinformatik und Wirtschaftsrecht.
Raffaello Bronzini & Eleonora Iachini, 2014. "Are Incentives for R&D Effective? Evidence from a Regression Discontinuity Approach," American Economic Journal: Economic Policy, American Economic Association, vol. 6(4), pages 100-134, November.
- Raffaello Bronzini & Eleonora Iachini, 2011. "Are incentives for R&D effective? Evidence from a regression discontinuity approach," Temi di discussione (Economic working papers) 791, Bank of Italy, Economic Research and International Relations Area.
- Raffaello Bronzini & Eleonora Iachini, 2012. "Are Incentives For R&D Effective? Evidence From A Regression Discontinuity Approach," ERSA conference papers ersa12p848, European Regional Science Association.
A. Colin Cameron & Jonah B. Gelbach & Douglas L. Miller, 2008. "Bootstrap-Based Improvements for Inference with Clustered Errors," The Review of Economics and Statistics, MIT Press, vol. 90(3), pages 414-427, August.
- Jonah B. Gelbach & Doug Miller & A. Colin Cameron, 2006. "Bootstrap-Based Improvements for Inference with Clustered Errors," Working Papers 621, University of California, Davis, Department of Economics.
- A. Colin Cameron & Jonah B. Gelbach & Douglas L. Miller, 2007. "Bootstrap-Based Improvements for Inference with Clustered Errors," NBER Technical Working Papers 0344, National Bureau of Economic Research, Inc.
Daniel Mejía & Pascual Restrepo & Sandra V. Rozo, 2017. "On the Effects of Enforcement on Illegal Markets: Evidence from a Quasi-Experiment in Colombia," The World Bank Economic Review, World Bank, vol. 31(2), pages 570-594.
- Mejía,Daniel & Restrepo,Pascual & Rozo,Sandra V., 2015. "On the effects of enforcement on illegal markets : evidence from a quasi-experiment in Colombia," Policy Research Working Paper Series 7409, The World Bank.
Adrien Montalbo, 2019. "Education and economic development. The influence of primary schooling on municipalities in nineteenth-century France," PSE Working Papers halshs-02286126, HAL.
Michael Pollmann, 2020. "Causal Inference for Spatial Treatments," Papers 2011.00373, arXiv.org, revised Apr 2026.
Koster, Hans R.A. & van Ommeren, Jos & Volkhausen, Nicolas, 2021. "Short-term rentals and the housing market: Quasi-experimental evidence from Airbnb in Los Angeles," Journal of Urban Economics, Elsevier, vol. 124(C).
- Koster, Hans & van Ommeren, Jos & Volhausen, Nicolas, 2018. "Short-term rentals and the housing market: Quasi-experimental evidence from Airbnb in Los Angeles," CEPR Discussion Papers 13094, Centre for Economic Policy Research.
David G. Blanchflower & Andrew Oswald, 1995. "International Wage Curves," NBER Chapters, in: Differences and Changes in Wage Structures, pages 145-174, National Bureau of Economic Research, Inc.
- David G. Blanchflower & Andrew J. Oswald, 1992. "International Wage Curves," NBER Working Papers 4200, National Bureau of Economic Research, Inc.
- David Blanchflower & A Oswald, 1993. "International Wage Curve," CEP Discussion Papers dp0116, Centre for Economic Performance, LSE.
Hagemann, Andreas, 2019. "Placebo inference on treatment effects when the number of clusters is small," Journal of Econometrics, Elsevier, vol. 213(1), pages 190-209.

More about this item

JEL classification:

C01 - Mathematical and Quantitative Methods - - General - - - Econometrics
C1 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General
C31 - Mathematical and Quantitative Methods - - Multiple or Simultaneous Equation Models; Multiple Variables - - - Cross-Sectional Models; Spatial Models; Treatment Effect Models; Quantile Regressions; Social Interaction Models

NEP fields

This paper has been announced in the following NEP Reports:

NEP-ECM-2010-03-06 (Econometrics)
NEP-GEO-2010-03-06 (Economic Geography)
NEP-URE-2010-03-06 (Urban and Real Estate Economics)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nbr:nberwo:15760. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: the person in charge (email available below). General contact details of provider: https://edirc.repec.org/data/nberrus.html .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Clustering, Spatial Correlations and Randomization Inference

Author

Abstract

Suggested Citation

Download full text from publisher

Other versions of this item:

References listed on IDEAS

Most related items

More about this item

JEL classification:

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data