Clustering, Spatial Correlations, and Randomization Inference

My bibliography Save this article

Clustering, Spatial Correlations, and Randomization Inference

Author

Listed:

Thomas Barrios
Rebecca Diamond
Guido W. Imbens
Michal Kolesár

Registered:

Abstract

It is a standard practice in regression analyses to allow for clustering in the error covariance matrix if the explanatory variable of interest varies at a more aggregate level (e.g., the state level) than the units of observation (e.g., individuals). Often, however, the structure of the error covariance matrix is more complex, with correlations not vanishing for units in different clusters. Here, we explore the implications of such correlations for the actual and estimated precision of least squares estimators. Our main theoretical result is that with equal-sized clusters, if the covariate of interest is randomly assigned at the cluster level, only accounting for nonzero covariances at the cluster level, and ignoring correlations between clusters as well as differences in within-cluster correlations, leads to valid confidence intervals. However, in the absence of random assignment of the covariates, ignoring general correlation structures may lead to biases in standard errors. We illustrate our findings using the 5% public-use census data. Based on these results, we recommend that researchers, as a matter of routine, explore the extent of spatial correlations in explanatory variables beyond state-level clustering.

Suggested Citation

Thomas Barrios & Rebecca Diamond & Guido W. Imbens & Michal Kolesár, 2012. "Clustering, Spatial Correlations, and Randomization Inference," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(498), pages 578-591, June.

Handle: RePEc:taf:jnlasa:v:107:y:2012:i:498:p:578-591
DOI: 10.1080/01621459.2012.682524

Download full text from publisher

As the access to this document is restricted, you may want to look for a different version below or

for a different version of it.

Other versions of this item:

Thomas Barrios & Rebecca Diamond & Guido W. Imbens & Michal Kolesar, 2010. "Clustering, Spatial Correlations and Randomization Inference," NBER Working Papers 15760, National Bureau of Economic Research, Inc.

References listed on IDEAS

Sandra E. Black, 1999. "Do Better Schools Matter? Parental Valuation of Elementary Education," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 114(2), pages 577-599.
- Sandra E. Black, 1997. "Do better schools matter? Parental valuation of elementary education," Research Paper 9729, Federal Reserve Bank of New York.
David S. Lee & Thomas Lemieux, 2009. "Regression Discontinuity Designs In Economics," Working Papers 1118, Princeton University, Department of Economics, Industrial Relations Section..
David S. Lee & Thomas Lemieux, 2010. "Regression Discontinuity Designs in Economics," Journal of Economic Literature, American Economic Association, vol. 48(2), pages 281-355, June.
- David S. Lee & Thomas Lemieux, 2009. "Regression Discontinuity Designs in Economics," Working Papers 1118, Princeton University, Department of Economics, Industrial Relations Section..
- David S. Lee & Thomas Lemieux, 2009. "Regression Discontinuity Designs in Economics," NBER Working Papers 14723, National Bureau of Economic Research, Inc.
Stephen G. Donald & Kevin Lang, 2007. "Inference with Difference-in-Differences and Other Panel Data," The Review of Economics and Statistics, MIT Press, vol. 89(2), pages 221-233, May.
Joshua D. Angrist & Jörn-Steffen Pischke, 2009. "Mostly Harmless Econometrics: An Empiricist's Companion," Economics Books, Princeton University Press, edition 1, number 8769.
Greenwald, Bruce C., 1983. "A general analysis of bias in the estimated standard errors of least squares coefficients," Journal of Econometrics, Elsevier, vol. 22(3), pages 323-338, August.
Small, Dylan S. & Ten Have, Thomas R. & Rosenbaum, Paul R., 2008. "Randomization Inference in a GroupRandomized Trial of Treatments for Depression: Covariate Adjustment, Noncompliance, and Quantile Effects," Journal of the American Statistical Association, American Statistical Association, vol. 103, pages 271-279, March.
Moulton, Brent R., 1986. "Random group effects and the precision of regression estimates," Journal of Econometrics, Elsevier, vol. 32(3), pages 385-397, August.
Imbens, Guido W. & Lemieux, Thomas, 2008. "Regression discontinuity designs: A guide to practice," Journal of Econometrics, Elsevier, vol. 142(2), pages 615-635, February.
- Guido Imbens & Thomas Lemieux, 2007. "Regression Discontinuity Designs: A Guide to Practice," NBER Working Papers 13039, National Bureau of Economic Research, Inc.
- Guido Imbens & Thomas Lemieux, 2007. "Regression Discontinuity Designs: A Guide to Practice," NBER Technical Working Papers 0337, National Bureau of Economic Research, Inc.
Kloek, T, 1981. "OLS Estimation in a Model Where a Microvariable Is Explained by Aggregates and Contemporaneous Disturbances Are Equicorrelated," Econometrica, Econometric Society, vol. 49(1), pages 205-207, January.
Bester, C. Alan & Conley, Timothy G. & Hansen, Christian B., 2011. "Inference with dependent data using cluster covariance estimators," Journal of Econometrics, Elsevier, vol. 165(2), pages 137-151.
Conley, T. G., 1999. "GMM estimation with cross sectional dependence," Journal of Econometrics, Elsevier, vol. 92(1), pages 1-45, September.
Moulton, Brent R, 1990. "An Illustration of a Pitfall in Estimating the Effects of Aggregate Variables on Micro Unit," The Review of Economics and Statistics, MIT Press, vol. 72(2), pages 334-338, May.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

A. Colin Cameron & Douglas L. Miller, 2010. "Robust Inference with Clustered Data," Working Papers 106, University of California, Davis, Department of Economics.
- Colin Cameron, 2011. "Robust inference with clustered data," Mexican Stata Users' Group Meetings 2011 07, Stata Users Group.
- A. Colin Cameron & Douglas L. Miller, 2010. "Robust Inference with Clustered Data," Working Papers 107, University of California, Davis, Department of Economics.
A. Colin Cameron & Douglas L. Miller, 2010. "Robust Inference with Clustered Data," Working Papers 318, University of California, Davis, Department of Economics.
- Colin Cameron, 2011. "Robust inference with clustered data," Mexican Stata Users' Group Meetings 2011 07, Stata Users Group.
- A. Colin Cameron & Douglas L. Miller, 2010. "Robust Inference with Clustered Data," Working Papers 316, University of California, Davis, Department of Economics.
Baum-Snow, Nathaniel & Ferreira, Fernando, 2015. "Causal Inference in Urban and Regional Economics," Handbook of Regional and Urban Economics, in: Gilles Duranton & J. V. Henderson & William C. Strange (ed.), Handbook of Regional and Urban Economics, edition 1, volume 5, chapter 0, pages 3-68, Elsevier.
- Nathaniel Baum-Snow & Fernando Ferreira, 2014. "Causal Inference in Urban and Regional Economics," NBER Working Papers 20535, National Bureau of Economic Research, Inc.
James G. MacKinnon & Matthew D. Webb, 2020. "When and How to Deal with Clustered Errors in Regression Models," Working Paper 1421, Economics Department, Queen's University.
Anil Kumar, 2018. "Do Restrictions on Home Equity Extraction Contribute to Lower Mortgage Defaults? Evidence from a Policy Discontinuity at the Texas Border," American Economic Journal: Economic Policy, American Economic Association, vol. 10(1), pages 268-297, February.
- Anil Kumar, 2014. "Do restrictions on home equity extraction contribute to lower mortgage defaults? evidence from a policy discontinuity at the Texas border," Working Papers 1410, Federal Reserve Bank of Dallas.
Koster, Hans R.A. & van Ommeren, Jos & Volkhausen, Nicolas, 2021. "Short-term rentals and the housing market: Quasi-experimental evidence from Airbnb in Los Angeles," Journal of Urban Economics, Elsevier, vol. 124(C).
- Koster, Hans & van Ommeren, Jos & Volhausen, Nicolas, 2018. "Short-term rentals and the housing market: Quasi-experimental evidence from Airbnb in Los Angeles," CEPR Discussion Papers 13094, C.E.P.R. Discussion Papers.
Alberto Abadie & Susan Athey & Guido W Imbens & Jeffrey M Wooldridge, 2023. "When Should You Adjust Standard Errors for Clustering?," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 138(1), pages 1-35.
- Alberto Abadie & Susan Athey & Guido Imbens & Jeffrey Wooldridge, 2017. "When Should You Adjust Standard Errors for Clustering?," Papers 1710.02926, arXiv.org, revised Sep 2022.
- Alberto Abadie & Susan Athey & Guido W. Imbens & Jeffrey Wooldridge, 2017. "When Should You Adjust Standard Errors for Clustering?," NBER Working Papers 24003, National Bureau of Economic Research, Inc.
- Abadie, Alberto & Athey, Susan & Imbens, Guido W. & Wooldridge, Jeffrey, 2017. "When Should You Adjust Standard Errors for Clustering?," Research Papers repec:ecl:stabus:3596, Stanford University, Graduate School of Business.
Hansen, Bruce E. & Lee, Seojeong, 2019. "Asymptotic theory for clustered samples," Journal of Econometrics, Elsevier, vol. 210(2), pages 268-290.
- Bruce E. Hansen & Seojeong Jay Lee, 2017. "Asymptotic Theory for Clustered Samples," Discussion Papers 2017-18, School of Economics, The University of New South Wales.
- Bruce E. Hansen & Seojeong Lee, 2019. "Asymptotic Theory for Clustered Samples," Papers 1902.01497, arXiv.org.
A. Colin Cameron & Jonah B. Gelbach & Douglas L. Miller, 2008. "Bootstrap-Based Improvements for Inference with Clustered Errors," The Review of Economics and Statistics, MIT Press, vol. 90(3), pages 414-427, August.
- Jonah B. Gelbach & Doug Miller & A. Colin Cameron, 2006. "Bootstrap-Based Improvements for Inference with Clustered Errors," Working Papers 128, University of California, Davis, Department of Economics.
- A. Colin Cameron & Jonah B. Gelbach & Douglas L. Miller, 2007. "Bootstrap-Based Improvements for Inference with Clustered Errors," NBER Technical Working Papers 0344, National Bureau of Economic Research, Inc.
Vikström, Johan, 2009. "Cluster sample inference using sensitivity analysis: the case with few groups," Working Paper Series 2009:15, IFAU - Institute for Evaluation of Labour Market and Education Policy.
James G. MacKinnon & Matthew D. Webb, 2017. "Wild Bootstrap Inference for Wildly Different Cluster Sizes," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 32(2), pages 233-254, March.
- James G. MacKinnon & Matthew D. Webb, 2015. "Wild Bootstrap Inference For Wildly Different Cluster Sizes," Working Paper 1314, Economics Department, Queen's University.
Adrien Montalbo, 2019. "Education and economic development. The influence of primary schooling on municipalities in nineteenth-century France," Working Papers halshs-02286126, HAL.
Daniel Mejía & Pascual Restrepo & Sandra V. Rozo, 2017. "On the Effects of Enforcement on Illegal Markets: Evidence from a Quasi-Experiment in Colombia," The World Bank Economic Review, World Bank, vol. 31(2), pages 570-594.
- Mejía,Daniel & Restrepo,Pascual & Rozo,Sandra V., 2015. "On the effects of enforcement on illegal markets : evidence from a quasi-experiment in Colombia," Policy Research Working Paper Series 7409, The World Bank.
A. Colin Cameron & Douglas L. Miller, 2015. "A Practitionerâ€™s Guide to Cluster-Robust Inference," Journal of Human Resources, University of Wisconsin Press, vol. 50(2), pages 317-372.
Thomas K. Bauer & Tanja Kasten & Lars-H. R. Siemers, 2017. "Business Taxation and Wages: Redistribution and Asymmetric Effects," MAGKS Papers on Economics 201732, Philipps-Universität Marburg, Faculty of Business Administration and Economics, Department of Economics (Volkswirtschaftliche Abteilung).
- Thomas K. Bauer & Tanja Kasten & Lars-H. R. Siemers, 2017. "Business Taxation and Wages: Redistribution and Asymmetric Effects," Volkswirtschaftliche Diskussionsbeiträge 182-17, Universität Siegen, Fakultät Wirtschaftswissenschaften, Wirtschaftsinformatik und Wirtschaftsrecht.
Raffaello Bronzini & Eleonora Iachini, 2014. "Are Incentives for R&D Effective? Evidence from a Regression Discontinuity Approach," American Economic Journal: Economic Policy, American Economic Association, vol. 6(4), pages 100-134, November.
- Raffaello Bronzini & Eleonora Iachini, 2011. "Are incentives for R&D effective? Evidence from a regression discontinuity approach," Temi di discussione (Economic working papers) 791, Bank of Italy, Economic Research and International Relations Area.
- Raffaello Bronzini & Eleonora Iachini, 2012. "Are Incentives For R&D Effective? Evidence From A Regression Discontinuity Approach," ERSA conference papers ersa12p848, European Regional Science Association.
A. Colin Cameron & Jonah B. Gelbach & Douglas L. Miller, 2008. "Bootstrap-Based Improvements for Inference with Clustered Errors," The Review of Economics and Statistics, MIT Press, vol. 90(3), pages 414-427, August.
- Jonah B. Gelbach & Doug Miller & A. Colin Cameron, 2006. "Bootstrap-Based Improvements for Inference with Clustered Errors," Working Papers 621, University of California, Davis, Department of Economics.
- A. Colin Cameron & Jonah B. Gelbach & Douglas L. Miller, 2007. "Bootstrap-Based Improvements for Inference with Clustered Errors," NBER Technical Working Papers 0344, National Bureau of Economic Research, Inc.
Adrien Montalbo, 2019. "Education and economic development. The influence of primary schooling on municipalities in nineteenth-century France," PSE Working Papers halshs-02286126, HAL.
Michael Pollmann, 2020. "Causal Inference for Spatial Treatments," Papers 2011.00373, arXiv.org, revised Jan 2023.
David G. Blanchflower & Andrew Oswald, 1995. "International Wage Curves," NBER Chapters, in: Differences and Changes in Wage Structures, pages 145-174, National Bureau of Economic Research, Inc.
- David G. Blanchflower & Andrew J. Oswald, 1992. "International Wage Curves," NBER Working Papers 4200, National Bureau of Economic Research, Inc.
- David Blanchflower & A Oswald, 1993. "International Wage Curve," CEP Discussion Papers dp0116, Centre for Economic Performance, LSE.

More about this item

JEL classification:

C01 - Mathematical and Quantitative Methods - - General - - - Econometrics
C1 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General
C31 - Mathematical and Quantitative Methods - - Multiple or Simultaneous Equation Models; Multiple Variables - - - Cross-Sectional Models; Spatial Models; Treatment Effect Models; Quantile Regressions; Social Interaction Models

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:taf:jnlasa:v:107:y:2012:i:498:p:578-591. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Longhurst (email available below). General contact details of provider: http://www.tandfonline.com/UASA20 .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Clustering, Spatial Correlations, and Randomization Inference

Author

Abstract

Suggested Citation

Download full text from publisher

Other versions of this item:

References listed on IDEAS

Most related items

More about this item

JEL classification:

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data