Andrew Gelman (Department of Statistics, Columbia University) Iain Pardoe (School of Business, University of Oregon)
Abstract
Explained variance (R^2) is a familiar summary of the fit of a linear regression and has been generalized in various ways to multilevel (hierarchical) models. The multilevel models we consider in this paper are characterized by hierarchical data structures in which individuals are grouped into units (which themselves might be further grouped into larger units), and there are variables measured on individuals and each grouping unit. The models are based on regression relationships at different levels, with the first level corresponding to the individual data, and subsequent levels corresponding to between-group regressions of individual predictor effects on grouping unit variables. We present an approach to defining R^2 at each level of the multilevel model, rather than attempting to create a single summary measure of fit. Our method is based on comparing variances in a single fitted model rather than comparing to a null model. In simple regression, our measure generalizes the classical adjusted R^2. We also discuss a related variance comparison to summarize the degree to which estimates at each level of the model are pooled together based on the level-specific regression relationship, rather than estimated separately. This pooling factor is related to the concept of shrinkage in simple hierarchical models. We illustrate the methods on a dataset of radon in houses within counties using a series of models ranging from a simple linear regression model to a multilevel varying-intercept, varying-slope model.
Download Info
To download:
If you experience problems downloading a file, check if you have the
proper application to
view it first. Information about this may be contained
in the File-Format links below. In case of further problems read
the IDEAS help
page. Note that these files are not on the IDEAS
site. Please be patient as the files may be large.
Publisher Info
Paper provided by EconWPA in its series Econometrics with number
0404002.
Find related papers by JEL classification: C1 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods: General C2 - Mathematical and Quantitative Methods - - Single Equation Models; Single Variables C3 - Mathematical and Quantitative Methods - - Multiple or Simultaneous Equation Models; Multiple Variables C4 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods: Special Topics C5 - Mathematical and Quantitative Methods - - Econometric Modeling C8 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs
This paper has been announced in the following NEP Reports: