Regression with imputed covariates: A generalized missing-indicator approach
A common problem in applied regression analysis is that covariate values may be missing for some observations but imputed values may be available. This situation generates a trade-off between bias and precision: the complete cases are often disarmingly few, but replacing the missing observations with the imputed values to gain precision may lead to bias. In this paper we formalize this trade-off by showing that one can augment the regression model with a set of auxiliary variables so as to obtain, under weak assumptions about the imputations, the same unbiased estimator of the parameters of interest as complete-case analysis. Given this augmented model, the bias-precision trade-off may then be tackled by either model reduction procedures or model averaging methods. We illustrate our approach by considering the problem of estimating the relation between income and the body mass index (BMI) using survey data affected by item non-response, where the missing values on the main covariates are filled in by imputations.
|Date of creation:||19 Apr 2011|
|Date of revision:|
|Publication status:||Published, Journal of Econometrics, 2011|
|Note:||View the original document on HAL open archive server: http://hal.archives-ouvertes.fr/hal-00815561|
|Contact details of provider:|| Web page: http://hal.archives-ouvertes.fr/|
Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:
- Shapiro, Jesse & Glaeser, Edward & Cutler, David, 2003.
"Why Have Americans Become More Obese,"
2640583, Harvard University Department of Economics.
- David M. Cutler & Edward L. Glaeser & Jesse M. Shapiro, 2003. "Why Have Americans Become More Obese?," Harvard Institute of Economic Research Working Papers 1994, Harvard - Institute of Economic Research.
- David Cutler & Edward Glaeser & Jesse Shapiro, 2003. "Why Have Americans Become More Obese?," NBER Working Papers 9446, National Bureau of Economic Research, Inc.
- Jaume Garcia & Climent Quintana, 2008.
"Income and Body Mass Index in Europe,"
- Xavier Sala-I-Martin & Gernot Doppelhofer & Ronald I. Miller, 2004.
"Determinants of Long-Term Growth: A Bayesian Averaging of Classical Estimates (BACE) Approach,"
American Economic Review,
American Economic Association, vol. 94(4), pages 813-835, September.
- Gernot Doppelhofer & Ronald I. Miller & Xavier Sala-i-Martin, 2000. "Determinants of Long-Term Growth: A Bayesian Averaging of Classical Estimates (BACE) Approach," NBER Working Papers 7750, National Bureau of Economic Research, Inc.
- Gernot Doppelhofer & Ronald I. Miller & Xavier Sala-i-Martin, 2000. "Determinants of Long-Term Growth: A Bayesian Averaging of Classical Estimates (Bace) Approach," OECD Economics Department Working Papers 266, OECD Publishing.
- Sanz-de-Galdeano, Anna, 2005.
"The Obesity Epidemic in Europe,"
IZA Discussion Papers
1814, Institute for the Study of Labor (IZA).
- Jan R. Magnus & J. Durbin, 1999. "Estimation of Regression Coefficients of Interest When Other Regression Coefficients Are of No Interest," Econometrica, Econometric Society, vol. 67(3), pages 639-644, May.
- Tomas Philipson & Richard Posner, 2008. "Is the Obesity Epidemic a Public Health Problem? A Decade of Research on the Economics of Obesity," NBER Working Papers 14010, National Bureau of Economic Research, Inc.
- John Cawley & John Moran & Kosali Simon, 2010.
"The impact of income on the weight of elderly Americans,"
John Wiley & Sons, Ltd., vol. 19(8), pages 979-993, August.
- John Cawley & John R. Moran & Kosali I. Simon, 2008. "The Impact of Income on the Weight of Elderly Americans," NBER Working Papers 14104, National Bureau of Economic Research, Inc.
- Julia Campos & Neil R. Ericsson & David F. Hendry, 2005. "General-to-specific modeling: an overview and selected bibliography," International Finance Discussion Papers 838, Board of Governors of the Federal Reserve System (U.S.).
- Horton, Nicholas J. & Kleinman, Ken P., 2007. "Much Ado About Nothing: A Comparison of Missing Data Methods and Software to Fit Incomplete Data Regression Models," The American Statistician, American Statistical Association, vol. 61, pages 79-90, February.
- repec:dgr:kubcen:200839 is not listed on IDEAS
- Magnus, Jan R. & Powell, Owen & Prüfer, Patricia, 2010. "A comparison of two model averaging techniques with an application to growth empirics," Journal of Econometrics, Elsevier, vol. 154(2), pages 139-153, February.
- Danilov, Dmitry & Magnus, J.R.Jan R., 2004. "On the harm that ignoring pretesting can cause," Journal of Econometrics, Elsevier, vol. 122(1), pages 27-46, September.
When requesting a correction, please mention this item's handle: RePEc:hal:journl:hal-00815561. See general information about how to correct material in RePEc.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (CCSD)
If references are entirely missing, you can add them using this form.