Regression with imputed covariates: A generalized missing-indicator approach
A common problem in applied regression analysis is that covariate values may be missing for some observations but imputed values may be available. This situation generates a trade-off between bias and precision: the complete cases are often disarmingly few, but replacing the missing observations with the imputed values to gain precision may lead to bias. In this paper we formalize this trade-off by showing that one can augment the regression model with a set of auxiliary variables so as to obtain, under weak assumptions about the imputations, the same unbiased estimator of the parameters of interest as complete-case analysis. Given this augmented model, the bias-precision trade-off may then be tackled by either model reduction procedures or model averaging methods. We illustrate our approach by considering the problem of estimating the relation between income and the body mass index (BMI) using survey data affected by item non-response, where the missing values on the main covariates are filled in by imputations.
|Date of creation:||19 Apr 2011|
|Date of revision:|
|Publication status:||Published in Journal of Econometrics, Elsevier, 2011, <10.1016/j.jeconom.2011.02.005>|
|Note:||View the original document on HAL open archive server: https://hal.archives-ouvertes.fr/hal-00815561|
|Contact details of provider:|| Web page: https://hal.archives-ouvertes.fr/|
Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:
- Magnus, J.R. & Powell, O.R. & Prüfer, P., 2008. "A Comparison of Two Averaging Techniques with an Application to Growth Empirics," Discussion Paper 2008-39, Tilburg University, Center for Economic Research.
- David M. Cutler & Edward L. Glaeser & Jesse M. Shapiro, 2003.
"Why Have Americans Become More Obese?,"
Harvard Institute of Economic Research Working Papers
1994, Harvard - Institute of Economic Research.
- David Cutler & Edward Glaeser & Jesse Shapiro, 2003. "Why Have Americans Become More Obese?," NBER Working Papers 9446, National Bureau of Economic Research, Inc.
- Shapiro, Jesse & Glaeser, Edward & Cutler, David, 2003. "Why Have Americans Become More Obese," Scholarly Articles 2640583, Harvard University Department of Economics.
- Horton, Nicholas J. & Kleinman, Ken P., 2007. "Much Ado About Nothing: A Comparison of Missing Data Methods and Software to Fit Incomplete Data Regression Models," The American Statistician, American Statistical Association, vol. 61, pages 79-90, February.
- Jaume Garcia & Climent Quintana, 2008.
"Income and Body Mass Index in Europe,"
- Tomas Philipson & Richard Posner, 2008. "Is the Obesity Epidemic a Public Health Problem? A Decade of Research on the Economics of Obesity," NBER Working Papers 14010, National Bureau of Economic Research, Inc.
- John Cawley & John R. Moran & Kosali I. Simon, 2008.
"The Impact of Income on the Weight of Elderly Americans,"
NBER Working Papers
14104, National Bureau of Economic Research, Inc.
- John Cawley & John Moran & Kosali Simon, 2010. "The impact of income on the weight of elderly Americans," Health Economics, John Wiley & Sons, Ltd., vol. 19(8), pages 979-993, August.
- Gernot Doppelhofer & Ronald I. Miller & Xavier Sala-i-Martin, 2000.
"Determinants of Long-Term Growth: A Bayesian Averaging of Classical Estimates (BACE) Approach,"
NBER Working Papers
7750, National Bureau of Economic Research, Inc.
- Xavier Sala-I-Martin & Gernot Doppelhofer & Ronald I. Miller, 2004. "Determinants of Long-Term Growth: A Bayesian Averaging of Classical Estimates (BACE) Approach," American Economic Review, American Economic Association, vol. 94(4), pages 813-835, September.
- Gernot Doppelhofer & Ronald I. Miller & Xavier Sala-i-Martin, 2000. "Determinants of Long-Term Growth: A Bayesian Averaging of Classical Estimates (Bace) Approach," OECD Economics Department Working Papers 266, OECD Publishing.
- Jan R. Magnus & J. Durbin, 1999. "Estimation of Regression Coefficients of Interest When Other Regression Coefficients Are of No Interest," Econometrica, Econometric Society, vol. 67(3), pages 639-644, May.
- Magnus, Jan R. & Powell, Owen & Prüfer, Patricia, 2010. "A comparison of two model averaging techniques with an application to growth empirics," Journal of Econometrics, Elsevier, vol. 154(2), pages 139-153, February.
- Danilov, Dmitry & Magnus, J.R.Jan R., 2004. "On the harm that ignoring pretesting can cause," Journal of Econometrics, Elsevier, vol. 122(1), pages 27-46, September.
- Julia Campos & Neil R. Ericsson & David F. Hendry, 2005. "General-to-specific modeling: an overview and selected bibliography," International Finance Discussion Papers 838, Board of Governors of the Federal Reserve System (U.S.).
- Sanz-de-Galdeano, Anna, 2005.
"The Obesity Epidemic in Europe,"
IZA Discussion Papers
1814, Institute for the Study of Labor (IZA).
When requesting a correction, please mention this item's handle: RePEc:hal:journl:hal-00815561. See general information about how to correct material in RePEc.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (CCSD)
If references are entirely missing, you can add them using this form.