Regression with imputed covariates: A generalized missing-indicator approach
AbstractA common problem in applied regression analysis is that covariate values may be missing for some observations but imputed values may be available. This situation generates a trade-off between bias and precision: the complete cases are often disarmingly few, but replacing the missing observations with the imputed values to gain precision may lead to bias. In this paper we formalize this trade-off by showing that one can augment the regression model with a set of auxiliary variables so as to obtain, under weak assumptions about the imputations, the same unbiased estimator of the parameters of interest as complete-case analysis. Given this augmented model, the bias-precision trade-off may then be tackled by either model reduction procedures or model averaging methods. We illustrate our approach by considering the problem of estimating the relation between income and the body mass index (BMI) using survey data affected by item non-response, where the missing values on the main covariates are filled in by imputations.
Download InfoIf you experience problems downloading a file, check if you have the proper application to view it first. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.
Bibliographic InfoPaper provided by HAL in its series Post-Print with number peer-00815561.
Date of creation: 19 Apr 2011
Date of revision:
Publication status: Published, Journal of Econometrics, 2011
Note: View the original document on HAL open archive server: http://peer.ccsd.cnrs.fr/peer-00815561
Contact details of provider:
Web page: http://hal.archives-ouvertes.fr/
C12; C13; C19; Missing covariates; Imputations; Bias-precision trade-off; Model reduction; Model averaging; BMI and income;
Other versions of this item:
- Dardanoni, Valentino & Modica, Salvatore & Peracchi, Franco, 2011. "Regression with imputed covariates: A generalized missing-indicator approach," Journal of Econometrics, Elsevier, vol. 162(2), pages 362-368, June.
- Valentino Dardanoni & Salvatore Modica & Franco Peracchi, 2011. "Regression with imputed covariates: A generalized missing-indicator approach," Post-Print hal-00815561, HAL.
- Valentino Dardanoni & Salvatore Modica & Franco Peracchi, 2009. "Regression with Imputed Covariates:a Generalized Missing Indicator Approach," CEIS Research Paper 150, Tor Vergata University, CEIS, revised 08 Oct 2009.
- C12 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Hypothesis Testing: General
- C13 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Estimation: General
- C19 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Other
Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:
- Danilov, Dmitry & Magnus, J.R.Jan R., 2004. "On the harm that ignoring pretesting can cause," Journal of Econometrics, Elsevier, vol. 122(1), pages 27-46, September.
- John Cawley & John R. Moran & Kosali I. Simon, 2008.
"The Impact of Income on the Weight of Elderly Americans,"
NBER Working Papers
14104, National Bureau of Economic Research, Inc.
- John Cawley & John Moran & Kosali Simon, 2010. "The impact of income on the weight of elderly Americans," Health Economics, John Wiley & Sons, Ltd., vol. 19(8), pages 979-993, August.
- Jaume Garcia Villar & Climent Quintana-Domeque, 2006.
"Income and body mass index in Europe,"
Economics Working Papers
1001, Department of Economics and Business, Universitat Pompeu Fabra, revised May 2008.
- Magnus, Jan R. & Powell, Owen & Prüfer, Patricia, 2010. "A comparison of two model averaging techniques with an application to growth empirics," Journal of Econometrics, Elsevier, vol. 154(2), pages 139-153, February.
- Gernot Doppelhofer & Ronald I. Miller & Xavier Sala-i-Martin, 2000.
"Determinants of Long-Term Growth: A Bayesian Averaging of Classical Estimates (Bace) Approach,"
OECD Economics Department Working Papers
266, OECD Publishing.
- Xavier Sala-I-Martin & Gernot Doppelhofer & Ronald I. Miller, 2004. "Determinants of Long-Term Growth: A Bayesian Averaging of Classical Estimates (BACE) Approach," American Economic Review, American Economic Association, vol. 94(4), pages 813-835, September.
- Gernot Doppelhofer & Ronald I. Miller & Xavier Sala-i-Martin, 2000. "Determinants of Long-Term Growth: A Bayesian Averaging of Classical Estimates (BACE) Approach," NBER Working Papers 7750, National Bureau of Economic Research, Inc.
- Anna Sanz De Galdeano, 2005.
"The Obesity Epidemic in Europe,"
CSEF Working Papers
143, Centre for Studies in Economics and Finance (CSEF), University of Naples, Italy.
- Magnus, J.R. & Powell, O.R. & Prüfer, P., 2008. "A Comparison of Two Averaging Techniques with an Application to Growth Empirics," Discussion Paper 2008-39, Tilburg University, Center for Economic Research.
- Shapiro, Jesse & Glaeser, Edward & Cutler, David, 2003. "Why Have Americans Become More Obese," Scholarly Articles 2640583, Harvard University Department of Economics.
- Jan R. Magnus & J. Durbin, 1999. "Estimation of Regression Coefficients of Interest When Other Regression Coefficients Are of No Interest," Econometrica, Econometric Society, vol. 67(3), pages 639-644, May.
- Horton, Nicholas J. & Kleinman, Ken P., 2007. "Much Ado About Nothing: A Comparison of Missing Data Methods and Software to Fit Incomplete Data Regression Models," The American Statistician, American Statistical Association, vol. 61, pages 79-90, February.
- David M. Cutler & Edward L. Glaeser & Jesse M. Shapiro, 2003.
"Why Have Americans Become More Obese?,"
Journal of Economic Perspectives,
American Economic Association, vol. 17(3), pages 93-118, Summer.
- David M. Cutler & Edward L. Glaeser & Jesse M. Shapiro, 2003. "Why Have Americans Become More Obese?," Harvard Institute of Economic Research Working Papers 1994, Harvard - Institute of Economic Research.
- David Cutler & Edward Glaeser & Jesse Shapiro, 2003. "Why Have Americans Become More Obese?," NBER Working Papers 9446, National Bureau of Economic Research, Inc.
- Julia Campos & Neil R. Ericsson & David F. Hendry, 2005. "General-to-specific modeling: an overview and selected bibliography," International Finance Discussion Papers 838, Board of Governors of the Federal Reserve System (U.S.).
- Tomas Philipson & Richard Posner, 2008. "Is the Obesity Epidemic a Public Health Problem? A Decade of Research on the Economics of Obesity," NBER Working Papers 14010, National Bureau of Economic Research, Inc.
Blog mentionsAs found by EconAcademics.org, the blog aggregator for Economics research:
- Gli esperti di valutazione allitaliana
by Alberto Baccini in ROARS - Return on Academic Research on 2011-12-16 15:45:50
- De Luca, G. & Magnus, J.R., 2011.
"Bayesian Model Averaging and Weighted Average Least Squares: Equivariance, Stability, and Numerical Issues,"
2011-082, Tilburg University, Center for Economic Research.
- Giuseppe De Luca & Jan R. Magnus, 2011. "Bayesian model averaging and weighted-average least squares: Equivariance, stability, and numerical issues," Stata Journal, StataCorp LP, vol. 11(4), pages 518-544, December.
- Zhang, Xinyu, 2013. "Model averaging with covariates that are missing completely at random," Economics Letters, Elsevier, vol. 121(3), pages 360-363.
- Valentino Dardanoni & Giuseppe De Luca & Salvatore Modica & Franco Peracchi, 2012.
"A generalized missing-indicator approach to regression with imputed covariates,"
StataCorp LP, vol. 12(4), pages 575-604, December.
- Valentino Dardanoni & Giuseppe De Luca & Salvatore Modica & Franco Peracchi, 2011. "A Generalized Missing-Indicator Approach to Regression with Imputed Covariates," EIEF Working Papers Series 1111, Einaudi Institute for Economics and Finance (EIEF), revised May 2011.
- Valentino Dardanoni & Giuseppe De Luca & Salvatore Modica & Franco Peracchi, 2013. "Bayesian Model Averaging for Generalized Linear Models with Missing Covariates," EIEF Working Papers Series 1311, Einaudi Institute for Economics and Finance (EIEF), revised May 2013.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (CCSD).
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
If references are entirely missing, you can add them using this form.
If the full references list an item that is present in RePEc, but the system did not link to it, you can help with this form.
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your profile, as there may be some citations waiting for confirmation.
Please note that corrections may take a couple of weeks to filter through the various RePEc services.