A generalized missing-indicator approach to regression with imputed covariates
We consider estimation of a linear regression model using data where some covariate values are missing but imputations are available to fill in the miss- ing values. This situation generates a tradeoff between bias and precision when estimating the regression parameters of interest. Using only the subsample of complete observations does not cause bias but may imply a substantial loss of precision because the complete cases may be too few. On the other hand, filling in the missing values with imputations may cause bias. We provide the new Stata command gmi, which handles such tradeoff by using either model reduction or Bayesian model averaging techniques in the context of the generalized missing- indicator approach recently proposed by Dardanoni, Modica, and Peracchi (2011, Journal of Econometrics 162: 362–368). If multiple imputations are available, gmi can also be combined with the built-in Stata prefix mi estimate to account for extra variability due to imputation. We illustrate the use of gmi with an empirical application in the health domain, where item nonresponse is substantial. Copyright 2012 by StataCorp LP.
Volume (Year): 12 (2012)
Issue (Month): 4 (December)
|Note:||to access software from within Stata, net describe http://www.stata-journal.com/software/sj12-4/st0273/|
|Contact details of provider:|| Web page: http://www.stata-journal.com/|
|Order Information:||Web: http://www.stata-journal.com/subscription.html|
References listed on IDEAS
Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:
- Einmahl, J.H.J. & Magnus, J.R. & Kumar, K., 2011. "On the Choice of Prior in Bayesian Model Averaging," Discussion Paper 2011-003, Tilburg University, Center for Economic Research.
- Dimitrios Christelis, 2011. "Imputation of Missing Data in Waves 1 and 2 of SHARE," CSEF Working Papers 278, Centre for Studies in Economics and Finance (CSEF), University of Naples, Italy.
- De Luca, G. & Magnus, J.R., 2011.
"Bayesian Model Averaging and Weighted Average Least Squares : Equivariance, Stability, and Numerical Issues,"
2011-082, Tilburg University, Center for Economic Research.
- Giuseppe De Luca & Jan R. Magnus, 2011. "Bayesian model averaging and weighted-average least squares: Equivariance, stability, and numerical issues," Stata Journal, StataCorp LP, vol. 11(4), pages 518-544, December.
- Jan R. Magnus, 2002. "Estimation of the mean of a univariate normal distribution with known variance," Econometrics Journal, Royal Economic Society, vol. 5(1), pages 225-236, June.
- Valentino Dardanoni & Salvatore Modica & Franco Peracchi, 2011.
"Regression with imputed covariates: A generalized missing-indicator approach,"
- Dardanoni, Valentino & Modica, Salvatore & Peracchi, Franco, 2011. "Regression with imputed covariates: A generalized missing-indicator approach," Journal of Econometrics, Elsevier, vol. 162(2), pages 362-368, June.
- Valentino Dardanoni & Salvatore Modica & Franco Peracchi, 2009. "Regression with Imputed Covariates:a Generalized Missing Indicator Approach," CEIS Research Paper 150, Tor Vergata University, CEIS, revised 08 Oct 2009.
- repec:hal:journl:peer-00815561 is not listed on IDEAS
- Agar Brugiavini & Tullio Jappelli & Guglielmo Weber, 2002. "The Survey on Health, Aging and Wealth," CSEF Working Papers 86, Centre for Studies in Economics and Finance (CSEF), University of Naples, Italy.
- Magnus, Jan R. & Powell, Owen & Prüfer, Patricia, 2010. "A comparison of two model averaging techniques with an application to growth empirics," Journal of Econometrics, Elsevier, vol. 154(2), pages 139-153, February.
- Magnus, Jan R. & Wan, Alan T.K. & Zhang, Xinyu, 2011. "Weighted average least squares estimation with nonspherical disturbances and an application to the Hong Kong housing market," Computational Statistics & Data Analysis, Elsevier, vol. 55(3), pages 1331-1341, March.
- Charles Lindsey & Simon Sheather, 2010. "Variable selection in linear regression," Stata Journal, StataCorp LP, vol. 10(4), pages 650-669, December.
- Horton, Nicholas J. & Kleinman, Ken P., 2007. "Much Ado About Nothing: A Comparison of Missing Data Methods and Software to Fit Incomplete Data Regression Models," The American Statistician, American Statistical Association, vol. 61, pages 79-90, February.
When requesting a correction, please mention this item's handle: RePEc:tsj:stataj:v:12:y:2012:i:4:p:575-604. See general information about how to correct material in RePEc.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Christopher F. Baum)or (Lisa Gilmore)
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
If references are entirely missing, you can add them using this form.
If the full references list an item that is present in RePEc, but the system did not link to it, you can help with this form.
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your profile, as there may be some citations waiting for confirmation.
Please note that corrections may take a couple of weeks to filter through the various RePEc services.