IDEAS home Printed from https://ideas.repec.org/p/rtv/ceisrp/150.html
   My bibliography  Save this paper

Regression with Imputed Covariates:a Generalized Missing Indicator Approach

Author

Abstract

A common problem in applied regression analysis is that covariate values may be missing for some observations but imputed values may be available. This situation generates a trade-off between bias and precision: the complete cases are often disarmingly few, but replacing the missing observations with the imputed values to gain precision may lead to bias. In this paper we formalize this trade-off by showing that one can augment the regression model with a set of auxiliary variables so as to obtain, under weak assumptions about the imputations, the same unbiased estimator of the parameters of interest as complete-case analysis. Given this augmented model, the bias-precision trade-off may then be tackled by either model reduction procedures or model averaging methods. We illustrate our approach by considering the problem of estimating the relation between income and the body mass index (BMI) using survey data affected by item non-response, where the missing values on the main covariates are filled in by imputations.

Suggested Citation

  • Valentino Dardanoni & Salvatore Modica & Franco Peracchi, 2009. "Regression with Imputed Covariates:a Generalized Missing Indicator Approach," CEIS Research Paper 150, Tor Vergata University, CEIS, revised 08 Oct 2009.
  • Handle: RePEc:rtv:ceisrp:150
    as

    Download full text from publisher

    File URL: https://ceistorvergata.it/RePEc/rpaper/RP150.pdf
    File Function: Main text
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Horton, Nicholas J. & Kleinman, Ken P., 2007. "Much Ado About Nothing: A Comparison of Missing Data Methods and Software to Fit Incomplete Data Regression Models," The American Statistician, American Statistical Association, vol. 61, pages 79-90, February.
    2. Magnus, J.R. & Powell, O.R. & Prüfer, P., 2008. "A Comparison of Two Averaging Techniques with an Application to Growth Empirics," Other publications TiSEM 0392dffa-51e0-4bc9-9644-f, Tilburg University, School of Economics and Management.
    3. John Cawley & John Moran & Kosali Simon, 2010. "The impact of income on the weight of elderly Americans," Health Economics, John Wiley & Sons, Ltd., vol. 19(8), pages 979-993, August.
    4. David M. Cutler & Edward L. Glaeser & Jesse M. Shapiro, 2003. "Why Have Americans Become More Obese?," Journal of Economic Perspectives, American Economic Association, vol. 17(3), pages 93-118, Summer.
    5. García Villar, Jaume & Quintana-Domeque, Climent, 2009. "Income and body mass index in Europe," Economics & Human Biology, Elsevier, vol. 7(1), pages 73-83, March.
    6. Sanz-de-Galdeano, Anna, 2005. "The Obesity Epidemic in Europe," IZA Discussion Papers 1814, Institute of Labor Economics (IZA).
    7. Magnus, Jan R. & Powell, Owen & Prüfer, Patricia, 2010. "A comparison of two model averaging techniques with an application to growth empirics," Journal of Econometrics, Elsevier, vol. 154(2), pages 139-153, February.
    8. Tomas Philipson & Richard Posner, 2008. "Is the Obesity Epidemic a Public Health Problem? A Decade of Research on the Economics of Obesity," NBER Working Papers 14010, National Bureau of Economic Research, Inc.
    9. Jan R. Magnus & J. Durbin, 1999. "Estimation of Regression Coefficients of Interest When Other Regression Coefficients Are of No Interest," Econometrica, Econometric Society, vol. 67(3), pages 639-644, May.
    10. Xavier Sala-I-Martin & Gernot Doppelhofer & Ronald I. Miller, 2004. "Determinants of Long-Term Growth: A Bayesian Averaging of Classical Estimates (BACE) Approach," American Economic Review, American Economic Association, vol. 94(4), pages 813-835, September.
    11. Danilov, Dmitry & Magnus, J.R.Jan R., 2004. "On the harm that ignoring pretesting can cause," Journal of Econometrics, Elsevier, vol. 122(1), pages 27-46, September.
    12. Julia Campos & Neil R. Ericsson & David F. Hendry, 2005. "General-to-specific modeling: an overview and selected bibliography," International Finance Discussion Papers 838, Board of Governors of the Federal Reserve System (U.S.).
    Full references (including those not matched with items on IDEAS)

    Citations

    Blog mentions

    As found by EconAcademics.org, the blog aggregator for Economics research:
    1. Gli esperti di valutazione all’italiana
      by Alberto Baccini in ROARS - Return on Academic Research on 2011-12-16 21:45:50

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Aedın Doris & Donal O’Neill & Olive Sweetman, 2011. "GMM estimation of the covariance structure of longitudinal data on earnings," Stata Journal, StataCorp LP, vol. 11(3), pages 439-459, September.
    2. Giuseppe De Luca & Jan R. Magnus, 2011. "Bayesian model averaging and weighted-average least squares: Equivariance, stability, and numerical issues," Stata Journal, StataCorp LP, vol. 11(4), pages 518-544, December.
    3. Giuseppe De Luca & Jan R. Magnus & Franco Peracchi, 2022. "Asymptotic properties of the weighted-average least squares (WALS) estimator," EIEF Working Papers Series 2203, Einaudi Institute for Economics and Finance (EIEF), revised Mar 2022.
    4. World Bank, 2015. "Tanzania Poverty Assessment," World Bank Publications - Reports 21871, The World Bank Group.
    5. McDonough, Ian K. & Millimet, Daniel L., 2017. "Missing data, imputation, and endogeneity," Journal of Econometrics, Elsevier, vol. 199(2), pages 141-155.
    6. Chris Muris, 2020. "Efficient GMM Estimation with Incomplete Data," The Review of Economics and Statistics, MIT Press, vol. 102(3), pages 518-530, July.
    7. Djavad Salehi-Isfahani & Nadia Hassine & Ragui Assaad, 2014. "Equality of opportunity in educational achievement in the Middle East and North Africa," The Journal of Economic Inequality, Springer;Society for the Study of Economic Inequality, vol. 12(4), pages 489-515, December.
    8. Giuseppe Luca & Jan R. Magnus & Franco Peracchi, 2023. "Weighted-Average Least Squares (WALS): Confidence and Prediction Intervals," Computational Economics, Springer;Society for Computational Economics, vol. 61(4), pages 1637-1664, April.
    9. Valentino Dardanoni & Giuseppe De Luca & Salvatore Modica & Franco Peracchi, 2012. "A generalized missing-indicator approach to regression with imputed covariates," Stata Journal, StataCorp LP, vol. 12(4), pages 575-604, December.
    10. Yuan, Chaoxia & Fang, Fang & Ni, Lyu, 2022. "Mallows model averaging with effective model size in fragmentary data prediction," Computational Statistics & Data Analysis, Elsevier, vol. 173(C).
    11. Jiti Gao & Bin Peng & Zhao Ren & Xiaohui Zhang, 2015. "Variable Selection for a Categorical Varying-Coefficient Model with Identifications for Determinants of Body Mass Index," Monash Econometrics and Business Statistics Working Papers 21/15, Monash University, Department of Econometrics and Business Statistics.
    12. Sophia Rabe-Hesketh & Anders Skrondal, 2023. "Ignoring Non-ignorable Missingness," Psychometrika, Springer;The Psychometric Society, vol. 88(1), pages 31-50, March.
    13. Valentino Dardanoni & Giuseppe De Luca & Salvatore Modica & Franco Peracchi, 2013. "Bayesian Model Averaging for Generalized Linear Models with Missing Covariates," EIEF Working Papers Series 1311, Einaudi Institute for Economics and Finance (EIEF), revised May 2013.
    14. Laszlo Goerke & Sabrina Jeworrek & Markus Pannenberg, 2015. "Trade union membership and paid vacation in Germany," IZA Journal of Labor Economics, Springer;Forschungsinstitut zur Zukunft der Arbeit GmbH (IZA), vol. 4(1), pages 1-26, December.
    15. Zhang, Xinyu, 2013. "Model averaging with covariates that are missing completely at random," Economics Letters, Elsevier, vol. 121(3), pages 360-363.
    16. Dardanoni, Valentino & De Luca, Giuseppe & Modica, Salvatore & Peracchi, Franco, 2015. "Model averaging estimation of generalized linear models with imputed covariates," Journal of Econometrics, Elsevier, vol. 184(2), pages 452-463.
    17. Francesco Bartolucci & Fulvia Pennoni & Giorgio Vittadini, 2023. "A Causal Latent Transition Model With Multivariate Outcomes and Unobserved Heterogeneity: Application to Human Capital Development," Journal of Educational and Behavioral Statistics, , vol. 48(4), pages 387-419, August.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. repec:hal:journl:peer-00815561 is not listed on IDEAS
    2. Judith Anne Clarke, 2017. "Model Averaging OLS and 2SLS: An Application of the WALS Procedure," Econometrics Working Papers 1701, Department of Economics, University of Victoria.
    3. Jan R. Magnus & Wendun Wang & Xinyu Zhang, 2016. "Weighted-Average Least Squares Prediction," Econometric Reviews, Taylor & Francis Journals, vol. 35(6), pages 1040-1074, June.
    4. Giuseppe De Luca & Jan R. Magnus, 2011. "Bayesian model averaging and weighted-average least squares: Equivariance, stability, and numerical issues," Stata Journal, StataCorp LP, vol. 11(4), pages 518-544, December.
    5. Aiyar, Shekhar & Duval, Romain & Puy, Damien & Wu, Yiqun & Zhang, Longmei, 2018. "Growth slowdowns and the middle-income trap," Japan and the World Economy, Elsevier, vol. 48(C), pages 22-37.
    6. De Luca, Giuseppe & Magnus, Jan R. & Peracchi, Franco, 2018. "Weighted-average least squares estimation of generalized linear models," Journal of Econometrics, Elsevier, vol. 204(1), pages 1-17.
    7. García Villar, Jaume & Quintana-Domeque, Climent, 2009. "Income and body mass index in Europe," Economics & Human Biology, Elsevier, vol. 7(1), pages 73-83, March.
    8. Salmasi, Luca & Celidoni, Martina, 2017. "Investigating the poverty-obesity paradox in Europe," Economics & Human Biology, Elsevier, vol. 26(C), pages 70-85.
    9. Dardanoni, Valentino & De Luca, Giuseppe & Modica, Salvatore & Peracchi, Franco, 2015. "Model averaging estimation of generalized linear models with imputed covariates," Journal of Econometrics, Elsevier, vol. 184(2), pages 452-463.
    10. Tumala, Mohammed M & Olubusoye, Olusanya E & Yaaba, Baba N & Yaya, OlaOluwa S & Akanbi, Olawale B, 2017. "Investigating Predictors of Inflation in Nigeria: BMA and WALS Techniques," MPRA Paper 88773, University Library of Munich, Germany, revised Feb 2018.
    11. Poghosyan, K., 2012. "Structural and reduced-form modeling and forecasting with application to Armenia," Other publications TiSEM ad1a24c3-15e6-4f04-b338-3, Tilburg University, School of Economics and Management.
    12. Valentino Dardanoni & Giuseppe De Luca & Salvatore Modica & Franco Peracchi, 2013. "Bayesian Model Averaging for Generalized Linear Models with Missing Covariates," EIEF Working Papers Series 1311, Einaudi Institute for Economics and Finance (EIEF), revised May 2013.
    13. Srdelić, Leonarda & Dávila-Fernández, Marwil J., 2024. "International trade and economic growth in Croatia," Structural Change and Economic Dynamics, Elsevier, vol. 68(C), pages 240-258.
    14. Aedın Doris & Donal O’Neill & Olive Sweetman, 2011. "GMM estimation of the covariance structure of longitudinal data on earnings," Stata Journal, StataCorp LP, vol. 11(3), pages 439-459, September.
    15. Sachs, Andreas, 2010. "A Bayesian approach to determine the impact of institutions on the unemployment rate," ZEW Discussion Papers 10-058, ZEW - Leibniz Centre for European Economic Research.
    16. Becker William & Paruolo Paolo & Saltelli Andrea, 2021. "Variable Selection in Regression Models Using Global Sensitivity Analysis," Journal of Time Series Econometrics, De Gruyter, vol. 13(2), pages 187-233, July.
    17. John W. Galbraith & Victoria Zinde-Walsh, 2011. "Partially Dimension-Reduced Regressions with Potentially Infinite-Dimensional Processes," CIRANO Working Papers 2011s-57, CIRANO.
    18. Afonso, António & Tovar Jalles, João, 2019. "Quantitative easing and sovereign yield spreads: Euro-area time-varying evidence," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 58(C), pages 208-224.
    19. Mark F. J. Steel, 2020. "Model Averaging and Its Use in Economics," Journal of Economic Literature, American Economic Association, vol. 58(3), pages 644-719, September.
    20. Nicolas End, 2020. "Rousseau's social contract or Machiavelli's virtue? A measure of fiscal credibility," Working Papers halshs-03078704, HAL.
    21. Tafreschi, Darjusch, 2015. "The income body weight gradients in the developing economy of China," Economics & Human Biology, Elsevier, vol. 16(C), pages 115-134.

    More about this item

    Keywords

    Missing covariates; Imputations; Bias-precision trade-off; Model reduction; Model averaging; BMI and income.;
    All these keywords.

    JEL classification:

    • C12 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Hypothesis Testing: General
    • C13 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Estimation: General
    • C19 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Other

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:rtv:ceisrp:150. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Barbara Piazzi (email available below). General contact details of provider: https://edirc.repec.org/data/csrotit.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.