IDEAS home Printed from https://ideas.repec.org/a/sae/somere/v42y2013i4p483-530.html
   My bibliography  Save this article

Multiple Imputation for Combined-survey Estimation With Incomplete Regressors in One but Not Both Surveys

Author

Listed:
  • Michael S. Rendall
  • Bonnie Ghosh-Dastidar
  • Margaret M. Weden
  • Elizabeth H. Baker
  • Zafar Nazarov

Abstract

Within-survey multiple imputation (MI) methods are adapted to pooled-survey regression estimation where one survey has more regressors, but typically fewer observations, than the other. This adaptation is achieved through (1) larger numbers of imputations to compensate for the higher fraction of missing values, (2) model-fit statistics to check the assumption that the two surveys sample from a common universe, and (3) specifying the analysis model completely from variables present in the survey with the larger set of regressors, thereby excluding variables never jointly observed. In contrast to the typical within-survey MI context, cross-survey missingness is monotonic and easily satisfies the missing at random assumption needed for unbiased MI. Large efficiency gains and substantial reduction in omitted variable bias are demonstrated in an application to sociodemographic differences in the risk of child obesity estimated from two nationally representative cohort surveys.

Suggested Citation

  • Michael S. Rendall & Bonnie Ghosh-Dastidar & Margaret M. Weden & Elizabeth H. Baker & Zafar Nazarov, 2013. "Multiple Imputation for Combined-survey Estimation With Incomplete Regressors in One but Not Both Surveys," Sociological Methods & Research, , vol. 42(4), pages 483-530, November.
  • Handle: RePEc:sae:somere:v:42:y:2013:i:4:p:483-530
    DOI: 10.1177/0049124113502947
    as

    Download full text from publisher

    File URL: https://journals.sagepub.com/doi/10.1177/0049124113502947
    Download Restriction: no

    File URL: https://libkey.io/10.1177/0049124113502947?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Judith K. Hellerstein & Guido W. Imbens, 1999. "Imposing Moment Restrictions From Auxiliary Data By Weighting," The Review of Economics and Statistics, MIT Press, vol. 81(1), pages 1-14, February.
    2. John Fitzgerald & Peter Gottschalk & Robert Moffitt, 1998. "An Analysis of Sample Attrition in Panel Data: The Michigan Panel Study of Income Dynamics," Journal of Human Resources, University of Wisconsin Press, vol. 33(2), pages 251-299.
    3. Mark Handcock & Sami Huovilainen & Michael Rendall, 2000. "Combining registration-system and survey data to estimate birth probabilities," Demography, Springer;Population Association of America (PAA), vol. 37(2), pages 187-192, May.
    4. Rodgers, Willard L, 1984. "An Evaluation of Statistical Matching," Journal of Business & Economic Statistics, American Statistical Association, vol. 2(1), pages 91-102, January.
    5. McCloskey, Donald N, 1985. "The Loss Function Has Been Mislaid: The Rhetoric of Significance Tests," American Economic Review, American Economic Association, vol. 75(2), pages 201-205, May.
    6. Michael Rendall & Ryan Admiraal & Alessandra DeRose & Paola DiGiulio & Mark Handcock & Filomena Racioppi, 2008. "Population constraints on pooled surveys in demographic hazard modeling," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 17(4), pages 519-539, October.
    7. Guido W. Imbens & Tony Lancaster, 1994. "Combining Micro and Macro Data in Microeconometric Models," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 61(4), pages 655-680.
    8. Moriarity, Chris & Scheuren, Fritz, 2003. "A Note on Rubin's Statistical Matching Using File Concatenation with Adjusted Weights and Multiple Imputations," Journal of Business & Economic Statistics, American Statistical Association, vol. 21(1), pages 65-73, January.
    9. Ai, Chunrong & Norton, Edward C., 2003. "Interaction terms in logit and probit models," Economics Letters, Elsevier, vol. 80(1), pages 123-129, July.
    10. Michael Rendall & Mark Handcock & Stefan Jonsson, 2009. "Bayesian estimation of hispanic fertility hazards from survey and population data," Demography, Springer;Population Association of America (PAA), vol. 46(1), pages 65-83, February.
    11. Pfeffermann, Danny & Sverchkov, Michail, 2007. "Small-Area Estimation Under Informative Probability Sampling of Areas and Within the Selected Areas," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 1427-1439, December.
    12. Ridder, Geert & Moffitt, Robert, 2007. "The Econometrics of Data Combination," Handbook of Econometrics, in: J.J. Heckman & E.E. Leamer (ed.), Handbook of Econometrics, edition 1, volume 6, chapter 75, Elsevier.
    13. Rubin, Donald B, 1986. "Statistical Matching Using File Concatenation with Adjusted Weights and Multiple Imputations," Journal of Business & Economic Statistics, American Statistical Association, vol. 4(1), pages 87-94, January.
    14. Patricia M. Anderson & Kristin F. Butcher, 2006. "Reading, Writing, and Refreshments: Are School Finances Contributing to Children’s Obesity?," Journal of Human Resources, University of Wisconsin Press, vol. 41(3).
    15. Michael Rendall & Margaret Weden & Melissa Favreault & Hilary Waldron, 2011. "The Protective Effect of Marriage for Survival: A Review and Update," Demography, Springer;Population Association of America (PAA), vol. 48(2), pages 481-506, May.
    16. Renato Assunção & Carl Schmertmann & Joseph Potter & Suzana Cavenaghi, 2005. "Empirical bayes estimation of demographic schedules for small areas," Demography, Springer;Population Association of America (PAA), vol. 42(3), pages 537-558, August.
    17. Michael S. Rendall & Bonnie Ghosh-Dastidar & Margaret M. Weden & Zafar Nazarov, 2011. "Multiple Imputation for Combined-Survey Estimation With Incomplete Regressors In One But Not Both Surveys," Working Papers WR-887-1, RAND Corporation.
    18. Vicki Freedman & Douglas Wolf, 1995. "A case study on the use of multiple imputation," Demography, Springer;Population Association of America (PAA), vol. 32(3), pages 459-470, August.
    19. Rendall, Michael S. & Ghosh-Dastidar, Bonnie & Weden, Margaret M. & Nazarov, Zafar, 2011. "Multiple Imputation for Combined-Survey Estimation With Incomplete Regressors In One But Not Both Surveys," Working Papers 887-1, RAND Corporation.
    20. Kimbro, R.T. & Brooks-Gunn, J. & McLanahan, S., 2007. "Racial and ethnic differentials in overweight and obesity among 3-year-old children," American Journal of Public Health, American Public Health Association, vol. 97(2), pages 298-305.
    21. Classen, Timothy & Hokayem, Charles, 2005. "Childhood influences on youth obesity," Economics & Human Biology, Elsevier, vol. 3(2), pages 165-187, July.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Angela Greulich & Michael Rendall, 2014. "Multiple imputation for demographic hazard models with left-censored predictor variables," Université Paris1 Panthéon-Sorbonne (Post-Print and Working Papers) hal-01298942, HAL.
    2. Angela Greulich & Michael Rendall, 2014. "Multiple imputation for demographic hazard models with left-censored predictor variables," Working Papers hal-01298942, HAL.
    3. Catalina Amuedo-Dorantes & Mary J. Lopez, 2018. "Impeding or Accelerating Assimilation? Immigration Enforcement and Its Impact on Naturalization Patterns," RF Berlin - CReAM Discussion Paper Series 1814, Rockwool Foundation Berlin (RF Berlin) - Centre for Research and Analysis of Migration (CReAM).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Michael S. Rendall & Bonnie Ghosh-Dastidar & Margaret M. Weden & Zafar Nazarov, 2011. "Multiple Imputation for Combined-Survey Estimation With Incomplete Regressors In One But Not Both Surveys," Working Papers WR-887-1, RAND Corporation.
    2. Michael S. Rendall & Mark S. Handcock & Stefan H. Jonsson, 2007. "Bayesian Estimation of Hispanic Fertility Hazards from Survey and Population Data," Working Papers WR-496, RAND Corporation.
    3. Michael Rendall & Ryan Admiraal & Alessandra DeRose & Paola DiGiulio & Mark Handcock & Filomena Racioppi, 2008. "Population constraints on pooled surveys in demographic hazard modeling," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 17(4), pages 519-539, October.
    4. Kiesl, Hans & Rässler, Susanne, 2006. "How valid can data fusion be?," IAB-Discussion Paper 200615, Institut für Arbeitsmarkt- und Berufsforschung (IAB), Nürnberg [Institute for Employment Research, Nuremberg, Germany].
    5. Igari, Ryosuke & Hoshino, Takahiro, 2018. "A Bayesian data combination approach for repeated durations under unobserved missing indicators: Application to interpurchase-timing in marketing," Computational Statistics & Data Analysis, Elsevier, vol. 126(C), pages 150-166.
    6. F Bravo, 2008. "Effcient M-estimators with auxiliary information," Discussion Papers 08/26, Department of Economics, University of York.
    7. Michael S. Rendall & Ryan Admiraal & Alessandra De Rose & Paola Di Giulio & Mark S. Handcock & Filomena Racioppi, 2006. "Population constraints on pooled surveys in demographic hazard modeling," MPIDR Working Papers WP-2006-039, Max Planck Institute for Demographic Research, Rostock, Germany.
    8. Esmeralda A. Ramalho & Richard J. Smith, 2013. "Discrete Choice Non-Response," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 80(1), pages 343-364.
    9. Esmeralda A. Ramalho & Joaquim J. S. Ramalho & Rui Evangelista, 2017. "Combining micro and macro data in hedonic price indexes," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 26(2), pages 317-332, June.
    10. Devereux, Paul J. & Tripathi, Gautam, 2009. "Optimally combining censored and uncensored datasets," Journal of Econometrics, Elsevier, vol. 151(1), pages 17-32, July.
    11. Liu, Tianqing & Yuan, Xiaohui, 2012. "Combining quasi and empirical likelihoods in generalized linear models with missing responses," Journal of Multivariate Analysis, Elsevier, vol. 111(C), pages 39-58.
    12. Hirukawa, Masayuki & Prokhorov, Artem, 2018. "Consistent estimation of linear regression models using matched data," Journal of Econometrics, Elsevier, vol. 203(2), pages 344-358.
    13. Ahfock, Daniel & Pyne, Saumyadipta & Lee, Sharon X. & McLachlan, Geoffrey J., 2016. "Partial identification in the statistical matching problem," Computational Statistics & Data Analysis, Elsevier, vol. 104(C), pages 79-90.
    14. Michael S. Rendall & Mark S. Handcock & Stefan H. Jonsson, 2007. "Bayesian Estimation of Hispanic Fertility Hazards from Survey and Population Data," Working Papers 496, RAND Corporation.
    15. Bryan S. Graham & Cristine Campos De Xavier Pinto & Daniel Egel, 2012. "Inverse Probability Tilting for Moment Condition Models with Missing Data," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 79(3), pages 1053-1079.
    16. Ryosuke Igari & Takahiro Hoshino, 2018. "A Bayesian Gamma Frailty Model Using the Sum of Independent Random Variables: Application of the Estimation of an Interpurchase Timing Model," Keio-IES Discussion Paper Series 2018-021, Institute for Economics Studies, Keio University.
    17. Bayram, Deniz & Dayé, Modeste, 2014. "Asymptotic Properties of the Weighted Least Squares Estimator Under Moments Restriction," MPRA Paper 60465, University Library of Munich, Germany.
    18. Jennifer M. Mellor, 2011. "Do cigarette taxes affect children's body mass index? The effect of household environment on health," Health Economics, John Wiley & Sons, Ltd., vol. 20(4), pages 417-431, April.
    19. Peter ven de Ven & Anne Harrison & Barbara Fraumeni & Dennis Fixler & David Johnson & Andrew Craig & Kevin Furlong, 2017. "A Consistent Data Series to Evaluate Growth and Inequality in the National Accounts Note: The views expressed in this research, including those related to statistical, methodological, technical, or op," Review of Income and Wealth, International Association for Research in Income and Wealth, vol. 63, pages 437-459, December.
    20. Kaili Wang & Sanjana Hossain & Khandker Nurul Habib, 2022. "A hybrid data fusion methodology for household travel surveys to reduce proxy biases and under-representation of specific sub-group of population," Transportation, Springer, vol. 49(6), pages 1801-1836, December.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:sae:somere:v:42:y:2013:i:4:p:483-530. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: SAGE Publications (email available below). General contact details of provider: .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.