IDEAS home Printed from https://ideas.repec.org/a/sae/somere/v46y2017i4p864-897.html
   My bibliography  Save this article

Nonparametric Multiple Imputation for Questionnaires with Individual Skip Patterns and Constraints: The Case of Income Imputation in the National Educational Panel Study

Author

Listed:
  • Christian Aßmann
  • Ariane Würbach
  • Solange Goßmann
  • Ferdinand Geissler
  • Anika Bela

Abstract

Large-scale surveys typically exhibit data structures characterized by rich mutual dependencies between surveyed variables and individual-specific skip patterns. Despite high efforts in fieldwork and questionnaire design, missing values inevitably occur. One approach for handling missing values is to provide multiply imputed data sets, thus enhancing the analytical potential of the surveyed data. To preserve possible nonlinear relationships among variables and incorporate skip patterns that make the full conditional distributions individual specific, we adapt a full conditional multiple imputation approach based on sequential classification and regression trees. Individual-specific skip patterns and constraints are handled within imputation in a way ensuring the consistency of the sequence of full conditional distributions. The suggested approach is illustrated in the context of income imputation in the adult cohort of the National Educational Panel Study.

Suggested Citation

  • Christian Aßmann & Ariane Würbach & Solange Goßmann & Ferdinand Geissler & Anika Bela, 2017. "Nonparametric Multiple Imputation for Questionnaires with Individual Skip Patterns and Constraints: The Case of Income Imputation in the National Educational Panel Study," Sociological Methods & Research, , vol. 46(4), pages 864-897, November.
  • Handle: RePEc:sae:somere:v:46:y:2017:i:4:p:864-897
    DOI: 10.1177/0049124115610346
    as

    Download full text from publisher

    File URL: https://journals.sagepub.com/doi/10.1177/0049124115610346
    Download Restriction: no

    File URL: https://libkey.io/10.1177/0049124115610346?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Doove, L.L. & Van Buuren, S. & Dusseldorp, E., 2014. "Recursive partitioning for missing data imputation in the presence of interaction effects," Computational Statistics & Data Analysis, Elsevier, vol. 72(C), pages 92-104.
    2. Regina Riphahn & Oliver Serfling, 2005. "Item non-response on income and wealth questions," Empirical Economics, Springer, vol. 30(2), pages 521-538, September.
    3. Frick, Joachim R. & Grabka, Markus M., 2007. "Item Non-Response and Imputation of Annual Labor Income in Panel Surveys from a Cross-National Perspective," IZA Discussion Papers 3043, Institute of Labor Economics (IZA).
    4. Little, Roderick J A, 1988. "Missing-Data Adjustments in Large Surveys," Journal of Business & Economic Statistics, American Statistical Association, vol. 6(3), pages 287-296, July.
    5. Schenker, Nathaniel & Raghunathan, Trivellore E. & Chiu, Pei-Lu & Makuc, Diane M. & Zhang, Guangyu & Cohen, Alan J., 2006. "Multiple Imputation of Missing Income Data in the National Health Interview Survey," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 924-933, September.
    6. Jörg Drechsler, 2011. "Multiple imputation in practice—a case study using a complex German establishment survey," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 95(1), pages 1-26, March.
    7. Hapfelmeier, A. & Hothorn, T. & Ulm, K., 2012. "Recursive partitioning on incomplete data using surrogate decisions and multiple imputation," Computational Statistics & Data Analysis, Elsevier, vol. 56(6), pages 1552-1565.
    8. Rubin, Donald B., 2004. "The Design of a General and Flexible System for Handling Nonresponse in Sample Surveys," The American Statistician, American Statistical Association, vol. 58, pages 298-302, November.
    9. Little, Roderick J A, 1988. "Missing-Data Adjustments in Large Surveys: Reply," Journal of Business & Economic Statistics, American Statistical Association, vol. 6(3), pages 300-301, July.
    10. P. Jenkins, Stephen, 2010. "The British Household Panel Survey and its income data," ISER Working Paper Series 2010-33, Institute for Social and Economic Research.
    11. Lane F. Burgette & Jerome P. Reiter, 2012. "Nonparametric Bayesian Multiple Imputation for Missing Data Due to Mid-Study Switching of Measurement Methods," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(498), pages 439-449, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Martin, Eisele & Zhu, Junyi, 2013. "Multiple imputation in a complex household survey - the German Panel on Household Finances (PHF): challenges and solutions," MPRA Paper 57666, University Library of Munich, Germany.
    2. Juana Sanchez & Sydney Noelle Kahmann, 2017. "R&D, Attrition and Multiple Imputation in BRDIS," Working Papers 17-13, Center for Economic Studies, U.S. Census Bureau.
    3. Zachary H. Seeskin, 2016. "Evaluating the Use of Commercial Data to Improve Survey Estimates of Property Taxes," CARRA Working Papers 2016-06, Center for Economic Studies, U.S. Census Bureau.
    4. Westermeier, Christian & Grabka, Markus M., 2016. "Longitudinal Wealth Data and Multiple Imputation: An Evaluation Study," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 10(3), pages 237-252.
    5. Youngjoo Cho & Debashis Ghosh, 2021. "Quantile-Based Subgroup Identification for Randomized Clinical Trials," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 13(1), pages 90-128, April.
    6. A. R. Linero, 2017. "Bayesian nonparametric analysis of longitudinal studies in the presence of informative missingness," Biometrika, Biometrika Trust, vol. 104(2), pages 327-341.
    7. Daniel Schunk, 2006. "The German SAVE Survey: Documentation and Methodology," MEA discussion paper series 06109, Munich Center for the Economics of Aging (MEA) at the Max Planck Institute for Social Law and Social Policy.
    8. Adel Bosch & Steven F. Koch, 2021. "Individual and Household Debt: Does Imputation Choice Matter?," Working Papers 202141, University of Pretoria, Department of Economics.
    9. Joost Ginkel & Pieter Kroonenberg, 2014. "Using Generalized Procrustes Analysis for Multiple Imputation in Principal Component Analysis," Journal of Classification, Springer;The Classification Society, vol. 31(2), pages 242-269, July.
    10. Verbeek, M.J.C.M. & Nijman, T.E., 1992. "Incomplete panels and selection bias : A survey," Discussion Paper 1992-7, Tilburg University, Center for Economic Research.
    11. Hai Zhong, 2010. "The impact of missing data in the estimation of concentration index: a potential source of bias," The European Journal of Health Economics, Springer;Deutsche Gesellschaft für Gesundheitsökonomie (DGGÖ), vol. 11(3), pages 255-266, June.
    12. Gerko Vink & Laurence E. Frank & Jeroen Pannekoek & Stef Buuren, 2014. "Predictive mean matching imputation of semicontinuous variables," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 68(1), pages 61-90, February.
    13. Dang, Hai-Anh & Carletto, Calogero, 2022. "Recall Bias Revisited: Measure Farm Labor Using Mixed-Mode Surveys and Multiple Imputation," IZA Discussion Papers 14997, Institute of Labor Economics (IZA).
    14. Frick, Joachim R. & Grabka, Markus M. & Groh-Samberg, Olaf, 2012. "Dealing With Incomplete Household Panel Data in Inequality Research," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 41(1), pages 89-123.
    15. Daniel Schunk, 2007. "A Markov Chain Monte Carlo Multiple Imputation Procedure for Dealing with Item Nonresponse in the German SAVE Survey," MEA discussion paper series 07121, Munich Center for the Economics of Aging (MEA) at the Max Planck Institute for Social Law and Social Policy.
    16. Brownstone, David, 1997. "Multiple Imputation Methodology for Missing Data, Non-Random Response, and Panel Attrition," University of California Transportation Center, Working Papers qt2zd6w6hh, University of California Transportation Center.
    17. F. Di Lascio & Simone Giannerini & Alessandra Reale, 2015. "Exploring copulas for the imputation of complex dependent data," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 24(1), pages 159-175, March.
    18. Ankita Patnaik & Jeffrey Hemmeter & Arif Mamun, "undated". "Promoting Readiness of Minors with Autism Spectrum Disorder: Evidence from a Randomized Controlled Trial," Mathematica Policy Research Reports a74c93d9bdce40709ad81cdbc, Mathematica Policy Research.
    19. Joachim R. Frick & Markus M. Grabka, 2007. "Item Non-response and Imputation of Annual Labor Income in Panel Surveys from a Cross-National Perspective," Discussion Papers of DIW Berlin 736, DIW Berlin, German Institute for Economic Research.
    20. Ahfock, Daniel & Pyne, Saumyadipta & McLachlan, Geoffrey J., 2022. "Statistical file-matching of non-Gaussian data: A game theoretic approach," Computational Statistics & Data Analysis, Elsevier, vol. 168(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:sae:somere:v:46:y:2017:i:4:p:864-897. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: SAGE Publications (email available below). General contact details of provider: .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.