IDEAS home Printed from https://ideas.repec.org/a/sae/somere/v46y2017i4p864-897.html

Nonparametric Multiple Imputation for Questionnaires with Individual Skip Patterns and Constraints: The Case of Income Imputation in the National Educational Panel Study

Author

Listed:
  • Christian Aßmann
  • Ariane Würbach
  • Solange Goßmann
  • Ferdinand Geissler
  • Anika Bela

Abstract

Large-scale surveys typically exhibit data structures characterized by rich mutual dependencies between surveyed variables and individual-specific skip patterns. Despite high efforts in fieldwork and questionnaire design, missing values inevitably occur. One approach for handling missing values is to provide multiply imputed data sets, thus enhancing the analytical potential of the surveyed data. To preserve possible nonlinear relationships among variables and incorporate skip patterns that make the full conditional distributions individual specific, we adapt a full conditional multiple imputation approach based on sequential classification and regression trees. Individual-specific skip patterns and constraints are handled within imputation in a way ensuring the consistency of the sequence of full conditional distributions. The suggested approach is illustrated in the context of income imputation in the adult cohort of the National Educational Panel Study.

Suggested Citation

  • Christian Aßmann & Ariane Würbach & Solange Goßmann & Ferdinand Geissler & Anika Bela, 2017. "Nonparametric Multiple Imputation for Questionnaires with Individual Skip Patterns and Constraints: The Case of Income Imputation in the National Educational Panel Study," Sociological Methods & Research, , vol. 46(4), pages 864-897, November.
  • Handle: RePEc:sae:somere:v:46:y:2017:i:4:p:864-897
    DOI: 10.1177/0049124115610346
    as

    Download full text from publisher

    File URL: https://journals.sagepub.com/doi/10.1177/0049124115610346
    Download Restriction: no

    File URL: https://libkey.io/10.1177/0049124115610346?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Little, Roderick J A, 1988. "Missing-Data Adjustments in Large Surveys," Journal of Business & Economic Statistics, American Statistical Association, vol. 6(3), pages 287-296, July.
    2. Joachim R. Frick & Markus M. Grabka, 2007. "Item Non-response and Imputation of Annual Labor Income in Panel Surveys from a Cross-National Perspective," SOEPpapers on Multidisciplinary Panel Data Research 49, DIW Berlin, The German Socio-Economic Panel (SOEP).
    3. Hapfelmeier, A. & Hothorn, T. & Ulm, K., 2012. "Recursive partitioning on incomplete data using surrogate decisions and multiple imputation," Computational Statistics & Data Analysis, Elsevier, vol. 56(6), pages 1552-1565.
    4. Schenker, Nathaniel & Raghunathan, Trivellore E. & Chiu, Pei-Lu & Makuc, Diane M. & Zhang, Guangyu & Cohen, Alan J., 2006. "Multiple Imputation of Missing Income Data in the National Health Interview Survey," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 924-933, September.
    5. Rubin, Donald B., 2004. "The Design of a General and Flexible System for Handling Nonresponse in Sample Surveys," The American Statistician, American Statistical Association, vol. 58, pages 298-302, November.
    6. Jörg Drechsler, 2011. "Multiple imputation in practice—a case study using a complex German establishment survey," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 95(1), pages 1-26, March.
    7. P. Jenkins, Stephen, 2010. "The British Household Panel Survey and its income data," ISER Working Paper Series 2010-33, Institute for Social and Economic Research.
    8. Doove, L.L. & Van Buuren, S. & Dusseldorp, E., 2014. "Recursive partitioning for missing data imputation in the presence of interaction effects," Computational Statistics & Data Analysis, Elsevier, vol. 72(C), pages 92-104.
    9. Little, Roderick J A, 1988. "Missing-Data Adjustments in Large Surveys: Reply," Journal of Business & Economic Statistics, American Statistical Association, vol. 6(3), pages 300-301, July.
    10. Regina Riphahn & Oliver Serfling, 2005. "Item non-response on income and wealth questions," Empirical Economics, Springer, vol. 30(2), pages 521-538, September.
    11. Lane F. Burgette & Jerome P. Reiter, 2012. "Nonparametric Bayesian Multiple Imputation for Missing Data Due to Mid-Study Switching of Measurement Methods," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(498), pages 439-449, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Eisele, Martin & Zhu, Junyi, 2013. "Multiple imputation in a complex household survey - the German Panel on Household Finances (PHF): challenges and solutions," EconStor Preprints 100007, ZBW - Leibniz Information Centre for Economics.
    2. Juana Sanchez & Sydney Noelle Kahmann, 2017. "R&D, Attrition and Multiple Imputation in BRDIS," Working Papers 17-13, Center for Economic Studies, U.S. Census Bureau.
    3. Zachary H. Seeskin, 2016. "Evaluating the Use of Commercial Data to Improve Survey Estimates of Property Taxes," CARRA Working Papers 2016-06, Center for Economic Studies, U.S. Census Bureau.
    4. Westermeier, Christian & Grabka, Markus M., 2016. "Longitudinal Wealth Data and Multiple Imputation: An Evaluation Study," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 10(3), pages 237-252.
    5. Youngjoo Cho & Debashis Ghosh, 2021. "Quantile-Based Subgroup Identification for Randomized Clinical Trials," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 13(1), pages 90-128, April.
    6. Adel Bosch & Steven F. Koch, 2021. "Individual and Household Debt: Does Imputation Choice Matter?," Working Papers 202141, University of Pretoria, Department of Economics.
    7. A. R. Linero, 2017. "Bayesian nonparametric analysis of longitudinal studies in the presence of informative missingness," Biometrika, Biometrika Trust, vol. 104(2), pages 327-341.
    8. Gerko Vink & Stef van Buuren, 2013. "Multiple Imputation of Squared Terms," Sociological Methods & Research, , vol. 42(4), pages 598-607, November.
    9. Valaei, Naser & Rezaei, Sajad & Ismail, Wan Khairuzzaman Wan, 2017. "Examining learning strategies, creativity, and innovation at SMEs using fuzzy set Qualitative Comparative Analysis and PLS path modeling," Journal of Business Research, Elsevier, vol. 70(C), pages 224-233.
    10. Joost Ginkel & Pieter Kroonenberg, 2014. "Using Generalized Procrustes Analysis for Multiple Imputation in Principal Component Analysis," Journal of Classification, Springer;The Classification Society, vol. 31(2), pages 242-269, July.
    11. Renate S M Buisman & Katharina Pittner & Marieke S Tollenaar & Jolanda Lindenberg & Lisa J M van den Berg & Laura H C G Compier-de Block & Joost R van Ginkel & Lenneke R A Alink & Marian J Bakermans-K, 2020. "Intergenerational transmission of child maltreatment using a multi-informant multi-generation family design," PLOS ONE, Public Library of Science, vol. 15(3), pages 1-23, March.
    12. Chenyang Gu & Roee Gutman, 2017. "Combining item response theory with multiple imputation to equate health assessment questionnaires," Biometrics, The International Biometric Society, vol. 73(3), pages 990-998, September.
    13. Hai Zhong, 2010. "The impact of missing data in the estimation of concentration index: a potential source of bias," The European Journal of Health Economics, Springer;Deutsche Gesellschaft für Gesundheitsökonomie (DGGÖ), vol. 11(3), pages 255-266, June.
    14. Gerko Vink & Laurence E. Frank & Jeroen Pannekoek & Stef Buuren, 2014. "Predictive mean matching imputation of semicontinuous variables," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 68(1), pages 61-90, February.
    15. Marco Di Zio & Ugo Guarnera & Roberta Varriale, 2016. "Estimation of the main variables of the economic account of small and medium enterprises based on administrative sources," Rivista di statistica ufficiale, ISTAT - Italian National Institute of Statistics - (Rome, ITALY), vol. 18(1), pages 71-81.
    16. Giuseppe Arbia & Giuseppe Espa & Diego Giuliani, 2016. "Dirty spatial econometrics," The Annals of Regional Science, Springer;Western Regional Science Association, vol. 56(1), pages 177-189, January.
    17. Brad R. Humphreys & Yang Seung Lee & Brian P. Soebbing, 2010. "Consumer behaviour in lottery: the double hurdle approach and zeros in gambling survey data," International Gambling Studies, Taylor & Francis Journals, vol. 10(2), pages 165-176, August.
    18. Stockdale, Susan E. & Wells, Kenneth B. & Tang, Lingqi & Belin, Thomas R. & Zhang, Lily & Sherbourne, Cathy D., 2007. "The importance of social context: Neighborhood stressors, stress-buffering mechanisms, and alcohol, drug, and mental health disorders," Social Science & Medicine, Elsevier, vol. 65(9), pages 1867-1881, November.
    19. Bilal, Muhammad & Zhang, Yunfeng & Cai, Shukai & Akram, Umair & Halibas, Alrence, 2024. "Artificial intelligence is the magic wand making customer-centric a reality! An investigation into the relationship between consumer purchase intention and consumer engagement through affective attach," Journal of Retailing and Consumer Services, Elsevier, vol. 77(C).
    20. Robert J. Batt & Christian Terwiesch, 2015. "Waiting Patiently: An Empirical Study of Queue Abandonment in an Emergency Department," Management Science, INFORMS, vol. 61(1), pages 39-59, January.

    More about this item

    Keywords

    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:sae:somere:v:46:y:2017:i:4:p:864-897. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: SAGE Publications (email available below). General contact details of provider: .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.