IDEAS home Printed from https://ideas.repec.org/a/sae/somere/v54y2025i2p448-499.html

High-Dimensional Imputation for the Social Sciences: A Comparison of State-of-The-Art Methods

Author

Listed:
  • Edoardo Costantini
  • Kyle M. Lang
  • Tim Reeskens
  • Klaas Sijtsma

Abstract

Including a large number of predictors in the imputation model underlying a multiple imputation (MI) procedure is one of the most challenging tasks imputers face. A variety of high-dimensional MI techniques can help, but there has been limited research on their relative performance. In this study, we investigated a wide range of extant high-dimensional MI techniques that can handle a large number of predictors in the imputation models and general missing data patterns. We assessed the relative performance of seven high-dimensional MI methods with a Monte Carlo simulation study and a resampling study based on real survey data. The performance of the methods was defined by the degree to which they facilitate unbiased and confidence-valid estimates of the parameters of complete data analysis models. We found that using lasso penalty or forward selection to select the predictors used in the MI model and using principal component analysis to reduce the dimensionality of auxiliary data produce the best results.

Suggested Citation

  • Edoardo Costantini & Kyle M. Lang & Tim Reeskens & Klaas Sijtsma, 2025. "High-Dimensional Imputation for the Social Sciences: A Comparison of State-of-The-Art Methods," Sociological Methods & Research, , vol. 54(2), pages 448-499, May.
  • Handle: RePEc:sae:somere:v:54:y:2025:i:2:p:448-499
    DOI: 10.1177/00491241231200194
    as

    Download full text from publisher

    File URL: https://journals.sagepub.com/doi/10.1177/00491241231200194
    Download Restriction: no

    File URL: https://libkey.io/10.1177/00491241231200194?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Douglas S. Massey & Roger Tourangeau, 2013. "Where Do We Go from Here? Nonresponse and Social Measurement," The ANNALS of the American Academy of Political and Social Science, , vol. 645(1), pages 222-236, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Salvatore, Camilla & Lugtig, Peter & Struminskaya, Bella, 2025. "Youth Nonresponse in Panel Surveys: Investigating the Impact of Life Events," SocArXiv br9ng_v1, Center for Open Science.
    2. Winnie S Chow & Jan Schmidtke & Adrian Loerbroks & Thomas Muth & Peter Angerer, 2018. "The Relationship between Personality Traits with Depressive Symptoms and Suicidal Ideation among Medical Students: A Cross-Sectional Study at One Medical School in Germany," IJERPH, MDPI, vol. 15(7), pages 1-11, July.
    3. Celhay, Pablo & Meyer, Bruce D. & Mittag, Nikolas, 2024. "What leads to measurement errors? Evidence from reports of program participation in three surveys," Journal of Econometrics, Elsevier, vol. 238(2).

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:sae:somere:v:54:y:2025:i:2:p:448-499. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: SAGE Publications (email available below). General contact details of provider: .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.