IDEAS home Printed from https://ideas.repec.org/a/spr/qualqt/v59y2025i2d10.1007_s11135-024-02028-z.html
   My bibliography  Save this article

Comparison of imputation methods for univariate categorical longitudinal data

Author

Listed:
  • Kevin Emery

    (Swiss Centre of Expertise in Life Course Research LIVES
    University of Geneva)

  • Matthias Studer

    (Swiss Centre of Expertise in Life Course Research LIVES
    University of Geneva)

  • André Berchtold

    (University of Lausanne)

Abstract

The life course paradigm emphasizes the need to study not only the situation at a given point in time, but also its evolution over the life course in the medium and long term. These trajectories are often represented by categorical data. This article aims to provide a comprehensive review of the multiple imputation methods proposed so far in the context of univariate categorical data and to assess their practical relevance through a simulation study based on real data. The primary goal is to provide clear methodological guidelines and improve the handling of missing data in life course research. In parallel, we develop the MICT-timing algorithm, which is an extension of the MICT algorithm. This innovative multiple imputation method improves the quality of imputation in trajectories subject to time-varying transition rates, a situation often encountered in life course data.

Suggested Citation

  • Kevin Emery & Matthias Studer & André Berchtold, 2025. "Comparison of imputation methods for univariate categorical longitudinal data," Quality & Quantity: International Journal of Methodology, Springer, vol. 59(2), pages 1767-1791, April.
  • Handle: RePEc:spr:qualqt:v:59:y:2025:i:2:d:10.1007_s11135-024-02028-z
    DOI: 10.1007/s11135-024-02028-z
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11135-024-02028-z
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11135-024-02028-z?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Matthias Studer & Gilbert Ritschard, 2016. "What matters in differences between life trajectories: a comparative review of sequence dissimilarity measures," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 179(2), pages 481-511, February.
    2. Doove, L.L. & Van Buuren, S. & Dusseldorp, E., 2014. "Recursive partitioning for missing data imputation in the presence of interaction effects," Computational Statistics & Data Analysis, Elsevier, vol. 72(C), pages 92-104.
    3. White, Ian R. & Daniel, Rhian & Royston, Patrick, 2010. "Avoiding bias due to perfect prediction in multiple imputation of incomplete categorical variables," Computational Statistics & Data Analysis, Elsevier, vol. 54(10), pages 2267-2275, October.
    4. Gérard Biau & Erwan Scornet, 2016. "Rejoinder on: A random forest guided tour," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 25(2), pages 264-268, June.
    5. Gérard Biau & Erwan Scornet, 2016. "A random forest guided tour," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 25(2), pages 197-227, June.
    6. Gabadinho, Alexis & Ritschard, Gilbert & Müller, Nicolas S & Studer, Matthias, 2011. "Analyzing and Visualizing State Sequences in R with TraMineR," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 40(i04).
    7. Duncan McVicar & Michael Anyadike‐Danes, 2002. "Predicting successful and unsuccessful transitions from school to work by using sequence methods," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 165(2), pages 317-334, June.
    8. David Pelletier & Simona Bignami-Van Assche & Anaïs Simard-Gendron, 2020. "Measuring Life Course Complexity with Dynamic Sequence Analysis," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 152(3), pages 1127-1151, December.
    9. Brendan Halpin, 2016. "Multiple imputation for categorical time series," Stata Journal, StataCorp LLC, vol. 16(3), pages 590-612, September.
    10. Liao, Tim F. & Bolano, Danilo & Brzinsky-Fay, Christian & Cornwell, Benjamin & Fasang, Anette Eva & Helske, Satu & Piccarreta, Raffaella & Raab, Marcel & Ritschard, Gilbert & Struffolino, Emanuela & S, 2022. "Sequence analysis: Its past, present, and future," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 107, pages 1-1.
    11. Gabadinho, Alexis & Ritschard, Gilbert, 2016. "Analyzing State Sequences with Probabilistic Suffix Trees: The PST R Package," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 72(i03).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Estelle McLean & Amelia C Crampin & Rebecca Sear & Maria Sironi & Emma Slaymaker & Albert Dube, 2024. "Transitions to adulthood in men and women in rural Malawi in the 21st century using sequence analysis: Some evidence of delay," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 51(14), pages 459-500.
    2. Erofili Grapsa & Dorrit Posel, 2016. "Sequencing the real time of the elderly: Evidence from South Africa," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 35(25), pages 711-744.
    3. Liao, Tim F. & Bolano, Danilo & Brzinsky-Fay, Christian & Cornwell, Benjamin & Fasang, Anette Eva & Helske, Satu & Piccarreta, Raffaella & Raab, Marcel & Ritschard, Gilbert & Struffolino, Emanuela & S, 2022. "Sequence analysis: Its past, present, and future," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 107, pages 1-1.
    4. Montorsi, Carlotta & Fusco, Alessio & Van Kerm, Philippe & Bordas, Stéphane P.A., 2024. "Predicting depression in old age: Combining life course data with machine learning," Economics & Human Biology, Elsevier, vol. 52(C).
    5. repec:osf:socarx:3mcfp_v1 is not listed on IDEAS
    6. Piccarreta, Raffaella & Bonetti, Marco, 2019. "Assessing and comparing models for sequence data by microsimulation (with Supplementary Material)," SocArXiv 3mcfp, Center for Open Science.
    7. Keefe Murphy & T. Brendan Murphy & Raffaella Piccarreta & I. Claire Gormley, 2021. "Clustering longitudinal life‐course sequences using mixtures of exponential‐distance models," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 184(4), pages 1414-1451, October.
    8. Dolores Sesma Carlos & Jan Kok & Michel Oris, 2022. "Coping with ageing: An historical longitudinal study of internal return migrations later in life in the Netherlands," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 46(27), pages 767-808.
    9. Marc A. Scott & Jean-Marie Goff & Jacques-Antoine Gauthier, 2024. "History matters: the statistical modelling of the life course," Quality & Quantity: International Journal of Methodology, Springer, vol. 58(1), pages 445-469, February.
    10. Marco Raffaella Piccarreta & Marco Bonetti & Stefano Lombardi, 2018. "Comparing models for sequence data: prediction and dissimilarities," Working Papers 113, "Carlo F. Dondena" Centre for Research on Social Dynamics (DONDENA), Università Commerciale Luigi Bocconi.
    11. Struffolino, Emanuela, 2019. "Navigating the early career: The social stratification of young workers’ employment trajectories in Italy," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 63, pages 1-17.
    12. Hou, Lei & Elsworth, Derek & Zhang, Fengshou & Wang, Zhiyuan & Zhang, Jianbo, 2023. "Evaluation of proppant injection based on a data-driven approach integrating numerical and ensemble learning models," Energy, Elsevier, vol. 264(C).
    13. Marcel Raab & Emanuela Struffolino, 2020. "The Heterogeneity of Partnership Trajectories to Childlessness in Germany," European Journal of Population, Springer;European Association for Population Studies, vol. 36(1), pages 53-70, March.
    14. Ma, Zhikai & Huo, Qian & Wang, Wei & Zhang, Tao, 2023. "Voltage-temperature aware thermal runaway alarming framework for electric vehicles via deep learning with attention mechanism in time-frequency domain," Energy, Elsevier, vol. 278(C).
    15. Patrick Krennmair & Timo Schmid, 2022. "Flexible domain prediction using mixed effects random forests," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 71(5), pages 1865-1894, November.
    16. Julia Mikolai & Hill Kulu, 2019. "Union dissolution and housing trajectories in Britain," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 41(7), pages 161-196.
    17. repec:jss:jstsof:40:i04 is not listed on IDEAS
    18. Jie Shi & Arno P. J. M. Siebes & Siamak Mehrkanoon, 2023. "TransCORALNet: A Two-Stream Transformer CORAL Networks for Supply Chain Credit Assessment Cold Start," Papers 2311.18749, arXiv.org.
    19. Zachary Van Winkle & Anette Fasang, 2021. "The complexity of employment and family life courses across 20th century Europe: More evidence for larger cross-national differences but little change across 1916‒1966 birth cohorts," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 44(32), pages 775-810.
    20. Babette Bühler & Katja Möhring & Andreas P. Weiland, 2022. "Assessing dissimilarity of employment history information from survey and administrative data using sequence analysis techniques," Quality & Quantity: International Journal of Methodology, Springer, vol. 56(6), pages 4747-4774, December.
    21. Marcantonio Caltabiano & Silvia Meggiolaro & Valentina Tocchioni, 2023. "The impact of parental separation on the pattern of transition to adulthood in Italy," Econometrics Working Papers Archive 2023_07, Universita' degli Studi di Firenze, Dipartimento di Statistica, Informatica, Applicazioni "G. Parenti".
    22. Bourdouxhe, Axel & Wibail, Lionel & Claessens, Hugues & Dufrêne, Marc, 2023. "Modeling potential natural vegetation: A new light on an old concept to guide nature conservation in fragmented and degraded landscapes," Ecological Modelling, Elsevier, vol. 481(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:qualqt:v:59:y:2025:i:2:d:10.1007_s11135-024-02028-z. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.