IDEAS home Printed from https://ideas.repec.org/a/bla/jorssa/v183y2020i3p1231-1251.html
   My bibliography  Save this article

Model‐based clustering and analysis of life history data

Author

Listed:
  • Marc A. Scott
  • Kaushik Mohan
  • Jacques‐Antoine Gauthier

Abstract

Methods and models for longitudinal data with categorical, multi‐dimensional outcomes are quite limited, but they are essential to the study of life histories. For example, in the Swiss Household Panel, information on the co‐residence and professional status of several thousand individuals is available through to age 45 years. Interest centres on the time and order of life course events such as having children and working full or part time and the duration of the phases that they delineate. With data of this type, optimal matching and clustering algorithms relying on a distance metric or parametric models of duration in a competing risks framework are used; the appropriateness of each derives from competing goals and orientation. We prefer model‐based approaches when certain goals are paramount: simulation of individual trajectories; adjusting for time‐dependent covariates; handling multistate trajectories and missing outcomes. Several of these goals are particularly challenging when the number of states is of moderate size, and many transitions are infrequent and/or time inhomogeneous. Using the Swiss Household Panel, we demonstrate the appropriateness of latent class growth curve models for analysing sequence data. In particular, models including heterogeneous dependence structure provide new techniques for assessing goodness of fit as well as yield insights into social processes.

Suggested Citation

  • Marc A. Scott & Kaushik Mohan & Jacques‐Antoine Gauthier, 2020. "Model‐based clustering and analysis of life history data," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 183(3), pages 1231-1251, June.
  • Handle: RePEc:bla:jorssa:v:183:y:2020:i:3:p:1231-1251
    DOI: 10.1111/rssa.12575
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/rssa.12575
    Download Restriction: no

    File URL: https://libkey.io/10.1111/rssa.12575?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Matthias Studer & Gilbert Ritschard, 2016. "What matters in differences between life trajectories: a comparative review of sequence dissimilarity measures," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 179(2), pages 481-511, February.
    2. Marc A. Scott, 2011. "Affinity models for career sequences," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 60(3), pages 417-436, May.
    3. Leisch, Friedrich, 2004. "FlexMix: A General Framework for Finite Mixture Models and Latent Class Regression in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 11(i08).
    4. Nicola Barban & Francesco C. Billari, 2012. "Classifying life course trajectories: a comparison of latent class and sequence analysis," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 61(5), pages 765-784, November.
    5. Grün, Bettina & Leisch, Friedrich, 2008. "FlexMix Version 2: Finite Mixtures with Concomitant Variables and Varying and Constant Parameters," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 28(i04).
    6. Paas, L.J. & Vermunt, J.K. & Bijmolt, T.H.A., 2007. "Discrete-time discrete-state latent Markov modelling for assessing and predicting household acquisitions of financial products," Other publications TiSEM 5781ab33-6687-4ad5-b57a-3, Tilburg University, School of Economics and Management.
    7. Gabadinho, Alexis & Ritschard, Gilbert & Müller, Nicolas S & Studer, Matthias, 2011. "Analyzing and Visualizing State Sequences in R with TraMineR," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 40(i04).
    8. Leonard J. Paas & Jeroen K. Vermunt & Tammo H. A. Bijmolt, 2007. "Discrete time, discrete state latent Markov modelling for assessing and predicting household acquisitions of financial products," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 170(4), pages 955-974, October.
    9. Dehnert, M. & Helm, W.E. & Hütt, M.-Th., 2003. "A discrete autoregressive process as a model for short-range correlations in DNA sequences," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 327(3), pages 535-553.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Marc A. Scott & Jean-Marie Goff & Jacques-Antoine Gauthier, 2024. "History matters: the statistical modelling of the life course," Quality & Quantity: International Journal of Methodology, Springer, vol. 58(1), pages 445-469, February.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Marc A. Scott & Jean-Marie Goff & Jacques-Antoine Gauthier, 2024. "History matters: the statistical modelling of the life course," Quality & Quantity: International Journal of Methodology, Springer, vol. 58(1), pages 445-469, February.
    2. Liao, Tim F. & Bolano, Danilo & Brzinsky-Fay, Christian & Cornwell, Benjamin & Fasang, Anette Eva & Helske, Satu & Piccarreta, Raffaella & Raab, Marcel & Ritschard, Gilbert & Struffolino, Emanuela & S, 2022. "Sequence analysis: Its past, present, and future," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 107, pages 1-1.
    3. Marcel Raab & Emanuela Struffolino, 2020. "The Heterogeneity of Partnership Trajectories to Childlessness in Germany," European Journal of Population, Springer;European Association for Population Studies, vol. 36(1), pages 53-70, March.
    4. Júlia Mikolai & Hill Kulu, 2019. "Union dissolution and housing trajectories in Britain," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 41(7), pages 161-196.
    5. Christian Kleiber & Achim Zeileis, 2016. "Visualizing Count Data Regressions Using Rootograms," The American Statistician, Taylor & Francis Journals, vol. 70(3), pages 296-303, July.
    6. Lebret, Rémi & Iovleff, Serge & Langrognet, Florent & Biernacki, Christophe & Celeux, Gilles & Govaert, Gérard, 2015. "Rmixmod: The R Package of the Model-Based Unsupervised, Supervised, and Semi-Supervised Classification Mixmod Library," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 67(i06).
    7. Grün, Bettina & Kosmidis, Ioannis & Zeileis, Achim, 2012. "Extended Beta Regression in R: Shaken, Stirred, Mixed, and Partitioned," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 48(i11).
    8. Babette Bühler & Katja Möhring & Andreas P. Weiland, 2022. "Assessing dissimilarity of employment history information from survey and administrative data using sequence analysis techniques," Quality & Quantity: International Journal of Methodology, Springer, vol. 56(6), pages 4747-4774, December.
    9. Devillanova, Carlo & Raitano, Michele & Struffolino, Emanuela, 2019. "Longitudinal employment trajectories and health in middle life: Insights from linked administrative and survey data," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, pages 1375-1412.
    10. Frick, Hannah & Strobl, Carolin & Leisch, Friedrich & Zeileis, Achim, 2012. "Flexible Rasch Mixture Models with Package psychomix," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 48(i07).
    11. Lisa Toczek & Hans Bosma & Richard Peter, 2022. "Early retirement intentions: the impact of employment biographies, work stress and health among a baby-boomer generation," European Journal of Ageing, Springer, vol. 19(4), pages 1479-1491, December.
    12. Michal Engelman & Heide Jackson, 2019. "Gradual Change, Homeostasis, and Punctuated Equilibrium: Reconsidering Patterns of Health in Later Life," Demography, Springer;Population Association of America (PAA), vol. 56(6), pages 2323-2347, December.
    13. Kandt, Jens & Leak, Alistair, 2019. "Examining inclusive mobility through smartcard data: What shall we make of senior citizens' declining bus patronage in the West Midlands?," Journal of Transport Geography, Elsevier, vol. 79(C), pages 1-1.
    14. Arthur Kaboth & Lena Hünefeld & Ralf Himmelreicher, 2023. "Employment trajectories of workers in low-skilled jobs in Western Germany," Journal for Labour Market Research, Springer;Institute for Employment Research/ Institut für Arbeitsmarkt- und Berufsforschung (IAB), vol. 57(1), pages 1-17, December.
    15. Maik Dehnert & Josephine Schumann, 2022. "Uncovering the digitalization impact on consumer decision-making for checking accounts in banking," Electronic Markets, Springer;IIM University of St. Gallen, vol. 32(3), pages 1503-1528, September.
    16. Solomon Zena Walelign & Mariève Pouliot & Helle Overgaard Larsen & Carsten Smith-Hall, 2015. "A novel approach to dynamic livelihood clustering: Empirical evidence from Nepal," IFRO Working Paper 2015/09, University of Copenhagen, Department of Food and Resource Economics.
    17. Prates, Marcos Oliveira & Lachos, Victor Hugo & Barbosa Cabral, Celso Rômulo, 2013. "mixsmsn: Fitting Finite Mixture of Scale Mixture of Skew-Normal Distributions," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 54(i12).
    18. Papastamoulis, Panagiotis & Martin-Magniette, Marie-Laure & Maugis-Rabusseau, Cathy, 2016. "On the estimation of mixtures of Poisson regression models with large number of components," Computational Statistics & Data Analysis, Elsevier, vol. 93(C), pages 97-106.
    19. Madero-Cabib, Ignacio & Biehl, Andres, 2021. "Lifetime employment–coresidential trajectories and extended working life in Chile," The Journal of the Economics of Ageing, Elsevier, vol. 19(C).
    20. Salvatore Ingrassia & Antonio Punzo, 2020. "Cluster Validation for Mixtures of Regressions via the Total Sum of Squares Decomposition," Journal of Classification, Springer;The Classification Society, vol. 37(2), pages 526-547, July.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jorssa:v:183:y:2020:i:3:p:1231-1251. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/rssssea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.