IDEAS home Printed from https://ideas.repec.org/p/osf/socarx/kj8d5.html
   My bibliography  Save this paper

Typologies in Sequence Analysis: Practical Guidelines for Identifying Robust Cluster Solutions

Author

Listed:
  • Andrade, Stefan B.
  • Fasang, Anette Eva
  • Helske, Satu
  • Karhula, Aleksi

Abstract

Sequence analysis in the social sciences heavily relies on cluster techniques to identify typologies. Clustering techniques and statistical cluster cut-off criteria for selecting the optimal number of clusters have greatly improved. In contrast, we lack a systematic assessment of how data features, such as the sequence sample size, the number of time points in the sequences, and the number of distinct states in the sequence alphabet might systematically impact the identification of sequence typologies. Drawing on both simulated data from mixture Markov models and real data from the German Family Panel survey, we provide best-practice guidelines for applied researchers to gauge whether their data is sufficient for extracting robust sequence typologies, if they empirically exist. Sequence typologies are most robust for samples with at least 500 sequences, sequence lengths greater than 10 time points, and state alphabets that have at least as many states as the “true” number of clusters.

Suggested Citation

  • Andrade, Stefan B. & Fasang, Anette Eva & Helske, Satu & Karhula, Aleksi, 2023. "Typologies in Sequence Analysis: Practical Guidelines for Identifying Robust Cluster Solutions," SocArXiv kj8d5, Center for Open Science.
  • Handle: RePEc:osf:socarx:kj8d5
    DOI: 10.31219/osf.io/kj8d5
    as

    Download full text from publisher

    File URL: https://osf.io/download/65252d329b0cf30107786f89/
    Download Restriction: no

    File URL: https://libkey.io/10.31219/osf.io/kj8d5?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Piccarreta, Raffaella & Struffolino, Emanuela, 2019. "An Integrated Heuristic for Validation in Sequence Analysis," SocArXiv v7mj8, Center for Open Science.
    2. Matthias Studer & Gilbert Ritschard, 2016. "What matters in differences between life trajectories: a comparative review of sequence dissimilarity measures," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 179(2), pages 481-511, February.
    3. Marcel Raab & Anette Fasang & Aleksi Karhula & Jani Erola, 2014. "Sibling Similarity in Family Formation," Demography, Springer;Population Association of America (PAA), vol. 51(6), pages 2127-2154, December.
    4. Raab, Marcel & Fasang, Anette Eva & Karhula, Aleksi & Erola, Jani, 2014. "Sibling Similarity in Family Formation," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 51(6), pages 2127-2154.
    5. Nicola Barban & Francesco C. Billari, 2012. "Classifying life course trajectories: a comparison of latent class and sequence analysis," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 61(5), pages 765-784, November.
    6. Rannveig Kaldager Hart, 2019. "Union Histories of Dissolution: What Can They Say About Childlessness?," European Journal of Population, Springer;European Association for Population Studies, vol. 35(1), pages 101-131, February.
    7. Cees H. Elzinga & Aart C. Liefbroer, 2007. "De-standardization of Family-Life Trajectories of Young Adults: A Cross-National Comparison Using Sequence Analysis," European Journal of Population, Springer;European Association for Population Studies, vol. 23(3), pages 225-250, October.
    8. Karhula, Aleksi & Erola, Jani & Raab, Marcel & Fasang, Anette Eva, 2019. "Destination as a process: Sibling similarity in early socioeconomic trajectories," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 40, pages 85-98.
    9. Raffaella Piccarreta & Francesco C. Billari, 2007. "Clustering work and family trajectories by using a divisive algorithm," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 170(4), pages 1061-1078, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Liao, Tim F. & Bolano, Danilo & Brzinsky-Fay, Christian & Cornwell, Benjamin & Fasang, Anette Eva & Helske, Satu & Piccarreta, Raffaella & Raab, Marcel & Ritschard, Gilbert & Struffolino, Emanuela & S, 2022. "Sequence analysis: Its past, present, and future," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 107, pages 1-1.
    2. Nicola Barban, 2013. "Family Trajectories and Health: A Life Course Perspective [Trajectoires familiales et santé: une approche sous l’angle de parcours de vie]," European Journal of Population, Springer;European Association for Population Studies, vol. 29(4), pages 357-385, November.
    3. Mitrofanova, Ekaterina S. & Artamonova, Alyona V., 2016. "Studying Family Formation Trajectories’ Deinstitutionalization in Russia Using Sequence Analysis," MPRA Paper 82877, University Library of Munich, Germany.
    4. Marcel Raab & Emanuela Struffolino, 2020. "The Heterogeneity of Partnership Trajectories to Childlessness in Germany," European Journal of Population, Springer;European Association for Population Studies, vol. 36(1), pages 53-70, March.
    5. Júlia Mikolai & Hill Kulu, 2019. "Union dissolution and housing trajectories in Britain," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 41(7), pages 161-196.
    6. Marc A. Scott & Kaushik Mohan & Jacques‐Antoine Gauthier, 2020. "Model‐based clustering and analysis of life history data," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 183(3), pages 1231-1251, June.
    7. Devillanova, Carlo & Raitano, Michele & Struffolino, Emanuela, 2019. "Longitudinal employment trajectories and health in middle life: Insights from linked administrative and survey data," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 40, pages 1375-1412.
    8. Cees H. Elzinga & Matthias Studer, 2019. "Normalization of Distance and Similarity in Sequence Analysis," Sociological Methods & Research, , vol. 48(4), pages 877-904, November.
    9. Okka Zimmermann & Nicole Hameister, 2019. "Stable cohabitational unions increase quality of life: Retrospective analysis of partnership histories also reveals gender differences," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 40(24), pages 657-692.
    10. Luca Maria Pesando & Nicola Barban & Maria Sironi & Frank F. Furstenberg, 2021. "A Sequence‐Analysis Approach to the Study of the Transition to Adulthood in Low‐ and Middle‐Income Countries," Population and Development Review, The Population Council, Inc., vol. 47(3), pages 719-747, September.
    11. Arthur Kaboth & Lena Hünefeld & Ralf Himmelreicher, 2023. "Employment trajectories of workers in low-skilled jobs in Western Germany," Journal for Labour Market Research, Springer;Institute for Employment Research/ Institut für Arbeitsmarkt- und Berufsforschung (IAB), vol. 57(1), pages 1-17, December.
    12. Zachary Winkle, 2018. "Family Trajectories Across Time and Space: Increasing Complexity in Family Life Courses in Europe?," Demography, Springer;Population Association of America (PAA), vol. 55(1), pages 135-164, February.
    13. N. Barban & X. de Luna & E. Lundholm & I. Svensson & F. C. Billari, 2020. "Causal Effects of the Timing of Life-course Events: Age at Retirement and Subsequent Health," Sociological Methods & Research, , vol. 49(1), pages 216-249, February.
    14. Lídia Montero & Lucía Mejía-Dorantes & Jaume Barceló, 2023. "Applying Data Analytics to Analyze Activity Sequences for an Assessment of Fragmentation in Daily Travel Patterns: A Case Study of the Metropolitan Region of Barcelona," Sustainability, MDPI, vol. 15(19), pages 1-22, September.
    15. Lídia Montero & Lucía Mejía-Dorantes & Jaume Barceló, 2024. "Land Use, Travel Patterns and Gender in Barcelona: A Sequence Analysis Approach," Sustainability, MDPI, vol. 16(20), pages 1-26, October.
    16. Sara Kalucza & Sergi Vidal & Karina Nilsson, 2021. "Intergenerational persistence of family formation trajectories among teenage-mothers and -fathers in Sweden," Journal of Population Research, Springer, vol. 38(3), pages 259-282, September.
    17. Piccarreta, Raffaella & Bonetti, Marco, 2019. "Assessing and comparing models for sequence data by microsimulation (with Supplementary Material)," SocArXiv 3mcfp, Center for Open Science.
    18. Pujadas-Mora, Joana-Maria & Brea-Martinez, Gabriel, 2020. "The increasing influence of siblings in social mobility. A long-term historical view (Barcelona area, 16th-19th centuries)," SocArXiv sf6vj, Center for Open Science.
    19. Brienna Perelli-Harris & Laura Bernardi, 2015. "Exploring social norms around cohabitation: The life course, individualization, and culture," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 33(25), pages 701-732.
    20. Maria Sironi & Nicola Barban & Roberto Impiacciatore, 2013. "The Role of Parental Social Class in the Transition to Adulthood: A Sequence Analysis Approach in Italy and the United States," Working Papers 059, "Carlo F. Dondena" Centre for Research on Social Dynamics (DONDENA), Università Commerciale Luigi Bocconi.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:osf:socarx:kj8d5. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: OSF (email available below). General contact details of provider: https://arabixiv.org .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.