IDEAS home Printed from https://ideas.repec.org/a/spr/jclass/v33y2016i1d10.1007_s00357-016-9195-5.html
   My bibliography  Save this article

Divisive Latent Class Modeling as a Density Estimation Method for Categorical Data

Author

Listed:
  • Daniël W. Palm

    (Cito)

  • L. Andries Ark

    (University of Amsterdam)

  • Jeroen K. Vermunt

    (Tilburg University)

Abstract

Traditionally latent class (LC) analysis is used by applied researchers as a tool for identifying substantively meaningful clusters. More recently, LC models have also been used as a density estimation tool for categorical variables. We introduce a divisive LC (DLC) model as a density estimation tool that may offer several advantages in comparison to a standard LC model. When using an LC model for density estimation, a considerable number of increasingly large LC models may have to be estimated before sufficient model-fit is achieved. A DLC model consists of a sequence of small LC models. Therefore, a DLC model can be estimated much faster and can easily utilize multiple processor cores, meaning that this model is more widely applicable and practical. In this study we describe the algorithm of fitting a DLC model, and discuss the various settings that indirectly influence the precision of a DLC model as a density estimation tool. These settings are illustrated using a synthetic data example, and the best performing algorithm is applied to a real-data example. The generated data example showed that, using specific decision rules, a DLC model is able to correctly model complex associations amongst categorical variables.

Suggested Citation

  • Daniël W. Palm & L. Andries Ark & Jeroen K. Vermunt, 2016. "Divisive Latent Class Modeling as a Density Estimation Method for Categorical Data," Journal of Classification, Springer;The Classification Society, vol. 33(1), pages 52-72, April.
  • Handle: RePEc:spr:jclass:v:33:y:2016:i:1:d:10.1007_s00357-016-9195-5
    DOI: 10.1007/s00357-016-9195-5
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s00357-016-9195-5
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s00357-016-9195-5?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Herbert Hoijtink & Annelise Notenboom, 2004. "Model based clustering of large data sets: Tracing the development of spelling ability," Psychometrika, Springer;The Psychometric Society, vol. 69(3), pages 481-498, September.
    2. Linzer, Drew A., 2011. "Reliable Inference in Highly Stratified Contingency Tables: Using Latent Class Models as Density Estimators," Political Analysis, Cambridge University Press, vol. 19(2), pages 173-187, April.
    3. Pascal Hattum & Herbert Hoijtink, 2009. "Market Segmentation Using Brand Strategy Research: Bayesian Inference with Respect to Mixtures of Log-Linear Models," Journal of Classification, Springer;The Classification Society, vol. 26(3), pages 297-328, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Boeschoten Laura & Oberski Daniel & de Waal Ton, 2017. "Estimating Classification Errors Under Edit Restrictions in Composite Survey-Register Data Using Multiple Imputation Latent Class Modelling (MILC)," Journal of Official Statistics, Sciendo, vol. 33(4), pages 921-962, December.
    2. Douglas L. Steinley, 2018. "Editorial," Journal of Classification, Springer;The Classification Society, vol. 35(1), pages 1-4, April.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Davide Vidotto & Jeroen K. Vermunt & Katrijn van Deun, 2018. "Bayesian Multilevel Latent Class Models for the Multiple Imputation of Nested Categorical Data," Journal of Educational and Behavioral Statistics, , vol. 43(5), pages 511-539, October.
    2. van der Ark, L. Andries, 2012. "New Developments in Mokken Scale Analysis in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 48(i05).
    3. Linzer, Drew A. & Lewis, Jeffrey B., 2011. "poLCA: An R Package for Polytomous Variable Latent Class Analysis," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 42(i10).
    4. Matthieu Marbac & Christophe Biernacki & Vincent Vandewalle, 2015. "Model-Based Clustering for Conditionally Correlated Categorical Data," Journal of Classification, Springer;The Classification Society, vol. 32(2), pages 145-175, July.
    5. Pascal Hattum & Herbert Hoijtink, 2009. "Market Segmentation Using Brand Strategy Research: Bayesian Inference with Respect to Mixtures of Log-Linear Models," Journal of Classification, Springer;The Classification Society, vol. 26(3), pages 297-328, December.
    6. Matthieu Marbac & Christophe Biernacki & Vincent Vandewalle, 2016. "Latent class model with conditional dependency per modes to cluster categorical data," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 10(2), pages 183-207, June.
    7. Chiheb-Eddine N’Cir & Nadia Essoussi & Mohamed Limam, 2015. "Kernel-Based Methods to Identify Overlapping Clusters with Linear and Nonlinear Boundaries," Journal of Classification, Springer;The Classification Society, vol. 32(2), pages 176-211, July.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:jclass:v:33:y:2016:i:1:d:10.1007_s00357-016-9195-5. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.