IDEAS home Printed from https://ideas.repec.org/a/spr/alstar/v102y2018i1d10.1007_s10182-017-0294-3.html
   My bibliography  Save this article

A mixture latent variable model for modeling mixed data in heterogeneous populations and its applications

Author

Listed:
  • Leila Amiri

    (Shahid Beheshti University)

  • Mojtaba Khazaei

    (Shahid Beheshti University)

  • Mojtaba Ganjali

    (Shahid Beheshti University)

Abstract

Latent variable models are widely used for jointly modeling of mixed data including nominal, ordinal, count and continuous data. In this paper, we consider a latent variable model for jointly modeling relationships between mixed binary, count and continuous variables with some observed covariates. We assume that, given a latent variable, mixed variables of interest are independent and count and continuous variables have Poisson distribution and normal distribution, respectively. As such data may be extracted from different subpopulations, consideration of an unobserved heterogeneity has to be taken into account. A mixture distribution is considered (for the distribution of the latent variable) which accounts the heterogeneity. The generalized EM algorithm which uses the Newton–Raphson algorithm inside the EM algorithm is used to compute the maximum likelihood estimates of parameters. The standard errors of the maximum likelihood estimates are computed by using the supplemented EM algorithm. Analysis of the primary biliary cirrhosis data is presented as an application of the proposed model.

Suggested Citation

  • Leila Amiri & Mojtaba Khazaei & Mojtaba Ganjali, 2018. "A mixture latent variable model for modeling mixed data in heterogeneous populations and its applications," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 102(1), pages 95-115, January.
  • Handle: RePEc:spr:alstar:v:102:y:2018:i:1:d:10.1007_s10182-017-0294-3
    DOI: 10.1007/s10182-017-0294-3
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10182-017-0294-3
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10182-017-0294-3?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Irini Moustaki & Martin Knott, 2000. "Generalized latent trait models," Psychometrika, Springer;The Psychometric Society, vol. 65(3), pages 391-411, September.
    2. Yiu-Fai Yung, 1997. "Finite mixtures in confirmatory factor-analysis models," Psychometrika, Springer;The Psychometric Society, vol. 62(3), pages 297-330, September.
    3. Liu, Xuefeng & Daniels, Michael J. & Marcus, Bess, 2009. "Joint Models for the Association of Longitudinal Binary and Continuous Processes With Application to a Smoking Cessation Trial," Journal of the American Statistical Association, American Statistical Association, vol. 104(486), pages 429-438.
    4. de Leon, A.R., 2005. "Pairwise likelihood approach to grouped continuous model and its extension," Statistics & Probability Letters, Elsevier, vol. 75(1), pages 49-57, November.
    5. Zhang, Xiao & Boscardin, W. John & Belin, Thomas R. & Wan, Xiaohai & He, Yulei & Zhang, Kui, 2015. "A Bayesian method for analyzing combinations of continuous, ordinal, and nominal categorical data with missing values," Journal of Multivariate Analysis, Elsevier, vol. 135(C), pages 43-58.
    6. Mary Dupuis Sammel & Louise M. Ryan & Julie M. Legler, 1997. "Latent Variable Models for Mixed Discrete and Continuous Outcomes," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 59(3), pages 667-678.
    7. M. Jamshidian & R. I. Jennrich, 2000. "Standard errors for EM estimation," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 62(2), pages 257-270.
    8. Cai, Jing-Heng & Song, Xin-Yuan & Lam, Kwok-Hap & Ip, Edward Hak-Sing, 2011. "A mixture of generalized latent variable models for mixed mode and heterogeneous data," Computational Statistics & Data Analysis, Elsevier, vol. 55(11), pages 2889-2907, November.
    9. Silvia Cagnone & Cinzia Viroli, 2014. "A factor mixture model for analyzing heterogeneity and cognitive structure of dementia," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 98(1), pages 1-20, January.
    10. Wai-Yin Poon & Sik-Yum Lee, 1987. "Maximum likelihood estimation of multivariate polyserial and polychoric correlation coefficients," Psychometrika, Springer;The Psychometric Society, vol. 52(3), pages 409-430, September.
    11. Philippe Huber & Elvezio Ronchetti & Maria‐Pia Victoria‐Feser, 2004. "Estimation of generalized linear latent variable models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 66(4), pages 893-908, November.
    12. D. B. Dunson, 2000. "Bayesian latent variable models for clustered mixed outcomes," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 62(2), pages 355-366.
    13. Bengt Muthén & Kerby Shedden, 1999. "Finite Mixture Modeling with Mixture Outcomes Using the EM Algorithm," Biometrics, The International Biometric Society, vol. 55(2), pages 463-469, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Cai, Jing-Heng & Song, Xin-Yuan & Lam, Kwok-Hap & Ip, Edward Hak-Sing, 2011. "A mixture of generalized latent variable models for mixed mode and heterogeneous data," Computational Statistics & Data Analysis, Elsevier, vol. 55(11), pages 2889-2907, November.
    2. Hao Bai & Yuan Zhong & Xin Gao & Wei Xu, 2020. "Multivariate Mixed Response Model with Pairwise Composite-Likelihood Method," Stats, MDPI, vol. 3(3), pages 1-18, July.
    3. Zhang, Q. & Ip, E.H., 2014. "Variable assessment in latent class models," Computational Statistics & Data Analysis, Elsevier, vol. 77(C), pages 146-156.
    4. Zhang, Xiao & Boscardin, W. John & Belin, Thomas R. & Wan, Xiaohai & He, Yulei & Zhang, Kui, 2015. "A Bayesian method for analyzing combinations of continuous, ordinal, and nominal categorical data with missing values," Journal of Multivariate Analysis, Elsevier, vol. 135(C), pages 43-58.
    5. Silvia Cagnone & Cinzia Viroli, 2014. "A factor mixture model for analyzing heterogeneity and cognitive structure of dementia," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 98(1), pages 1-20, January.
    6. Jenni Niku & David I. Warton & Francis K. C. Hui & Sara Taskinen, 2017. "Generalized Linear Latent Variable Models for Multivariate Count and Biomass Data in Ecology," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 22(4), pages 498-522, December.
    7. Fokoué, Ernest, 2005. "Mixtures of factor analyzers: an extension with covariates," Journal of Multivariate Analysis, Elsevier, vol. 95(2), pages 370-384, August.
    8. Ling Zhou & Huazhen Lin & Yi-Chen Lin, 2016. "Education, Intelligence, and Well-Being: Evidence from a Semiparametric Latent Variable Transformation Model for Multiple Outcomes of Mixed Types," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 125(3), pages 1011-1033, February.
    9. Yang Lu, 2019. "Flexible (panel) regression models for bivariate count–continuous data with an insurance application," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 182(4), pages 1503-1521, October.
    10. Yanyuan Ma & Marc G. Genton, 2010. "Explicit estimating equations for semiparametric generalized linear latent variable models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 72(4), pages 475-495, September.
    11. Emilio Augusto Coelho-Barros & Jorge Alberto Achcar & Josmar Mazucheli, 2010. "Longitudinal Poisson modeling: an application for CD4 counting in HIV-infected patients," Journal of Applied Statistics, Taylor & Francis Journals, vol. 37(5), pages 865-880.
    12. Vitoratou, Silia & Ntzoufras, Ioannis & Moustaki, Irini, 2016. "Explaining the behavior of joint and marginal Monte Carlo estimators in latent variable models with independence assumptions," LSE Research Online Documents on Economics 57685, London School of Economics and Political Science, LSE Library.
    13. Zhou, Xingcai & Liu, Xinsheng, 2008. "The EM algorithm for the extended finite mixture of the factor analyzers model," Computational Statistics & Data Analysis, Elsevier, vol. 52(8), pages 3939-3953, April.
    14. Luo, Chongliang & Liang, Jian & Li, Gen & Wang, Fei & Zhang, Changshui & Dey, Dipak K. & Chen, Kun, 2018. "Leveraging mixed and incomplete outcomes via reduced-rank modeling," Journal of Multivariate Analysis, Elsevier, vol. 167(C), pages 378-394.
    15. Hoshino, Takahiro, 2008. "A Bayesian propensity score adjustment for latent variable modeling and MCMC algorithm," Computational Statistics & Data Analysis, Elsevier, vol. 52(3), pages 1413-1429, January.
    16. David B. Dunson & M. Watson & Jack A. Taylor, 2003. "Bayesian Latent Variable Models for Median Regression on Multiple Outcomes," Biometrics, The International Biometric Society, vol. 59(2), pages 296-304, June.
    17. Silvia Cagnone & Cinzia Viroli, 2018. "Multivariate latent variable transition models of longitudinal mixed data: an analysis on alcohol use disorder," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 67(5), pages 1399-1418, November.
    18. Florian Schuberth & Jörg Henseler & Theo K. Dijkstra, 2018. "Partial least squares path modeling using ordinal categorical indicators," Quality & Quantity: International Journal of Methodology, Springer, vol. 52(1), pages 9-35, January.
    19. Katsikatsou, Myrsini & Moustaki, Irini & Md Jamil, Haziq, 2022. "Pairwise likelihood estimation for confirmatory factor analysis models with categorical variables and data that are missing at random," LSE Research Online Documents on Economics 108933, London School of Economics and Political Science, LSE Library.
    20. Robin Fuchs & Denys Pommeret & Cinzia Viroli, 2022. "Mixed Deep Gaussian Mixture Model: a clustering model for mixed datasets," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 16(1), pages 31-53, March.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:alstar:v:102:y:2018:i:1:d:10.1007_s10182-017-0294-3. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.