IDEAS home Printed from https://ideas.repec.org/a/spr/testjl/v30y2021i2d10.1007_s11749-020-00727-x.html
   My bibliography  Save this article

Dimension reduction for longitudinal multivariate data by optimizing class separation of projected latent Markov models

Author

Listed:
  • Alessio Farcomeni

    (University of Rome “Tor Vergata”)

  • Monia Ranalli

    (Sapienza - University of Rome)

  • Sara Viviani

    (Viale delle Terme di Caracalla)

Abstract

We present a method for dimension reduction of multivariate longitudinal data, where new variables are assumed to follow a latent Markov model. New variables are obtained as linear combinations of the multivariate outcome as usual. Weights of each linear combination maximize a measure of separation of the latent intercepts, subject to orthogonality constraints. We evaluate our proposal in a simulation study and illustrate it using an EU-level data set on income and living conditions, where dimension reduction leads to an optimal scoring system for material deprivation. An R implementation of our approach can be downloaded from https://github.com/afarcome/LMdim.

Suggested Citation

  • Alessio Farcomeni & Monia Ranalli & Sara Viviani, 2021. "Dimension reduction for longitudinal multivariate data by optimizing class separation of projected latent Markov models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 30(2), pages 462-480, June.
  • Handle: RePEc:spr:testjl:v:30:y:2021:i:2:d:10.1007_s11749-020-00727-x
    DOI: 10.1007/s11749-020-00727-x
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11749-020-00727-x
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11749-020-00727-x?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Jung, Robert C. & Liesenfeld, Roman & Richard, Jean-François, 2011. "Dynamic Factor Models for Multivariate Count Data: An Application to Stock-Market Trading Activity," Journal of Business & Economic Statistics, American Statistical Association, vol. 29(1), pages 73-85.
    2. Robert C. Jung & Roman Liesenfeld & Jean-François Richard, 2011. "Dynamic Factor Models for Multivariate Count Data: An Application to Stock-Market Trading Activity," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 29(1), pages 73-85, January.
    3. Xinyuan Song & Yemao Xia & Hongtu Zhu, 2017. "Hidden Markov latent variable models with multivariate longitudinal data," Biometrics, The International Biometric Society, vol. 73(1), pages 313-323, March.
    4. Giulia Barbati & Alessio Farcomeni, 2018. "Prognostic assessment of repeatedly measured time-dependent biomarkers, with application to dilated cardiomyopathy," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 27(3), pages 545-557, August.
    5. Francesco Dotto & Alessio Farcomeni & Maria Grazia Pittau & Roberto Zelli, 2019. "A dynamic inhomogeneous latent state model for measuring material deprivation," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 182(2), pages 495-516, February.
    6. Douglas Steinley & Robert Henson, 2005. "OCLUS: An Analytic Method for Generating Clusters with Known Overlap," Journal of Classification, Springer;The Classification Society, vol. 22(2), pages 221-250, September.
    7. Hong, Yili, 2013. "On computing the distribution function for the Poisson binomial distribution," Computational Statistics & Data Analysis, Elsevier, vol. 59(C), pages 41-51.
    8. de Leeuw, Jan, 2006. "Principal component analysis of binary data by iterated singular value decomposition," Computational Statistics & Data Analysis, Elsevier, vol. 50(1), pages 21-39, January.
    9. Scrucca, Luca, 2013. "GA: A Package for Genetic Algorithms in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 53(i04).
    10. Hector E. Najera Catalan, 2017. "Multiple Deprivation, Severity and Latent Sub-Groups: Advantages of Factor Mixture Modelling for Analysing Material Deprivation," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 131(2), pages 681-700, March.
    11. Andrade, Dalton F. & Tavares, Heliton R., 2005. "Item response theory for longitudinal data: population parameter estimation," Journal of Multivariate Analysis, Elsevier, vol. 95(1), pages 1-22, July.
    12. Maria Marino & Marco Alfó, 2015. "Latent drop-out based transitions in linear quantile hidden Markov models for longitudinal responses with attrition," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 9(4), pages 483-502, December.
    13. Francesco Bartolucci & Alessio Farcomeni, 2015. "A discrete time event-history approach to informative drop-out in mixed latent Markov models with covariates," Biometrics, The International Biometric Society, vol. 71(1), pages 80-89, March.
    14. Deheuvels, Paul & Puri, Madan L. & Ralescu, Stefan S., 1989. "Asymptotic expansions for sums of nonidentically distributed Bernoulli random variables," Journal of Multivariate Analysis, Elsevier, vol. 28(2), pages 282-303, February.
    15. Yingye Zheng & Patrick Heagerty, 2004. "Semiparametric Estimation of Time-Dependent: ROC Curves for Longitudinal Marker Data," UW Biostatistics Working Paper Series 1052, Berkeley Electronic Press.
    16. F. Bartolucci & A. Farcomeni & F. Pennoni, 2014. "Rejoinder on: Latent Markov models: a review of a general framework for the analysis of longitudinal data with covariates," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 23(3), pages 484-486, September.
    17. Tomohiro Ando & Jushan Bai, 2017. "Clustering Huge Number of Financial Time Series: A Panel Data Approach With High-Dimensional Predictors and Factor Structures," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(519), pages 1182-1198, July.
    18. F. Bartolucci & A. Farcomeni & F. Pennoni, 2014. "Latent Markov models: a review of a general framework for the analysis of longitudinal data with covariates," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 23(3), pages 433-465, September.
    19. Xia, Ye-Mao & Tang, Nian-Sheng & Gou, Jian-Wei, 2016. "Generalized linear latent models for multivariate longitudinal measurements mixed with hidden Markov models," Journal of Multivariate Analysis, Elsevier, vol. 152(C), pages 259-275.
    20. Dias, José G. & Vermunt, Jeroen K. & Ramos, Sofia, 2015. "Clustering financial time series: New insights from an extended hidden Markov model," European Journal of Operational Research, Elsevier, vol. 243(3), pages 852-864.
    21. A. Atkinson, 2003. "Multidimensional Deprivation: Contrasting Social Welfare and Counting Approaches," The Journal of Economic Inequality, Springer;Society for the Study of Economic Inequality, vol. 1(1), pages 51-65, April.
    22. Antonello Maruotti, 2015. "Handling non-ignorable dropouts in longitudinal data: a conditional model based on a latent Markov heterogeneity structure," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 24(1), pages 84-109, March.
    23. Alessio Farcomeni, 2015. "Generalized Linear Mixed Models Based on Latent Markov Heterogeneity Structures," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 42(4), pages 1127-1135, December.
    24. Jushan Bai & Peng Wang, 2015. "Identification and Bayesian Estimation of Dynamic Factor Models," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 33(2), pages 221-240, April.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Esther Acquah & Lorenzo Carbonari & Alessio Farcomeni & Giovanni Trovato, 2023. "Institutions and economic development: new measurements and evidence," Empirical Economics, Springer, vol. 65(4), pages 1693-1728, October.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Francesco Dotto & Alessio Farcomeni & Maria Grazia Pittau & Roberto Zelli, 2019. "A dynamic inhomogeneous latent state model for measuring material deprivation," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 182(2), pages 495-516, February.
    2. Esther Acquah & Lorenzo Carbonari & Alessio Farcomeni & Giovanni Trovato, 2023. "Institutions and economic development: new measurements and evidence," Empirical Economics, Springer, vol. 65(4), pages 1693-1728, October.
    3. Gordon Anderson & Alessio Farcomeni & Maria Grazia Pittau & Roberto Zelli, 2019. "Rectangular latent Markov models for time‐specific clustering, with an analysis of the wellbeing of nations," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 68(3), pages 603-621, April.
    4. Francesco Bartolucci & Alessio Farcomeni, 2022. "A hidden Markov space–time model for mapping the dynamics of global access to food," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 185(1), pages 246-266, January.
    5. Marino, Maria Francesca & Alfó, Marco, 2016. "Gaussian quadrature approximations in mixed hidden Markov models for longitudinal data: A simulation study," Computational Statistics & Data Analysis, Elsevier, vol. 94(C), pages 193-209.
    6. Fokianos, Konstantinos & Fried, Roland & Kharin, Yuriy & Voloshko, Valeriy, 2022. "Statistical analysis of multivariate discrete-valued time series," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    7. Tullio, Federico & Bartolucci, Francesco, 2019. "Evaluating time-varying treatment effects in latent Markov models: An application to the effect of remittances on poverty dynamics," MPRA Paper 91459, University Library of Munich, Germany.
    8. Gordon Anderson & Alessio Farcomeni & Grazia Pittau & Roberto Zelli, 2017. "Rectangular latent Markov models for time-specific clustering," Working Papers tecipa-589, University of Toronto, Department of Economics.
    9. Gordon Anderson, Alessio Farcomeni, Maria Grazia Pittau and Roberto Zelli, 2019. "Multidimensional Nation Wellbeing, More Equal yet More Polarized: An Analysis of the Progress of Human Development Since 1990," Journal of Economic Development, Chung-Ang Unviersity, Department of Economics, vol. 44(1), pages 1-22, March.
    10. Fulvia Pennoni & Ewa Genge, 2020. "Analysing the course of public trust via hidden Markov models: a focus on the Polish society," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 29(2), pages 399-425, June.
    11. Alessio Farcomeni, 2015. "Generalized Linear Mixed Models Based on Latent Markov Heterogeneity Structures," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 42(4), pages 1127-1135, December.
    12. Younghoon Kim & Zachary F. Fisher & Vladas Pipiras, 2023. "Latent Gaussian dynamic factor modeling and forecasting for multivariate count time series," Papers 2307.10454, arXiv.org.
    13. Roberto Mari & Antonello Maruotti, 2022. "A two-step estimator for generalized linear models for longitudinal data with time-varying measurement error," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 16(2), pages 273-300, June.
    14. Fulvia Pennoni & Beata Bal-Domańska, 2022. "NEETs and Youth Unemployment: A Longitudinal Comparison Across European Countries," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 162(2), pages 739-761, July.
    15. Dimpfl, Thomas, 2014. "A note on cointegration of international stock market indices," International Review of Financial Analysis, Elsevier, vol. 33(C), pages 10-16.
    16. Falk Bräuning & Siem Jan Koopman, 2016. "The dynamic factor network model with an application to global credit risk," Working Papers 16-13, Federal Reserve Bank of Boston.
    17. Aknouche, Abdelhakim & Dimitrakopoulos, Stefanos, 2020. "On an integer-valued stochastic intensity model for time series of counts," MPRA Paper 105406, University Library of Munich, Germany.
    18. Antonello Maruotti & Jan Bulla & Tanya Mark, 2019. "Assessing the influence of marketing activities on customer behaviors: a dynamic clustering approach," METRON, Springer;Sapienza Università di Roma, vol. 77(1), pages 19-42, April.
    19. Dag Tjøstheim, 2012. "Some recent theory for autoregressive count time series," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 21(3), pages 413-438, September.
    20. Siem Jan Koopman & Rutger Lit & Thuy Minh Nguyen, 2012. "Fast Efficient Importance Sampling by State Space Methods," Tinbergen Institute Discussion Papers 12-008/4, Tinbergen Institute, revised 16 Oct 2014.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:testjl:v:30:y:2021:i:2:d:10.1007_s11749-020-00727-x. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.