IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v103y2016icp206-228.html

A variational Expectation–Maximization algorithm for temporal data clustering

Author

Listed:
  • El Assaad, Hani
  • Samé, Allou
  • Govaert, Gérard
  • Aknin, Patrice

Abstract

The problem of temporal data clustering is addressed using a dynamic Gaussian mixture model. In addition to the missing clusters used in the classical Gaussian mixture model, the proposed approach assumes that the means of the Gaussian densities are latent variables distributed according to random walks. The parameters of the proposed algorithm are estimated by the maximum likelihood approach. However, the EM algorithm cannot be applied directly due to the complex structure of the model, and some approximations are required. Using a variational approximation, an algorithm called VEM-DyMix is proposed to estimate the parameters of the proposed model. Using simulated data, the ability of the proposed approach to accurately estimate the parameters is demonstrated. VEM-DyMix outperforms, in terms of clustering and estimation accuracy, other state-of-the-art algorithms. The experiments performed on real world data from two fields of application (railway condition monitoring and object tracking from videos) show the strong potential of the proposed algorithms.

Suggested Citation

  • El Assaad, Hani & Samé, Allou & Govaert, Gérard & Aknin, Patrice, 2016. "A variational Expectation–Maximization algorithm for temporal data clustering," Computational Statistics & Data Analysis, Elsevier, vol. 103(C), pages 206-228.
  • Handle: RePEc:eee:csdana:v:103:y:2016:i:c:p:206-228
    DOI: 10.1016/j.csda.2016.05.007
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167947316301098
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2016.05.007?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Wayne DeSarbo & William Cron, 1988. "A maximum likelihood methodology for clusterwise linear regression," Journal of Classification, Springer;The Classification Society, vol. 5(2), pages 249-282, September.
    2. Michel Wedel & Wayne DeSarbo, 1995. "A mixture likelihood approach for generalized linear models," Journal of Classification, Springer;The Classification Society, vol. 12(1), pages 21-55, March.
    3. Harvey,Andrew C., 1991. "Forecasting, Structural Time Series Models and the Kalman Filter," Cambridge Books, Cambridge University Press, number 9780521405737, January.
    4. Govaert, Gérard & Nadif, Mohamed, 2008. "Block clustering with Bernoulli mixture models: Comparison of different approaches," Computational Statistics & Data Analysis, Elsevier, vol. 52(6), pages 3233-3245, February.
    5. Durbin, James & Koopman, Siem Jan, 2012. "Time Series Analysis by State Space Methods," OUP Catalogue, Oxford University Press, edition 2, number 9780199641178.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Avanzi, Benjamin & Taylor, Greg & Vu, Phuong Anh & Wong, Bernard, 2020. "A multivariate evolutionary generalised linear model framework with adaptive estimation for claims reserving," Insurance: Mathematics and Economics, Elsevier, vol. 93(C), pages 50-71.
    2. Guido Bulligan & Lorenzo Burlon & Davide Delle Monache & Andrea Silvestrini, 2019. "Real and financial cycles: estimates using unobserved component models for the Italian economy," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 28(3), pages 541-569, September.
    3. Alptekin, Aynur & Broadstock, David C. & Chen, Xiaoqi & Wang, Dong, 2019. "Time-varying parameter energy demand functions: Benchmarking state-space methods against rolling-regressions," Energy Economics, Elsevier, vol. 82(C), pages 26-41.
    4. Zirogiannis, Nikolaos & Tripodis, Yorghos, "undated". "A Generalized Dynamic Factor Model for Panel Data: Estimation with a Two-Cycle Conditional Expectation-Maximization Algorithm," Working Paper Series 142752, University of Massachusetts, Amherst, Department of Resource Economics.
    5. Tobias Hartl & Roland Jucknewitz, 2022. "Approximate state space modelling of unobserved fractional components," Econometric Reviews, Taylor & Francis Journals, vol. 41(1), pages 75-98, January.
    6. S. Boragan Aruoba & Francis X. Diebold, 2010. "Real-Time Macroeconomic Monitoring: Real Activity, Inflation, and Interactions," American Economic Review, American Economic Association, vol. 100(2), pages 20-24, May.
    7. Petropoulos, Fotios & Apiletti, Daniele & Assimakopoulos, Vassilios & Babai, Mohamed Zied & Barrow, Devon K. & Ben Taieb, Souhaib & Bergmeir, Christoph & Bessa, Ricardo J. & Bijak, Jakub & Boylan, Joh, 2022. "Forecasting: theory and practice," International Journal of Forecasting, Elsevier, vol. 38(3), pages 705-871.
      • Fotios Petropoulos & Daniele Apiletti & Vassilios Assimakopoulos & Mohamed Zied Babai & Devon K. Barrow & Souhaib Ben Taieb & Christoph Bergmeir & Ricardo J. Bessa & Jakub Bijak & John E. Boylan & Jet, 2020. "Forecasting: theory and practice," Papers 2012.03854, arXiv.org, revised Jan 2022.
    8. Martínez-Zarzoso, Inmaculada & Maruotti, Antonello, 2011. "The impact of urbanization on CO2 emissions: Evidence from developing countries," Ecological Economics, Elsevier, vol. 70(7), pages 1344-1353, May.
    9. Pennings, Joost M.E. & Garcia, Philip & Irwin, Scott H. & Good, Darrel L., 2003. "How To Group Market Participants? Heterogeneity In Hedging Behavior," 2003 Annual meeting, July 27-30, Montreal, Canada 21963, American Agricultural Economics Association (New Name 2008: Agricultural and Applied Economics Association).
    10. Obryan Poyser, 2017. "Exploring the determinants of Bitcoin's price: an application of Bayesian Structural Time Series," Papers 1706.01437, arXiv.org.
    11. Nalan Basturk & Cem Cakmakli & S. Pinar Ceyhan & Herman K. van Dijk, 2014. "On the Rise of Bayesian Econometrics after Cowles Foundation Monographs 10, 14," Tinbergen Institute Discussion Papers 14-085/III, Tinbergen Institute, revised 04 Sep 2014.
    12. Rob Luginbuhl, 2020. "Estimation of the Financial Cycle with a Rank-Reduced Multivariate State-Space Model," CPB Discussion Paper 409, CPB Netherlands Bureau for Economic Policy Analysis.
    13. Krist'of N'emeth & D'aniel Hadh'azi, 2023. "GDP nowcasting with artificial neural networks: How much does long-term memory matter?," Papers 2304.05805, arXiv.org, revised Jan 2025.
    14. Heimberger, Philipp & Kapeller, Jakob & Schütz, Bernhard, 2017. "The NAIRU determinants: What’s structural about unemployment in Europe?," Journal of Policy Modeling, Elsevier, vol. 39(5), pages 883-908.
    15. Heungsun Hwang & Marc Tomiuk, 2010. "Fuzzy clusterwise quasi-likelihood generalized linear models," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 4(4), pages 255-270, December.
    16. Bernardi, Mauro & Catania, Leopoldo, 2018. "Portfolio optimisation under flexible dynamic dependence modelling," Journal of Empirical Finance, Elsevier, vol. 48(C), pages 1-18.
    17. Krist'of N'emeth & D'aniel Hadh'azi, 2024. "Generating density nowcasts for U.S. GDP growth with deep learning: Bayes by Backprop and Monte Carlo dropout," Papers 2405.15579, arXiv.org.
    18. Ralf Dewenter & Ulrich Heimeshoff, 2017. "Predicting Advertising Volumes Using Structural Time Series Models: A Case Study," Economics Bulletin, AccessEcon, vol. 37(3), pages 1644-1652.
    19. Mellár, Tamás & Németh, Kristóf, 2018. "A kibocsátási rés becslése többváltozós állapottérmodellekben. Szuperhiszterézis és további empirikus eredmények [Estimating output gap in multivariate state space models. Super-hysteresis and furt," Közgazdasági Szemle (Economic Review - monthly of the Hungarian Academy of Sciences), Közgazdasági Szemle Alapítvány (Economic Review Foundation), vol. 0(6), pages 557-591.
    20. Davide Delle Monache & Stefano Grassi & Paolo Santucci de Magistris, 2017. "Does the ARFIMA really shift?," CREATES Research Papers 2017-16, Department of Economics and Business Economics, Aarhus University.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:103:y:2016:i:c:p:206-228. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.