IDEAS home Printed from https://ideas.repec.org/a/bla/jtsera/v43y2022i6p918-937.html
   My bibliography  Save this article

Autoregressive mixture models for clustering time series

Author

Listed:
  • Benny Ren
  • Ian Barnett

Abstract

Clustering time series into similar groups can improve models by combining information across like time series. While there is a well developed body of literature for clustering of time series, these approaches tend to generate clusters independently of model training, which can lead to poor model fit. We propose a novel distributed approach that simultaneously clusters and fits autoregression models for groups of similar individuals. We apply a Wishart mixture model so as to cluster individuals while modelling the corresponding autocovariance matrices at the same time. The fitted Wishart scale matrices map to cluster‐level autoregressive coefficients through the Yule–Walker equations, fitting robust parsimonious autoregressive mixture models. This approach is able to discern differences in underlying autocorrelation variation of time series in settings with large heterogeneous datasets. We prove consistency of our cluster membership estimator, asymptotic distributions of coefficients and compare our approach against competing methods through simulation as well as by fitting a COVID‐19 forecast model.

Suggested Citation

  • Benny Ren & Ian Barnett, 2022. "Autoregressive mixture models for clustering time series," Journal of Time Series Analysis, Wiley Blackwell, vol. 43(6), pages 918-937, November.
  • Handle: RePEc:bla:jtsera:v:43:y:2022:i:6:p:918-937
    DOI: 10.1111/jtsa.12644
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/jtsa.12644
    Download Restriction: no

    File URL: https://libkey.io/10.1111/jtsa.12644?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Hansen, Lars Peter, 1982. "Large Sample Properties of Generalized Method of Moments Estimators," Econometrica, Econometric Society, vol. 50(4), pages 1029-1054, July.
    2. Bodnar, Taras & Okhrin, Yarema, 2008. "Properties of the singular, inverse and generalized inverse partitioned Wishart distributions," Journal of Multivariate Analysis, Elsevier, vol. 99(10), pages 2389-2405, November.
    3. Geoffrey Coke & Min Tsao, 2010. "Random effects mixture models for clustering electrical load series," Journal of Time Series Analysis, Wiley Blackwell, vol. 31(6), pages 451-464, November.
    4. Domenico Piccolo, 1990. "A Distance Measure For Classifying Arima Models," Journal of Time Series Analysis, Wiley Blackwell, vol. 11(2), pages 153-164, March.
    5. Qiu, D. & Shao, Q. & Yang, L., 2013. "Efficient inference for autoregressive coefficients in the presence of trends," Journal of Multivariate Analysis, Elsevier, vol. 114(C), pages 40-53.
    6. Montero, Pablo & Vilar, José A., 2014. "TSclust: An R Package for Time Series Clustering," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 62(i01).
    7. Gourieroux, C. & Jasiak, J. & Sufana, R., 2009. "The Wishart Autoregressive process of multivariate stochastic volatility," Journal of Econometrics, Elsevier, vol. 150(2), pages 167-181, June.
    8. Michael I. Jordan & Jason D. Lee & Yun Yang, 2019. "Communication-Efficient Distributed Statistical Inference," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 114(526), pages 668-681, April.
    9. Carolina Euán & Hernando Ombao & Joaquín Ortega, 2018. "The Hierarchical Spectral Merger Algorithm: A New Time Series Clustering Procedure," Journal of Classification, Springer;The Classification Society, vol. 35(1), pages 71-99, April.
    10. Genolini, Christophe & Alacoque, Xavier & Sentenac, Mariane & Arnaud, Catherine, 2015. "kml and kml3d: R Packages to Cluster Longitudinal Data," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 65(i04).
    11. Q. Shao & L. J. Yang, 2011. "Autoregressive coefficient estimation in nonparametric analysis," Journal of Time Series Analysis, Wiley Blackwell, vol. 32(6), pages 587-597, November.
    12. Galeano, Pedro & Peña, Daniel, 2001. "Multivariate analysis in vector time series," DES - Working Papers. Statistics and Econometrics. WS ws012415, Universidad Carlos III de Madrid. Departamento de Estadística.
    13. Konstantinos Fokianos & Vasilis J. Promponas, 2012. "Biological applications of time series frequency domain clustering," Journal of Time Series Analysis, Wiley Blackwell, vol. 33(5), pages 744-756, September.
    14. Datta Gupta, Syamantak & Mazumdar, Ravi R. & Glynn, Peter, 2013. "On the convergence of the spectrum of finite order approximations of stationary time series," Journal of Multivariate Analysis, Elsevier, vol. 121(C), pages 1-21.
    15. Xu Gao & Weining Shen & Liwen Zhang & Jianhua Hu & Norbert J. Fortin & Ron D. Frostig & Hernando Ombao, 2021. "Regularized matrix data clustering and its application to image analysis," Biometrics, The International Biometric Society, vol. 77(3), pages 890-902, September.
    16. Yongning Wang & Ruey S. Tsay, 2019. "Clustering Multiple Time Series with Structural Breaks," Journal of Time Series Analysis, Wiley Blackwell, vol. 40(2), pages 182-202, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Sipan Aslan & Ceylan Yozgatligil & Cem Iyigun, 2018. "Temporal clustering of time series via threshold autoregressive models: application to commodity prices," Annals of Operations Research, Springer, vol. 260(1), pages 51-77, January.
    2. Asai, Manabu & McAleer, Michael, 2015. "Leverage and feedback effects on multifactor Wishart stochastic volatility for option pricing," Journal of Econometrics, Elsevier, vol. 187(2), pages 436-446.
    3. Beibei Zhang & Rong Chen, 2018. "Nonlinear Time Series Clustering Based on Kolmogorov-Smirnov 2D Statistic," Journal of Classification, Springer;The Classification Society, vol. 35(3), pages 394-421, October.
    4. Manabu Asai & Michael McAleer, 2017. "A fractionally integrated Wishart stochastic volatility model," Econometric Reviews, Taylor & Francis Journals, vol. 36(1-3), pages 42-59, March.
    5. Sonia Díaz & José Vilar, 2010. "Comparing Several Parametric and Nonparametric Approaches to Time Series Clustering: A Simulation Study," Journal of Classification, Springer;The Classification Society, vol. 27(3), pages 333-362, November.
    6. Qin Shao & Lijian Yang, 2017. "Oracally efficient estimation and consistent model selection for auto-regressive moving average time series with trend," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(2), pages 507-524, March.
    7. L. Tang & Q. Shao, 2014. "Efficient Estimation For Periodic Autoregressive Coefficients Via Residuals," Journal of Time Series Analysis, Wiley Blackwell, vol. 35(4), pages 378-389, July.
    8. Alfelt, Gustav & Bodnar, Taras & Javed, Farrukh & Tyrcha, Joanna, 2020. "Singular conditional autoregressive Wishart model for realized covariance matrices," Working Papers 2021:1, Örebro University, School of Business.
    9. Corduas, Marcella & Piccolo, Domenico, 2008. "Time series clustering and classification by the autoregressive metric," Computational Statistics & Data Analysis, Elsevier, vol. 52(4), pages 1860-1872, January.
    10. Tianbo Chen & Ying Sun & Carolina Euan & Hernando Ombao, 2021. "Clustering Brain Signals: a Robust Approach Using Functional Data Ranking," Journal of Classification, Springer;The Classification Society, vol. 38(3), pages 425-442, October.
    11. Alonso, A.M. & Berrendero, J.R. & Hernandez, A. & Justel, A., 2006. "Time series clustering based on forecast densities," Computational Statistics & Data Analysis, Elsevier, vol. 51(2), pages 762-776, November.
    12. Lucio Palazzo & Riccardo Ievoli, 2023. "Detecting Regional Differences in Italian Health Services during Five COVID-19 Waves," Stats, MDPI, vol. 6(2), pages 1-13, April.
    13. Caiado, Jorge & Crato, Nuno & Pena, Daniel, 2006. "A periodogram-based metric for time series classification," Computational Statistics & Data Analysis, Elsevier, vol. 50(10), pages 2668-2684, June.
    14. Druica, Elena & Goschin, Zizi, 2016. "Does Economic Status Matter for the Regional Variation of Malnutrition-Related Diabetes in Romania? Temporal Clustering and Spatial Analyses," MPRA Paper 88831, University Library of Munich, Germany.
    15. Margherita Gerolimetto & Stefano Magrini, 2022. "Weighting in clustering time series: an application to Covid-19 data," RIEDS - Rivista Italiana di Economia, Demografia e Statistica - The Italian Journal of Economic, Demographic and Statistical Studies, SIEDS Societa' Italiana di Economia Demografia e Statistica, vol. 76(4), pages 4-12, October-D.
    16. B. Lafuente-Rego & P. D’Urso & J. A. Vilar, 2020. "Robust fuzzy clustering based on quantile autocovariances," Statistical Papers, Springer, vol. 61(6), pages 2393-2448, December.
    17. Gagliardini, Patrick & Gouriéroux, Christian, 2019. "Identification by Laplace transforms in nonlinear time series and panel models with unobserved stochastic dynamic effects," Journal of Econometrics, Elsevier, vol. 208(2), pages 613-637.
    18. Hillebrand, Eric & Schnabl, Gunther & Ulu, Yasemin, 2009. "Japanese foreign exchange intervention and the yen-to-dollar exchange rate: A simultaneous equations approach using realized volatility," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 19(3), pages 490-505, July.
    19. Frederico Belo & Chen Xue & Lu Zhang, 2010. "Cross-sectional Tobin's Q," NBER Working Papers 16336, National Bureau of Economic Research, Inc.
    20. Bansal, Ravi & Kiku, Dana & Yaron, Amir, 2016. "Risks for the long run: Estimation with time aggregation," Journal of Monetary Economics, Elsevier, vol. 82(C), pages 52-69.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jtsera:v:43:y:2022:i:6:p:918-937. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.blackwellpublishing.com/journal.asp?ref=0143-9782 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.