IDEAS home Printed from https://ideas.repec.org/p/com/wpaper/028.html
   My bibliography  Save this paper

Fuzzy clustering of univariate and multivariate time series by genetic multiobjective optimization

Author

Listed:
  • S. Bandyopadhyay
  • R. Baragona
  • U. Maulik

Abstract

Given a set of time series, it is of interest to discover subsets that share similar properties. For instance, this may be useful for identifying and estimating a single model that may fit conveniently several time series, instead of performing the usual identification and estimation steps for each one. On the other hand time series in the same cluster are related with respect to the measures assumed for cluster analysis and are suitable for building multivariate time series models. Though many approaches to clustering time series exist, in this view the most effective method seems to have to rely on choosing some features relevant for the problem at hand and seeking for clusters according to their measurements, for instance the autoregressive coe±cients, spectral measures or the eigenvectors of the covariance matrix. Some new indexes based on goodnessof-fit criteria will be proposed in this paper for fuzzy clustering of multivariate time series. A general purpose fuzzy clustering algorithm may be used to estimate the proper cluster structure according to some internal criteria of cluster validity. Such indexes are known to measure actually definite often conflicting cluster properties, compactness or connectedness, for instance, or distribution, orientation, size and shape. It is argued that the multiobjective optimization supported by genetic algorithms is a most effective choice in such a di±cult context. In this paper we use the Xie-Beni index and the C-means functional as objective functions to evaluate the cluster validity in a multiobjective optimization framework. The concept of Pareto optimality in multiobjective genetic algorithms is used to evolve a set of potential solutions towards a set of optimal non-dominated solutions. Genetic algorithms are well suited for implementing di±cult optimization problems where objective functions do not usually have good mathematical properties such as continuity, differentiability or convexity. In addition the genetic algorithms, as population based methods, may yield a complete Pareto front at each step of the iterative evolutionary procedure. The method is illustrated by means of a set of real data and an artificial multivariate time series data set.

Suggested Citation

  • S. Bandyopadhyay & R. Baragona & U. Maulik, 2010. "Fuzzy clustering of univariate and multivariate time series by genetic multiobjective optimization," Working Papers 028, COMISEF.
  • Handle: RePEc:com:wpaper:028
    as

    Download full text from publisher

    File URL: http://comisef.eu/files/wps028.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Domenico Piccolo, 1990. "A Distance Measure For Classifying Arima Models," Journal of Time Series Analysis, Wiley Blackwell, vol. 11(2), pages 153-164, March.
    2. Roberto Baragona & Francesco Battaglia & Claudio Calzini, 2001. "Clustering of time series with genetic algorithms," Metron - International Journal of Statistics, Dipartimento di Statistica, Probabilità e Statistiche Applicate - University of Rome, vol. 0(1-2), pages 111-128.
    3. Winker, Peter & Gilli, Manfred, 2004. "Applications of optimization heuristics to estimation and modelling problems," Computational Statistics & Data Analysis, Elsevier, vol. 47(2), pages 211-223, September.
    4. Hsiao-Yun Huang & Hernando Ombao & David S. Stoffer, 2004. "Discrimination and Classification of Nonstationary Time Series Using the SLEX Model," Journal of the American Statistical Association, American Statistical Association, vol. 99, pages 763-774, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Corduas, Marcella & Piccolo, Domenico, 2008. "Time series clustering and classification by the autoregressive metric," Computational Statistics & Data Analysis, Elsevier, vol. 52(4), pages 1860-1872, January.
    2. Alessandro Casini & Pierre Perron, 2021. "Change-Point Analysis of Time Series with Evolutionary Spectra," Papers 2106.02031, arXiv.org, revised Jun 2021.
    3. Giancarlo Bruno & Edoardo Otranto, 2006. "The choice of time interval in seasonal adjustment: A heuristic approach," Statistical Papers, Springer, vol. 47(3), pages 393-417, June.
    4. Blueschke-Nikolaeva, V. & Blueschke, D. & Neck, R., 2012. "Optimal control of nonlinear dynamic econometric models: An algorithm and an application," Computational Statistics & Data Analysis, Elsevier, vol. 56(11), pages 3230-3240.
    5. Umberto Triacca, 2016. "Measuring the Distance between Sets of ARMA Models," Econometrics, MDPI, vol. 4(3), pages 1-11, July.
    6. Otranto, Edoardo, 2010. "Identifying financial time series with similar dynamic conditional correlation," Computational Statistics & Data Analysis, Elsevier, vol. 54(1), pages 1-15, January.
    7. Giorgio Fagiolo & Mattia Guerini & Francesco Lamperti & Alessio Moneta & Andrea Roventini, 2017. "Validation of Agent-Based Models in Economics and Finance," LEM Papers Series 2017/23, Laboratory of Economics and Management (LEM), Sant'Anna School of Advanced Studies, Pisa, Italy.
    8. Juan Vilar & José Vilar & Sonia Pértega, 2009. "Classifying Time Series Data: A Nonparametric Approach," Journal of Classification, Springer;The Classification Society, vol. 26(1), pages 3-28, April.
    9. Manfred Gilli & Enrico Schumann, 2012. "Heuristic optimisation in financial modelling," Annals of Operations Research, Springer, vol. 193(1), pages 129-158, March.
    10. Zhelin Huang & Ngai Hang Chan, 2020. "Walsh Fourier Transform of Locally Stationary Time Series," Journal of Time Series Analysis, Wiley Blackwell, vol. 41(2), pages 312-340, March.
    11. Mahmoudi, Mohammad Reza, 2021. "A computational technique to classify several fractional Brownian motion processes," Chaos, Solitons & Fractals, Elsevier, vol. 150(C).
    12. De Gregorio, Alessandro & Maria Iacus, Stefano, 2010. "Clustering of discretely observed diffusion processes," Computational Statistics & Data Analysis, Elsevier, vol. 54(2), pages 598-606, February.
    13. S.-C. Horng & S.-Y. Lin, 2009. "Ordinal Optimization of G/G/1/K Polling Systems with k-Limited Service Discipline," Journal of Optimization Theory and Applications, Springer, vol. 140(2), pages 213-231, February.
    14. Sonia Díaz & José Vilar, 2010. "Comparing Several Parametric and Nonparametric Approaches to Time Series Clustering: A Simulation Study," Journal of Classification, Springer;The Classification Society, vol. 27(3), pages 333-362, November.
    15. Luis Lorenzo & Javier Arroyo, 2023. "Online risk-based portfolio allocation on subsets of crypto assets applying a prototype-based clustering algorithm," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 9(1), pages 1-40, December.
    16. Di Iorio, Francesca & Triacca, Umberto, 2013. "Testing for Granger non-causality using the autoregressive metric," Economic Modelling, Elsevier, vol. 33(C), pages 120-125.
    17. Lin, D.K.J. & Sharpe, C. & Winker, P., 2010. "Optimized U-type designs on flexible regions," Computational Statistics & Data Analysis, Elsevier, vol. 54(6), pages 1505-1515, June.
    18. Savin, Ivan & Egbetokun, Abiodun, 2016. "Emergence of innovation networks from R&D cooperation with endogenous absorptive capacity," Journal of Economic Dynamics and Control, Elsevier, vol. 64(C), pages 82-103.
    19. Paloma Taltavull de La Paz, 2021. "Predicting housing prices. A long term housing price path for Spanish regions," LARES lares-2021-4dra, Latin American Real Estate Society (LARES).
    20. Pierpaolo D’Urso & Livia Giovanni & Riccardo Massari & Dario Lallo, 2013. "Noise fuzzy clustering of time series by autoregressive metric," METRON, Springer;Sapienza Università di Roma, vol. 71(3), pages 217-243, November.

    More about this item

    Keywords

    Fuzzy clustering; Internal criteria of cluster validity; Genetic algorithms; Multiobjective optimization; Time series; Pareto optimality;
    All these keywords.

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:com:wpaper:028. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Anil Khuman (email available below). General contact details of provider: http://www.comisef.eu .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.