IDEAS home Printed from https://ideas.repec.org/a/spr/compst/v39y2024i4d10.1007_s00180-023-01440-7.html
   My bibliography  Save this article

Dimension reduction and visualization of multiple time series data: a symbolic data analysis approach

Author

Listed:
  • Emily Chia-Yu Su

    (Taipei Medical University)

  • Han-Ming Wu

    (National Chengchi University)

Abstract

Exploratory analysis and visualization of multiple time series data are essential for discovering the underlying dynamics of a series before attempting modeling and forecasting. This study extends two dimension reduction methods - principal component analysis (PCA) and sliced inverse regression (SIR) - to multiple time series data. This is achieved through the innovative path point approach, a new addition to the symbolic data analysis framework. By transforming multiple time series data into time-dependent intervals marked by starting and ending values, each series is geometrically represented as successive directed segments with unique path points. These path points serve as the foundation of our novel representation approach. PCA and SIR are then applied to the data table formed by the coordinates of these path points, enabling visualization of temporal trajectories of objects within a reduced-dimensional subspace. Empirical studies encompassing simulations, microarray time series data from a yeast cell cycle, and financial data confirm the effectiveness of our path point approach in revealing the structure and behavior of objects within a 2D factorial plane. Comparative analyses with existing methods, such as the applied vector approach for PCA and SIR on time-dependent interval data, further underscore the strength and versatility of our path point representation in the realm of time series data.

Suggested Citation

  • Emily Chia-Yu Su & Han-Ming Wu, 2024. "Dimension reduction and visualization of multiple time series data: a symbolic data analysis approach," Computational Statistics, Springer, vol. 39(4), pages 1937-1969, June.
  • Handle: RePEc:spr:compst:v:39:y:2024:i:4:d:10.1007_s00180-023-01440-7
    DOI: 10.1007/s00180-023-01440-7
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s00180-023-01440-7
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s00180-023-01440-7?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Giordani, Paolo & Kiers, Henk A.L., 2006. "A comparison of three methods for principal component analysis of fuzzy interval data," Computational Statistics & Data Analysis, Elsevier, vol. 51(1), pages 379-397, November.
    2. Paulo Teles & Paula Brito, 2015. "Modeling Interval Time Series with Space–Time Processes," Communications in Statistics - Theory and Methods, Taylor & Francis Journals, vol. 44(17), pages 3599-3627, September.
    3. Wenhua Li & Junpeng Guo & Ying Chen & Minglu Wang, 2016. "A New Representation of Interval Symbolic Data and Its Application in Dynamic Clustering," Journal of Classification, Springer;The Classification Society, vol. 33(1), pages 149-165, April.
    4. Federica Gioia & Carlo Lauro, 2006. "Principal component analysis on interval data," Computational Statistics, Springer, vol. 21(2), pages 343-363, June.
    5. Benoît Liquet & Jérôme Saracco, 2012. "A graphical tool for selecting the number of slices and the dimension of the model in SIR and SAVE approaches," Computational Statistics, Springer, vol. 27(1), pages 103-125, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Michael Greenacre & Patrick J. F Groenen & Trevor Hastie & Alfonso Iodice d’Enza & Angelos Markos & Elena Tuzhilina, 2023. "Principal component analysis," Economics Working Papers 1856, Department of Economics and Business, Universitat Pompeu Fabra.
    2. Drago, Carlo & Gatto, Andrea, 2022. "Policy, regulation effectiveness, and sustainability in the energy sector: A worldwide interval-based composite indicator," Energy Policy, Elsevier, vol. 167(C).
    3. Marie Chavent & Stéphane Girard & Vanessa Kuentz-Simonet & Benoit Liquet & Thi Nguyen & Jérôme Saracco, 2014. "A sliced inverse regression approach for data stream," Computational Statistics, Springer, vol. 29(5), pages 1129-1152, October.
    4. Blanco-Fernández, Angela & Corral, Norberto & González-Rodríguez, Gil, 2011. "Estimation of a flexible simple linear model for interval data based on set arithmetic," Computational Statistics & Data Analysis, Elsevier, vol. 55(9), pages 2568-2578, September.
    5. Huiwen Wang & Liying Shangguan & Rong Guan & Lynne Billard, 2015. "Principal component analysis for compositional data vectors," Computational Statistics, Springer, vol. 30(4), pages 1079-1096, December.
    6. Lian, Heng & Li, Gaorong, 2014. "Series expansion for functional sufficient dimension reduction," Journal of Multivariate Analysis, Elsevier, vol. 124(C), pages 150-165.
    7. Sun, Yuying & Zhang, Xinyu & Wan, Alan T.K. & Wang, Shouyang, 2022. "Model averaging for interval-valued data," European Journal of Operational Research, Elsevier, vol. 301(2), pages 772-784.
    8. Prendergast, Luke A. & Smith, Jodie A., 2022. "Influence functions for linear discriminant analysis: Sensitivity analysis and efficient influence diagnostics," Journal of Multivariate Analysis, Elsevier, vol. 190(C).
    9. Anuradha Roy, 2014. "A two-stage principal component analysis of symbolic data using equicorrelated and jointly equicorrelated covariance structures," Working Papers 0164mss, College of Business, University of Texas at San Antonio.
    10. Karel Hron & Paula Brito & Peter Filzmoser, 2017. "Exploratory data analysis for interval compositional data," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 11(2), pages 223-241, June.
    11. Roy, Falguni & K. Gupta, Dharmendra., 2018. "Sufficient regularity conditions for complex interval matrices and approximations of eigenvalues sets," Applied Mathematics and Computation, Elsevier, vol. 317(C), pages 193-209.
    12. Liu, Bingsheng & Shen, Yinghua & Zhang, Wei & Chen, Xiaohong & Wang, Xueqing, 2015. "An interval-valued intuitionistic fuzzy principal component analysis model-based method for complex multi-attribute large-group decision-making," European Journal of Operational Research, Elsevier, vol. 245(1), pages 209-225.
    13. Pierpaolo D’Urso & María Ángeles Gil, 2017. "Fuzzy data analysis and classification," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 11(4), pages 645-657, December.
    14. Coppi, Renato & Gil, Maria A. & Kiers, Henk A.L., 2006. "The fuzzy approach to statistical analysis," Computational Statistics & Data Analysis, Elsevier, vol. 51(1), pages 1-14, November.
    15. Wenyang Huang & Huiwen Wang & Shanshan Wang, 2021. "Dimension reduction of open-high-low-close data in candlestick chart based on pseudo-PCA," Papers 2103.16908, arXiv.org.
    16. Drago, Carlo & Gatto, Andrea, 2023. "Gauging energy poverty in developing countries with a composite metric of electricity access," Utilities Policy, Elsevier, vol. 81(C).
    17. Coudret, R. & Girard, S. & Saracco, J., 2014. "A new sliced inverse regression method for multivariate response," Computational Statistics & Data Analysis, Elsevier, vol. 77(C), pages 285-299.
    18. M. Rosário Oliveira & Margarida Azeitona & António Pacheco & Rui Valadas, 2022. "Association measures for interval variables," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 16(3), pages 491-520, September.
    19. Jaromír Antoch & Miroslav Brzezina & Rafaelle Miele, 2010. "A note on variability of interval data," Computational Statistics, Springer, vol. 25(1), pages 143-153, March.
    20. Grażyna Dehnel & Marek Walesiak, 2019. "A Comparative Analysis Of Economic Efficiency Of Medium-Sized Manufacturing Enterprises In Districts Of Wielkopolska Province Using The Hybrid Approach With Metric And Interval-Valued Data," Statistics in Transition New Series, Polish Statistical Association, vol. 20(2), pages 49-68, June.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:compst:v:39:y:2024:i:4:d:10.1007_s00180-023-01440-7. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.