IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v70y2014icp345-361.html
   My bibliography  Save this article

Polarization of forecast densities: A new approach to time series classification

Author

Listed:
  • Liu, Shen
  • Maharaj, Elizabeth Ann
  • Inder, Brett

Abstract

Time series classification has been extensively explored in many fields of study. Most methods are based on the historical or current information extracted from data. However, if interest is in a specific future time period, methods that directly relate to forecasts of time series are much more appropriate. An approach to time series classification is proposed based on a polarization measure of forecast densities of time series. By fitting autoregressive models, forecast replicates of each time series are obtained via the bias-corrected bootstrap, and a stationarity correction is considered when necessary. Kernel estimators are then employed to approximate forecast densities, and discrepancies of forecast densities of pairs of time series are estimated by a polarization measure, which evaluates the extent to which two densities overlap. Following the distributional properties of the polarization measure, a discriminant rule and a clustering method are proposed to conduct the supervised and unsupervised classification, respectively. The proposed methodology is applied to both simulated and real data sets, and the results show desirable properties.

Suggested Citation

  • Liu, Shen & Maharaj, Elizabeth Ann & Inder, Brett, 2014. "Polarization of forecast densities: A new approach to time series classification," Computational Statistics & Data Analysis, Elsevier, vol. 70(C), pages 345-361.
  • Handle: RePEc:eee:csdana:v:70:y:2014:i:c:p:345-361
    DOI: 10.1016/j.csda.2013.10.008
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167947313003617
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2013.10.008?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Basalto, N. & Bellotti, R. & De Carlo, F. & Facchi, P. & Pascazio, S., 2005. "Clustering stock market companies via chaotic map synchronization," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 345(1), pages 196-206.
    2. Liu, Xueli & Lee, Sheng-Chien & Casella, George & Peter, Gary F., 2008. "Assessing agreement of clustering methods with gene expression microarray data," Computational Statistics & Data Analysis, Elsevier, vol. 52(12), pages 5356-5366, August.
    3. Alonso, Andrés M. & Casado, David & Romo, Juan, 2012. "Supervised classification for functional data: A weighted distance approach," Computational Statistics & Data Analysis, Elsevier, vol. 56(7), pages 2334-2346.
    4. Jae H. Kim, 2004. "Bias-corrected bootstrap prediction regions for vector autoregression," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 23(2), pages 141-154.
    5. Esteban, Joan & Ray, Debraj, 1994. "On the Measurement of Polarization," Econometrica, Econometric Society, vol. 62(4), pages 819-851, July.
    6. Kim, Jae H, 2001. "Bootstrap-after-Bootstrap Prediction Intervals for Autoregressive Models," Journal of Business & Economic Statistics, American Statistical Association, vol. 19(1), pages 117-128, January.
    7. Kim, Jae H. & Wong, Kevin & Athanasopoulos, George & Liu, Shen, 2011. "Beyond point forecasting: Evaluation of alternative prediction intervals for tourist arrivals," International Journal of Forecasting, Elsevier, vol. 27(3), pages 887-901.
    8. Volant, Stevenn & Martin Magniette, Marie-Laure & Robin, Stéphane, 2012. "Variational Bayes approach for model aggregation in unsupervised classification with Markovian dependency," Computational Statistics & Data Analysis, Elsevier, vol. 56(8), pages 2375-2387.
    9. Salcedo, Gladys E. & Porto, Rogério F. & Morettin, Pedro A., 2012. "Comparing non-stationary and irregularly spaced time series," Computational Statistics & Data Analysis, Elsevier, vol. 56(12), pages 3921-3934.
    10. Kim, Jae H., 2004. "Bootstrap prediction intervals for autoregression using asymptotically mean-unbiased estimators," International Journal of Forecasting, Elsevier, vol. 20(1), pages 85-97.
    11. Liu, Shen & Maharaj, Elizabeth Ann, 2013. "A hypothesis test using bias-adjusted AR estimators for classifying time series in small samples," Computational Statistics & Data Analysis, Elsevier, vol. 60(C), pages 32-49.
    12. Jean-Yves Duclos & Joan Esteban & Debraj Ray, 2004. "Polarization: Concepts, Measurement, Estimation," Econometrica, Econometric Society, vol. 72(6), pages 1737-1772, November.
    13. Scrucca, Luca, 2007. "Class prediction and gene selection for DNA microarrays using regularized sliced inverse regression," Computational Statistics & Data Analysis, Elsevier, vol. 52(1), pages 438-451, September.
    14. Harvill, Jane L. & Ravishanker, Nalini & Ray, Bonnie K., 2013. "Bispectral-based methods for clustering time series," Computational Statistics & Data Analysis, Elsevier, vol. 64(C), pages 113-131.
    15. Maharaj, Elizabeth A. & Alonso, Andres M., 2007. "Discrimination of locally stationary time series using wavelets," Computational Statistics & Data Analysis, Elsevier, vol. 52(2), pages 879-895, October.
    16. Maharaj, Elizabeth Ann & D’Urso, Pierpaolo, 2010. "A coherence-based approach for the pattern recognition of time series," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 389(17), pages 3516-3537.
    17. Dose, Christian & Cincotti, Silvano, 2005. "Clustering of financial time series with application to index and enhanced index tracking portfolio," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 355(1), pages 145-151.
    18. Park, Changyi & Koo, Ja-Yong & Kim, Sujong & Sohn, Insuk & Lee, Jae Won, 2008. "Classification of gene functions using support vector machine for time-course gene expression data," Computational Statistics & Data Analysis, Elsevier, vol. 52(5), pages 2578-2587, January.
    19. Anderson, Gordon & Linton, Oliver & Whang, Yoon-Jae, 2012. "Nonparametric estimation and inference about the overlap of two distributions," Journal of Econometrics, Elsevier, vol. 171(1), pages 1-23.
    20. Ausloos, M. & Lambiotte, R., 2007. "Clusters or networks of economies? A macroeconomy study through Gross Domestic Product," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 382(1), pages 16-21.
    21. Alonso, A.M. & Berrendero, J.R. & Hernandez, A. & Justel, A., 2006. "Time series clustering based on forecast densities," Computational Statistics & Data Analysis, Elsevier, vol. 51(2), pages 762-776, November.
    22. Lorenzo Pascual & Juan Romo & Esther Ruiz, 2004. "Bootstrap predictive inference for ARIMA processes," Journal of Time Series Analysis, Wiley Blackwell, vol. 25(4), pages 449-465, July.
    23. Liang, Faming, 2007. "Use of SVD-based probit transformation in clustering gene expression profiles," Computational Statistics & Data Analysis, Elsevier, vol. 51(12), pages 6355-6366, August.
    24. Lutz Kilian, 1998. "Small-Sample Confidence Intervals For Impulse Response Functions," The Review of Economics and Statistics, MIT Press, vol. 80(2), pages 218-230, May.
    25. Douzal-Chouakria, Ahlame & Diallo, Alpha & Giroud, Françoise, 2009. "Adaptive clustering for time series: Application for identifying cell cycle expressed genes," Computational Statistics & Data Analysis, Elsevier, vol. 53(4), pages 1414-1426, February.
    26. Pattarin, Francesco & Paterlini, Sandra & Minerva, Tommaso, 2004. "Clustering financial time series: an application to mutual funds style analysis," Computational Statistics & Data Analysis, Elsevier, vol. 47(2), pages 353-372, September.
    27. Slaets, Leen & Claeskens, Gerda & Hubert, Mia, 2012. "Phase and amplitude-based clustering for functional data," Computational Statistics & Data Analysis, Elsevier, vol. 56(7), pages 2360-2374.
    28. Anderson, Gordon, 2004. "Toward an empirical analysis of polarization," Journal of Econometrics, Elsevier, vol. 122(1), pages 1-26, September.
    29. Miśkiewicz, Janusz & Ausloos, Marcel, 2008. "Correlation measure to detect time series distances, whence economy globalization," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 387(26), pages 6584-6594.
    30. Corduas, Marcella & Piccolo, Domenico, 2008. "Time series clustering and classification by the autoregressive metric," Computational Statistics & Data Analysis, Elsevier, vol. 52(4), pages 1860-1872, January.
    31. Clements, Michael P. & Kim, Jae H., 2007. "Bootstrap prediction intervals for autoregressive time series," Computational Statistics & Data Analysis, Elsevier, vol. 51(7), pages 3580-3594, April.
    32. Shumway, Robert H., 2003. "Time-frequency clustering and discriminant analysis," Statistics & Probability Letters, Elsevier, vol. 63(3), pages 307-314, July.
    33. Vilar, J.A. & Alonso, A.M. & Vilar, J.M., 2010. "Non-linear time series clustering based on non-parametric forecast densities," Computational Statistics & Data Analysis, Elsevier, vol. 54(11), pages 2850-2865, November.
    34. Lutz Kilian, 1998. "Accounting for Lag Order Uncertainty in Autoregressions: the Endogenous Lag Order Bootstrap Algorithm," Journal of Time Series Analysis, Wiley Blackwell, vol. 19(5), pages 531-548, September.
    35. Kim, Yongdai & Kwon, Sunghoon & Heun Song, Seuck, 2006. "Multiclass sparse logistic regression for classification of multiple cancer types using gene expression data," Computational Statistics & Data Analysis, Elsevier, vol. 51(3), pages 1643-1655, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Liu, Shen & Maharaj, Elizabeth Ann, 2013. "A hypothesis test using bias-adjusted AR estimators for classifying time series in small samples," Computational Statistics & Data Analysis, Elsevier, vol. 60(C), pages 32-49.
    2. João Henrique Gonçalves Mazzeu & Esther Ruiz & Helena Veiga, 2018. "Uncertainty And Density Forecasts Of Arma Models: Comparison Of Asymptotic, Bayesian, And Bootstrap Procedures," Journal of Economic Surveys, Wiley Blackwell, vol. 32(2), pages 388-419, April.
    3. Douzal-Chouakria, Ahlame & Diallo, Alpha & Giroud, Françoise, 2009. "Adaptive clustering for time series: Application for identifying cell cycle expressed genes," Computational Statistics & Data Analysis, Elsevier, vol. 53(4), pages 1414-1426, February.
    4. Fresoli, Diego & Ruiz, Esther & Pascual, Lorenzo, 2015. "Bootstrap multi-step forecasts of non-Gaussian VAR models," International Journal of Forecasting, Elsevier, vol. 31(3), pages 834-848.
    5. Gonçalves Mazzeu, Joao Henrique & Ruiz Ortega, Esther & Veiga, Helena, 2015. "Model uncertainty and the forecast accuracy of ARMA models: A survey," DES - Working Papers. Statistics and Econometrics. WS ws1508, Universidad Carlos III de Madrid. Departamento de Estadística.
    6. Diego Fresoli, 2022. "Bootstrap VAR forecasts: The effect of model uncertainties," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 41(2), pages 279-293, March.
    7. Anna Staszewska-Bystrova, 2009. "Bootstrap Confidence Bands for Forecast Paths," Working Papers 024, COMISEF.
    8. Giovanni De Luca & Paola Zuccolotto, 2011. "A tail dependence-based dissimilarity measure for financial time series clustering," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 5(4), pages 323-340, December.
    9. De Luca Giovanni & Zuccolotto Paola, 2017. "A double clustering algorithm for financial time series based on extreme events," Statistics & Risk Modeling, De Gruyter, vol. 34(1-2), pages 1-12, June.
    10. Antonis A. Michis, 2021. "Wavelet Multidimensional Scaling Analysis of European Economic Sentiment Indicators," Journal of Classification, Springer;The Classification Society, vol. 38(3), pages 443-480, October.
    11. Maharaj, Elizabeth Ann & D’Urso, Pierpaolo, 2010. "A coherence-based approach for the pattern recognition of time series," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 389(17), pages 3516-3537.
    12. Allison, David B. & Visscher, Peter M. & Rosa, Guilherme J.M. & Amos, Christopher I., 2009. "Statistical genetics & statistical genomics: Where biology, epistemology, statistics, and computation collide," Computational Statistics & Data Analysis, Elsevier, vol. 53(5), pages 1531-1534, March.
    13. B. Lafuente-Rego & P. D’Urso & J. A. Vilar, 2020. "Robust fuzzy clustering based on quantile autocovariances," Statistical Papers, Springer, vol. 61(6), pages 2393-2448, December.
    14. Anna Staszewska‐Bystrova, 2011. "Bootstrap prediction bands for forecast paths from vector autoregressive models," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 30(8), pages 721-735, December.
    15. Daniel J. Henderson & Christopher F. Parmeter & R. Robert Russell, 2008. "Modes, weighted modes, and calibrated modes: evidence of clustering using modality tests," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 23(5), pages 607-638.
    16. Anna Staszewska-Bystrova & Peter Winker, 2016. "Improved bootstrap prediction intervals for SETAR models," Statistical Papers, Springer, vol. 57(1), pages 89-98, March.
    17. Daniel Grabowski & Anna Staszewska-Bystrova & Peter Winker, 2020. "Skewness-adjusted bootstrap confidence intervals and confidence bands for impulse response functions," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 104(1), pages 5-32, March.
    18. Gordon Anderson & Oliver Linton & Yoon-Jae Wang, 2009. "Non Parametric Estimation of a Polarization Measure," Working Papers tecipa-363, University of Toronto, Department of Economics.
    19. Beibei Zhang & Rong Chen, 2018. "Nonlinear Time Series Clustering Based on Kolmogorov-Smirnov 2D Statistic," Journal of Classification, Springer;The Classification Society, vol. 35(3), pages 394-421, October.
    20. Staszewska-Bystrova, Anna & Winker, Peter, 2013. "Constructing narrowest pathwise bootstrap prediction bands using threshold accepting," International Journal of Forecasting, Elsevier, vol. 29(2), pages 221-233.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:70:y:2014:i:c:p:345-361. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.