A Run Length Transformation for Discriminating Between Auto Regressive Time Series

My bibliography Save this article

A Run Length Transformation for Discriminating Between Auto Regressive Time Series

Author

Listed:

Anthony Bagnall
Gareth Janacek

Registered:

Abstract

We describe a simple time series transformation to detect differences in series that can be accurately modelled as stationary autoregressive (AR) processes. The transformation involves forming the histogram of above and below the mean run lengths. The run length (RL) transformation has the benefits of being very fast, compact and updatable for new data in constant time. Furthermore, it can be generated directly from data that has already been highly compressed. We first establish the theoretical asymptotic relationship between run length distributions and AR models through consideration of the zero crossing probability and the distribution of runs. We benchmark our transformation against two alternatives: the truncated Autocorrelation function (ACF) transform and the AR transformation, which involves the standard method of fitting the partial autocorrelation coefficients with the Durbin-Levinson recursions and using the Akaike Information Criterion stopping procedure. Whilst optimal in the idealized scenario, representing the data in these ways is time consuming and the representation cannot be updated online for new data. We show that for classification problems the accuracy obtained through using the run length distribution tends towards that obtained from using the full fitted models. We then propose three alternative distance measures for run length distributions based on Gower’s general similarity coefficient, the likelihood ratio and dynamic time warping (DTW). Through simulated classification experiments we show that a nearest neighbour distance based on DTW converges to the optimal faster than classifiers based on Euclidean distance, Gower’s coefficient and the likelihood ratio. We experiment with a variety of classifiers and demonstrate that although the RL transform requires more data than the best performing classifier to achieve the same accuracy as AR or ACF, this factor is at worst non-increasing with the series length, m, whereas the relative time taken to fit AR and ACF increases with m. We conclude that if the data is stationary and can be suitably modelled by an AR series, and if time is an important factor in reaching a discriminatory decision, then the run length distribution transform is a simple and effective transformation to use. Copyright Springer Science+Business Media New York 2014

Suggested Citation

Anthony Bagnall & Gareth Janacek, 2014. "A Run Length Transformation for Discriminating Between Auto Regressive Time Series," Journal of Classification, Springer;The Classification Society, vol. 31(2), pages 154-178, July.

Handle: RePEc:spr:jclass:v:31:y:2014:i:2:p:154-178
DOI: 10.1007/s00357-013-9135-6

Download full text from publisher

As the access to this document is restricted, you may want to

for a different version of it.

References listed on IDEAS

Maharaj, E.A., 1994. "A Significance Test for Classifying ARMA Models," Monash Econometrics and Business Statistics Working Papers 18/94, Monash University, Department of Econometrics and Business Statistics.
Corduas, Marcella & Piccolo, Domenico, 2008. "Time series clustering and classification by the autoregressive metric," Computational Statistics & Data Analysis, Elsevier, vol. 52(4), pages 1860-1872, January.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Patrick Toman & Nalini Ravishanker & Sanguthevar Rajasekaran & Nathan Lally, 2023. "Online Evidential Nearest Neighbour Classification for Internet of Things Time Series," International Statistical Review, International Statistical Institute, vol. 91(3), pages 395-426, December.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Umberto Triacca, 2016. "Measuring the Distance between Sets of ARMA Models," Econometrics, MDPI, vol. 4(3), pages 1-11, July.
Otranto, Edoardo, 2010. "Identifying financial time series with similar dynamic conditional correlation," Computational Statistics & Data Analysis, Elsevier, vol. 54(1), pages 1-15, January.
- E. Otranto, 2008. "Identifying Financial Time Series with Similar Dynamic Conditional Correlation," Working Paper CRENoS 200817, Centre for North South Economic Research, University of Cagliari and Sassari, Sardinia.
Beibei Zhang & Rong Chen, 2018. "Nonlinear Time Series Clustering Based on Kolmogorov-Smirnov 2D Statistic," Journal of Classification, Springer;The Classification Society, vol. 35(3), pages 394-421, October.
Sonia Díaz & José Vilar, 2010. "Comparing Several Parametric and Nonparametric Approaches to Time Series Clustering: A Simulation Study," Journal of Classification, Springer;The Classification Society, vol. 27(3), pages 333-362, November.
Di Iorio, Francesca & Triacca, Umberto, 2013. "Testing for Granger non-causality using the autoregressive metric," Economic Modelling, Elsevier, vol. 33(C), pages 120-125.
Pierpaolo D’Urso & Livia Giovanni & Riccardo Massari & Dario Lallo, 2013. "Noise fuzzy clustering of time series by autoregressive metric," METRON, Springer;Sapienza Università di Roma, vol. 71(3), pages 217-243, November.
E. Otranto, 2011. "Classification of Volatility in Presence of Changes in Model Parameters," Working Paper CRENoS 201113, Centre for North South Economic Research, University of Cagliari and Sassari, Sardinia.
Liu, Shen & Maharaj, Elizabeth Ann, 2013. "A hypothesis test using bias-adjusted AR estimators for classifying time series in small samples," Computational Statistics & Data Analysis, Elsevier, vol. 60(C), pages 32-49.
Otranto, Edoardo, 2008. "Clustering heteroskedastic time series by model-based procedures," Computational Statistics & Data Analysis, Elsevier, vol. 52(10), pages 4685-4698, June.
- E. Otranto, 2008. "Clustering Heteroskedastic Time Series by Model-Based Procedures," Working Paper CRENoS 200801, Centre for North South Economic Research, University of Cagliari and Sassari, Sardinia.
Francesca Di Iorio & Umberto Triacca, 2014. "Testing for A Set of Linear Restrictions in VARMA Models Using Autoregressive Metric: An Application to Granger Causality Test," Econometrics, MDPI, vol. 2(4), pages 1-14, December.
Vilar, J.A. & Alonso, A.M. & Vilar, J.M., 2010. "Non-linear time series clustering based on non-parametric forecast densities," Computational Statistics & Data Analysis, Elsevier, vol. 54(11), pages 2850-2865, November.
Pacifico, Antonio, 2020. "Bayesian Fuzzy Clustering with Robust Weighted Distance for Multiple ARIMA and Multivariate Time-Series," MPRA Paper 104379, University Library of Munich, Germany.
João A. Bastos & Jorge Caiado, 2014. "Clustering financial time series with variance ratio statistics," Quantitative Finance, Taylor & Francis Journals, vol. 14(12), pages 2121-2133, December.
- Joao A. Bastos & Jorge Caiado, 2009. "Clustering financial time series with variance ratio statistics," CEMAPRE Working Papers 0904, Centre for Applied Mathematics and Economics (CEMAPRE), School of Economics and Management (ISEG), Technical University of Lisbon.
Liu, Shen & Maharaj, Elizabeth Ann & Inder, Brett, 2014. "Polarization of forecast densities: A new approach to time series classification," Computational Statistics & Data Analysis, Elsevier, vol. 70(C), pages 345-361.
Juan Vilar & José Vilar & Sonia Pértega, 2009. "Classifying Time Series Data: A Nonparametric Approach," Journal of Classification, Springer;The Classification Society, vol. 26(1), pages 3-28, April.
Bob Walrave, 2016. "Determining intervention thresholds that change output behavior patterns," System Dynamics Review, System Dynamics Society, vol. 32(3-4), pages 261-278, July.
De Gregorio, Alessandro & Maria Iacus, Stefano, 2010. "Clustering of discretely observed diffusion processes," Computational Statistics & Data Analysis, Elsevier, vol. 54(2), pages 598-606, February.
- Alessandro De Gregorio & Stefano Iacus, 2008. "Clustering of discretely observed diffusion processes," UNIMI - Research Papers in Economics, Business, and Statistics unimi-1077, Universitá degli Studi di Milano.
- Alessandro De Gregorio & Stefano Maria Iacus, 2008. "Clustering of discretely observed diffusion processes," Papers 0809.3902, arXiv.org.
Roy Cerqueti & Antonio Iovanella & Raffaele Mattera, 2024. "Clustering networked funded European research activities through rank-size laws," Annals of Operations Research, Springer, vol. 342(3), pages 1707-1735, November.
Paloma Taltavull de La Paz, 2021. "Predicting housing prices. A long term housing price path for Spanish regions," LARES lares-2021-4dra, Latin American Real Estate Society (LARES).
Leijiao Ge & Tianshuo Du & Changlu Li & Yuanliang Li & Jun Yan & Muhammad Umer Rafiq, 2022. "Virtual Collection for Distributed Photovoltaic Data: Challenges, Methodologies, and Applications," Energies, MDPI, vol. 15(23), pages 1-24, November.

More about this item

Keywords

; ; ;

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:jclass:v:31:y:2014:i:2:p:154-178. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

A Run Length Transformation for Discriminating Between Auto Regressive Time Series

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data