IDEAS home Printed from https://ideas.repec.org/p/ise/remwps/wp01852021.html
   My bibliography  Save this paper

On the classification of financial data with domain agnostic features

Author

Listed:
  • João A. Bastos
  • Jorge Caiado

Abstract

We compare a data-driven domain agnostic set of canonical features with a smaller collection of features that capture well-known stylized facts about financial asset returns. We show that these facts discriminate better different asset types than general-purpose features. Therefore, financial time series analysis is a domain where well-informed expert knowledge may not be disregarded in favor of agnosticrepresentations of the data.

Suggested Citation

  • João A. Bastos & Jorge Caiado, 2021. "On the classification of financial data with domain agnostic features," Working Papers REM 2021/0185, ISEG - Lisbon School of Economics and Management, REM, Universidade de Lisboa.
  • Handle: RePEc:ise:remwps:wp01852021
    as

    Download full text from publisher

    File URL: https://rem.rc.iseg.ulisboa.pt/wps/pdf/REM_WP_0185_2021.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Jorge Caiado & Nuno Crato, 2010. "Identifying common dynamic features in stock returns," Quantitative Finance, Taylor & Francis Journals, vol. 10(7), pages 797-807.
    2. Ding, Zhuanxin & Granger, Clive W. J. & Engle, Robert F., 1993. "A long memory property of stock market returns and a new model," Journal of Empirical Finance, Elsevier, vol. 1(1), pages 83-106, June.
    3. Caiado, Jorge & Crato, Nuno, 2007. "A GARCH-based method for clustering of financial time series: International stock markets evidence," MPRA Paper 2074, University Library of Munich, Germany.
    4. Harvey, Campbell R, 1995. "Predictable Risk and Returns in Emerging Markets," The Review of Financial Studies, Society for Financial Studies, vol. 8(3), pages 773-816.
    5. Caiado, Jorge & Crato, Nuno & Pena, Daniel, 2006. "A periodogram-based metric for time series classification," Computational Statistics & Data Analysis, Elsevier, vol. 50(10), pages 2668-2684, June.
    6. Clarida, Richard & Gali, Jordi, 1994. "Sources of real exchange-rate fluctuations: How important are nominal shocks?," Carnegie-Rochester Conference Series on Public Policy, Elsevier, vol. 41(1), pages 1-56, December.
    7. Granger, Clive W. J. & Ding, Zhuanxin, 1996. "Varieties of long memory models," Journal of Econometrics, Elsevier, vol. 73(1), pages 61-77, July.
    8. Bekaert, Geert & Harvey, Campbell R., 1997. "Emerging equity market volatility," Journal of Financial Economics, Elsevier, vol. 43(1), pages 29-77, January.
    9. Franses,Philip Hans & Dijk,Dick van, 2000. "Non-Linear Time Series Models in Empirical Finance," Cambridge Books, Cambridge University Press, number 9780521770415.
    10. Jorge Caiado & Nuno Crato & Pilar Poncela, 2020. "A fragmented-periodogram approach for clustering big data time series," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 14(1), pages 117-146, March.
    11. Maharaj, Elizabeth Ann & D’Urso, Pierpaolo, 2010. "A coherence-based approach for the pattern recognition of time series," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 389(17), pages 3516-3537.
    12. Zakoian, Jean-Michel, 1994. "Threshold heteroskedastic models," Journal of Economic Dynamics and Control, Elsevier, vol. 18(5), pages 931-955, September.
    13. Alonso, Andres M. & Maharaj, Elizabeth A., 2006. "Comparison of time series using subsampling," Computational Statistics & Data Analysis, Elsevier, vol. 50(10), pages 2589-2599, June.
    14. Otranto, Edoardo, 2008. "Clustering heteroskedastic time series by model-based procedures," Computational Statistics & Data Analysis, Elsevier, vol. 52(10), pages 4685-4698, June.
    15. Domenico Piccolo, 1990. "A Distance Measure For Classifying Arima Models," Journal of Time Series Analysis, Wiley Blackwell, vol. 11(2), pages 153-164, March.
    16. Bollerslev, Tim, 1986. "Generalized autoregressive conditional heteroskedasticity," Journal of Econometrics, Elsevier, vol. 31(3), pages 307-327, April.
    17. Pena D. & Prieto F.J., 2001. "Cluster Identification Using Projections," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1433-1445, December.
    18. R. Cont, 2001. "Empirical properties of asset returns: stylized facts and statistical issues," Quantitative Finance, Taylor & Francis Journals, vol. 1(2), pages 223-236.
    19. Sánchez Granero, M.A. & Trinidad Segovia, J.E. & García Pérez, J., 2008. "Some comments on Hurst exponent and the long memory processes on capital markets," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 387(22), pages 5543-5551.
    20. Caiado, Jorge & Crato, Nuno & Peña, Daniel, 2009. "Comparison of time series with unequal length in the frequency domain," MPRA Paper 15310, University Library of Munich, Germany.
    21. Benoit Mandelbrot, 2015. "The Variation of Certain Speculative Prices," World Scientific Book Chapters, in: Anastasios G Malliaris & William T Ziemba (ed.), THE WORLD SCIENTIFIC HANDBOOK OF FUTURES MARKETS, chapter 3, pages 39-78, World Scientific Publishing Co. Pte. Ltd..
    22. João A. Bastos & Jorge Caiado, 2014. "Clustering financial time series with variance ratio statistics," Quantitative Finance, Taylor & Francis Journals, vol. 14(12), pages 2121-2133, December.
    23. Kraus, Alan & Litzenberger, Robert H, 1976. "Skewness Preference and the Valuation of Risk Assets," Journal of Finance, American Finance Association, vol. 31(4), pages 1085-1100, September.
    24. Galeano, Pedro & Pena, Daniel & Tsay, Ruey S., 2006. "Outlier Detection in Multivariate Time Series by Projection Pursuit," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 654-669, June.
    25. Engle, Robert F, 1982. "Autoregressive Conditional Heteroscedasticity with Estimates of the Variance of United Kingdom Inflation," Econometrica, Econometric Society, vol. 50(4), pages 987-1007, July.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Lúcio, Francisco & Caiado, Jorge, 2022. "COVID-19 and Stock Market Volatility: A Clustering Approach for S&P 500 Industry Indices," Finance Research Letters, Elsevier, vol. 49(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Lúcio, Francisco & Caiado, Jorge, 2022. "COVID-19 and Stock Market Volatility: A Clustering Approach for S&P 500 Industry Indices," Finance Research Letters, Elsevier, vol. 49(C).
    2. B. Lafuente-Rego & P. D’Urso & J. A. Vilar, 2020. "Robust fuzzy clustering based on quantile autocovariances," Statistical Papers, Springer, vol. 61(6), pages 2393-2448, December.
    3. D’Urso, Pierpaolo & Cappelli, Carmela & Di Lallo, Dario & Massari, Riccardo, 2013. "Clustering of financial time series," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 392(9), pages 2114-2129.
    4. Chebbi, Ali & Hedhli, Amel, 2022. "Revisiting the accuracy of standard VaR methods for risk assessment: Using the Copula–EVT multidimensional approach for stock markets in the MENA region," The Quarterly Review of Economics and Finance, Elsevier, vol. 84(C), pages 430-445.
    5. Assaf, Ata, 2015. "Value-at-Risk analysis in the MENA equity markets: Fat tails and conditional asymmetries in return distributions," Journal of Multinational Financial Management, Elsevier, vol. 29(C), pages 30-45.
    6. Zou, Yongjie & Li, Honggang, 2014. "Time spans between price maxima and price minima in stock markets," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 395(C), pages 303-309.
    7. Charles, Amélie, 2010. "The day-of-the-week effects on the volatility: The role of the asymmetry," European Journal of Operational Research, Elsevier, vol. 202(1), pages 143-152, April.
    8. Alistair Mees & Berndt Pilgram, 2000. "Non-Linear Markov Modelling Using Canonical Variate Analysis: Forecasting Exchange Rate Volatility," Econometric Society World Congress 2000 Contributed Papers 1162, Econometric Society.
    9. Otranto, Edoardo, 2010. "Identifying financial time series with similar dynamic conditional correlation," Computational Statistics & Data Analysis, Elsevier, vol. 54(1), pages 1-15, January.
    10. De Santis, Giorgio & imrohoroglu, Selahattin, 1997. "Stock returns and volatility in emerging financial markets," Journal of International Money and Finance, Elsevier, vol. 16(4), pages 561-579, August.
    11. Rui Pedro Brito & Hélder Sebastião & Pedro Godinho, 2015. "Portfolio Management With Higher Moments: The Cardinality Impact," GEMF Working Papers 2015-15, GEMF, Faculty of Economics, University of Coimbra.
    12. S. M. Abdullah & Salina Siddiqua & Muhammad Shahadat Hossain Siddiquee & Nazmul Hossain, 2017. "Modeling and forecasting exchange rate volatility in Bangladesh using GARCH models: a comparison based on normal and Student’s t-error distribution," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 3(1), pages 1-19, December.
    13. Giulia Di Nunno & Kk{e}stutis Kubilius & Yuliya Mishura & Anton Yurchenko-Tytarenko, 2023. "From constant to rough: A survey of continuous volatility modeling," Papers 2309.01033, arXiv.org, revised Sep 2023.
    14. Michael McKenzie & Heather Mitchell & Robert Brooks & Robert Faff, 2001. "Power ARCH modelling of commodity futures data on the London Metal Exchange," The European Journal of Finance, Taylor & Francis Journals, vol. 7(1), pages 22-38.
    15. Kin‐Yip Ho & Zhaoyong Zhang, 2012. "Dynamic Linkages among Financial Markets in the Greater China Region: A Multivariate Asymmetric Approach," The World Economy, Wiley Blackwell, vol. 35(4), pages 500-523, April.
    16. Gürtler, Marc & Rauh, Ronald, 2012. "Challenging traditional risk models by a non-stationary approach with nonparametric heteroscedasticity," Working Papers IF41V1, Technische Universität Braunschweig, Institute of Finance.
    17. Carnero, María Ángeles & Peña, Daniel & Ruiz Ortega, Esther, 2001. "Outliers and conditional autoregressive heteroscedasticity in time series," DES - Working Papers. Statistics and Econometrics. WS ws010704, Universidad Carlos III de Madrid. Departamento de Estadística.
    18. Jorge Caiado & Nuno Crato & Pilar Poncela, 2020. "A fragmented-periodogram approach for clustering big data time series," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 14(1), pages 117-146, March.
    19. Geoffrey F. Loudon & Wing H. Watt & Pradeep K. Yadav, 2000. "An empirical analysis of alternative parametric ARCH models," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 15(2), pages 117-136.
    20. Amare Wubishet Ayele & Emmanuel Gabreyohannes & Yohannes Yebabe Tesfay, 2017. "Macroeconomic Determinants of Volatility for the Gold Price in Ethiopia: The Application of GARCH and EWMA Volatility Models," Global Business Review, International Management Institute, vol. 18(2), pages 308-326, April.

    More about this item

    Keywords

    Financial economics; Time series; Clustering; Classification; Machine learning;
    All these keywords.

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ise:remwps:wp01852021. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sandra Araújo (email available below). General contact details of provider: https://rem.rc.iseg.ulisboa.pt/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.