IDEAS home Printed from https://ideas.repec.org/p/arx/papers/0706.0168.html
   My bibliography  Save this paper

Kullback-Leibler distance as a measure of the information filtered from multivariate data

Author

Listed:
  • Michele Tumminello
  • Fabrizio Lillo
  • Rosario Nunzio Mantegna

Abstract

We show that the Kullback-Leibler distance is a good measure of the statistical uncertainty of correlation matrices estimated by using a finite set of data. For correlation matrices of multivariate Gaussian variables we analytically determine the expected values of the Kullback-Leibler distance of a sample correlation matrix from a reference model and we show that the expected values are known also when the specific model is unknown. We propose to make use of the Kullback-Leibler distance to estimate the information extracted from a correlation matrix by correlation filtering procedures. We also show how to use this distance to measure the stability of filtering procedures with respect to statistical uncertainty. We explain the effectiveness of our method by comparing four filtering procedures, two of them being based on spectral analysis and the other two on hierarchical clustering. We compare these techniques as applied both to simulations of factor models and empirical data. We investigate the ability of these filtering procedures in recovering the correlation matrix of models from simulations. We discuss such an ability in terms of both the heterogeneity of model parameters and the length of data series. We also show that the two spectral techniques are typically more informative about the sample correlation matrix than techniques based on hierarchical clustering, whereas the latter are more stable with respect to statistical uncertainty.

Suggested Citation

  • Michele Tumminello & Fabrizio Lillo & Rosario Nunzio Mantegna, 2007. "Kullback-Leibler distance as a measure of the information filtered from multivariate data," Papers 0706.0168, arXiv.org.
  • Handle: RePEc:arx:papers:0706.0168
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/0706.0168
    File Function: Latest version
    Download Restriction: no
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jovanovic, Franck & Mantegna, Rosario N. & Schinckus, Christophe, 2019. "When financial economics influences physics: The role of Econophysics," International Review of Financial Analysis, Elsevier, vol. 65(C).
    2. Huang, Wei-Qiang & Yao, Shuang & Zhuang, Xin-Tian & Yuan, Ying, 2017. "Dynamic asset trees in the US stock market: Structure variation and market phenomena," Chaos, Solitons & Fractals, Elsevier, vol. 94(C), pages 44-53.
    3. Gautier Marti & S'ebastien Andler & Frank Nielsen & Philippe Donnat, 2016. "Clustering Financial Time Series: How Long is Enough?," Papers 1603.04017, arXiv.org, revised Apr 2016.
    4. Giacomo Bormetti & Maria Elena De Giuli & Danilo Delpini & Claudia Tarantola, 2008. "Bayesian Analysis of Value-at-Risk with Product Partition Models," Papers 0809.0241, arXiv.org, revised May 2009.
    5. Sandoval, Leonidas & Franca, Italo De Paula, 2012. "Correlation of financial markets in times of crisis," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 391(1), pages 187-208.
    6. Joel Bun & Jean-Philippe Bouchaud & Marc Potters, 2016. "Cleaning large correlation matrices: tools from random matrix theory," Papers 1610.08104, arXiv.org.
    7. Musmeci, Nicoló & Aste, Tomaso & Di Matteo, T., 2015. "Relation between financial market structure and the real economy: comparison between clustering methods," LSE Research Online Documents on Economics 61644, London School of Economics and Political Science, LSE Library.
    8. Wu Zhu & Jian-an Fang & Yang Tang & Wenbing Zhang & Wei Du, 2012. "Digital IIR Filters Design Using Differential Evolution Algorithm with a Controllable Probabilistic Population Size," PLOS ONE, Public Library of Science, vol. 7(7), pages 1-9, July.
    9. Sensoy, Ahmet & Tabak, Benjamin M., 2014. "Dynamic spanning trees in stock market networks: The case of Asia-Pacific," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 414(C), pages 387-402.
    10. Contreras-Reyes, Javier E., 2014. "Asymptotic form of the Kullback–Leibler divergence for multivariate asymmetric heavy-tailed distributions," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 395(C), pages 200-208.
    11. Gautier Marti & Frank Nielsen & Miko{l}aj Bi'nkowski & Philippe Donnat, 2017. "A review of two decades of correlations, hierarchies, networks and clustering in financial markets," Papers 1703.00485, arXiv.org, revised Nov 2020.
    12. Tumminello, Michele & Lillo, Fabrizio & Mantegna, Rosario N., 2010. "Correlation, hierarchies, and networks in financial markets," Journal of Economic Behavior & Organization, Elsevier, vol. 75(1), pages 40-58, July.
    13. Nicolo Musmeci & Tomaso Aste & Tiziana Di Matteo, 2014. "Relation between Financial Market Structure and the Real Economy: Comparison between Clustering Methods," Papers 1406.0496, arXiv.org, revised Jan 2015.
    14. Giacomo Bormetti & Maria Elena De Giuli & Danilo Delpini & Claudia Tarantola, 2012. "Bayesian Value-at-Risk with product partition models," Quantitative Finance, Taylor & Francis Journals, vol. 12(5), pages 769-780, November.
    15. Christian Bongiorno & Marco Berritta, 2023. "Optimal Covariance Cleaning for Heavy-Tailed Distributions: Insights from Information Theory," Papers 2304.14098, arXiv.org, revised Apr 2023.
    16. Nicoló Musmeci & Tomaso Aste & T Di Matteo, 2015. "Relation between Financial Market Structure and the Real Economy: Comparison between Clustering Methods," PLOS ONE, Public Library of Science, vol. 10(3), pages 1-24, March.
    17. Zhang, Xin & Podobnik, Boris & Kenett, Dror Y. & Eugene Stanley, H., 2014. "Systemic risk and causality dynamics of the world international shipping market," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 415(C), pages 43-53.
    18. Gautier Marti & Sébastien Andler & Frank Nielsen & Philippe Donnat, 2016. "Clustering Financial Time Series: How Long is Enough?," Post-Print hal-01400395, HAL.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:0706.0168. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.