IDEAS home Printed from https://ideas.repec.org/a/eee/jmvana/v186y2021ics0047259x21000610.html
   My bibliography  Save this article

Sequential estimation of Spearman rank correlation using Hermite series estimators

Author

Listed:
  • Stephanou, Michael
  • Varughese, Melvin

Abstract

In this article we describe a new Hermite series based sequential estimator for the Spearman rank correlation coefficient and provide algorithms applicable in both the stationary and non-stationary settings. To treat the non-stationary setting, we introduce a novel, exponentially weighted estimator for the Spearman rank correlation, which allows the local nonparametric correlation of a bivariate data stream to be tracked. To the best of our knowledge this is the first algorithm to be proposed for estimating a time varying Spearman rank correlation that does not rely on a moving window approach. We explore the practical effectiveness of the Hermite series based estimators through real data and simulation studies demonstrating good practical performance. The simulation studies in particular reveal competitive performance compared to an existing algorithm. The potential applications of this work are manifold. The Hermite series based Spearman rank correlation estimator can be applied to fast and robust online calculation of correlation which may vary over time. Possible machine learning applications include, amongst others, fast feature selection and hierarchical clustering on massive data sets.

Suggested Citation

  • Stephanou, Michael & Varughese, Melvin, 2021. "Sequential estimation of Spearman rank correlation using Hermite series estimators," Journal of Multivariate Analysis, Elsevier, vol. 186(C).
  • Handle: RePEc:eee:jmvana:v:186:y:2021:i:c:s0047259x21000610
    DOI: 10.1016/j.jmva.2021.104783
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0047259X21000610
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jmva.2021.104783?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Greblicki, Wlodzimierz & Pawlak, Miroslaw, 1985. "Pointwise consistency of the hermite series density estimate," Statistics & Probability Letters, Elsevier, vol. 3(2), pages 65-69, April.
    2. E. Liebscher, 1990. "Hermite series estimators for probability densities," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 37(1), pages 321-343, December.
    3. Asma Jmaei & Yousri Slaoui & Wassima Dellagi, 2017. "Recursive distribution estimator defined by stochastic approximation method using Bernstein polynomials," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 29(4), pages 792-805, October.
    4. Christophe Croux & Catherine Dehon, 2010. "Influence functions of the Spearman and Kendall correlation measures," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 19(4), pages 497-515, November.
    5. Philippe Pébay & Timothy B. Terriberry & Hemanth Kolla & Janine Bennett, 2016. "Numerically stable, scalable formulas for parallel and online computation of higher-order multivariate central moments with arbitrary weights," Computational Statistics, Springer, vol. 31(4), pages 1305-1325, December.
    6. Aït-Sahalia, Yacine & Fan, Jianqing & Xiu, Dacheng, 2010. "High-Frequency Covariance Estimates With Noisy and Asynchronous Financial Data," Journal of the American Statistical Association, American Statistical Association, vol. 105(492), pages 1504-1517.
    7. Michael Stephanou & Melvin Varughese, 2021. "On the properties of hermite series based distribution function estimators," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 84(4), pages 535-559, May.
    8. Greblicki, W?odzimierz & Pawlak, Miros?aw, 1984. "Hermite series estimates of a probability density and its derivatives," Journal of Multivariate Analysis, Elsevier, vol. 15(2), pages 174-182, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Michael Stephanou & Melvin Varughese, 2021. "On the properties of hermite series based distribution function estimators," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 84(4), pages 535-559, May.
    2. Foster, Joshua, 2022. "Semi-nonparametric estimation of secret reserve prices in auctions," Economics Letters, Elsevier, vol. 220(C).
    3. repec:cte:wsrepe:es142416 is not listed on IDEAS
    4. Harry-Paul Vander Elst & David Veredas, 2014. "Disentangled Jump-Robust Realized Covariances and Correlations with Non-Synchronous Prices," Working Papers ECARES ECARES 2014-35, ULB -- Universite Libre de Bruxelles.
    5. Donelli, Nicola & Peluso, Stefano & Mira, Antonietta, 2021. "A Bayesian semiparametric vector Multiplicative Error Model," Computational Statistics & Data Analysis, Elsevier, vol. 161(C).
    6. Katerina Papagiannouli, 2022. "A Lepskiĭ-type stopping rule for the covariance estimation of multi-dimensional Lévy processes," Statistical Inference for Stochastic Processes, Springer, vol. 25(3), pages 505-535, October.
    7. Altmeyer, Randolf & Bibinger, Markus, 2015. "Functional stable limit theorems for quasi-efficient spectral covolatility estimators," Stochastic Processes and their Applications, Elsevier, vol. 125(12), pages 4556-4600.
    8. Pablo Aragonés‐Beltrán & Mª. Carmen González‐Cruz & Astrid León‐Camargo & Rosario Viñoles‐Cebolla, 2023. "Assessment of regional development needs according to criteria based on the Sustainable Development Goals in the Meta Region (Colombia)," Sustainable Development, John Wiley & Sons, Ltd., vol. 31(2), pages 1101-1121, April.
    9. Peter Reinhard Hansen & Guillaume Horel & Asger Lunde & Ilya Archakov, 2015. "A Markov Chain Estimator of Multivariate Volatility from High Frequency Data," CREATES Research Papers 2015-19, Department of Economics and Business Economics, Aarhus University.
    10. Kim, Donggyu & Fan, Jianqing, 2019. "Factor GARCH-Itô models for high-frequency data with application to large volatility matrix prediction," Journal of Econometrics, Elsevier, vol. 208(2), pages 395-417.
    11. Zhang, Zhengjun & Zhu, Bin, 2016. "Copula structured M4 processes with application to high-frequency financial data," Journal of Econometrics, Elsevier, vol. 194(2), pages 231-241.
    12. Liao, Yin & Anderson, Heather M., 2019. "Testing for cojumps in high-frequency financial data: An approach based on first-high-low-last prices," Journal of Banking & Finance, Elsevier, vol. 99(C), pages 252-274.
    13. Carlo Campajola & Fabrizio Lillo & Daniele Tantari, 2019. "Unveiling the relation between herding and liquidity with trader lead-lag networks," Papers 1909.10807, arXiv.org, revised Mar 2020.
    14. Patrick Chang & Roger Bukuru & Tim Gebbie, 2019. "Revisiting the Epps effect using volume time averaging: An exercise in R," Papers 1912.02416, arXiv.org, revised Feb 2020.
    15. Fulvio Corsi & Stefano Peluso & Francesco Audrino, 2015. "Missing in Asynchronicity: A Kalman‐em Approach for Multivariate Realized Covariance Estimation," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 30(3), pages 377-397, April.
    16. Liang Wu & Lin Guan & Feng Li & Qi Zhao & Yingjun Zhuo & Peng Chen & Yaotang Lv, 2018. "Optimal Dynamic Reactive Power Reserve for Wind Farms Addressing Short-Term Voltage Issues Caused by Wind Turbines Tripping," Energies, MDPI, vol. 11(7), pages 1-15, July.
    17. Yiqi Liu & Qiang Liu & Zhi Liu & Deng Ding, 2017. "Determining the integrated volatility via limit order books with multiple records," Quantitative Finance, Taylor & Francis Journals, vol. 17(11), pages 1697-1714, November.
    18. Umut Asan & Ayberk Soyer, 2022. "A Weighted Bonferroni-OWA Operator Based Cumulative Belief Degree Approach to Personnel Selection Based on Automated Video Interview Assessment Data," Mathematics, MDPI, vol. 10(9), pages 1-33, May.
    19. Haugom, Erik & Lien, Gudbrand & Veka, Steinar & Westgaard, Sjur, 2014. "Covariance estimation using high-frequency data: Sensitivities of estimation methods," Economic Modelling, Elsevier, vol. 43(C), pages 416-425.
    20. Michael Pfarrhofer, 2020. "Forecasts with Bayesian vector autoregressions under real time conditions," Papers 2004.04984, arXiv.org.
    21. Grønborg, Niels S. & Lunde, Asger & Olesen, Kasper V. & Vander Elst, Harry, 2022. "Realizing correlations across asset classes," Journal of Financial Markets, Elsevier, vol. 59(PA).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:jmvana:v:186:y:2021:i:c:s0047259x21000610. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/622892/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.