IDEAS home Printed from https://ideas.repec.org/a/spr/stpapr/v66y2025i3d10.1007_s00362-025-01680-3.html
   My bibliography  Save this article

Spectral Clustering Algorithm for the Allometric Extension Model

Author

Listed:
  • Kohei Kawamoto

    (Kyushu University)

  • Yuichi Goto

    (Kyushu University)

  • Koji Tsukuda

    (Kyushu University)

Abstract

The spectral clustering algorithm is often used as a binary clustering method for unclassified data by applying the principal component analysis. When investigating the theoretical properties of the spectral clustering algorithm, existing studies have tended to invoke the assumption of conditional homoscedasticity. However, this assumption is restrictive and, in practice, often unrealistic. Therefore, in this paper, we consider the allometric extension model in which the directions of the first eigenvectors of two covariance matrices and the direction of the difference of two mean vectors coincide. We derive a non-asymptotic bound for the error probability of the spectral clustering algorithm under this allometric extension model. As a byproduct of this result, we demonstrate that the clustering method is consistent in high-dimensional settings.

Suggested Citation

  • Kohei Kawamoto & Yuichi Goto & Koji Tsukuda, 2025. "Spectral Clustering Algorithm for the Allometric Extension Model," Statistical Papers, Springer, vol. 66(3), pages 1-32, April.
  • Handle: RePEc:spr:stpapr:v:66:y:2025:i:3:d:10.1007_s00362-025-01680-3
    DOI: 10.1007/s00362-025-01680-3
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s00362-025-01680-3
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s00362-025-01680-3?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Kurata, Hiroshi & Hoshino, Takahiro & Fujikoshi, Yasunori, 2008. "Allometric extension model for conditional distributions," Journal of Multivariate Analysis, Elsevier, vol. 99(9), pages 1985-1998, October.
    2. Stefania Bartoletti & Bernard D. Flury & Daan G. Nel, 1999. "Allometric Extension," Biometrics, The International Biometric Society, vol. 55(4), pages 1210-1214, December.
    3. Borysov, Petro & Hannig, Jan & Marron, J.S., 2014. "Asymptotics of hierarchical clustering for growing dimension," Journal of Multivariate Analysis, Elsevier, vol. 124(C), pages 465-479.
    4. Shun Matsuura & Hiroshi Kurata, 2014. "Principal points for an allometric extension model," Statistical Papers, Springer, vol. 55(3), pages 853-870, August.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Egashira, Kento & Yata, Kazuyoshi & Aoshima, Makoto, 2024. "Asymptotic properties of hierarchical clustering in high-dimensional settings," Journal of Multivariate Analysis, Elsevier, vol. 199(C).
    2. Audoly, Richard & McGee, Rory & Ocampo Díaz, Sergio & Paz Pardo, Gonzalo, 2024. "The life-cycle dynamics of wealth mobility," CLEF Working Paper Series 68, Canadian Labour Economics Forum (CLEF), University of Waterloo.
    3. Gautier Marti & Frank Nielsen & Philippe Donnat & S'ebastien Andler, 2016. "On clustering financial time series: a need for distances between dependent random variables," Papers 1603.07822, arXiv.org.
    4. Long-Hao Xu & Kai-Tai Fang & Ping He, 2022. "Properties and generation of representative points of the exponential distribution," Statistical Papers, Springer, vol. 63(1), pages 197-223, February.
    5. Gautier Marti & Sébastien Andler & Frank Nielsen & Philippe Donnat, 2016. "Clustering Financial Time Series: How Long is Enough?," Post-Print hal-01400395, HAL.
    6. Santanu Chakraborty & Mrinal Kanti Roychowdhury & Josef Sifuentes, 2021. "High Precision Numerical Computation of Principal Points for Univariate Distributions," Sankhya B: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 83(2), pages 558-584, November.
    7. Gautier Marti & Frank Nielsen & Miko{l}aj Bi'nkowski & Philippe Donnat, 2017. "A review of two decades of correlations, hierarchies, networks and clustering in financial markets," Papers 1703.00485, arXiv.org, revised Nov 2020.
    8. Shun Matsuura & Thaddeus Tarpey, 2020. "Optimal principal points estimators of multivariate distributions of location-scale and location-scale-rotation families," Statistical Papers, Springer, vol. 61(4), pages 1629-1643, August.
    9. Shun Matsuura & Hiroshi Kurata, 2014. "Principal points for an allometric extension model," Statistical Papers, Springer, vol. 55(3), pages 853-870, August.
    10. Luis Lorenzo & Javier Arroyo, 2023. "Online risk-based portfolio allocation on subsets of crypto assets applying a prototype-based clustering algorithm," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 9(1), pages 1-40, December.
    11. Patrick K. Kimes & Yufeng Liu & David Neil Hayes & James Stephen Marron, 2017. "Statistical significance for hierarchical clustering," Biometrics, The International Biometric Society, vol. 73(3), pages 811-821, September.
    12. Lin Xiao & Arash Sioofy Khoojine, 2024. "Dynamic Anomaly Detection in the Chinese Energy Market During Financial Turbulence Using Ratio Mutual Information and Crude Oil Price Movements," Energies, MDPI, vol. 17(23), pages 1-22, November.
    13. Yang, Jun & He, Ping & Fang, Kai-Tai, 2022. "Three kinds of discrete approximations of statistical multivariate distributions and their applications," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    14. Nakayama, Yugo & Yata, Kazuyoshi & Aoshima, Makoto, 2021. "Clustering by principal component analysis with Gaussian kernel in high-dimension, low-sample-size settings," Journal of Multivariate Analysis, Elsevier, vol. 185(C).
    15. Kazuyoshi Yata & Makoto Aoshima, 2020. "Geometric consistency of principal component scores for high‐dimensional mixture models and its application," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 47(3), pages 899-921, September.
    16. Gautier Marti & S'ebastien Andler & Frank Nielsen & Philippe Donnat, 2016. "Clustering Financial Time Series: How Long is Enough?," Papers 1603.04017, arXiv.org, revised Apr 2016.
    17. Modarres, Reza, 2022. "A high dimensional dissimilarity measure," Computational Statistics & Data Analysis, Elsevier, vol. 175(C).
    18. Gautier Marti & Philippe Very & Philippe Donnat & Frank Nielsen, 2015. "A proposal of a methodological framework with experimental guidelines to investigate clustering stability on financial time series," Papers 1509.05475, arXiv.org.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:stpapr:v:66:y:2025:i:3:d:10.1007_s00362-025-01680-3. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.