IDEAS home Printed from https://ideas.repec.org/a/eee/spapps/v192y2026ics0304414925002406.html

Clustering functional data sets by law

Author

Listed:
  • Galves, Antonio
  • Najman, Fernando A.
  • Svarc, Marcela
  • Vargas, Claudia D.

Abstract

We introduce a new clustering procedure for functional data analysis which can classify independent sets of functional samples by their probabilistic law, i.e. that aims to assign data sets to the same cluster if and only if the data were generated with the same underlying distribution. This method has the virtue of being non-supervised and non-parametric, allowing for exploratory investigation with few assumptions about the data. We also present rigorous finite bounds that give us the effect of the number of samples in each dataset on the classification. We also provide an objective heuristic that consistently selects the best partition in a data-driven manner. We show the performance of the method by clustering simulated datasets generated with different distributions.

Suggested Citation

  • Galves, Antonio & Najman, Fernando A. & Svarc, Marcela & Vargas, Claudia D., 2026. "Clustering functional data sets by law," Stochastic Processes and their Applications, Elsevier, vol. 192(C).
  • Handle: RePEc:eee:spapps:v:192:y:2026:i:c:s0304414925002406
    DOI: 10.1016/j.spa.2025.104796
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0304414925002406
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.spa.2025.104796?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Yingqiu Zhu & Qiong Deng & Danyang Huang & Bingyi Jing & Bo Zhang, 2021. "Clustering based on Kolmogorov–Smirnov statistic with application to bank card transaction data," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 70(3), pages 558-578, June.
    2. R'emy Chicheportiche & Jean-Philippe Bouchaud, 2012. "Weighted Kolmogorov-Smirnov test: Accounting for the tails," Papers 1207.7308, arXiv.org, revised Oct 2012.
    3. Fernando A Najman & Antonio Galves & Marcela Svarc & Claudia D Vargas, 2025. "Extracting the fingerprints of sequences of random rhythmic auditory stimuli from electrophysiological data," PLOS Computational Biology, Public Library of Science, vol. 21(1), pages 1-18, January.
    4. Aline Duarte & Ricardo Fraiman & Antonio Galves & Guilherme Ost & Claudia D. Vargas, 2019. "Retrieving a Context Tree from EEG Data," Mathematics, MDPI, vol. 7(5), pages 1-13, May.
    5. Naaman, Michael, 2021. "On the tight constant in the multivariate Dvoretzky–Kiefer–Wolfowitz inequality," Statistics & Probability Letters, Elsevier, vol. 173(C).
    6. Wei, Fan & Dudley, Richard M., 2012. "Two-sample Dvoretzky–Kiefer–Wolfowitz inequalities," Statistics & Probability Letters, Elsevier, vol. 82(3), pages 636-644.
    7. Juan Antonio Cuesta-Albertos & Ricardo Fraiman & Thomas Ransford, 2007. "A Sharp Form of the Cramér–Wold Theorem," Journal of Theoretical Probability, Springer, vol. 20(2), pages 201-209, June.
    8. González–Rodríguez, Gil & Colubi, Ana & González–Manteiga, Wenceslao & Febrero–Bande, Manuel, 2024. "A consistent test of equality of distributions for Hilbert-valued random elements," Journal of Multivariate Analysis, Elsevier, vol. 202(C).
    9. Garivier, A. & Leonardi, F., 2011. "Context tree selection: A unifying view," Stochastic Processes and their Applications, Elsevier, vol. 121(11), pages 2488-2506, November.
    10. J. P. Verma, 2013. "Cluster Analysis: For Segmenting the Population," Springer Books, in: Data Analysis in Management with SPSS Software, edition 127, chapter 0, pages 317-358, Springer.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Sloot Henrik, 2022. "Implementing Markovian models for extendible Marshall–Olkin distributions," Dependence Modeling, De Gruyter, vol. 10(1), pages 308-343, January.
    2. Fraiman, Ricardo & Moreno, Leonardo & Ransford, Thomas, 2023. "A Cramér–Wold theorem for elliptical distributions," Journal of Multivariate Analysis, Elsevier, vol. 196(C).
    3. Ioannis Kontoyiannis & Lambros Mertzanis & Athina Panotopoulou & Ioannis Papageorgiou & Maria Skoularidou, 2022. "Bayesian context trees: Modelling and exact inference for discrete time series," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 84(4), pages 1287-1323, September.
    4. Thao Nguyen‐Trang & Tai Vo‐Van & Ha Che‐Ngoc, 2024. "An efficient automatic clustering algorithm for probability density functions and its applications in surface material classification," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 78(1), pages 244-260, February.
    5. Ricardo Fraiman & Leonardo Moreno & Thomas Ransford, 2024. "Application of the Cramér–Wold theorem to testing for invariance under group actions," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 33(2), pages 379-399, June.
    6. Aline Duarte & Ricardo Fraiman & Antonio Galves & Guilherme Ost & Claudia D. Vargas, 2019. "Retrieving a Context Tree from EEG Data," Mathematics, MDPI, vol. 7(5), pages 1-13, May.
    7. Goldman, Matt & Kaplan, David M., 2018. "Comparing distributions by multiple testing across quantiles or CDF values," Journal of Econometrics, Elsevier, vol. 206(1), pages 143-166.
    8. Ji, Lanpeng & Liu, Peng & Robert, Stephan, 2019. "Tail asymptotic behavior of the supremum of a class of chi-square processes," Statistics & Probability Letters, Elsevier, vol. 154(C), pages 1-1.
    9. Antoine Mandel & Vipin P. Veetil, 2025. "Do granular shocks generate sizeable aggregate volatility?," Journal of Evolutionary Economics, Springer, vol. 35(1), pages 71-94, January.
    10. Brzezinski, Michal, 2014. "Do wealth distributions follow power laws? Evidence from ‘rich lists’," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 406(C), pages 155-162.
    11. Cárcamo, Javier & Cuevas, Antonio & Rodríguez, Luis-Alberto, 2024. "A uniform kernel trick for high and infinite-dimensional two-sample problems," Journal of Multivariate Analysis, Elsevier, vol. 202(C).
    12. Young Shin Kim & Hyun-Gyoon Kim, 2024. "Quanto Option Pricing on a Multivariate Levy Process Model with a Generative Artificial Intelligence," Papers 2402.17919, arXiv.org, revised Mar 2024.
    13. Angelini, Daniele & Bianchi, Sergio, 2023. "Nonlinear biases in the roughness of a Fractional Stochastic Regularity Model," Chaos, Solitons & Fractals, Elsevier, vol. 172(C).
    14. Ryan Cumings-Menon, 2022. "Differentially Private Estimation via Statistical Depth," Papers 2207.12602, arXiv.org.
    15. Jiafei Huan & Li Deng & Yue Zhu & Shangguang Jiang & Fei Qi, 2024. "Short-to-Medium-Term Wind Power Forecasting through Enhanced Transformer and Improved EMD Integration," Energies, MDPI, vol. 17(10), pages 1-22, May.
    16. Nicol`o Cesa-Bianchi & Tommaso Cesari & Roberto Colomboni & Federico Fusco & Stefano Leonardi, 2021. "Bilateral Trade: A Regret Minimization Perspective," Papers 2109.12974, arXiv.org.
    17. Bernhard G. Bodmann & Martin Ehler & Manuel Gräf, 2018. "From Low- to High-Dimensional Moments Without Magic," Journal of Theoretical Probability, Springer, vol. 31(4), pages 2167-2193, December.
    18. Kylie-Anne Richards & Gareth W. Peters & William Dunsmuir, 2015. "Heavy-tailed features and dependence in limit order book volume profiles in futures markets," International Journal of Financial Engineering (IJFE), World Scientific Publishing Co. Pte. Ltd., vol. 2(03), pages 1-56.
    19. Greene, Evan & Wellner, Jon A., 2016. "Finite sampling inequalities: An application to two-sample Kolmogorov–Smirnov statistics," Stochastic Processes and their Applications, Elsevier, vol. 126(12), pages 3701-3715.
    20. R'emy Chicheportiche & Jean-Philippe Bouchaud, 2013. "Some applications of first-passage ideas to finance," Papers 1306.3110, arXiv.org.

    More about this item

    Keywords

    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:spapps:v:192:y:2026:i:c:s0304414925002406. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/505572/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.