IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v99y2016icp204-222.html
   My bibliography  Save this article

Classification methods for Hilbert data based on surrogate density

Author

Listed:
  • Bongiorno, Enea G.
  • Goia, Aldo

Abstract

An unsupervised and a supervised classification approach for Hilbert random curves are studied. Both rest on the use of a surrogate of the probability density which is defined, in a distribution-free mixture context, from an asymptotic factorization of the small-ball probability. That surrogate density is estimated by a kernel approach from the principal components of the data. The focus is on the illustration of the classification algorithms and the computational implications, with particular attention to the tuning of the parameters involved. Some asymptotic results are sketched. Applications on simulated and real datasets show how the proposed methods work.

Suggested Citation

  • Bongiorno, Enea G. & Goia, Aldo, 2016. "Classification methods for Hilbert data based on surrogate density," Computational Statistics & Data Analysis, Elsevier, vol. 99(C), pages 204-222.
  • Handle: RePEc:eee:csdana:v:99:y:2016:i:c:p:204-222
    DOI: 10.1016/j.csda.2016.01.019
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167947316300056
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2016.01.019?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Biernacki, Christophe & Celeux, Gilles & Govaert, Gerard & Langrognet, Florent, 2006. "Model-based cluster and discriminant analysis with the MIXMOD software," Computational Statistics & Data Analysis, Elsevier, vol. 51(2), pages 587-600, November.
    2. Dabo-Niang, Sophie & Ferraty, Frederic & Vieu, Philippe, 2007. "On the using of modal curves for radar waveforms classification," Computational Statistics & Data Analysis, Elsevier, vol. 51(10), pages 4878-4890, June.
    3. A. Delaigle & P. Hall & N. Bathia, 2012. "Componentwise classification and clustering of functional data," Biometrika, Biometrika Trust, vol. 99(2), pages 299-313.
    4. Shin, Hyejin, 2008. "An extension of Fisher's discriminant analysis for stochastic processes," Journal of Multivariate Analysis, Elsevier, vol. 99(6), pages 1191-1216, July.
    5. Th. Gasser & P. Hall & B. Presnell, 1998. "Nonparametric estimation of the mode of a distribution of random curves," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 60(4), pages 681-691.
    6. Jacques, Julien & Preda, Cristian, 2014. "Model-based clustering for multivariate functional data," Computational Statistics & Data Analysis, Elsevier, vol. 71(C), pages 92-106.
    7. Frédéric Ferraty & Nadia Kudraszow & Philippe Vieu, 2012. "Nonparametric estimation of a surrogate density function in infinite-dimensional spaces," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 24(2), pages 447-464.
    8. Tarn Duong & Martin L. Hazelton, 2005. "Cross‐validation Bandwidth Matrices for Multivariate Kernel Density Estimation," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 32(3), pages 485-506, September.
    9. Ferraty, F. & Vieu, P., 2003. "Curves discrimination: a nonparametric functional approach," Computational Statistics & Data Analysis, Elsevier, vol. 44(1-2), pages 161-173, October.
    10. Goia, Aldo & May, Caterina & Fusai, Gianluca, 2010. "Functional clustering and linear regression for peak load forecasting," International Journal of Forecasting, Elsevier, vol. 26(4), pages 700-711, October.
    11. Goia, Aldo, 2012. "A functional linear model for time series prediction with exogenous variables," Statistics & Probability Letters, Elsevier, vol. 82(5), pages 1005-1011.
    12. Gareth M. James & Trevor J. Hastie, 2001. "Functional linear discriminant analysis for irregularly sampled curves," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 63(3), pages 533-550.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Adriano Zanin Zambom & Julian A. A. Collazos & Ronaldo Dias, 2019. "Functional data clustering via hypothesis testing k-means," Computational Statistics, Springer, vol. 34(2), pages 527-549, June.
    2. Amandine Schmutz & Julien Jacques & Charles Bouveyron & Laurence Chèze & Pauline Martin, 2020. "Clustering multivariate functional data in group-specific functional subspaces," Computational Statistics, Springer, vol. 35(3), pages 1101-1131, September.
    3. Simon Nanty & Céline Helbert & Amandine Marrel & Nadia Pérot & Clémentine Prieur, 2017. "Uncertainty quantification for functional dependent random variables," Computational Statistics, Springer, vol. 32(2), pages 559-583, June.
    4. Daniel Kosiorowski & Jerzy P. Rydlewski, 2020. "Centrality-oriented causality. A study of EU agricultural subsidies and digital developement in Poland," Operations Research and Decisions, Wroclaw University of Science and Technology, Faculty of Management, vol. 30(3), pages 47-63.
    5. Aubin, Jean-Baptiste & Bongiorno, Enea G. & Goia, Aldo, 2022. "The correction term in a small-ball probability factorization for random curves," Journal of Multivariate Analysis, Elsevier, vol. 189(C).
    6. Zhang, Yi-Chen & Sakhanenko, Lyudmila, 2019. "The naive Bayes classifier for functional data," Statistics & Probability Letters, Elsevier, vol. 152(C), pages 137-146.
    7. Kosiorowski Daniel & Jerzy P. Rydlewski, 2019. "Centrality-oriented Causality -- A Study of EU Agricultural Subsidies and Digital Developement in Poland," Papers 1908.11099, arXiv.org, revised Sep 2019.
    8. Rhoden, Imke & Weller, Daniel & Voit, Ann-Katrin, 2021. "Spatio-temporal dynamics of European innovation: An exploratory approach via multivariate functional data cluster analysis," Ruhr Economic Papers 926, RWI - Leibniz-Institut für Wirtschaftsforschung, Ruhr-University Bochum, TU Dortmund University, University of Duisburg-Essen.
    9. Enea G. Bongiorno & Aldo Goia & Philippe Vieu, 2019. "Modeling functional data: a test procedure," Computational Statistics, Springer, vol. 34(2), pages 451-468, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Olusola Samuel Makinde, 2019. "Classification rules based on distribution functions of functional depth," Statistical Papers, Springer, vol. 60(3), pages 629-640, June.
    2. Joseph, Esdras & Galeano San Miguel, Pedro & Lillo Rodríguez, Rosa Elvira, 2013. "The Mahalanobis distance for functional data with applications to classification," DES - Working Papers. Statistics and Econometrics. WS ws131312, Universidad Carlos III de Madrid. Departamento de Estadística.
    3. Zhang, Yi-Chen & Sakhanenko, Lyudmila, 2019. "The naive Bayes classifier for functional data," Statistics & Probability Letters, Elsevier, vol. 152(C), pages 137-146.
    4. Karl Mosler & Pavlo Mozharovskyi, 2017. "Fast DD-classification of functional data," Statistical Papers, Springer, vol. 58(4), pages 1055-1089, December.
    5. Aubin, Jean-Baptiste & Bongiorno, Enea G. & Goia, Aldo, 2022. "The correction term in a small-ball probability factorization for random curves," Journal of Multivariate Analysis, Elsevier, vol. 189(C).
    6. Tengteng Xu & Riquan Zhang & Xiuzhen Zhang, 2023. "Estimation of spatial-functional based-line logit model for multivariate longitudinal data," Computational Statistics, Springer, vol. 38(1), pages 79-99, March.
    7. Maria Ruiz-Medina & Rosa Espejo & Elvira Romano, 2014. "Spatial functional normal mixed effect approach for curve classification," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 8(3), pages 257-285, September.
    8. Guochang Wang & Xinyuan Song, 2018. "Functional Sufficient Dimension Reduction for Functional Data Classification," Journal of Classification, Springer;The Classification Society, vol. 35(2), pages 250-272, July.
    9. Hongxiao Zhu & Philip J. Brown & Jeffrey S. Morris, 2012. "Robust Classification of Functional and Quantitative Image Data Using Functional Mixed Models," Biometrics, The International Biometric Society, vol. 68(4), pages 1260-1268, December.
    10. Mojirsheibani, Majid & Shaw, Crystal, 2018. "Classification with incomplete functional covariates," Statistics & Probability Letters, Elsevier, vol. 139(C), pages 40-46.
    11. Carlo Sguera & Pedro Galeano & Rosa Lillo, 2014. "Spatial depth-based classification for functional data," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 23(4), pages 725-750, December.
    12. Alonso Fernández, Andrés Modesto & Casado, David & Romo, Juan, 2009. "Classification of functional data: a weighted distance approach," DES - Working Papers. Statistics and Econometrics. WS ws093915, Universidad Carlos III de Madrid. Departamento de Estadística.
    13. Llop, P. & Forzani, L. & Fraiman, R., 2011. "On local times, density estimation and supervised classification from functional data," Journal of Multivariate Analysis, Elsevier, vol. 102(1), pages 73-86, January.
    14. Fabrizio Maturo & Rosanna Verde, 2023. "Supervised classification of curves via a combined use of functional data analysis and tree-based methods," Computational Statistics, Springer, vol. 38(1), pages 419-459, March.
    15. Mohamed Chaouch & Salah Khardani, 2015. "Randomly censored quantile regression estimation using functional stationary ergodic data," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 27(1), pages 65-87, March.
    16. Górecki Tomasz & Krzyśko Mirosław & Wołyński Waldemar, 2015. "Classification Problems Based on Regression Models for Multi-Dimensional Functional Data," Statistics in Transition New Series, Polish Statistical Association, vol. 16(1), pages 97-110, March.
    17. Andrés Alonso & David Casado & Sara López-Pintado & Juan Romo, 2014. "Robust Functional Supervised Classification for Time Series," Journal of Classification, Springer;The Classification Society, vol. 31(3), pages 325-350, October.
    18. Park, Yeonjoo & Simpson, Douglas G., 2019. "Robust probabilistic classification applicable to irregularly sampled functional data," Computational Statistics & Data Analysis, Elsevier, vol. 131(C), pages 37-49.
    19. Marco Stefanucci & Laura M. Sangalli & Pierpaolo Brutti, 2018. "PCA‐based discrimination of partially observed functional data, with an application to AneuRisk65 data set," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 72(3), pages 246-264, August.
    20. Górecki Tomasz & Krzyśko Mirosław & ński Waldemar Woły, 2019. "Variable Selection In Multivariate Functional Data Classification," Statistics in Transition New Series, Polish Statistical Association, vol. 20(2), pages 123-138, June.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:99:y:2016:i:c:p:204-222. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.