IDEAS home Printed from https://ideas.repec.org/a/spr/jclass/v42y2025i3d10.1007_s00357-025-09503-8.html

Curve Clustering via Pairwise Directions Estimation

Author

Listed:
  • Heng-Hui Lue

    (Tunghai University)

Abstract

This article concerns the cluster analysis of curve response data with multi-dimensional covariates. A novel clustering approach based on dimension reduction to group curves with similar patterns without requiring a prespecified parametric model is introduced. The proposed method can be applied to analyze regularly or irregularly observed curve data. Instead of being driven by cost optimization, the clustering problem is shifted to explore the mean functions and basis patterns in data from the geometric viewpoint. For implementing a data-driven function search, the method of pairwise directions estimation ( $$\textsf {PDE}$$ PDE ) (Lue Journal of Statistical Computation and Simulation 89, 776-794 2019) is applied. The benefit of using geometric information from the $$\textsf {PDE}$$ PDE is highlighted. The proposed method is on the basis of the squared prediction error to achieve optimal cluster membership prediction. Our proposal can not only obtain higher cluster qualities in clustering but also enhance the interpretation of cluster structure. Several simulation examples are conducted, and comparisons are made with nine methods. Applications to two real datasets are also presented for illustration.

Suggested Citation

  • Heng-Hui Lue, 2025. "Curve Clustering via Pairwise Directions Estimation," Journal of Classification, Springer;The Classification Society, vol. 42(3), pages 565-595, November.
  • Handle: RePEc:spr:jclass:v:42:y:2025:i:3:d:10.1007_s00357-025-09503-8
    DOI: 10.1007/s00357-025-09503-8
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s00357-025-09503-8
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s00357-025-09503-8?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. C. Abraham & P. A. Cornillon & E. Matzner‐Løber & N. Molinari, 2003. "Unsupervised Curve Clustering using B‐Splines," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 30(3), pages 581-595, September.
    2. M. Giacofci & S. Lambert-Lacroix & G. Marot & F. Picard, 2013. "Wavelet-Based Clustering for Mixed-Effects Functional Models in High Dimension," Biometrics, The International Biometric Society, vol. 69(1), pages 31-40, March.
    3. Jeng‐Min Chiou & Pai‐Ling Li, 2007. "Functional clustering and identifying substructures of longitudinal data," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 69(4), pages 679-699, September.
    4. Heard, Nicholas A. & Holmes, Christopher C. & Stephens, David A., 2006. "A Quantitative Study of Gene Regulation Involved in the Immune Response of Anopheline Mosquitoes: An Application of Bayesian Hierarchical Clustering of Curves," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 18-29, March.
    5. Heng-Hui Lue, 2010. "On principal Hessian directions for multivariate response regressions," Computational Statistics, Springer, vol. 25(4), pages 619-632, December.
    6. L. Yang & R. Tschernig, 1999. "Multivariate bandwidth selection for local linear regression," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 61(4), pages 793-815.
    7. Chiou, Jeng-Min & Li, Pai-Ling, 2008. "Correlation-Based Functional Clustering via Subspace Projection," Journal of the American Statistical Association, American Statistical Association, vol. 103(484), pages 1684-1692.
    8. Yingcun Xia & Howell Tong & W. K. Li & Li‐Xing Zhu, 2002. "An adaptive estimation of dimension reduction space," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 64(3), pages 363-410, August.
    9. Ma, Ping & Zhong, Wenxuan, 2008. "Penalized Clustering of Large-Scale Functional Data With Multiple Covariates," Journal of the American Statistical Association, American Statistical Association, vol. 103, pages 625-636, June.
    10. Fernando A. Quintana & Pilar L. Iglesias, 2003. "Bayesian clustering and product partition models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 65(2), pages 557-574, May.
    11. Luis Angel Garcia-Escudero & Alfonso Gordaliza, 2005. "A Proposal for Robust Curve Clustering," Journal of Classification, Springer;The Classification Society, vol. 22(2), pages 185-201, September.
    12. Shubhankar Ray & Bani Mallick, 2006. "Functional clustering by Bayesian wavelet methods," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 68(2), pages 305-332, April.
    13. Jacques, Julien & Preda, Cristian, 2014. "Model-based clustering for multivariate functional data," Computational Statistics & Data Analysis, Elsevier, vol. 71(C), pages 92-106.
    14. Serban, Nicoleta & Wasserman, Larry, 2005. "CATS: Clustering After Transformation and Smoothing," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 990-999, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Li, Pai-Ling & Chiou, Jeng-Min, 2011. "Identifying cluster number for subspace projected functional data clustering," Computational Statistics & Data Analysis, Elsevier, vol. 55(6), pages 2090-2103, June.
    2. Zhongnan Jin & Jie Min & Yili Hong & Pang Du & Qingyu Yang, 2024. "Multivariate Functional Clustering with Variable Selection and Application to Sensor Data from Engineering Systems," INFORMS Joural on Data Science, INFORMS, vol. 3(2), pages 203-218, October.
    3. Fang, Kuangnan & Chen, Yuanxing & Ma, Shuangge & Zhang, Qingzhao, 2022. "Biclustering analysis of functionals via penalized fusion," Journal of Multivariate Analysis, Elsevier, vol. 189(C).
    4. Kim, Joonpyo & Oh, Hee-Seok, 2020. "Pseudo-quantile functional data clustering," Journal of Multivariate Analysis, Elsevier, vol. 178(C).
    5. Julien Jacques & Cristian Preda, 2014. "Functional data clustering: a survey," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 8(3), pages 231-255, September.
    6. Zhu, Hanbing & Li, Rui & Zhang, Riquan & Lian, Heng, 2020. "Nonlinear functional canonical correlation analysis via distance covariance," Journal of Multivariate Analysis, Elsevier, vol. 180(C).
    7. Adriano Zanin Zambom & Julian A. A. Collazos & Ronaldo Dias, 2019. "Functional data clustering via hypothesis testing k-means," Computational Statistics, Springer, vol. 34(2), pages 527-549, June.
    8. Amovin-Assagba, Martial & Gannaz, Irène & Jacques, Julien, 2022. "Outlier detection in multivariate functional data through a contaminated mixture model," Computational Statistics & Data Analysis, Elsevier, vol. 174(C).
    9. Yaeji Lim & Hee-Seok Oh & Ying Kuen Cheung, 2019. "Multiscale Clustering for Functional Data," Journal of Classification, Springer;The Classification Society, vol. 36(2), pages 368-391, July.
    10. Michael Vogt & Oliver Linton, 2015. "Classification of nonparametric regression functions in heterogeneous panels," CeMMAP working papers 06/15, Institute for Fiscal Studies.
    11. Jim Q. Smith & Paul E. Anderson & Silvia Liverani, 2008. "Separation measures and the geometry of Bayes factor selection for classification," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 70(5), pages 957-980, November.
    12. Michael Vogt & Oliver Linton, 2017. "Classification of non-parametric regression functions in longitudinal data models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(1), pages 5-27, January.
    13. Susanna Levantesi & Andrea Nigri & Gabriella Piscopo & Alessandro Spelta, 2023. "Multi-country clustering-based forecasting of healthy life expectancy," Quality & Quantity: International Journal of Methodology, Springer, vol. 57(2), pages 189-215, December.
    14. Aurore Delaigle & Peter Hall & Tung Pham, 2019. "Clustering functional data into groups by using projections," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 81(2), pages 271-304, April.
    15. Christoph Hellmayr & Alan E. Gelfand, 2021. "A Partition Dirichlet Process Model for Functional Data Analysis," Sankhya B: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 83(1), pages 30-65, May.
    16. Giovanna Apicella & Emilia Di Lorenzo & Gabriella Piscopo & Marilena Sibillo, 2025. "Lee–Carter model: assessing the potential to capture gender-related mortality dynamics," Decisions in Economics and Finance, Springer;Associazione per la Matematica, vol. 48(2), pages 1065-1092, December.
    17. Cho, Haeran & Goude, Yannig & Brossat, Xavier & Yao, Qiwei, 2013. "Modeling and forecasting daily electricity load curves: a hybrid approach," LSE Research Online Documents on Economics 49634, London School of Economics and Political Science, LSE Library.
    18. Susana Conde & Shahin Tavakoli & Daphne Ezer, 2024. "Functional regression clustering with multiple functional gene expressions," PLOS ONE, Public Library of Science, vol. 19(11), pages 1-23, November.
    19. Ja‐Yoon Jang & Hee‐Seok Oh & Yaeji Lim & Ying Kuen Cheung, 2021. "Ensemble clustering for step data via binning," Biometrics, The International Biometric Society, vol. 77(1), pages 293-304, March.
    20. Tin Lok James Ng & Thomas Brendan Murphy, 2021. "Model-based Clustering of Count Processes," Journal of Classification, Springer;The Classification Society, vol. 38(2), pages 188-211, July.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:jclass:v:42:y:2025:i:3:d:10.1007_s00357-025-09503-8. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.