IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v9y2021i23p3074-d691113.html
   My bibliography  Save this article

Categorical Functional Data Analysis. The cfda R Package

Author

Listed:
  • Cristian Preda

    (UMR CNRS 8524—Laboratoire Paul Painlevé, University of Lille, 59000 Lille, France
    Institute of Statistics and Applied Mathematics of the Romanian Academy, 050711 Bucharest, Romania
    Inria Lille Nord-Europe, MODAL, 59655 Villeneuve d’Ascq, France)

  • Quentin Grimonprez

    (DiagRAMS Technologies, 59000 Lille, France)

  • Vincent Vandewalle

    (Inria Lille Nord-Europe, MODAL, 59655 Villeneuve d’Ascq, France
    Biostatistics Department, University of Lille, CHU Lille—ULR 2694 METRICS, 59000 Lille, France)

Abstract

Categorical functional data represented by paths of a stochastic jump process with continuous time and a finite set of states are considered. As an extension of the multiple correspondence analysis to an infinite set of variables, optimal encodings of states over time are approximated using an arbitrary finite basis of functions. This allows dimension reduction, optimal representation, and visualisation of data in lower dimensional spaces. The methodology is implemented in the cfda R package and is illustrated using a real data set in the clustering framework.

Suggested Citation

  • Cristian Preda & Quentin Grimonprez & Vincent Vandewalle, 2021. "Categorical Functional Data Analysis. The cfda R Package," Mathematics, MDPI, vol. 9(23), pages 1-31, November.
  • Handle: RePEc:gam:jmathe:v:9:y:2021:i:23:p:3074-:d:691113
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/9/23/3074/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/9/23/3074/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Nath, Ravinder & Pavur, Robert, 1985. "A new statistic in the one-way multivariate analysis of variance," Computational Statistics & Data Analysis, Elsevier, vol. 2(4), pages 297-315, February.
    2. Dai, Wenlin & Mrkvička, Tomáš & Sun, Ying & Genton, Marc G., 2020. "Functional outlier detection and taxonomy by sequential transformations," Computational Statistics & Data Analysis, Elsevier, vol. 149(C).
    3. Deville, J. -C. & Saporta, G., 1983. "Correspondence analysis, with an extension towards nominal time series," Journal of Econometrics, Elsevier, vol. 22(1-2), pages 169-189.
    4. Jackson, Christopher, 2011. "Multi-State Models for Panel Data: The msm Package for R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 38(i08).
    5. Scholz, Michael, 2016. "R Package clickstream: Analyzing Clickstream Data with Markov Chains," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 74(i04).
    6. Hervé Cardot & Guillaume Lecuelle & Pascal Schlich & Michel Visalli, 2019. "Estimating finite mixtures of semi‐Markov chains: an application to the segmentation of temporal sensory data," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 68(5), pages 1281-1303, November.
    7. Melnykov, Volodymyr, 2016. "ClickClust: An R Package for Model-Based Clustering of Categorical Sequences," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 74(i09).
    8. Byron J. Idrovo-Aguirre & Francisco J. Lozano & Javier E. Contreras-Reyes, 2021. "Prosperity or Real Estate Bubble? Exuberance Probability Index of Real Housing Prices in Chile," IJFS, MDPI, vol. 9(3), pages 1-24, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Qiu, Qinjing & Kawai, Reiichiro, 2022. "A decoupling principle for Markov-modulated chains," Statistics & Probability Letters, Elsevier, vol. 182(C).
    2. Jackson, Christopher, 2016. "flexsurv: A Platform for Parametric Survival Modeling in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 70(i08).
    3. Vernon T. Farewell & Li Su & Christopher Jackson, 2019. "Partially hidden multi-state modelling of a prolonged disease state defined by a composite outcome," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 25(4), pages 696-711, October.
    4. repec:jss:jstsof:40:i04 is not listed on IDEAS
    5. Gaffney, Edward & McCann, Fergal, 2019. "The cyclicality in SICR: mortgage modelling under IFRS 9," ESRB Working Paper Series 92, European Systemic Risk Board.
    6. Cindy Frascolla & Guillaume Lecuelle & Pascal Schlich & Hervé Cardot, 2022. "Two sample tests for Semi-Markov processes with parametric sojourn time distributions: an application in sensory analysis," Computational Statistics, Springer, vol. 37(5), pages 2553-2580, November.
    7. Biagini, Francesca & Groll, Andreas & Widenmann, Jan, 2013. "Intensity-based premium evaluation for unemployment insurance products," Insurance: Mathematics and Economics, Elsevier, vol. 53(1), pages 302-316.
    8. Touraine, Célia & Gerds, Thomas A. & Joly, Pierre, 2017. "SmoothHazard: An R Package for Fitting Regression Models to Interval-Censored Observations of Illness-Death Models," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 79(i07).
    9. Sharples, Linda D., 2018. "The role of statistics in the era of big data: Electronic health records for healthcare research," Statistics & Probability Letters, Elsevier, vol. 136(C), pages 105-110.
    10. Alex Bottle & Chiara Maria Ventura & Kumar Dharmarajan & Paul Aylin & Francesca Ieva & Anna Maria Paganoni, 2018. "Regional variation in hospitalisation and mortality in heart failure: comparison of England and Lombardy using multistate modelling," Health Care Management Science, Springer, vol. 21(2), pages 292-304, June.
    11. Wildhaber, Mark L. & Albers, Janice L. & Green, Nicholas S. & Moran, Edward H., 2017. "A fully-stochasticized, age-structured population model for population viability analysis of fish: Lower Missouri River endangered pallid sturgeon example," Ecological Modelling, Elsevier, vol. 359(C), pages 434-448.
    12. Gabadinho, Alexis & Ritschard, Gilbert & Müller, Nicolas S & Studer, Matthias, 2011. "Analyzing and Visualizing State Sequences in R with TraMineR," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 40(i04).
    13. Alexandra Grand & Regina Dittrich & Brian Francis, 2015. "Markov models of dependence in longitudinal paired comparisons: an application to course design," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 99(2), pages 237-257, April.
    14. Budhi Surya, 2021. "A new class of conditional Markov jump processes with regime switching and path dependence: properties and maximum likelihood estimation," Papers 2107.07026, arXiv.org.
    15. Linda Möstel & Marius Pfeuffer & Matthias Fischer, 2020. "Statistical inference for Markov chains with applications to credit risk," Computational Statistics, Springer, vol. 35(4), pages 1659-1684, December.
    16. Alejandra Marroig, 2023. "Transitions across states with and without difficulties in performing activities of daily living and death: a longitudinal comparison of ten European countries," European Journal of Ageing, Springer, vol. 20(1), pages 1-12, December.
    17. Emily B Dennis & Byron J T Morgan & Stephen N Freeman & Martin S Ridout & Tom M Brereton & Richard Fox & Gary D Powney & David B Roy, 2017. "Efficient occupancy model-fitting for extensive citizen-science data," PLOS ONE, Public Library of Science, vol. 12(3), pages 1-17, March.
    18. Lozano Navarro, Francisco-Javier & Idrovo Aguirre, Byron, 2023. "Shocks regulatorios al mercado inmobiliario de Chile: ¿Cuánto del IVA a la vivienda se transfiere a precio de venta? [Regulatory shocks to housing market: How much of the Chilean VAT on housing sal," MPRA Paper 120017, University Library of Munich, Germany.
    19. Tsiropoulos, Vasilis, 2018. "A Vulnerability Analysis for Mortgaged Irish Households," Financial Stability Notes 02-18, Central Bank of Ireland.
    20. Todorov, Valentin & Filzmoser, Peter, 2010. "Robust statistic for the one-way MANOVA," Computational Statistics & Data Analysis, Elsevier, vol. 54(1), pages 37-48, January.
    21. Sarrias, Mauricio, 2021. "A two recursive equation model to correct for endogeneity in latent class binary probit models," Journal of choice modelling, Elsevier, vol. 40(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:9:y:2021:i:23:p:3074-:d:691113. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.