IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v55y2011i1p401-420.html
   My bibliography  Save this article

Dimensionality reduction when data are density functions

Author

Listed:
  • Delicado, P.

Abstract

Functional Data Analysis deals with samples where a whole function is observed for each individual. A relevant case of FDA is when the observed functions are density functions. Among the particular characteristics of density functions, the most of the fact that they are an example of infinite dimensional compositional data (parts of some whole which only carry relative information) is made. Several dimensionality reduction methods for this particular type of data are compared: functional principal components analysis with or without a previous data transformation, and multidimensional scaling for different inter-density distances, one of them taking into account the compositional nature of density functions. The emphasis is on the steps previous and posterior to the application of a particular dimensionality reduction method: care must be taken in choosing the right density function transformation and/or the appropriate distance between densities before performing dimensionality reduction; subsequently the graphical representation of dimensionality reduction results must take into account that the observed objects are density functions. The different methods are applied1 to artificial and real data (population pyramids for 223 countries in year 2000). As a global conclusion, the use of multidimensional scaling based on compositional distance is recommended.

Suggested Citation

  • Delicado, P., 2011. "Dimensionality reduction when data are density functions," Computational Statistics & Data Analysis, Elsevier, vol. 55(1), pages 401-420, January.
  • Handle: RePEc:eee:csdana:v:55:y:2011:i:1:p:401-420
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167-9473(10)00195-7
    Download Restriction: Full text for ScienceDirect subscribers only.
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Manteiga, Wenceslao Gonzalez & Vieu, Philippe, 2007. "Statistics for Functional Data," Computational Statistics & Data Analysis, Elsevier, vol. 51(10), pages 4788-4792, June.
    2. Mariano Valderrama, 2007. "An overview to modelling functional data," Computational Statistics, Springer, vol. 22(3), pages 331-334, September.
    3. Kneip A. & Utikal K. J, 2001. "Inference for Density Families Using Functional Principal Component Analysis," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 519-542, June.
    4. Pedro Delicado & Mario Huerta, 2003. "Principal Curves of Oriented Points: theoretical and computational improvements," Computational Statistics, Springer, vol. 18(2), pages 293-315, July.
    5. van der Linde, Angelika, 2008. "Variational Bayesian functional PCA," Computational Statistics & Data Analysis, Elsevier, vol. 53(2), pages 517-533, December.
    6. Nerini, David & Ghattas, Badih, 2007. "Classifying densities using functional regression trees: Applications in oceanology," Computational Statistics & Data Analysis, Elsevier, vol. 51(10), pages 4984-4993, June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Hron, K. & Menafoglio, A. & Templ, M. & Hrůzová, K. & Filzmoser, P., 2016. "Simplicial principal component analysis for density functions in Bayes spaces," Computational Statistics & Data Analysis, Elsevier, vol. 94(C), pages 330-350.
    2. Won-Ki Seo, 2020. "Functional Principal Component Analysis for Cointegrated Functional Time Series," Papers 2011.12781, arXiv.org, revised Apr 2023.
    3. Tadao Hoshino, 2024. "Functional Spatial Autoregressive Models," Papers 2402.14763, arXiv.org.
    4. Karel Hron & Jitka Machalová & Alessandra Menafoglio, 2023. "Bivariate densities in Bayes spaces: orthogonal decomposition and spline representation," Statistical Papers, Springer, vol. 64(5), pages 1629-1667, October.
    5. J. Machalová & K. Hron & G.S. Monti, 2016. "Preprocessing of centred logratio transformed density functions using smoothing splines," Journal of Applied Statistics, Taylor & Francis Journals, vol. 43(8), pages 1419-1435, June.
    6. Martínez-Camblor, Pablo & Corral, Norberto, 2011. "Repeated measures analysis for functional data," Computational Statistics & Data Analysis, Elsevier, vol. 55(12), pages 3244-3256, December.
    7. Delicado, Pedro & Vieu, Philippe, 2015. "Optimal level sets for bivariate density representation," Journal of Multivariate Analysis, Elsevier, vol. 140(C), pages 1-18.
    8. Jitka Machalová & Renáta Talská & Karel Hron & Aleš Gába, 2021. "Compositional splines for representation of density functions," Computational Statistics, Springer, vol. 36(2), pages 1031-1064, June.
    9. Kokoszka, Piotr & Miao, Hong & Petersen, Alexander & Shang, Han Lin, 2019. "Forecasting of density functions with an application to cross-sectional and intraday returns," International Journal of Forecasting, Elsevier, vol. 35(4), pages 1304-1317.
    10. Menafoglio, Alessandra & Petris, Giovanni, 2016. "Kriging for Hilbert-space valued random fields: The operatorial point of view," Journal of Multivariate Analysis, Elsevier, vol. 146(C), pages 84-94.
    11. Shang, Han Lin & Haberman, Steven & Xu, Ruofan, 2022. "Multi-population modelling and forecasting life-table death counts," Insurance: Mathematics and Economics, Elsevier, vol. 106(C), pages 239-253.
    12. Zhang, Zhen & Müller, Hans-Georg, 2011. "Functional density synchronization," Computational Statistics & Data Analysis, Elsevier, vol. 55(7), pages 2234-2249, July.
    13. Hsin‐wen Chang & Ian W. McKeague, 2022. "Empirical likelihood‐based inference for functional means with application to wearable device data," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 84(5), pages 1947-1968, November.
    14. Epifanio, Irene & Ventura-Campos, Noelia, 2011. "Functional data analysis in shape analysis," Computational Statistics & Data Analysis, Elsevier, vol. 55(9), pages 2758-2773, September.
    15. Berrendero, José R. & Cuevas, Antonio & Pateiro-López, Beatriz, 2016. "Shape classification based on interpoint distance distributions," Journal of Multivariate Analysis, Elsevier, vol. 146(C), pages 237-247.
    16. Calò, Daniela G. & Montanari, Angela & Viroli, Cinzia, 2014. "A hierarchical modeling approach for clustering probability density functions," Computational Statistics & Data Analysis, Elsevier, vol. 71(C), pages 79-91.
    17. ARATA Yoshiyuki, 2017. "A Functional Linear Regression Model in the Space of Probability Density Functions," Discussion papers 17015, Research Institute of Economy, Trade and Industry (RIETI).
    18. Bongiorno, Enea G. & Goia, Aldo, 2019. "Describing the concentration of income populations by functional principal component analysis on Lorenz curves," Journal of Multivariate Analysis, Elsevier, vol. 170(C), pages 10-24.
    19. S. Barahona & P. Centella & X. Gual-Arnau & M. V. Ibáñez & A. Simó, 2020. "Supervised classification of geometrical objects by integrating currents and functional data analysis," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 29(3), pages 637-660, September.
    20. Petersen, Alexander & Zhang, Chao & Kokoszka, Piotr, 2022. "Modeling Probability Density Functions as Data Objects," Econometrics and Statistics, Elsevier, vol. 21(C), pages 159-178.
    21. Berrendero, J.R. & Justel, A. & Svarc, M., 2011. "Principal components for multivariate functional data," Computational Statistics & Data Analysis, Elsevier, vol. 55(9), pages 2619-2634, September.
    22. Genest, Christian & Hron, Karel & Nešlehová, Johanna G., 2023. "Orthogonal decomposition of multivariate densities in Bayes spaces and relation with their copula-based representation," Journal of Multivariate Analysis, Elsevier, vol. 198(C).
    23. Seo, Won-Ki & Beare, Brendan K., 2019. "Cointegrated linear processes in Bayes Hilbert space," Statistics & Probability Letters, Elsevier, vol. 147(C), pages 90-95.
    24. Talská, R. & Menafoglio, A. & Machalová, J. & Hron, K. & Fišerová, E., 2018. "Compositional regression with functional response," Computational Statistics & Data Analysis, Elsevier, vol. 123(C), pages 66-85.
    25. Angela Montanari & Daniela Calò, 2013. "Model-based clustering of probability density functions," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 7(3), pages 301-319, September.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. van der Linde, Angelika, 2008. "Variational Bayesian functional PCA," Computational Statistics & Data Analysis, Elsevier, vol. 53(2), pages 517-533, December.
    2. Aneiros, Germán & Cao, Ricardo & Fraiman, Ricardo & Genest, Christian & Vieu, Philippe, 2019. "Recent advances in functional data analysis and high-dimensional statistics," Journal of Multivariate Analysis, Elsevier, vol. 170(C), pages 3-9.
    3. Berrendero, J.R. & Justel, A. & Svarc, M., 2011. "Principal components for multivariate functional data," Computational Statistics & Data Analysis, Elsevier, vol. 55(9), pages 2619-2634, September.
    4. Gheriballah, Abdelkader & Laksaci, Ali & Sekkal, Soumeya, 2013. "Nonparametric M-regression for functional ergodic data," Statistics & Probability Letters, Elsevier, vol. 83(3), pages 902-908.
    5. Laurent Delsol, 2013. "No effect tests in regression on functional variable and some applications to spectrometric studies," Computational Statistics, Springer, vol. 28(4), pages 1775-1811, August.
    6. Boj, Eva & Delicado, Pedro & Fortiana, Josep, 2010. "Distance-based local linear regression for functional predictors," Computational Statistics & Data Analysis, Elsevier, vol. 54(2), pages 429-437, February.
    7. Vieu, Philippe, 2018. "On dimension reduction models for functional data," Statistics & Probability Letters, Elsevier, vol. 136(C), pages 134-138.
    8. Heungsun Hwang & Hye Suk & Yoshio Takane & Jang-Han Lee & Jooseop Lim, 2015. "Generalized Functional Extended Redundancy Analysis," Psychometrika, Springer;The Psychometric Society, vol. 80(1), pages 101-125, March.
    9. Hron, K. & Menafoglio, A. & Templ, M. & Hrůzová, K. & Filzmoser, P., 2016. "Simplicial principal component analysis for density functions in Bayes spaces," Computational Statistics & Data Analysis, Elsevier, vol. 94(C), pages 330-350.
    10. Germán Aneiros-Pérez & Philippe Vieu, 2013. "Testing linearity in semi-parametric functional data analysis," Computational Statistics, Springer, vol. 28(2), pages 413-434, April.
    11. Aneiros, Germán & Horová, Ivana & Hušková, Marie & Vieu, Philippe, 2022. "On functional data analysis and related topics," Journal of Multivariate Analysis, Elsevier, vol. 189(C).
    12. Zhang, Zhen & Müller, Hans-Georg, 2011. "Functional density synchronization," Computational Statistics & Data Analysis, Elsevier, vol. 55(7), pages 2234-2249, July.
    13. Wafaa Benyelles & Tahar Mourid, 2012. "On a minimum distance estimate of the period in functional autoregressive processes," Journal of Applied Statistics, Taylor & Francis Journals, vol. 39(8), pages 1703-1718, February.
    14. Han Shang, 2014. "A survey of functional principal component analysis," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 98(2), pages 121-142, April.
    15. Germán Aneiros & Philippe Vieu, 2016. "Comments on: Probability enhanced effective dimension reduction for classifying sparse functional data," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 25(1), pages 27-32, March.
    16. Zhou, Jie, 2011. "Maximum likelihood ratio test for the stability of sequence of Gaussian random processes," Computational Statistics & Data Analysis, Elsevier, vol. 55(6), pages 2114-2127, June.
    17. Martínez-Camblor, Pablo & Corral, Norberto, 2011. "Repeated measures analysis for functional data," Computational Statistics & Data Analysis, Elsevier, vol. 55(12), pages 3244-3256, December.
    18. Lydia Kara-Zaitri & Ali Laksaci & Mustapha Rachdi & Philippe Vieu, 2017. "Uniform in bandwidth consistency for various kernel estimators involving functional data," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 29(1), pages 85-107, January.
    19. Aguilera, Ana M. & Escabias, Manuel & Valderrama, Mariano J., 2008. "Discussion of different logistic models with functional data. Application to Systemic Lupus Erythematosus," Computational Statistics & Data Analysis, Elsevier, vol. 53(1), pages 151-163, September.
    20. Germán Aneiros & Philippe Vieu, 2016. "Comments on: Probability enhanced effective dimension reduction for classifying sparse functional data," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 25(1), pages 27-32, March.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:55:y:2011:i:1:p:401-420. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.