IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v94y2016icp330-350.html
   My bibliography  Save this article

Simplicial principal component analysis for density functions in Bayes spaces

Author

Listed:
  • Hron, K.
  • Menafoglio, A.
  • Templ, M.
  • Hrůzová, K.
  • Filzmoser, P.

Abstract

Probability density functions are frequently used to characterize the distributional properties of large-scale database systems. As functional compositions, densities primarily carry relative information. As such, standard methods of functional data analysis (FDA) are not appropriate for their statistical processing. The specific features of density functions are accounted for in Bayes spaces, which result from the generalization to the infinite dimensional setting of the Aitchison geometry for compositional data. The aim is to build up a concise methodology for functional principal component analysis of densities. A simplicial functional principal component analysis (SFPCA) is proposed, based on the geometry of the Bayes space B2 of functional compositions. SFPCA is performed by exploiting the centred log-ratio transform, an isometric isomorphism between B2 and L2 which enables one to resort to standard FDA tools. The advantages of the proposed approach with respect to existing techniques are demonstrated using simulated data and a real-world example of population pyramids in Upper Austria.

Suggested Citation

  • Hron, K. & Menafoglio, A. & Templ, M. & Hrůzová, K. & Filzmoser, P., 2016. "Simplicial principal component analysis for density functions in Bayes spaces," Computational Statistics & Data Analysis, Elsevier, vol. 94(C), pages 330-350.
  • Handle: RePEc:eee:csdana:v:94:y:2016:i:c:p:330-350
    DOI: 10.1016/j.csda.2015.07.007
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167947315001644
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2015.07.007?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Kneip, Alois & Sickles, Robin C. & Song, Wonho, 2012. "A New Panel Data Treatment For Heterogeneity In Time Trends," Econometric Theory, Cambridge University Press, vol. 28(3), pages 590-628, June.
    2. Han Shang, 2014. "A survey of functional principal component analysis," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 98(2), pages 121-142, April.
    3. Delicado, P., 2011. "Dimensionality reduction when data are density functions," Computational Statistics & Data Analysis, Elsevier, vol. 55(1), pages 401-420, January.
    4. Zhang, Zhen & Müller, Hans-Georg, 2011. "Functional density synchronization," Computational Statistics & Data Analysis, Elsevier, vol. 55(7), pages 2234-2249, July.
    5. Pedro Delicado, 2007. "Functional k-sample problem when data are density functions," Computational Statistics, Springer, vol. 22(3), pages 391-410, September.
    6. Liebl, Dominik, 2013. "Modeling and Forecasting Electricity Spot Prices: A Functional Data Perspective," MPRA Paper 50881, University Library of Munich, Germany.
    7. Kneip A. & Utikal K. J, 2001. "Inference for Density Families Using Functional Principal Component Analysis," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 519-542, June.
    8. Nerini, David & Ghattas, Badih, 2007. "Classifying densities using functional regression trees: Applications in oceanology," Computational Statistics & Data Analysis, Elsevier, vol. 51(10), pages 4984-4993, June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Matteo Iacopini & Dominique Guégan, 2018. "Nonparametric Forecasting of Multivariate Probability Density Functions," Working Papers 2018:15, Department of Economics, University of Venice "Ca' Foscari".
    2. Won-Ki Seo, 2020. "Functional Principal Component Analysis for Cointegrated Functional Time Series," Papers 2011.12781, arXiv.org, revised Apr 2023.
    3. Genest, Christian & Hron, Karel & Nešlehová, Johanna G., 2023. "Orthogonal decomposition of multivariate densities in Bayes spaces and relation with their copula-based representation," Journal of Multivariate Analysis, Elsevier, vol. 198(C).
    4. ARATA Yoshiyuki, 2017. "A Functional Linear Regression Model in the Space of Probability Density Functions," Discussion papers 17015, Research Institute of Economy, Trade and Industry (RIETI).
    5. Tadao Hoshino, 2024. "Functional Spatial Autoregressive Models," Papers 2402.14763, arXiv.org.
    6. Pini, Alessia & Stamm, Aymeric & Vantini, Simone, 2018. "Hotelling’s T2 in separable Hilbert spaces," Journal of Multivariate Analysis, Elsevier, vol. 167(C), pages 284-305.
    7. Aneiros, Germán & Cao, Ricardo & Fraiman, Ricardo & Genest, Christian & Vieu, Philippe, 2019. "Recent advances in functional data analysis and high-dimensional statistics," Journal of Multivariate Analysis, Elsevier, vol. 170(C), pages 3-9.
    8. Bongiorno, Enea G. & Goia, Aldo, 2019. "Describing the concentration of income populations by functional principal component analysis on Lorenz curves," Journal of Multivariate Analysis, Elsevier, vol. 170(C), pages 10-24.
    9. Jitka Machalová & Renáta Talská & Karel Hron & Aleš Gába, 2021. "Compositional splines for representation of density functions," Computational Statistics, Springer, vol. 36(2), pages 1031-1064, June.
    10. Marco Stefanucci & Stefano Mazzuco, 2022. "Analysing cause‐specific mortality trends using compositional functional data analysis," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 185(1), pages 61-83, January.
    11. Chao Zhang & Piotr Kokoszka & Alexander Petersen, 2022. "Wasserstein autoregressive models for density time series," Journal of Time Series Analysis, Wiley Blackwell, vol. 43(1), pages 30-52, January.
    12. Dominique Guegan & Matteo Iacopini, 2018. "Nonparametric forecasting of multivariate probability density functions," Post-Print halshs-01821815, HAL.
    13. Seo, Won-Ki & Beare, Brendan K., 2019. "Cointegrated linear processes in Bayes Hilbert space," Statistics & Probability Letters, Elsevier, vol. 147(C), pages 90-95.
    14. Dominique Guégan & Matteo Iacopini, 2018. "Nonparameteric forecasting of multivariate probability density functions," Documents de travail du Centre d'Economie de la Sorbonne 18012, Université Panthéon-Sorbonne (Paris 1), Centre d'Economie de la Sorbonne.
    15. Talská, R. & Menafoglio, A. & Machalová, J. & Hron, K. & Fišerová, E., 2018. "Compositional regression with functional response," Computational Statistics & Data Analysis, Elsevier, vol. 123(C), pages 66-85.
    16. Dominique Guegan & Matteo Iacopini, 2018. "Nonparametric forecasting of multivariate probability density functions," Université Paris1 Panthéon-Sorbonne (Post-Print and Working Papers) halshs-01821815, HAL.
    17. Karel Hron & Jitka Machalová & Alessandra Menafoglio, 2023. "Bivariate densities in Bayes spaces: orthogonal decomposition and spline representation," Statistical Papers, Springer, vol. 64(5), pages 1629-1667, October.
    18. Kokoszka, Piotr & Miao, Hong & Petersen, Alexander & Shang, Han Lin, 2019. "Forecasting of density functions with an application to cross-sectional and intraday returns," International Journal of Forecasting, Elsevier, vol. 35(4), pages 1304-1317.
    19. Łukasz Smaga & Jin‐Ting Zhang, 2020. "Linear hypothesis testing for weighted functional data with applications," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 47(2), pages 493-515, June.
    20. Petersen, Alexander & Zhang, Chao & Kokoszka, Piotr, 2022. "Modeling Probability Density Functions as Data Objects," Econometrics and Statistics, Elsevier, vol. 21(C), pages 159-178.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Zhang, Zhen & Müller, Hans-Georg, 2011. "Functional density synchronization," Computational Statistics & Data Analysis, Elsevier, vol. 55(7), pages 2234-2249, July.
    2. Bongiorno, Enea G. & Goia, Aldo, 2019. "Describing the concentration of income populations by functional principal component analysis on Lorenz curves," Journal of Multivariate Analysis, Elsevier, vol. 170(C), pages 10-24.
    3. S. Barahona & P. Centella & X. Gual-Arnau & M. V. Ibáñez & A. Simó, 2020. "Supervised classification of geometrical objects by integrating currents and functional data analysis," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 29(3), pages 637-660, September.
    4. Petersen, Alexander & Zhang, Chao & Kokoszka, Piotr, 2022. "Modeling Probability Density Functions as Data Objects," Econometrics and Statistics, Elsevier, vol. 21(C), pages 159-178.
    5. Brenda López Cabrera & Franziska Schulz, 2017. "Forecasting Generalized Quantiles of Electricity Demand: A Functional Data Approach," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(517), pages 127-136, January.
    6. Aneiros, Germán & Cao, Ricardo & Fraiman, Ricardo & Genest, Christian & Vieu, Philippe, 2019. "Recent advances in functional data analysis and high-dimensional statistics," Journal of Multivariate Analysis, Elsevier, vol. 170(C), pages 3-9.
    7. Won-Ki Seo, 2020. "Functional Principal Component Analysis for Cointegrated Functional Time Series," Papers 2011.12781, arXiv.org, revised Apr 2023.
    8. Delicado, P., 2011. "Dimensionality reduction when data are density functions," Computational Statistics & Data Analysis, Elsevier, vol. 55(1), pages 401-420, January.
    9. Santiago Gall n & Jorge Barrientos, 2021. "Forecasting the Colombian Electricity Spot Price under a Functional Approach," International Journal of Energy Economics and Policy, Econjournals, vol. 11(2), pages 67-74.
    10. van der Linde, Angelika, 2008. "Variational Bayesian functional PCA," Computational Statistics & Data Analysis, Elsevier, vol. 53(2), pages 517-533, December.
    11. Karel Hron & Jitka Machalová & Alessandra Menafoglio, 2023. "Bivariate densities in Bayes spaces: orthogonal decomposition and spline representation," Statistical Papers, Springer, vol. 64(5), pages 1629-1667, October.
    12. Martínez-Camblor, Pablo & Corral, Norberto, 2011. "Repeated measures analysis for functional data," Computational Statistics & Data Analysis, Elsevier, vol. 55(12), pages 3244-3256, December.
    13. ARATA Yoshiyuki, 2017. "A Functional Linear Regression Model in the Space of Probability Density Functions," Discussion papers 17015, Research Institute of Economy, Trade and Industry (RIETI).
    14. Alonso, Andrés M. & Casado, David & Romo, Juan, 2012. "Supervised classification for functional data: A weighted distance approach," Computational Statistics & Data Analysis, Elsevier, vol. 56(7), pages 2334-2346.
    15. Maria Ruiz-Medina & Rosa Espejo & Elvira Romano, 2014. "Spatial functional normal mixed effect approach for curve classification," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 8(3), pages 257-285, September.
    16. Kokoszka, Piotr & Miao, Hong & Petersen, Alexander & Shang, Han Lin, 2019. "Forecasting of density functions with an application to cross-sectional and intraday returns," International Journal of Forecasting, Elsevier, vol. 35(4), pages 1304-1317.
    17. Menafoglio, Alessandra & Petris, Giovanni, 2016. "Kriging for Hilbert-space valued random fields: The operatorial point of view," Journal of Multivariate Analysis, Elsevier, vol. 146(C), pages 84-94.
    18. Florian Ziel & Rick Steinert & Sven Husmann, 2015. "Forecasting day ahead electricity spot prices: The impact of the EXAA to other European electricity markets," Papers 1501.00818, arXiv.org, revised Dec 2015.
    19. Kim Huynh & David Jacho-Chávez & Robert Petrunia & Marcel Voia, 2015. "A nonparametric analysis of firm size, leverage and labour productivity distribution dynamics," Empirical Economics, Springer, vol. 48(1), pages 337-360, February.
    20. Florian Ziel & Rick Steinert & Sven Husmann, 2014. "Efficient Modeling and Forecasting of the Electricity Spot Price," Papers 1402.7027, arXiv.org, revised Oct 2014.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:94:y:2016:i:c:p:330-350. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.