IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v55y2011i7p2234-2249.html
   My bibliography  Save this article

Functional density synchronization

Author

Listed:
  • Zhang, Zhen
  • Müller, Hans-Georg

Abstract

Estimating an overall density function from repeated observations on each of a sample of independent subjects or experimental units is of interest. An example is provided by biodemographic studies, where one observes age-at-death for several cohorts of flies. Cohorts are kept in separate cages, which form the experimental units. Time variation then is likely to exist between the cohort densities and hazard rates due to cage effects on aging. Given the densities of age-at-death for the individual cohorts, one aims to obtain an estimate for the underlying overall density and hazard rate. In microarray gene expression experiments, similar problems arise when addressing the need for normalization of probe-level data from different arrays. Conventional methods, such as the cross-sectional average density, ignore time variation and hence are often not representative for such data. We view densities as functional data and model individual densities as warped versions of an underlying overall density, where the observed densities are assumed to be realizations of an underlying stochastic process. Quantile-synchronized distribution functions are obtained from an inverse warping mapping, based on quantile synchronization, leading to quantile-synchronized density and hazard functions. Kernel type smoothing methods with plug-in bandwidth selection can be used for estimating the components of the model. Asymptotic properties of the synchronized density estimates are derived. Simulation results show that functional density synchronization is often advantageous when compared to conventional density averaging or simple time-shift warping. Our approach complements previous quantile normalization methods used for microarray expression data and is illustrated with both longevity data obtained for 54 cohorts of mexflies (Mexican fruit flies) and gene expression data of the Ts1Cje mouse study for Down syndrome.

Suggested Citation

  • Zhang, Zhen & Müller, Hans-Georg, 2011. "Functional density synchronization," Computational Statistics & Data Analysis, Elsevier, vol. 55(7), pages 2234-2249, July.
  • Handle: RePEc:eee:csdana:v:55:y:2011:i:7:p:2234-2249
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167-9473(11)00009-0
    Download Restriction: Full text for ScienceDirect subscribers only.
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Daniel Gervini & Theo Gasser, 2004. "Self‐modelling warping functions," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 66(4), pages 959-971, November.
    2. Pedro Delicado, 2007. "Functional k-sample problem when data are density functions," Computational Statistics, Springer, vol. 22(3), pages 391-410, September.
    3. Nerini, David & Ghattas, Badih, 2007. "Classifying densities using functional regression trees: Applications in oceanology," Computational Statistics & Data Analysis, Elsevier, vol. 51(10), pages 4984-4993, June.
    4. Daniel Gervini & Theo Gasser, 2005. "Nonparametric maximum likelihood estimation of the structural mean of a sample of curves," Biometrika, Biometrika Trust, vol. 92(4), pages 801-820, December.
    5. C. Mante & J. P. Durbec & J. C. Dauvin, 2005. "A functional data-analytic approach to the classification of species according to their spatial dispersion. Application to a marine macrobenthic community from the Bay of Morlaix (Western English Chan," Journal of Applied Statistics, Taylor & Francis Journals, vol. 32(8), pages 831-840.
    6. Xueli Liu & Hans-Georg Muller, 2004. "Functional Convex Averaging and Synchronization for Time-Warped Random Curves," Journal of the American Statistical Association, American Statistical Association, vol. 99, pages 687-699, January.
    7. Antonio Cuevas & Manuel Febrero & Ricardo Fraiman, 2007. "Robust estimation and classification for functional data via projection-based depth notions," Computational Statistics, Springer, vol. 22(3), pages 481-496, September.
    8. Delicado, P., 2011. "Dimensionality reduction when data are density functions," Computational Statistics & Data Analysis, Elsevier, vol. 55(1), pages 401-420, January.
    9. Alejandro Quintela-Del-Río, 2008. "Hazard function given a functional variable: Non-parametric estimation under strong mixing conditions," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 20(5), pages 413-430.
    10. M. Jones, 1992. "Estimating densities, quantiles, quantile densities and density quantiles," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 44(4), pages 721-727, December.
    11. Kneip A. & Utikal K. J, 2001. "Inference for Density Families Using Functional Principal Component Analysis," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 519-542, June.
    12. Chiou, Jeng-Min & Müller, Hans-Georg, 2009. "Modeling Hazard Rates as Functional Data for the Analysis of Cohort Lifetables and Mortality Forecasting," Journal of the American Statistical Association, American Statistical Association, vol. 104(486), pages 572-585.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Bongiorno, Enea G. & Goia, Aldo, 2019. "Describing the concentration of income populations by functional principal component analysis on Lorenz curves," Journal of Multivariate Analysis, Elsevier, vol. 170(C), pages 10-24.
    2. Hron, K. & Menafoglio, A. & Templ, M. & Hrůzová, K. & Filzmoser, P., 2016. "Simplicial principal component analysis for density functions in Bayes spaces," Computational Statistics & Data Analysis, Elsevier, vol. 94(C), pages 330-350.
    3. Alonso, Andrés M. & Casado, David & Romo, Juan, 2012. "Supervised classification for functional data: A weighted distance approach," Computational Statistics & Data Analysis, Elsevier, vol. 56(7), pages 2334-2346.
    4. Arribas-Gil, Ana & Müller, Hans-Georg, 2014. "Pairwise dynamic time warping for event data," Computational Statistics & Data Analysis, Elsevier, vol. 69(C), pages 255-268.
    5. Maria Ruiz-Medina & Rosa Espejo & Elvira Romano, 2014. "Spatial functional normal mixed effect approach for curve classification," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 8(3), pages 257-285, September.
    6. Petersen, Alexander & Zhang, Chao & Kokoszka, Piotr, 2022. "Modeling Probability Density Functions as Data Objects," Econometrics and Statistics, Elsevier, vol. 21(C), pages 159-178.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Hron, K. & Menafoglio, A. & Templ, M. & Hrůzová, K. & Filzmoser, P., 2016. "Simplicial principal component analysis for density functions in Bayes spaces," Computational Statistics & Data Analysis, Elsevier, vol. 94(C), pages 330-350.
    2. Fang Yao & Yichao Wu & Jialin Zou, 2016. "Probability-enhanced effective dimension reduction for classifying sparse functional data," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 25(1), pages 1-22, March.
    3. S. Barahona & P. Centella & X. Gual-Arnau & M. V. Ibáñez & A. Simó, 2020. "Supervised classification of geometrical objects by integrating currents and functional data analysis," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 29(3), pages 637-660, September.
    4. Fang Yao & Yichao Wu & Jialin Zou, 2016. "Probability-enhanced effective dimension reduction for classifying sparse functional data," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 25(1), pages 1-22, March.
    5. Jason Cleveland & Wei Wu & Anuj Srivastava, 2016. "Norm-preserving constraint in the Fisher--Rao registration and its application in signal estimation," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 28(2), pages 338-359, June.
    6. Boudaoud, S. & Rix, H. & Meste, O., 2010. "Core Shape modelling of a set of curves," Computational Statistics & Data Analysis, Elsevier, vol. 54(2), pages 308-325, February.
    7. Bongiorno, Enea G. & Goia, Aldo, 2019. "Describing the concentration of income populations by functional principal component analysis on Lorenz curves," Journal of Multivariate Analysis, Elsevier, vol. 170(C), pages 10-24.
    8. Cleveland, Jason & Zhao, Weilong & Wu, Wei, 2018. "Robust template estimation for functional data with phase variability using band depth," Computational Statistics & Data Analysis, Elsevier, vol. 125(C), pages 10-26.
    9. Gerda Claeskens & Bernard W. Silverman & Leen Slaets, 2010. "A multiresolution approach to time warping achieved by a Bayesian prior–posterior transfer fitting strategy," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 72(5), pages 673-694, November.
    10. Kokoszka, Piotr & Miao, Hong & Petersen, Alexander & Shang, Han Lin, 2019. "Forecasting of density functions with an application to cross-sectional and intraday returns," International Journal of Forecasting, Elsevier, vol. 35(4), pages 1304-1317.
    11. Liu, Xueli & Yang, Mark C.K., 2009. "Simultaneous curve registration and clustering for functional data," Computational Statistics & Data Analysis, Elsevier, vol. 53(4), pages 1361-1376, February.
    12. Arribas-Gil, Ana & Müller, Hans-Georg, 2014. "Pairwise dynamic time warping for event data," Computational Statistics & Data Analysis, Elsevier, vol. 69(C), pages 255-268.
    13. Aneiros, Germán & Cao, Ricardo & Fraiman, Ricardo & Genest, Christian & Vieu, Philippe, 2019. "Recent advances in functional data analysis and high-dimensional statistics," Journal of Multivariate Analysis, Elsevier, vol. 170(C), pages 3-9.
    14. Fabrizio Maturo & Rosanna Verde, 2023. "Supervised classification of curves via a combined use of functional data analysis and tree-based methods," Computational Statistics, Springer, vol. 38(1), pages 419-459, March.
    15. Mante, Claude & Yao, Anne-Francoise & Degiovanni, Claude, 2007. "Principal component analysis of measures, with special emphasis on grain-size curves," Computational Statistics & Data Analysis, Elsevier, vol. 51(10), pages 4969-4983, June.
    16. Tucker, J. Derek & Wu, Wei & Srivastava, Anuj, 2013. "Generative models for functional data using phase and amplitude separation," Computational Statistics & Data Analysis, Elsevier, vol. 61(C), pages 50-66.
    17. A. K. S. Alshabani & I. L. Dryden & C. D. Litton & J. Richardson, 2007. "Bayesian analysis of human movement curves," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 56(4), pages 415-428, August.
    18. Delicado, P., 2011. "Dimensionality reduction when data are density functions," Computational Statistics & Data Analysis, Elsevier, vol. 55(1), pages 401-420, January.
    19. Daniel Gervini & Patrick A. Carter, 2014. "Warped functional analysis of variance," Biometrics, The International Biometric Society, vol. 70(3), pages 526-535, September.
    20. Han Shang, 2014. "A survey of functional principal component analysis," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 98(2), pages 121-142, April.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:55:y:2011:i:7:p:2234-2249. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.