IDEAS home Printed from https://ideas.repec.org/a/bla/jorssb/v70y2008i4p703-723.html
   My bibliography  Save this article

Modelling sparse generalized longitudinal observations with latent Gaussian processes

Author

Listed:
  • Peter Hall
  • Hans‐Georg Müller
  • Fang Yao

Abstract

Summary. In longitudinal data analysis one frequently encounters non‐Gaussian data that are repeatedly collected for a sample of individuals over time. The repeated observations could be binomial, Poisson or of another discrete type or could be continuous. The timings of the repeated measurements are often sparse and irregular. We introduce a latent Gaussian process model for such data, establishing a connection to functional data analysis. The functional methods proposed are non‐parametric and computationally straightforward as they do not involve a likelihood. We develop functional principal components analysis for this situation and demonstrate the prediction of individual trajectories from sparse observations. This method can handle missing data and leads to predictions of the functional principal component scores which serve as random effects in this model. These scores can then be used for further statistical analysis, such as inference, regression, discriminant analysis or clustering. We illustrate these non‐parametric methods with longitudinal data on primary biliary cirrhosis and show in simulations that they are competitive in comparisons with generalized estimating equations and generalized linear mixed models.

Suggested Citation

  • Peter Hall & Hans‐Georg Müller & Fang Yao, 2008. "Modelling sparse generalized longitudinal observations with latent Gaussian processes," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 70(4), pages 703-723, September.
  • Handle: RePEc:bla:jorssb:v:70:y:2008:i:4:p:703-723
    DOI: 10.1111/j.1467-9868.2008.00656.x
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/j.1467-9868.2008.00656.x
    Download Restriction: no

    File URL: https://libkey.io/10.1111/j.1467-9868.2008.00656.x?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Fang Yao & Hans-Georg Müller & Andrew J. Clifford & Steven R. Dueker & Jennifer Follett & Yumei Lin & Bruce A. Buchholz & John S. Vogel, 2003. "Shrinkage Estimation for Functional Principal Component Scores with Application to the Population Kinetics of Plasma Folate," Biometrics, The International Biometric Society, vol. 59(3), pages 676-685, September.
    2. Cécile Proust & Hélène Jacqmin-Gadda & Jeremy M. G. Taylor & Julien Ganiayre & Daniel Commenges, 2006. "A Nonlinear Model with Latent Process for Cognitive Evolution Using Multivariate Longitudinal Data," Biometrics, The International Biometric Society, vol. 62(4), pages 1014-1024, December.
    3. Yao, Fang & Muller, Hans-Georg & Wang, Jane-Ling, 2005. "Functional Data Analysis for Sparse Longitudinal Data," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 577-590, June.
    4. Boente, Graciela & Fraiman, Ricardo, 2000. "Kernel-based functional principal components," Statistics & Probability Letters, Elsevier, vol. 48(4), pages 335-345, July.
    5. Patrick J. Heagerty, 1999. "Marginally Specified Logistic-Normal Models for Longitudinal Binary Data," Biometrics, The International Biometric Society, vol. 55(3), pages 688-698, September.
    6. Vandna Jowaheer, 2002. "Analysing longitudinal count data with overdispersion," Biometrika, Biometrika Trust, vol. 89(2), pages 389-399, June.
    7. Jeng‐Min Chiou & Hans‐Georg Müller, 2005. "Estimated estimating equations: semiparametric inference for clustered and longitudinal data," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(4), pages 531-553, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Kehui Chen & Xiaoke Zhang & Alexander Petersen & Hans-Georg Müller, 2017. "Quantifying Infinite-Dimensional Data: Functional Data Analysis in Action," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 9(2), pages 582-604, December.
    2. Hao Ji & Hans-Georg Müller, 2017. "Optimal designs for longitudinal and functional data," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(3), pages 859-876, June.
    3. Fang Yao & Yichao Wu & Jialin Zou, 2016. "Rejoinder on: Probability enhanced effective dimension reduction for classifying sparse functional data," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 25(1), pages 52-58, March.
    4. Yifan Zhu & Chongzhi Di & Ying Qing Chen, 2019. "Clustering Functional Data with Application to Electronic Medication Adherence Monitoring in HIV Prevention Trials," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 11(2), pages 238-261, July.
    5. Jia, Shengji & Zhang, Chunming & Lu, Haoran, 2022. "Covariance function versus covariance matrix estimation in efficient semi-parametric regression for longitudinal data analysis," Journal of Multivariate Analysis, Elsevier, vol. 187(C).
    6. Wei Wang, 2014. "Linear mixed function-on-function regression models," Biometrics, The International Biometric Society, vol. 70(4), pages 794-801, December.
    7. Cederbaum, Jona & Scheipl, Fabian & Greven, Sonja, 2018. "Fast symmetric additive covariance smoothing," Computational Statistics & Data Analysis, Elsevier, vol. 120(C), pages 25-41.
    8. Gertheiss, Jan & Goldsmith, Jeff & Staicu, Ana-Maria, 2017. "A note on modeling sparse exponential-family functional response curves," Computational Statistics & Data Analysis, Elsevier, vol. 105(C), pages 46-52.
    9. Qingzhi Zhong & Huazhen Lin & Yi Li, 2021. "Cluster non‐Gaussian functional data," Biometrics, The International Biometric Society, vol. 77(3), pages 852-865, September.
    10. Weishampel, Anthony & Staicu, Ana-Maria & Rand, William, 2023. "Classification of social media users with generalized functional data analysis," Computational Statistics & Data Analysis, Elsevier, vol. 179(C).
    11. Zhong, Rou & Liu, Shishi & Li, Haocheng & Zhang, Jingxiao, 2022. "Robust functional principal component analysis for non-Gaussian longitudinal data," Journal of Multivariate Analysis, Elsevier, vol. 189(C).
    12. Fang, Xiaolei & Zhou, Rensheng & Gebraeel, Nagi, 2015. "An adaptive functional regression-based prognostic model for applications with missing data," Reliability Engineering and System Safety, Elsevier, vol. 133(C), pages 266-274.
    13. Mengfei Ran & Yihe Yang, 2022. "Optimal Estimation of Large Functional and Longitudinal Data by Using Functional Linear Mixed Model," Mathematics, MDPI, vol. 10(22), pages 1-28, November.
    14. Haocheng Li & John Staudenmayer & Raymond J. Carroll, 2014. "Hierarchical functional data with mixed continuous and binary measurements," Biometrics, The International Biometric Society, vol. 70(4), pages 802-811, December.
    15. Jia Chen & Degui Li & Yingcun Xia, 2015. "New Semiparametric Estimation Procedure for Functional Coefficient Longitudinal Data Models," Discussion Papers 15/17, Department of Economics, University of York.
    16. Li, Yehua & Qiu, Yumou & Xu, Yuhang, 2022. "From multivariate to functional data analysis: Fundamentals, recent developments, and emerging areas," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    17. Nicoleta Serban & Ana-Maria Staicu & Raymond J. Carroll, 2013. "Multilevel Cross-Dependent Binary Longitudinal Data," Biometrics, The International Biometric Society, vol. 69(4), pages 903-913, December.
    18. Chen, Jia & Li, Degui & Xia, Yingcun, 2019. "Estimation of a rank-reduced functional-coefficient panel data model with serial correlation," Journal of Multivariate Analysis, Elsevier, vol. 173(C), pages 456-479.
    19. Kelava, Augustin & Kohler, Michael & Krzyżak, Adam & Schaffland, Tim Fabian, 2017. "Nonparametric estimation of a latent variable model," Journal of Multivariate Analysis, Elsevier, vol. 154(C), pages 112-134.
    20. Julia Wrobel & Vadim Zipunnikov & Jennifer Schrack & Jeff Goldsmith, 2019. "Registration for exponential family functional data," Biometrics, The International Biometric Society, vol. 75(1), pages 48-57, March.
    21. Jiang, Jiakun & Lin, Huazhen & Zhong, Qingzhi & Li, Yi, 2022. "Analysis of multivariate non-gaussian functional data: A semiparametric latent process approach," Journal of Multivariate Analysis, Elsevier, vol. 189(C).
    22. Hsin‐wen Chang & Ian W. McKeague, 2022. "Empirical likelihood‐based inference for functional means with application to wearable device data," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 84(5), pages 1947-1968, November.
    23. Daniel Backenroth & Russell T. Shinohara & Jennifer A. Schrack & Jeff Goldsmith, 2020. "Nonnegative decomposition of functional count data," Biometrics, The International Biometric Society, vol. 76(4), pages 1273-1284, December.
    24. Jeff Goldsmith & Vadim Zipunnikov & Jennifer Schrack, 2015. "Generalized multilevel function-on-scalar regression and principal component analysis," Biometrics, The International Biometric Society, vol. 71(2), pages 344-353, June.
    25. Fang Yao & Yichao Wu & Jialin Zou, 2016. "Rejoinder on: Probability enhanced effective dimension reduction for classifying sparse functional data," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 25(1), pages 52-58, March.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Park, Juhyun & Gasser, Theo & Rousson, Valentin, 2009. "Structural components in functional data," Computational Statistics & Data Analysis, Elsevier, vol. 53(9), pages 3452-3465, July.
    2. Şentürk, Damla & Ghosh, Samiran & Nguyen, Danh V., 2014. "Exploratory time varying lagged regression: Modeling association of cognitive and functional trajectories with expected clinic visits in older adults," Computational Statistics & Data Analysis, Elsevier, vol. 73(C), pages 1-15.
    3. Li, Pai-Ling & Chiou, Jeng-Min, 2011. "Identifying cluster number for subspace projected functional data clustering," Computational Statistics & Data Analysis, Elsevier, vol. 55(6), pages 2090-2103, June.
    4. Poskitt, D.S. & Sengarapillai, Arivalzahan, 2013. "Description length and dimensionality reduction in functional data analysis," Computational Statistics & Data Analysis, Elsevier, vol. 58(C), pages 98-113.
    5. Tomáš Rubín & Victor M. Panaretos, 2020. "Functional lagged regression with sparse noisy observations," Journal of Time Series Analysis, Wiley Blackwell, vol. 41(6), pages 858-882, November.
    6. Beran, Jan & Liu, Haiyan, 2016. "Estimation of eigenvalues, eigenvectors and scores in FDA models with dependent errors," Journal of Multivariate Analysis, Elsevier, vol. 147(C), pages 218-233.
    7. Orlando Joaqui-Barandica & Diego F. Manotas-Duque, 2023. "How do Climate and Macroeconomic Factors Affect the Profitability of the Energy Sector?," International Journal of Energy Economics and Policy, Econjournals, vol. 13(4), pages 444-454, July.
    8. Kyunghee Han & Pantelis Z Hadjipantelis & Jane-Ling Wang & Michael S Kramer & Seungmi Yang & Richard M Martin & Hans-Georg Müller, 2018. "Functional principal component analysis for identifying multivariate patterns and archetypes of growth, and their association with long-term cognitive development," PLOS ONE, Public Library of Science, vol. 13(11), pages 1-18, November.
    9. J. Goldsmith & S. Greven & C. Crainiceanu, 2013. "Corrected Confidence Bands for Functional Data Using Principal Components," Biometrics, The International Biometric Society, vol. 69(1), pages 41-51, March.
    10. Yao, Fang, 2007. "Asymptotic distributions of nonparametric regression estimators for longitudinal or functional data," Journal of Multivariate Analysis, Elsevier, vol. 98(1), pages 40-56, January.
    11. van der Linde, Angelika, 2008. "Variational Bayesian functional PCA," Computational Statistics & Data Analysis, Elsevier, vol. 53(2), pages 517-533, December.
    12. Cederbaum, Jona & Scheipl, Fabian & Greven, Sonja, 2018. "Fast symmetric additive covariance smoothing," Computational Statistics & Data Analysis, Elsevier, vol. 120(C), pages 25-41.
    13. Rha, Hyungmin & Kao, Ming-Hung & Pan, Rong, 2020. "Design optimal sampling plans for functional regression models," Computational Statistics & Data Analysis, Elsevier, vol. 146(C).
    14. Liebl, Dominik, 2010. "Modeling hourly Electricity Spot Market Prices as non stationary functional times series," MPRA Paper 25017, University Library of Munich, Germany.
    15. Caleb Weaver & Luo Xiao & Wenbin Lu, 2023. "Functional data analysis for longitudinal data with informative observation times," Biometrics, The International Biometric Society, vol. 79(2), pages 722-733, June.
    16. Lee, Keunbaik & Joo, Yongsung, 2019. "Marginalized models for longitudinal count data," Computational Statistics & Data Analysis, Elsevier, vol. 136(C), pages 47-58.
    17. Chiou, Jeng-Min & Muller, Hans-Georg, 2007. "Diagnostics for functional regression via residual processes," Computational Statistics & Data Analysis, Elsevier, vol. 51(10), pages 4849-4863, June.
    18. Müller, Hans-Georg & Sen, Rituparna & Stadtmüller, Ulrich, 2011. "Functional data analysis for volatility," Journal of Econometrics, Elsevier, vol. 165(2), pages 233-245.
    19. Jiang, Jiakun & Lin, Huazhen & Zhong, Qingzhi & Li, Yi, 2022. "Analysis of multivariate non-gaussian functional data: A semiparametric latent process approach," Journal of Multivariate Analysis, Elsevier, vol. 189(C).
    20. Jimin Ding & Jane-Ling Wang, 2008. "Modeling Longitudinal Data with Nonparametric Multiplicative Random Effects Jointly with Survival Data," Biometrics, The International Biometric Society, vol. 64(2), pages 546-556, June.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jorssb:v:70:y:2008:i:4:p:703-723. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley-Blackwell Digital Licensing or Christopher F. Baum (email available below). General contact details of provider: https://edirc.repec.org/data/rssssea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.