IDEAS home Printed from https://ideas.repec.org/a/eee/jmvana/v207y2025ics0047259x25000235.html
   My bibliography  Save this article

Graph-constrained analysis for multivariate functional data

Author

Listed:
  • Dey, Debangan
  • Banerjee, Sudipto
  • Lindquist, Martin A.
  • Datta, Abhirup

Abstract

The manuscript considers multivariate functional data analysis with a known graphical model among the functional variables representing their conditional relationships (e.g., brain region-level fMRI data with a prespecified connectivity graph among brain regions). Functional Gaussian graphical models (GGM) used for analyzing multivariate functional data customarily estimate an unknown graphical model, and cannot preserve knowledge of a given graph. We propose a method for multivariate functional analysis that exactly conforms to a given inter-variable graph. We first show the equivalence between partially separable functional GGM and graphical Gaussian processes (GP), proposed recently for constructing optimal multivariate covariance functions that retain a given graphical model. The theoretical connection helps to design a new algorithm that leverages Dempster’s covariance selection for obtaining the maximum likelihood estimate of the covariance function for multivariate functional data under graphical constraints. We also show that the finite term truncation of functional GGM basis expansion used in practice is equivalent to a low-rank graphical GP, which is known to oversmooth marginal distributions. To remedy this, we extend our algorithm to better preserve marginal distributions while respecting the graph and retaining computational scalability. The benefits of the proposed algorithms are illustrated using empirical experiments and a neuroimaging application.

Suggested Citation

  • Dey, Debangan & Banerjee, Sudipto & Lindquist, Martin A. & Datta, Abhirup, 2025. "Graph-constrained analysis for multivariate functional data," Journal of Multivariate Analysis, Elsevier, vol. 207(C).
  • Handle: RePEc:eee:jmvana:v:207:y:2025:i:c:s0047259x25000235
    DOI: 10.1016/j.jmva.2025.105428
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0047259X25000235
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jmva.2025.105428?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Tatiyana V. Apanasovich & Marc G. Genton & Ying Sun, 2012. "A Valid Matérn Class of Cross-Covariance Functions for Multivariate Random Fields With Any Number of Components," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(497), pages 180-193, March.
    2. Abhirup Datta & Sudipto Banerjee & Andrew O. Finley & Alan E. Gelfand, 2016. "Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geostatistical Datasets," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(514), pages 800-812, April.
    3. Sudipto Banerjee & Alan E. Gelfand & Andrew O. Finley & Huiyan Sang, 2008. "Gaussian predictive process models for large spatial data sets," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 70(4), pages 825-848, September.
    4. Richard J. Barker & William A. Link, 2013. "Bayesian Multimodel Inference by RJMCMC: A Gibbs Sampling Approach," The American Statistician, Taylor & Francis Journals, vol. 67(3), pages 150-156, August.
    5. Debangan Dey & Abhirup Datta & Sudipto Banerjee, 2022. "Graphical Gaussian process models for highly multivariate spatial data [Cross-covariance functions for multivariate random fields based on latent dimensions]," Biometrika, Biometrika Trust, vol. 109(4), pages 993-1014.
    6. Xiaoping Jin & Sudipto Banerjee & Bradley P. Carlin, 2007. "Order‐free co‐regionalized areal data models with application to multiple‐disease mapping," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 69(5), pages 817-838, November.
    7. Xinghao Qiao & Shaojun Guo & Gareth M. James, 2019. "Functional Graphical Models," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 114(525), pages 211-222, January.
    8. Wenjing Yang & Hans‐Georg Müller & Ulrich Stadtmüller, 2011. "Functional singular component analysis," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 73(3), pages 303-324, June.
    9. Peter J. Green & Alun Thomas, 2013. "Sampling decomposable graphs using a Markov chain on junction trees," Biometrika, Biometrika Trust, vol. 100(1), pages 91-110.
    10. Bing Li & Eftychia Solea, 2018. "A Nonparametric Graphical Model for Functional Data With Application to Brain Networks Based on fMRI," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 113(524), pages 1637-1655, October.
    11. J Zapata & S Y Oh & A Petersen, 2022. "Partial separability and functional graphical models for multivariate Gaussian processes [Tests for separability in nonparametric covariance operators of random surfaces]," Biometrika, Biometrika Trust, vol. 109(3), pages 665-681.
    12. Choong-Wan Woo & Mathieu Roy & Jason T Buhle & Tor D Wager, 2015. "Distinct Brain Systems Mediate the Effects of Nociceptive Input and Self-Regulation on Pain," PLOS Biology, Public Library of Science, vol. 13(1), pages 1-14, January.
    13. Peng, Jie & Wang, Pei & Zhou, Nengfeng & Zhu, Ji, 2009. "Partial Correlation Estimation by Joint Sparse Regression Models," Journal of the American Statistical Association, American Statistical Association, vol. 104(486), pages 735-746.
    14. Yao, Fang & Muller, Hans-Georg & Wang, Jane-Ling, 2005. "Functional Data Analysis for Sparse Longitudinal Data," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 577-590, June.
    15. Finley, Andrew O. & Sang, Huiyan & Banerjee, Sudipto & Gelfand, Alan E., 2009. "Improving the performance of predictive process modeling for large datasets," Computational Statistics & Data Analysis, Elsevier, vol. 53(8), pages 2873-2884, June.
    16. Gneiting, Tilmann & Kleiber, William & Schlather, Martin, 2010. "Matérn Cross-Covariance Functions for Multivariate Random Fields," Journal of the American Statistical Association, American Statistical Association, vol. 105(491), pages 1167-1177.
    17. Alberto Roverato, 2002. "Hyper Inverse Wishart Distribution for Non‐decomposable Graphs and its Application to Bayesian Inference for Gaussian Graphical Models," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 29(3), pages 391-411, September.
    18. Carlos M. Carvalho & Hélène Massam & Mike West, 2007. "Simulation of hyper-inverse Wishart distributions in graphical models," Biometrika, Biometrika Trust, vol. 94(3), pages 647-659.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Matthias Katzfuss & Joseph Guinness & Wenlong Gong & Daniel Zilber, 2020. "Vecchia Approximations of Gaussian-Process Predictions," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 25(3), pages 383-414, September.
    2. Kim, Kyongwon, 2022. "On principal graphical models with application to gene network," Computational Statistics & Data Analysis, Elsevier, vol. 166(C).
    3. Petersen, Alexander, 2024. "Mean and covariance estimation for discretely observed high-dimensional functional data: Rates of convergence and division of observational regimes," Journal of Multivariate Analysis, Elsevier, vol. 204(C).
    4. Isabelle Grenier & Bruno Sansó & Jessica L. Matthews, 2024. "Multivariate nearest‐neighbors Gaussian processes with random covariance matrices," Environmetrics, John Wiley & Sons, Ltd., vol. 35(3), May.
    5. Matthew J. Heaton & Abhirup Datta & Andrew O. Finley & Reinhard Furrer & Joseph Guinness & Rajarshi Guhaniyogi & Florian Gerber & Robert B. Gramacy & Dorit Hammerling & Matthias Katzfuss & Finn Lindgr, 2019. "A Case Study Competition Among Methods for Analyzing Large Spatial Data," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 24(3), pages 398-425, September.
    6. Sameh Abdulah & Yuxiao Li & Jian Cao & Hatem Ltaief & David E. Keyes & Marc G. Genton & Ying Sun, 2023. "Large‐scale environmental data science with ExaGeoStatR," Environmetrics, John Wiley & Sons, Ltd., vol. 34(1), February.
    7. Fangting Zhou & Kejun He & Kunbo Wang & Yanxun Xu & Yang Ni, 2023. "Functional Bayesian networks for discovering causality from multivariate functional data," Biometrics, The International Biometric Society, vol. 79(4), pages 3279-3293, December.
    8. Kuang‐Yao Lee & Lexin Li, 2022. "Functional structural equation model," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 84(2), pages 600-629, April.
    9. Jialuo Liu & Tingjin Chu & Jun Zhu & Haonan Wang, 2022. "Large spatial data modeling and analysis: A Krylov subspace approach," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 49(3), pages 1115-1143, September.
    10. Yiping Hong & Yan Song & Sameh Abdulah & Ying Sun & Hatem Ltaief & David E. Keyes & Marc G. Genton, 2023. "The Third Competition on Spatial Statistics for Large Datasets," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 28(4), pages 618-635, December.
    11. Liu, Chunshan & Kowal, Daniel R. & Doss-Gollin, James & Vannucci, Marina, 2025. "Bayesian functional graphical models with change-point detection," Computational Statistics & Data Analysis, Elsevier, vol. 206(C).
    12. Zilber, Daniel & Katzfuss, Matthias, 2021. "Vecchia–Laplace approximations of generalized Gaussian processes for big non-Gaussian spatial data," Computational Statistics & Data Analysis, Elsevier, vol. 153(C).
    13. Codazzi, Laura & Colombi, Alessandro & Gianella, Matteo & Argiento, Raffaele & Paci, Lucia & Pini, Alessia, 2022. "Gaussian graphical modeling for spectrometric data analysis," Computational Statistics & Data Analysis, Elsevier, vol. 174(C).
    14. Guhaniyogi, Rajarshi & Banerjee, Sudipto, 2019. "Multivariate spatial meta kriging," Statistics & Probability Letters, Elsevier, vol. 144(C), pages 3-8.
    15. Shinichiro Shirota & Andrew O. Finley & Bruce D. Cook & Sudipto Banerjee, 2023. "Conjugate sparse plus low rank models for efficient Bayesian interpolation of large spatial data," Environmetrics, John Wiley & Sons, Ltd., vol. 34(1), February.
    16. Qian Ren & Sudipto Banerjee, 2013. "Hierarchical Factor Models for Large Spatially Misaligned Data: A Low-Rank Predictive Process Approach," Biometrics, The International Biometric Society, vol. 69(1), pages 19-30, March.
    17. Guinness, Joseph, 2022. "Nonparametric spectral methods for multivariate spatial and spatial–temporal data," Journal of Multivariate Analysis, Elsevier, vol. 187(C).
    18. Moreno Bevilacqua & Alfredo Alegria & Daira Velandia & Emilio Porcu, 2016. "Composite Likelihood Inference for Multivariate Gaussian Random Fields," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 21(3), pages 448-469, September.
    19. Bakar, Khandoker Shuvo & Sahu, Sujit K., 2015. "spTimer: Spatio-Temporal Bayesian Modeling Using R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 63(i15).
    20. Marchetti, Yuliya & Nguyen, Hai & Braverman, Amy & Cressie, Noel, 2018. "Spatial data compression via adaptive dispersion clustering," Computational Statistics & Data Analysis, Elsevier, vol. 117(C), pages 138-153.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:jmvana:v:207:y:2025:i:c:s0047259x25000235. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/622892/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.