IDEAS home Printed from https://ideas.repec.org/a/bla/biomet/v78y2022i3p1067-1079.html
   My bibliography  Save this article

Tensor envelope mixture model for simultaneous clustering and multiway dimension reduction

Author

Listed:
  • Kai Deng
  • Xin Zhang

Abstract

In the form of multidimensional arrays, tensor data have become increasingly prevalent in modern scientific studies and biomedical applications such as computational biology, brain imaging analysis, and process monitoring system. These data are intrinsically heterogeneous with complex dependencies and structure. Therefore, ad‐hoc dimension reduction methods on tensor data may lack statistical efficiency and can obscure essential findings. Model‐based clustering is a cornerstone of multivariate statistics and unsupervised learning; however, existing methods and algorithms are not designed for tensor‐variate samples. In this article, we propose a tensor envelope mixture model (TEMM) for simultaneous clustering and multiway dimension reduction of tensor data. TEMM incorporates tensor‐structure‐preserving dimension reduction into mixture modeling and drastically reduces the number of free parameters and estimative variability. An expectation‐maximization‐type algorithm is developed to obtain likelihood‐based estimators of the cluster means and covariances, which are jointly parameterized and constrained onto a series of lower dimensional subspaces known as the tensor envelopes. We demonstrate the encouraging empirical performance of the proposed method in extensive simulation studies and a real data application in comparison with existing vector and tensor clustering methods.

Suggested Citation

  • Kai Deng & Xin Zhang, 2022. "Tensor envelope mixture model for simultaneous clustering and multiway dimension reduction," Biometrics, The International Biometric Society, vol. 78(3), pages 1067-1079, September.
  • Handle: RePEc:bla:biomet:v:78:y:2022:i:3:p:1067-1079
    DOI: 10.1111/biom.13486
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/biom.13486
    Download Restriction: no

    File URL: https://libkey.io/10.1111/biom.13486?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. R. Dennis Cook & Xin Zhang, 2015. "Foundations for Envelope Models and Methods," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(510), pages 599-611, June.
    2. Yuqing Pan & Qing Mai & Xin Zhang, 2019. "Covariate-Adjusted Tensor Classification in High Dimensions," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 114(527), pages 1305-1319, July.
    3. David J. Lockhart & Elizabeth A. Winzeler, 2000. "Genomics, gene expression and DNA arrays," Nature, Nature, vol. 405(6788), pages 827-836, June.
    4. Sergio E Baranzini & Parvin Mousavi & Jordi Rio & Stacy J Caillier & Althea Stillman & Pablo Villoslada & Matthew M Wyatt & Manuel Comabella & Larry D Greller & Roland Somogyi & Xavier Montalban & Jor, 2004. "Transcription-Based Prediction of Response to IFNβ Using Supervised Computational Methods," PLOS Biology, Public Library of Science, vol. 3(1), pages 1-1, December.
    5. Lexin Li & Xin Zhang, 2017. "Parsimonious Tensor Response Regression," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(519), pages 1131-1146, July.
    6. Hua Zhou & Lexin Li & Hongtu Zhu, 2013. "Tensor Regression with Applications in Neuroimaging Data Analysis," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 108(502), pages 540-552, June.
    7. Will Wei Sun & Lexin Li, 2019. "Dynamic Tensor Clustering," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 114(528), pages 1894-1907, October.
    8. Yao, Weixin & Lindsay, Bruce G., 2009. "Bayesian Mixture Labeling by Highest Posterior Density," Journal of the American Statistical Association, American Statistical Association, vol. 104(486), pages 758-767.
    9. Witten, Daniela M. & Tibshirani, Robert, 2010. "A Framework for Feature Selection in Clustering," Journal of the American Statistical Association, American Statistical Association, vol. 105(490), pages 713-726.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Lan Liu & Wei Li & Zhihua Su & Dennis Cook & Luca Vizioli & Essa Yacoub, 2022. "Efficient estimation via envelope chain in magnetic resonance imaging‐based studies," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 49(2), pages 481-501, June.
    2. Inkoo Lee & Debajyoti Sinha & Qing Mai & Xin Zhang & Dipankar Bandyopadhyay, 2023. "Bayesian regression analysis of skewed tensor responses," Biometrics, The International Biometric Society, vol. 79(3), pages 1814-1825, September.
    3. Zengchao Xu & Shan Luo & Zehua Chen, 2023. "A Portmanteau Local Feature Discrimination Approach to the Classification with High-dimensional Matrix-variate Data," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 85(1), pages 441-467, February.
    4. Minji Lee & Zhihua Su, 2020. "A Review of Envelope Models," International Statistical Review, International Statistical Institute, vol. 88(3), pages 658-676, December.
    5. Yue Zhao & Ingrid Van Keilegom & Shanshan Ding, 2022. "Envelopes for censored quantile regression," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 49(4), pages 1562-1585, December.
    6. Giuseppe Brandi & T. Di Matteo, 2020. "A new multilayer network construction via Tensor learning," Papers 2004.05367, arXiv.org.
    7. Daniel Spencer & Rajarshi Guhaniyogi & Raquel Prado, 2020. "Joint Bayesian Estimation of Voxel Activation and Inter-regional Connectivity in fMRI Experiments," Psychometrika, Springer;The Psychometric Society, vol. 85(4), pages 845-869, December.
    8. Monica Billio & Roberto Casarin & Matteo Iacopini & Sylvia Kaufmann, 2023. "Bayesian Dynamic Tensor Regression," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 41(2), pages 429-439, April.
    9. May, Paul & Biesecker, Matthew & Rekabdarkolaee, Hossein Moradi, 2022. "Response envelopes for linear coregionalization models," Journal of Multivariate Analysis, Elsevier, vol. 192(C).
    10. Bo Wei & Limin Peng & Ying Guo & Amita Manatunga & Jennifer Stevens, 2023. "Tensor response quantile regression with neuroimaging data," Biometrics, The International Biometric Society, vol. 79(3), pages 1947-1958, September.
    11. Ghannam, Mai & Nkurunziza, Sévérien, 2023. "Tensor Stein-rules in a generalized tensor regression model," Journal of Multivariate Analysis, Elsevier, vol. 198(C).
    12. Yeonhee Park & Zhihua Su & Hongtu Zhu, 2017. "Groupwise envelope models for imaging genetic analysis," Biometrics, The International Biometric Society, vol. 73(4), pages 1243-1253, December.
    13. Lin Liu, 2021. "Matrix‐based introduction to multivariate data analysis, by KoheiAdachi 2nd edition. Singapore: Springer Nature, 2020. pp. 457," Biometrics, The International Biometric Society, vol. 77(4), pages 1498-1500, December.
    14. Cui Guo & Jian Kang & Timothy D. Johnson, 2022. "A spatial Bayesian latent factor model for image‐on‐image regression," Biometrics, The International Biometric Society, vol. 78(1), pages 72-84, March.
    15. Daeyoung Kim & Bruce Lindsay, 2015. "Empirical identifiability in finite mixture models," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 67(4), pages 745-772, August.
    16. Yao, Weixin & Wei, Yan & Yu, Chun, 2014. "Robust mixture regression using the t-distribution," Computational Statistics & Data Analysis, Elsevier, vol. 71(C), pages 116-127.
    17. Yaeji Lim & Hee-Seok Oh & Ying Kuen Cheung, 2019. "Multiscale Clustering for Functional Data," Journal of Classification, Springer;The Classification Society, vol. 36(2), pages 368-391, July.
    18. Yujia Li & Xiangrui Zeng & Chien‐Wei Lin & George C. Tseng, 2022. "Simultaneous estimation of cluster number and feature sparsity in high‐dimensional cluster analysis," Biometrics, The International Biometric Society, vol. 78(2), pages 574-585, June.
    19. Dong Liu & Changwei Zhao & Yong He & Lei Liu & Ying Guo & Xinsheng Zhang, 2023. "Simultaneous cluster structure learning and estimation of heterogeneous graphs for matrix‐variate fMRI data," Biometrics, The International Biometric Society, vol. 79(3), pages 2246-2259, September.
    20. Jeffrey Andrews & Paul McNicholas, 2014. "Variable Selection for Clustering and Classification," Journal of Classification, Springer;The Classification Society, vol. 31(2), pages 136-153, July.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:biomet:v:78:y:2022:i:3:p:1067-1079. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.blackwellpublishing.com/journal.asp?ref=0006-341X .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.