IDEAS home Printed from https://ideas.repec.org/a/eee/jmvana/v173y2019icp26-37.html
   My bibliography  Save this article

The generalized degrees of freedom of multilinear principal component analysis

Author

Listed:
  • Tu, I-Ping
  • Huang, Su-Yun
  • Hsieh, Dai-Ni

Abstract

Tensor data, such as image set, movie data, gene-environment interactions, or gene–gene interactions, have become a popular data format in many fields. Multilinear Principal Component Analysis (MPCA) has been recognized as an efficient dimension reduction method for tensor data analysis. However, a gratifying rank selection method for a general application of MPCA is not yet available. For example, both the Akaike Information Criterion (AIC) and Bayesian Information Criterion (BIC), arguably two of the most commonly used model selection methods, require more strict model assumptions when applying on the rank selection in MPCA. In this paper, we propose a rank selection rule for MPCA based on the minimum risk criterion and Stein’s unbiased risk estimate (SURE). We derive a neat formula while using the minimum model assumptions for MPCA. It is composed of a residual sum of squares for model fitting and a penalty on the model complexity referred as the generalized degrees of freedom (GDF). We allocate each term in the GDF to either the number of parameters used in the model or the complexity in separating the signal from the noise. Compared with AIC and BIC and their modification methods, this criterion reaches higher accuracies in a thorough simulation study. Importantly, it has potential for more general application because it makes fewer model assumptions.

Suggested Citation

  • Tu, I-Ping & Huang, Su-Yun & Hsieh, Dai-Ni, 2019. "The generalized degrees of freedom of multilinear principal component analysis," Journal of Multivariate Analysis, Elsevier, vol. 173(C), pages 26-37.
  • Handle: RePEc:eee:jmvana:v:173:y:2019:i:c:p:26-37
    DOI: 10.1016/j.jmva.2019.01.010
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0047259X1730492X
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jmva.2019.01.010?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Hung Hung & Peishien Wu & Iping Tu & Suyun Huang, 2012. "On multilinear principal component analysis of order-two tensors," Biometrika, Biometrika Trust, vol. 99(3), pages 569-583.
    2. Chen, Yin-Ping & Huang, Hsin-Cheng & Tu, I-Ping, 2010. "A new approach for selecting the number of factors," Computational Statistics & Data Analysis, Elsevier, vol. 54(12), pages 2990-2998, December.
    3. Magnus, J.R. & Neudecker, H., 1979. "The commutation matrix : Some properties and applications," Other publications TiSEM d0b1e779-7795-4676-ac98-1, Tilburg University, School of Economics and Management.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Yuangang Li & Maohua Sun & Guanghui Yuan & Yujing Liu, 2019. "Evaluation Methods of Water Environment Safety and Their Application to the Three Northeast Provinces of China," Sustainability, MDPI, vol. 11(18), pages 1-16, September.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Huang, Shih-Hao & Huang, Su-Yun, 2021. "On the asymptotic normality and efficiency of Kronecker envelope principal component analysis," Journal of Multivariate Analysis, Elsevier, vol. 184(C).
    2. Paulo M. D. C. Parente & Richard J. Smith, 2021. "Quasi‐maximum likelihood and the kernel block bootstrap for nonlinear dynamic models," Journal of Time Series Analysis, Wiley Blackwell, vol. 42(4), pages 377-405, July.
    3. Kim, Hyun Hak & Swanson, Norman R., 2018. "Mining big data using parsimonious factor, machine learning, variable selection and shrinkage methods," International Journal of Forecasting, Elsevier, vol. 34(2), pages 339-354.
    4. Norman R. Swanson & Weiqi Xiong, 2018. "Big data analytics in economics: What have we learned so far, and where should we go from here?," Canadian Journal of Economics/Revue canadienne d'économique, John Wiley & Sons, vol. 51(3), pages 695-746, August.
    5. D.A. Turkington, 1997. "Some results in matrix calculus and an example of their application to econometrics," Economics Discussion / Working Papers 97-07, The University of Western Australia, Department of Economics.
    6. Loperfido, Nicola, 2021. "Some theoretical properties of two kurtosis matrices, with application to invariant coordinate selection," Journal of Multivariate Analysis, Elsevier, vol. 186(C).
    7. Joshua C. C. Chan & Liana Jacobi & Dan Zhu, 2019. "How Sensitive Are VAR Forecasts to Prior Hyperparameters? An Automated Sensitivity Analysis," Advances in Econometrics, in: Topics in Identification, Limited Dependent Variables, Partial Observability, Experimentation, and Flexible Modeling: Part A, volume 40, pages 229-248, Emerald Group Publishing Limited.
    8. Hong Lan & Alexander Meyer-Gohde, 2013. "Pruning in Perturbation DSGE Models - Guidance from Nonlinear Moving Average Approximations," SFB 649 Discussion Papers SFB649DP2013-024, Sonderforschungsbereich 649, Humboldt University, Berlin, Germany.
    9. Lan, Hong & Meyer-Gohde, Alexander, 2012. "Existence and Uniqueness of Perturbation Solutions in DSGE Models," Dynare Working Papers 14, CEPREMAP.
    10. Haas, Markus & Mittnik, Stefan & Paolella, Marc S., 2009. "Asymmetric multivariate normal mixture GARCH," Computational Statistics & Data Analysis, Elsevier, vol. 53(6), pages 2129-2154, April.
    11. O. J. Boxma & E. J. Cahen & D. Koops & M. Mandjes, 2019. "Linear Stochastic Fluid Networks: Rare-Event Simulation and Markov Modulation," Methodology and Computing in Applied Probability, Springer, vol. 21(1), pages 125-153, March.
    12. Loperfido, Nicola, 2014. "Linear transformations to symmetry," Journal of Multivariate Analysis, Elsevier, vol. 129(C), pages 186-192.
    13. Liu, Shuangzhe & Leiva, Víctor & Zhuang, Dan & Ma, Tiefeng & Figueroa-Zúñiga, Jorge I., 2022. "Matrix differential calculus with applications in the multivariate linear model and its diagnostics," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    14. Johan Lyhagen, 2012. "A note on the representation of $${E\left({\textit{\textbf {x}}}\otimes {\textit{\textbf {xx}}}^{\prime}\right) }$$ and $${E\left({\textit{\textbf {xx}}}^{\prime }\otimes {\textit{\textbf {xx}}}^{\pri," Statistical Papers, Springer, vol. 53(3), pages 697-701, August.
    15. Arismendi, J.C., 2013. "Multivariate truncated moments," Journal of Multivariate Analysis, Elsevier, vol. 117(C), pages 41-75.
    16. Dante Amengual & Gabriele Fiorentini & Enrique Sentana, 2022. "Tests for Random Coefficient Variation in Vector Autoregressive Models," Advances in Econometrics, in: Essays in Honour of Fabio Canova, volume 44, pages 1-35, Emerald Group Publishing Limited.
    17. Lan, Hong & Meyer-Gohde, Alexander, 2013. "Solving DSGE models with a nonlinear moving average," Journal of Economic Dynamics and Control, Elsevier, vol. 37(12), pages 2643-2667.
    18. Reinaldo B. Arellano-Valle & Adelchi Azzalini, 2022. "Some properties of the unified skew-normal distribution," Statistical Papers, Springer, vol. 63(2), pages 461-487, April.
    19. Jiming Jiang & Matt P. Wand & Aishwarya Bhaskaran, 2022. "Usable and precise asymptotics for generalized linear mixed model analysis and design," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 84(1), pages 55-82, February.
    20. Symeonides Spyridon D. & Karavias Yiannis & Tzavalis Elias, 2017. "Size corrected Significance Tests in Seemingly Unrelated Regressions with Autocorrelated Errors," Journal of Time Series Econometrics, De Gruyter, vol. 9(1), pages 1-41, January.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:jmvana:v:173:y:2019:i:c:p:26-37. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/622892/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.