IDEAS home Printed from https://ideas.repec.org/a/eee/jmvana/v167y2018icp31-48.html
   My bibliography  Save this article

An expectation–maximization algorithm for the matrix normal distribution with an application in remote sensing

Author

Listed:
  • Glanz, Hunter
  • Carvalho, Luis

Abstract

Dramatic increases in the size and dimensionality of many modern datasets make crucial the need for sophisticated methods that can exploit inherent structure and handle missing values. In this article we derive an expectation–maximization (EM) algorithm for the matrix normal distribution, a distribution well-suited for naturally structured data such as spatio-temporal data. We review previously established maximum likelihood matrix normal estimates, and then consider the situation involving missing data. We apply our EM method in a simulation study exploring errors across different dimensions and proportions of missing data. We compare these errors to those from three alternative methods and show that our proposed EM method outperforms them in all scenarios. Finally, we implement the proposed EM method in a novel way on a satellite image dataset to investigate land-cover classification separability.

Suggested Citation

  • Glanz, Hunter & Carvalho, Luis, 2018. "An expectation–maximization algorithm for the matrix normal distribution with an application in remote sensing," Journal of Multivariate Analysis, Elsevier, vol. 167(C), pages 31-48.
  • Handle: RePEc:eee:jmvana:v:167:y:2018:i:c:p:31-48
    DOI: 10.1016/j.jmva.2018.03.010
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0047259X17300477
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jmva.2018.03.010?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Dayanand Naik & Shantha Rao, 2001. "Analysis of multivariate repeated measures data with a Kronecker product structured covariance matrix," Journal of Applied Statistics, Taylor & Francis Journals, vol. 28(1), pages 91-105.
    2. Roś, Beata & Bijma, Fetsje & de Munck, Jan C. & de Gunst, Mathisca C.M., 2016. "Existence and uniqueness of the maximum likelihood estimator for models with a Kronecker product covariance structure," Journal of Multivariate Analysis, Elsevier, vol. 143(C), pages 345-361.
    3. Joseph G. Ibrahim & Ming-Hui Chen & Stuart R. Lipsitz & Amy H. Herring, 2005. "Missing-Data Methods for Generalized Linear Models: A Comparative Review," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 332-346, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Kenneth Lange & Hua Zhou, 2022. "A Legacy of EM Algorithms," International Statistical Review, International Statistical Institute, vol. 90(S1), pages 52-66, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ryo Kato & Takahiro Hoshino, 2020. "Semiparametric Bayesian multiple imputation for regression models with missing mixed continuous–discrete covariates," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 72(3), pages 803-825, June.
    2. Li Cai & Lijie Gu & Qihua Wang & Suojin Wang, 2021. "Simultaneous confidence bands for nonparametric regression with missing covariate data," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 73(6), pages 1249-1279, December.
    3. McDonough, Ian K. & Millimet, Daniel L., 2017. "Missing data, imputation, and endogeneity," Journal of Econometrics, Elsevier, vol. 199(2), pages 141-155.
    4. J. Andrew Royle, 2009. "Analysis of Capture–Recapture Models with Individual Covariates Using Data Augmentation," Biometrics, The International Biometric Society, vol. 65(1), pages 267-274, March.
    5. Jiang, Depeng & Zhao, Puying & Tang, Niansheng, 2016. "A propensity score adjustment method for regression models with nonignorable missing covariates," Computational Statistics & Data Analysis, Elsevier, vol. 94(C), pages 98-119.
    6. J. F. Lawless, 2018. "Two-phase outcome-dependent studies for failure times and testing for effects of expensive covariates," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 24(1), pages 28-44, January.
    7. el Bouhaddani, Said & Uh, Hae-Won & Hayward, Caroline & Jongbloed, Geurt & Houwing-Duistermaat, Jeanine, 2018. "Probabilistic partial least squares model: Identifiability, estimation and application," Journal of Multivariate Analysis, Elsevier, vol. 167(C), pages 331-346.
    8. Hui Yao & Sungduk Kim & Ming-Hui Chen & Joseph G. Ibrahim & Arvind K. Shah & Jianxin Lin, 2015. "Bayesian Inference for Multivariate Meta-Regression With a Partially Observed Within-Study Sample Covariance Matrix," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(510), pages 528-544, June.
    9. Yi Qian & Hui Xie, 2011. "No Customer Left Behind: A Distribution-Free Bayesian Approach to Accounting for Missing Xs in Marketing Models," Marketing Science, INFORMS, vol. 30(4), pages 717-736, July.
    10. Jiang, Wei & Josse, Julie & Lavielle, Marc, 2020. "Logistic regression with missing covariates—Parameter estimation, model selection and prediction within a joint-modeling framework," Computational Statistics & Data Analysis, Elsevier, vol. 145(C).
    11. Baojiang Chen & Xiao-Hua Zhou, 2011. "Doubly Robust Estimates for Binary Longitudinal Data Analysis with Missing Response and Missing Covariates," Biometrics, The International Biometric Society, vol. 67(3), pages 830-842, September.
    12. Zhuoer Sun & Suojin Wang, 2019. "Semiparametric estimation in regression with missing covariates using single-index models," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 71(5), pages 1201-1232, October.
    13. Yang, Ying & Kang, Jian, 2010. "Joint analysis of mixed Poisson and continuous longitudinal data with nonignorable missing values," Computational Statistics & Data Analysis, Elsevier, vol. 54(1), pages 193-207, January.
    14. Lei Wang, 2019. "Dimension reduction for kernel-assisted M-estimators with missing response at random," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 71(4), pages 889-910, August.
    15. Driezen, Kassandra & Adriaensen, Frank & Rondinini, Carlo & Doncaster, C. Patrick & Matthysen, Erik, 2007. "Evaluating least-cost model predictions with empirical dispersal data: A case-study using radiotracking data of hedgehogs (Erinaceus europaeus)," Ecological Modelling, Elsevier, vol. 209(2), pages 314-322.
    16. Peisong Han, 2016. "Combining Inverse Probability Weighting and Multiple Imputation to Improve Robustness of Estimation," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 43(1), pages 246-260, March.
    17. Sean L Simpson & Lloyd J Edwards & Martin A Styner & Keith E Muller, 2014. "Kronecker Product Linear Exponent AR(1) Correlation Structures for Multivariate Repeated Measures," PLOS ONE, Public Library of Science, vol. 9(2), pages 1-10, February.
    18. Guo, Xu & Song, Lianlian & Fang, Yun & Zhu, Lixing, 2019. "Model checking for general linear regression with nonignorable missing response," Computational Statistics & Data Analysis, Elsevier, vol. 138(C), pages 1-12.
    19. Hui Peng & He Wang & Weijia Kong & Jinyan Li & Wilson Wen Bin Goh, 2024. "Optimizing differential expression analysis for proteomics data via high-performing rules and ensemble inference," Nature Communications, Nature, vol. 15(1), pages 1-18, December.
    20. Lee, Min Cherng & Mitra, Robin, 2016. "Multiply imputing missing values in data sets with mixed measurement scales using a sequence of generalised linear models," Computational Statistics & Data Analysis, Elsevier, vol. 95(C), pages 24-38.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:jmvana:v:167:y:2018:i:c:p:31-48. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/622892/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.