IDEAS home Printed from https://ideas.repec.org/a/spr/metron/v80y2022i2d10.1007_s40300-021-00227-4.html
   My bibliography  Save this article

An EM algorithm for estimating the parameters of the multivariate skew-normal distribution with censored responses

Author

Listed:
  • Christian E. Galarza

    (Escuela Superior Politécnica del Litoral, ESPOL)

  • Larissa A. Matos

    (Universidade Estadual de Campinas)

  • Victor H. Lachos

    (University of Connecticut)

Abstract

Limited or censored data are collected in many studies. This occurs for many reasons in several practical situations, such as limitations in measuring equipment or from an experimental design. Consequently, the true value is recorded only if it falls within an interval range so that the responses can be either left, interval, or right-censored. Missing values can be seen just as a particular case. Linear and nonlinear regression models are routinely used to analyze these types of data. Most of these models are based on the normality assumption for the error term. However, such analyses might not provide robust inference when the normality assumption (or symmetry) is questionable. The need for asymmetric distributions for the random errors motivates us to develop a likelihood-based inference for linear models with censored responses based on the multivariate skew-normal distribution, where the missing/censoring mechanism is assumed to be “missing at random” (MAR). The proposed EM-type algorithm for maximum likelihood estimation uses closed-form expressions at the E-step based on formulas for the mean and variance of a truncated multivariate skew-normal distribution, available in the R package MomTrunc. Three datasets with censored and/or missing observations are analyzed and discussed.

Suggested Citation

  • Christian E. Galarza & Larissa A. Matos & Victor H. Lachos, 2022. "An EM algorithm for estimating the parameters of the multivariate skew-normal distribution with censored responses," METRON, Springer;Sapienza Università di Roma, vol. 80(2), pages 231-253, August.
  • Handle: RePEc:spr:metron:v:80:y:2022:i:2:d:10.1007_s40300-021-00227-4
    DOI: 10.1007/s40300-021-00227-4
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s40300-021-00227-4
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s40300-021-00227-4?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Jun Li & Daniel R. Jeske, 2009. "Maximum likelihood estimators of clock offset and skew under exponential delays," Applied Stochastic Models in Business and Industry, John Wiley & Sons, vol. 25(4), pages 506-507, July.
    2. Reinaldo B. Arellano-Valle & Marc G. Genton, 2010. "Multivariate extended skew-t distributions and related families," Metron - International Journal of Statistics, Dipartimento di Statistica, Probabilità e Statistiche Applicate - University of Rome, vol. 0(3), pages 201-234.
    3. Cabral, Celso Rômulo Barbosa & Lachos, Víctor Hugo & Prates, Marcos O., 2012. "Multivariate mixture modeling using skew-normal independent distributions," Computational Statistics & Data Analysis, Elsevier, vol. 56(1), pages 126-142, January.
    4. A. Capitanio & A. Azzalini & E. Stanghellini, 2003. "Graphical models for skew‐normal variates," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 30(1), pages 129-144, March.
    5. Lin, Tsung I. & Ho, Hsiu J. & Chen, Chiang L., 2009. "Analysis of multivariate skew normal models with incomplete data," Journal of Multivariate Analysis, Elsevier, vol. 100(10), pages 2337-2351, November.
    6. Jun Li & Daniel R. Jeske, 2009. "Maximum likelihood estimators of clock offset and skew under exponential delays," Applied Stochastic Models in Business and Industry, John Wiley & Sons, vol. 25(4), pages 445-459, July.
    7. A. Azzalini & A. Capitanio, 1999. "Statistical applications of the multivariate skew normal distribution," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 61(3), pages 579-602.
    8. Christian E. Galarza & Tsung-I Lin & Wan-Lun Wang & Víctor H. Lachos, 2021. "On moments of folded and truncated multivariate Student-t distributions based on recurrence relations," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 84(6), pages 825-850, August.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Valeriano, Katherine A.L. & Galarza, Christian E. & Matos, Larissa A. & Lachos, Victor H., 2023. "Likelihood-based inference for the multivariate skew-t regression with censored or missing responses," Journal of Multivariate Analysis, Elsevier, vol. 196(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Azzalini, Adelchi, 2022. "An overview on the progeny of the skew-normal family— A personal perspective," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    2. Francisco H. C. Alencar & Christian E. Galarza & Larissa A. Matos & Victor H. Lachos, 2022. "Finite mixture modeling of censored and missing data using the multivariate skew-normal distribution," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 16(3), pages 521-557, September.
    3. Sharon Lee & Geoffrey McLachlan, 2013. "On mixtures of skew normal and skew $$t$$ -distributions," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 7(3), pages 241-266, September.
    4. Valeriano, Katherine A.L. & Galarza, Christian E. & Matos, Larissa A. & Lachos, Victor H., 2023. "Likelihood-based inference for the multivariate skew-t regression with censored or missing responses," Journal of Multivariate Analysis, Elsevier, vol. 196(C).
    5. Olcay Arslan, 2015. "Variance-mean mixture of the multivariate skew normal distribution," Statistical Papers, Springer, vol. 56(2), pages 353-378, May.
    6. Christopher J. Adcock, 2022. "Properties and Limiting Forms of the Multivariate Extended Skew-Normal and Skew-Student Distributions," Stats, MDPI, vol. 5(1), pages 1-42, March.
    7. Katherine Elizabeth Castellano & Andrew Dean Ho, 2013. "Contrasting OLS and Quantile Regression Approaches to Student “Growth†Percentiles," Journal of Educational and Behavioral Statistics, , vol. 38(2), pages 190-215, April.
    8. Reinaldo B. Arellano-Valle & Marc G. Genton, 2010. "Multivariate extended skew-t distributions and related families," Metron - International Journal of Statistics, Dipartimento di Statistica, Probabilità e Statistiche Applicate - University of Rome, vol. 0(3), pages 201-234.
    9. Anna Gottard & Simona Pacillo, 2007. "On the impact of contaminations in graphical Gaussian models," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 15(3), pages 343-354, February.
    10. M. Teimourian & T. Baghfalaki & M. Ganjali & D. Berridge, 2015. "Joint modeling of mixed skewed continuous and ordinal longitudinal responses: a Bayesian approach," Journal of Applied Statistics, Taylor & Francis Journals, vol. 42(10), pages 2233-2256, October.
    11. Anna Gottard & Simona Pacillo, 2007. "On the impact of contaminations in graphical Gaussian models," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 15(3), pages 343-354, February.
    12. David Mayston, 2015. "Analysing the effectiveness of public service producers with endogenous resourcing," Journal of Productivity Analysis, Springer, vol. 44(1), pages 115-126, August.
    13. Zinoviy Landsman & Udi Makov & Tomer Shushi, 2017. "Extended Generalized Skew-Elliptical Distributions and their Moments," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 79(1), pages 76-100, February.
    14. Zareifard, Hamid & Rue, Håvard & Khaledi, Majid Jafari & Lindgren, Finn, 2016. "A skew Gaussian decomposable graphical model," Journal of Multivariate Analysis, Elsevier, vol. 145(C), pages 58-72.
    15. C. J. Adcock, 2023. "The Linear Skew-t Distribution and Its Properties," Stats, MDPI, vol. 6(1), pages 1-30, February.
    16. Libin Jin & Sung Nok Chiu & Jianhua Zhao & Lixing Zhu, 2023. "A constrained maximum likelihood estimation for skew normal mixtures," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 86(4), pages 391-419, May.
    17. Cheng, Qixiu & Lin, Yuqian & Zhou, Xuesong (Simon) & Liu, Zhiyuan, 2024. "Analytical formulation for explaining the variations in traffic states: A fundamental diagram modeling perspective with stochastic parameters," European Journal of Operational Research, Elsevier, vol. 312(1), pages 182-197.
    18. Cabral, Celso Rômulo Barbosa & Lachos, Víctor Hugo & Zeller, Camila Borelli, 2014. "Multivariate measurement error models using finite mixtures of skew-Student t distributions," Journal of Multivariate Analysis, Elsevier, vol. 124(C), pages 179-198.
    19. Antonio Canale & Euloge Clovis Kenne Pagui & Bruno Scarpa, 2016. "Bayesian modeling of university first-year students' grades after placement test," Journal of Applied Statistics, Taylor & Francis Journals, vol. 43(16), pages 3015-3029, December.
    20. Timothy Opheim & Anuradha Roy, 2021. "Linear models for multivariate repeated measures data with block exchangeable covariance structure," Computational Statistics, Springer, vol. 36(3), pages 1931-1963, September.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:metron:v:80:y:2022:i:2:d:10.1007_s40300-021-00227-4. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.