IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v90y2015icp61-73.html
   My bibliography  Save this article

Location and scale mixtures of Gaussians with flexible tail behaviour: Properties, inference and application to multivariate clustering

Author

Listed:
  • Wraith, Darren
  • Forbes, Florence

Abstract

The family of location and scale mixtures of Gaussians has the ability to generate a number of flexible distributional forms. The family nests as particular cases several important asymmetric distributions like the Generalized Hyperbolic distribution. The Generalized Hyperbolic distribution in turn nests many other well known distributions such as the Normal Inverse Gaussian. In a multivariate setting, an extension of the standard location and scale mixture concept is proposed into a so called multiple scaled framework which has the advantage of allowing different tail and skewness behaviours in each dimension with arbitrary correlation between dimensions. Estimation of the parameters is provided via an EM algorithm and extended to cover the case of mixtures of such multiple scaled distributions for application to clustering. Assessments on simulated and real data confirm the gain in degrees of freedom and flexibility in modelling data of varying tail behaviour and directional shape.

Suggested Citation

  • Wraith, Darren & Forbes, Florence, 2015. "Location and scale mixtures of Gaussians with flexible tail behaviour: Properties, inference and application to multivariate clustering," Computational Statistics & Data Analysis, Elsevier, vol. 90(C), pages 61-73.
  • Handle: RePEc:eee:csdana:v:90:y:2015:i:c:p:61-73
    DOI: 10.1016/j.csda.2015.04.008
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S016794731500105X
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2015.04.008?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Ferreira, Jose T.A.S. & Steel, Mark F.J., 2007. "Model comparison of coordinate-free multivariate skewed distributions with an application to stochastic frontiers," Journal of Econometrics, Elsevier, vol. 137(2), pages 641-673, April.
    2. Benaglia, Tatiana & Chauveau, Didier & Hunter, David R. & Young, Derek S., 2009. "mixtools: An R Package for Analyzing Mixture Models," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 32(i06).
    3. Young, D.S. & Hunter, D.R., 2010. "Mixtures of regressions with predictor-dependent mixing proportions," Computational Statistics & Data Analysis, Elsevier, vol. 54(10), pages 2253-2266, October.
    4. McLachlan, Geoff & Lee, Sharon X, 2013. "EMMIXuskew: An R Package for Fitting Mixtures of Multivariate Skew t Distributions via the EM Algorithm," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 55(i12).
    5. Sharon Lee & Geoffrey McLachlan, 2013. "Rejoinder to the discussion of “Model-based clustering and classification with non-normal mixture distributions”," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 22(4), pages 473-479, November.
    6. Sharon Lee & Geoffrey McLachlan, 2013. "Model-based clustering and classification with non-normal mixture distributions," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 22(4), pages 427-454, November.
    7. Kotz,Samuel & Nadarajah,Saralees, 2004. "Multivariate T-Distributions and Their Applications," Cambridge Books, Cambridge University Press, number 9780521826549.
    8. David Hunter & Derek Young, 2012. "Semiparametric mixtures of regressions," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 24(1), pages 19-38.
    9. Branco, Márcia D. & Dey, Dipak K., 2001. "A General Class of Multivariate Skew-Elliptical Distributions," Journal of Multivariate Analysis, Elsevier, vol. 79(1), pages 99-113, October.
    10. Basso, Rodrigo M. & Lachos, Víctor H. & Cabral, Celso Rômulo Barbosa & Ghosh, Pulak, 2010. "Robust mixture modeling based on scale mixtures of skew-normal distributions," Computational Statistics & Data Analysis, Elsevier, vol. 54(12), pages 2926-2941, December.
    11. Adelchi Azzalini & Antonella Capitanio, 2003. "Distributions generated by perturbation of symmetry with emphasis on a multivariate skew t‐distribution," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 65(2), pages 367-389, May.
    12. Bouveyron, C. & Girard, S. & Schmid, C., 2007. "High-dimensional data clustering," Computational Statistics & Data Analysis, Elsevier, vol. 52(1), pages 502-519, September.
    13. Vilca, Filidor & Balakrishnan, N. & Zeller, Camila Borelli, 2014. "Multivariate Skew-Normal Generalized Hyperbolic distribution and its properties," Journal of Multivariate Analysis, Elsevier, vol. 128(C), pages 73-85.
    14. Kjersti Aas & Ingrid Hobaek Haff, 2006. "The Generalized Hyperbolic Skew Student's t-Distribution," Journal of Financial Econometrics, Oxford University Press, vol. 4(2), pages 275-309.
    15. Vilca, Filidor & Balakrishnan, N. & Zeller, Camila Borelli, 2014. "A robust extension of the bivariate Birnbaum–Saunders distribution and associated inference," Journal of Multivariate Analysis, Elsevier, vol. 124(C), pages 418-435.
    16. Cabral, Celso Rômulo Barbosa & Lachos, Víctor Hugo & Prates, Marcos O., 2012. "Multivariate mixture modeling using skew-normal independent distributions," Computational Statistics & Data Analysis, Elsevier, vol. 56(1), pages 126-142, January.
    17. Karlis, Dimitris & Xekalaki, Evdokia, 2003. "Choosing initial values for the EM algorithm for finite mixtures," Computational Statistics & Data Analysis, Elsevier, vol. 41(3-4), pages 577-590, January.
    18. Chang, George T. & Walther, Guenther, 2007. "Clustering with mixtures of log-concave distributions," Computational Statistics & Data Analysis, Elsevier, vol. 51(12), pages 6242-6251, August.
    19. Wang, Wan-Lun, 2015. "Mixtures of common t-factor analyzers for modeling high-dimensional data with missing values," Computational Statistics & Data Analysis, Elsevier, vol. 83(C), pages 223-235.
    20. Karlis, Dimitris, 2002. "An EM type algorithm for maximum likelihood estimation of the normal-inverse Gaussian distribution," Statistics & Probability Letters, Elsevier, vol. 57(1), pages 43-52, March.
    21. Schmidt, Rafael & Hrycej, Tomas & Stutzle, Eric, 2006. "Multivariate distribution models with generalized hyperbolic margins," Computational Statistics & Data Analysis, Elsevier, vol. 50(8), pages 2065-2096, April.
    22. Ryan Browne & Paul McNicholas, 2014. "Estimating common principal components in high dimensions," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 8(2), pages 217-226, June.
    23. Lin, Tsung-I, 2014. "Learning from incomplete data via parameterized t mixture models through eigenvalue decomposition," Computational Statistics & Data Analysis, Elsevier, vol. 71(C), pages 183-195.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Mohsen Maleki & Darren Wraith & Reinaldo B. Arellano-Valle, 2019. "A flexible class of parametric distributions for Bayesian linear mixed models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 28(2), pages 543-564, June.
    2. Murray, Paula M. & Browne, Ryan P. & McNicholas, Paul D., 2017. "A mixture of SDB skew-t factor analyzers," Econometrics and Statistics, Elsevier, vol. 3(C), pages 160-168.
    3. Eckhard Liebscher & Wolf-Dieter Richter, 2016. "Estimation of Star-Shaped Distributions," Risks, MDPI, vol. 4(4), pages 1-37, November.
    4. Sladana Babic & Laetitia Gelbgras & Marc Hallin & Christophe Ley, 2019. "Optimal tests for elliptical symmetry: specified and unspecified location," Working Papers ECARES 2019-26, ULB -- Universite Libre de Bruxelles.
    5. Lee, Sharon X. & McLachlan, Geoffrey J., 2022. "An overview of skew distributions in model-based clustering," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    6. Perthame, Emeline & Forbes, Florence & Deleforge, Antoine, 2018. "Inverse regression approach to robust nonlinear high-to-low dimensional mapping," Journal of Multivariate Analysis, Elsevier, vol. 163(C), pages 1-14.
    7. Lorenzo Ricci & David Veredas, 2012. "TailCoR," Working Papers 1227, Banco de España.
      • Sla{dj}ana Babi'c & Christophe Ley & Lorenzo Ricci & David Veredas, 2020. "TailCoR," Papers 2011.14817, arXiv.org.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Lee, Sharon X. & McLachlan, Geoffrey J., 2022. "An overview of skew distributions in model-based clustering," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    2. Antonio Parisi & B. Liseo, 2018. "Objective Bayesian analysis for the multivariate skew-t model," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 27(2), pages 277-295, June.
    3. Derek S. Young & Xi Chen & Dilrukshi C. Hewage & Ricardo Nilo-Poyanco, 2019. "Finite mixture-of-gamma distributions: estimation, inference, and model-based clustering," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 13(4), pages 1053-1082, December.
    4. Nicola Loperfido, 2019. "Finite mixtures, projection pursuit and tensor rank: a triangulation," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 13(1), pages 145-173, March.
    5. Morris, Katherine & Punzo, Antonio & McNicholas, Paul D. & Browne, Ryan P., 2019. "Asymmetric clusters and outliers: Mixtures of multivariate contaminated shifted asymmetric Laplace distributions," Computational Statistics & Data Analysis, Elsevier, vol. 132(C), pages 145-166.
    6. McLachlan, Geoffrey J. & Lee, Sharon X., 2016. "Comment on “On nomenclature, and the relative merits of two formulations of skew distributions” by A. Azzalini, R. Browne, M. Genton, and P. McNicholas," Statistics & Probability Letters, Elsevier, vol. 116(C), pages 1-5.
    7. Lin, Tsung-I & McLachlan, Geoffrey J. & Lee, Sharon X., 2016. "Extending mixtures of factor models using the restricted multivariate skew-normal distribution," Journal of Multivariate Analysis, Elsevier, vol. 143(C), pages 398-413.
    8. Azzalini, Adelchi, 2022. "An overview on the progeny of the skew-normal family— A personal perspective," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    9. Wan-Lun Wang & Ahad Jamalizadeh & Tsung-I Lin, 2020. "Finite mixtures of multivariate scale-shape mixtures of skew-normal distributions," Statistical Papers, Springer, vol. 61(6), pages 2643-2670, December.
    10. Sharon Lee & Geoffrey McLachlan, 2013. "Model-based clustering and classification with non-normal mixture distributions," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 22(4), pages 427-454, November.
    11. Tsung-I Lin & Pal Wu & Geoffrey McLachlan & Sharon Lee, 2015. "A robust factor analysis model using the restricted skew- $$t$$ t distribution," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 24(3), pages 510-531, September.
    12. Marcel Wollschlager & Rudi Schafer, 2015. "Impact of non-stationarity on estimating and modeling empirical copulas of daily stock returns," Papers 1506.08054, arXiv.org.
    13. Zhu, Xuwen & Melnykov, Volodymyr, 2018. "Manly transformation in finite mixture modeling," Computational Statistics & Data Analysis, Elsevier, vol. 121(C), pages 190-208.
    14. Panagiotelis, Anastasios & Smith, Michael, 2010. "Bayesian skew selection for multivariate models," Computational Statistics & Data Analysis, Elsevier, vol. 54(7), pages 1824-1839, July.
    15. Azzalini, Adelchi & Browne, Ryan P. & Genton, Marc G. & McNicholas, Paul D., 2016. "On nomenclature for, and the relative merits of, two formulations of skew distributions," Statistics & Probability Letters, Elsevier, vol. 110(C), pages 201-206.
    16. Manabu Asai & Michael McAleer & Jun Yu, 2006. "Multivariate Stochastic Volatility," Microeconomics Working Papers 22058, East Asian Bureau of Economic Research.
    17. Koliai, Lyes, 2016. "Extreme risk modeling: An EVT–pair-copulas approach for financial stress tests," Journal of Banking & Finance, Elsevier, vol. 70(C), pages 1-22.
    18. Murray, Paula M. & Browne, Ryan P. & McNicholas, Paul D., 2017. "Hidden truncation hyperbolic distributions, finite mixtures thereof, and their application for clustering," Journal of Multivariate Analysis, Elsevier, vol. 161(C), pages 141-156.
    19. Sharon Lee & Geoffrey McLachlan, 2013. "On mixtures of skew normal and skew $$t$$ -distributions," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 7(3), pages 241-266, September.
    20. Mohsen Maleki & Darren Wraith & Reinaldo B. Arellano-Valle, 2019. "A flexible class of parametric distributions for Bayesian linear mixed models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 28(2), pages 543-564, June.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:90:y:2015:i:c:p:61-73. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.