IDEAS home Printed from https://ideas.repec.org/a/bla/jorssb/v70y2008i4p779-802.html
   My bibliography  Save this article

Robust estimation in the normal mixture model based on robust clustering

Author

Listed:
  • J. A. Cuesta‐Albertos
  • C. Matrán
  • A. Mayo‐Iscar

Abstract

Summary. We introduce a robust estimation procedure that is based on the choice of a representative trimmed subsample through an initial robust clustering procedure, and subsequent improvements based on maximum likelihood. To obtain the initial trimming we resort to the trimmed k‐means, a simple procedure designed for finding the core of the clusters under appropriate configurations. By handling the trimmed data as censored, maximum likelihood estimation provides in each step the location and shape of the next trimming. Data‐driven restrictions on the parameters, requiring that every distribution in the mixture must be sufficiently represented in the initial clustered region, allow singularities to be avoided and guarantee the existence of the estimator. Our analysis includes robustness properties and asymptotic results as well as worked examples.

Suggested Citation

  • J. A. Cuesta‐Albertos & C. Matrán & A. Mayo‐Iscar, 2008. "Robust estimation in the normal mixture model based on robust clustering," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 70(4), pages 779-802, September.
  • Handle: RePEc:bla:jorssb:v:70:y:2008:i:4:p:779-802
    DOI: 10.1111/j.1467-9868.2008.00657.x
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/j.1467-9868.2008.00657.x
    Download Restriction: no

    File URL: https://libkey.io/10.1111/j.1467-9868.2008.00657.x?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Marianthi Markatou, 2000. "Mixture Models, Robustness, and the Weighted Likelihood Methodology," Biometrics, The International Biometric Society, vol. 56(2), pages 483-486, June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Neykov, N.M. & Filzmoser, P. & Neytchev, P.N., 2012. "Robust joint modeling of mean and dispersion through trimming," Computational Statistics & Data Analysis, Elsevier, vol. 56(1), pages 34-48, January.
    2. L. A. García-Escudero & A. Gordaliza & C. Matrán & A. Mayo-Iscar, 2018. "Comments on “The power of monitoring: how to make the most of a contaminated multivariate sample”," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 27(4), pages 605-608, December.
    3. Alessio Farcomeni & Luca Greco, 2015. "S-estimation of hidden Markov models," Computational Statistics, Springer, vol. 30(1), pages 57-80, March.
    4. Gao, Jinxin & Hitchcock, David B., 2010. "James-Stein shrinkage to improve k-means cluster analysis," Computational Statistics & Data Analysis, Elsevier, vol. 54(9), pages 2113-2127, September.
    5. Julien Hambuckers & Marie Kratz & Antoine Usseglio-Carleve, 2023. "Efficient Estimation In Extreme Value Regression Models Of Hedge Fund Tail Risks," Working Papers hal-04090916, HAL.
    6. Julien Hambuckers & Marie Kratz & Antoine Usseglio-Carleve, 2023. "Efficient Estimation in Extreme Value Regression Models of Hedge Fund Tail Risks," Papers 2304.06950, arXiv.org.
    7. Luis García-Escudero & Alfonso Gordaliza & Carlos Matrán & Agustín Mayo-Iscar, 2010. "A review of robust clustering methods," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 4(2), pages 89-109, September.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jorge I. Figueroa-Zúñiga & Cristian L. Bayes & Víctor Leiva & Shuangzhe Liu, 2022. "Robust beta regression modeling with errors-in-variables: a Bayesian approach and numerical applications," Statistical Papers, Springer, vol. 63(3), pages 919-942, June.
    2. Yao, Weixin & Wei, Yan & Yu, Chun, 2014. "Robust mixture regression using the t-distribution," Computational Statistics & Data Analysis, Elsevier, vol. 71(C), pages 116-127.
    3. Atefeh Zarei & Zahra Khodadadi & Mohsen Maleki & Karim Zare, 2023. "Robust mixture regression modeling based on two-piece scale mixtures of normal distributions," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 17(1), pages 181-210, March.
    4. Chalabi, Yohan / Y. & Wuertz, Diethelm, 2010. "Weighted trimmed likelihood estimator for GARCH models," MPRA Paper 26536, University Library of Munich, Germany.
    5. Salem Reyen & John Miller & Edward Wegman, 2009. "Separating a mixture of two normals with proportional covariances," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 70(3), pages 297-314, November.
    6. Neykov, N. & Filzmoser, P. & Dimova, R. & Neytchev, P., 2007. "Robust fitting of mixtures using the trimmed likelihood estimator," Computational Statistics & Data Analysis, Elsevier, vol. 52(1), pages 299-308, September.
    7. Bai, Xiuqin & Yao, Weixin & Boyer, John E., 2012. "Robust fitting of mixture regression models," Computational Statistics & Data Analysis, Elsevier, vol. 56(7), pages 2347-2359.
    8. John P. Saldanha & Bradley S. Price & Douglas J. Thomas, 2023. "A nonparametric approach for setting safety stock levels," Production and Operations Management, Production and Operations Management Society, vol. 32(4), pages 1150-1168, April.
    9. Qingguo Tang & R. J. Karunamuni, 2018. "Robust variable selection for finite mixture regression models," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 70(3), pages 489-521, June.
    10. Zhan, Tingting & Chevoneva, Inna & Iglewicz, Boris, 2011. "Generalized weighted likelihood density estimators with application to finite mixture of exponential family distributions," Computational Statistics & Data Analysis, Elsevier, vol. 55(1), pages 457-465, January.
    11. Tang, Qingguo & Karunamuni, Rohana J., 2013. "Minimum distance estimation in a finite mixture regression model," Journal of Multivariate Analysis, Elsevier, vol. 120(C), pages 185-204.
    12. Fung, Tsz Chai, 2022. "Maximum weighted likelihood estimator for robust heavy-tail modelling of finite mixture models," Insurance: Mathematics and Economics, Elsevier, vol. 107(C), pages 180-198.
    13. Paul D. McNicholas, 2016. "Model-Based Clustering," Journal of Classification, Springer;The Classification Society, vol. 33(3), pages 331-373, October.
    14. Alessio Farcomeni & Luca Greco, 2015. "S-estimation of hidden Markov models," Computational Statistics, Springer, vol. 30(1), pages 57-80, March.
    15. Bagkavos, Dimitrios & Patil, Prakash N., 2023. "Goodness-of-fit testing for normal mixture densities," Computational Statistics & Data Analysis, Elsevier, vol. 188(C).
    16. Chun Yu & Weixin Yao & Guangren Yang, 2020. "A Selective Overview and Comparison of Robust Mixture Regression Estimators," International Statistical Review, International Statistical Institute, vol. 88(1), pages 176-202, April.
    17. Luca Greco, 2022. "Robust fitting of mixtures of GLMs by weighted likelihood," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 106(1), pages 25-48, March.
    18. Luis García-Escudero & Alfonso Gordaliza & Carlos Matrán & Agustín Mayo-Iscar, 2010. "A review of robust clustering methods," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 4(2), pages 89-109, September.
    19. Karlis, Dimitris & Xekalaki, Evdokia, 2003. "Choosing initial values for the EM algorithm for finite mixtures," Computational Statistics & Data Analysis, Elsevier, vol. 41(3-4), pages 577-590, January.
    20. Reyen, Salem S. & Miller, John J., 2005. "The moment of inertia and the linear discriminant function," Statistics & Probability Letters, Elsevier, vol. 71(1), pages 39-46, January.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jorssb:v:70:y:2008:i:4:p:779-802. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/rssssea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.