IDEAS home Printed from https://ideas.repec.org/a/anm/alpnmr/v4y2016i2p85-94.html
   My bibliography  Save this article

Robust Principal Component Analysis Based on Modified Minimum Covariance Determinant in the Presence of Outliers

Author

Listed:
  • B. Barış Alkan

Abstract

Principal component analysis (PCA) is not resistant to outliers existing in multivariate data sets. The results which are obtained by using classical PCA are far from real values in the presence of outliers. Therefore, using robust versions of PCA is favorable. The easiest way to obtain robust principal components is to replace classical estimates of the location and scale parameters with their robust versions. Robust estimations of location and scale parameters can be found with minimum covariance determinant (MCD) providing high breakdown point. In this study, algorithm of MCD is modified using Jackknife resampling approach and results of this modification are examined. Proposed robust principal component analysis (RPCA) based on modified MCD (MMCD) method that is modified using Jaccknife resampling are evaluated over two real data with different outlier ratios. In the light of obtained results, it can be said that RPCA based on MMCD is better than RPCA based on MCD in the presence of outliers.

Suggested Citation

  • B. Barış Alkan, 2016. "Robust Principal Component Analysis Based on Modified Minimum Covariance Determinant in the Presence of Outliers," Alphanumeric Journal, Bahadir Fatih Yildirim, vol. 4(2), pages 85-94, September.
  • Handle: RePEc:anm:alpnmr:v:4:y:2016:i:2:p:85-94
    DOI: http://dx.doi.org/10.17093/aj.2016.4.2.5000189525
    as

    Download full text from publisher

    File URL: https://www.alphanumericjournal.com/media/Issue/volume-4-issue-2-2016/aykiri-gozlemlerin-varliginda-uyarlanmis-en-kucuk-kovaryans-_5YcrjpT.pdf
    Download Restriction: no

    File URL: https://alphanumericjournal.com/article/aykiri-gozlemlerin-varliginda-uyarlanmis-en-kucuk-kovaryans-determinant-tahminine-dayali-dayanikli-temel-bilesenler-analizi/
    Download Restriction: no

    File URL: https://libkey.io/http://dx.doi.org/10.17093/aj.2016.4.2.5000189525?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Croux, Christophe & Ruiz-Gazen, Anne, 2005. "High breakdown estimators for principal components: the projection-pursuit approach revisited," Journal of Multivariate Analysis, Elsevier, vol. 95(1), pages 206-226, July.
    2. N. Locantore & J. Marron & D. Simpson & N. Tripoli & J. Zhang & K. Cohen & Graciela Boente & Ricardo Fraiman & Babette Brumback & Christophe Croux & Jianqing Fan & Alois Kneip & John Marden & Daniel P, 1999. "Robust principal component analysis for functional data," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 8(1), pages 1-73, June.
    3. B. Baris Alkan & Cemal Atakan & Nesrin Alkan, 2015. "A comparison of different procedures for principal component analysis in the presence of outliers," Journal of Applied Statistics, Taylor & Francis Journals, vol. 42(8), pages 1716-1722, August.
    4. Todorov, Valentin & Filzmoser, Peter, 2009. "An Object-Oriented Framework for Robust Multivariate Analysis," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 32(i03).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Cevallos-Valdiviezo, Holger & Van Aelst, Stefan, 2019. "Fast computation of robust subspace estimators," Computational Statistics & Data Analysis, Elsevier, vol. 134(C), pages 171-185.
    2. Sven Serneels, 2019. "Projection pursuit based generalized betas accounting for higher order co-moment effects in financial market analysis," Papers 1908.00141, arXiv.org.
    3. Debruyne, Michiel & Hubert, Mia & Van Horebeek, Johan, 2010. "Detecting influential observations in Kernel PCA," Computational Statistics & Data Analysis, Elsevier, vol. 54(12), pages 3007-3019, December.
    4. Todorov, Valentin & Filzmoser, Peter, 2009. "An Object-Oriented Framework for Robust Multivariate Analysis," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 32(i03).
    5. Bali, Juan Lucas & Boente, Graciela, 2015. "Influence function of projection-pursuit principal components for functional data," Journal of Multivariate Analysis, Elsevier, vol. 133(C), pages 173-199.
    6. Lee, Seokho & Shin, Hyejin & Billor, Nedret, 2013. "M-type smoothing spline estimators for principal functions," Computational Statistics & Data Analysis, Elsevier, vol. 66(C), pages 89-100.
    7. Hyndman, Rob J. & Shahid Ullah, Md., 2007. "Robust forecasting of mortality and fertility rates: A functional data approach," Computational Statistics & Data Analysis, Elsevier, vol. 51(10), pages 4942-4956, June.
    8. Hervé Cardot & Antoine Godichon-Baggioni, 2017. "Fast estimation of the median covariation matrix with application to online robust principal components analysis," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 26(3), pages 461-480, September.
    9. Luca Greco & Alessio Farcomeni, 2016. "A plug-in approach to sparse and robust principal component analysis," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 25(3), pages 449-481, September.
    10. Dürre, Alexander & Vogel, Daniel & Fried, Roland, 2015. "Spatial sign correlation," Journal of Multivariate Analysis, Elsevier, vol. 135(C), pages 89-105.
    11. Bali, Juan Lucas & Boente, Graciela, 2014. "Consistency of a numerical approximation to the first principal component projection pursuit estimator," Statistics & Probability Letters, Elsevier, vol. 94(C), pages 181-191.
    12. Kondylis, Athanassios & Hadi, Ali S., 2006. "Derived components regression using the BACON algorithm," Computational Statistics & Data Analysis, Elsevier, vol. 51(2), pages 556-569, November.
    13. Graciela Boente & Matías Salibian-Barrera, 2015. "S -Estimators for Functional Principal Component Analysis," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(511), pages 1100-1111, September.
    14. Bali, Juan Lucas & Boente, Graciela, 2017. "Robust estimators under a functional common principal components model," Computational Statistics & Data Analysis, Elsevier, vol. 113(C), pages 424-440.
    15. Christian Acal & Manuel Escabias & Ana M. Aguilera & Mariano J. Valderrama, 2021. "COVID-19 Data Imputation by Multiple Function-on-Function Principal Component Regression," Mathematics, MDPI, vol. 9(11), pages 1-23, May.
    16. Langworthy, Benjamin W. & Stephens, Rebecca L. & Gilmore, John H. & Fine, Jason P., 2021. "Canonical correlation analysis for elliptical copulas," Journal of Multivariate Analysis, Elsevier, vol. 183(C).
    17. Choulakian, V. & Allard, J. & Almhana, J., 2006. "Robust centroid method," Computational Statistics & Data Analysis, Elsevier, vol. 51(2), pages 737-746, November.
    18. Guangxing Wang & Sisheng Liu & Fang Han & Chong‐Zhi Di, 2023. "Robust functional principal component analysis via a functional pairwise spatial sign operator," Biometrics, The International Biometric Society, vol. 79(2), pages 1239-1253, June.
    19. Pallavi Sawant & Nedret Billor & Hyejin Shin, 2012. "Functional outlier detection with robust functional principal component analysis," Computational Statistics, Springer, vol. 27(1), pages 83-102, March.
    20. Paula R. Bouzas & Ana M. Aguilera & Nuria Ruiz-Fuentes, 2012. "Functional Estimation of the Random Rate of a Cox Process," Methodology and Computing in Applied Probability, Springer, vol. 14(1), pages 57-69, March.

    More about this item

    Keywords

    Minimum Covariance Determinant; Outliers; Robust Principal Component Analysis;
    All these keywords.

    JEL classification:

    • C38 - Mathematical and Quantitative Methods - - Multiple or Simultaneous Equation Models; Multiple Variables - - - Classification Methdos; Cluster Analysis; Principal Components; Factor Analysis

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:anm:alpnmr:v:4:y:2016:i:2:p:85-94. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Bahadir Fatih Yildirim (email available below). General contact details of provider: https://www.alphanumericjournal.com/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.