IDEAS home Printed from https://ideas.repec.org/a/bla/jorssb/v71y2009i3p549-592.html
   My bibliography  Save this article

Invariant co‐ordinate selection

Author

Listed:
  • David E. Tyler
  • Frank Critchley
  • Lutz Dümbgen
  • Hannu Oja

Abstract

Summary. A general method for exploring multivariate data by comparing different estimates of multivariate scatter is presented. The method is based on the eigenvalue–eigenvector decomposition of one scatter matrix relative to another. In particular, it is shown that the eigenvectors can be used to generate an affine invariant co‐ordinate system for the multivariate data. Consequently, we view this method as a method for invariant co‐ordinate selection. By plotting the data with respect to this new invariant co‐ordinate system, various data structures can be revealed. For example, under certain independent components models, it is shown that the invariant co‐ ordinates correspond to the independent components. Another example pertains to mixtures of elliptical distributions. In this case, it is shown that a subset of the invariant co‐ordinates corresponds to Fisher's linear discriminant subspace, even though the class identifications of the data points are unknown. Some illustrative examples are given.

Suggested Citation

  • David E. Tyler & Frank Critchley & Lutz Dümbgen & Hannu Oja, 2009. "Invariant co‐ordinate selection," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 71(3), pages 549-592, June.
  • Handle: RePEc:bla:jorssb:v:71:y:2009:i:3:p:549-592
    DOI: 10.1111/j.1467-9868.2009.00706.x
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/j.1467-9868.2009.00706.x
    Download Restriction: no

    File URL: https://libkey.io/10.1111/j.1467-9868.2009.00706.x?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Nordhausen, Klaus & Oja, Hannu & Paindaveine, Davy, 2009. "Signed-rank tests for location in the symmetric independent component model," Journal of Multivariate Analysis, Elsevier, vol. 100(5), pages 821-834, May.
    2. Annaliisa Kankainen & Sara Taskinen & Hannu Oja, 2007. "Tests of multinormality based on location vectors and scatter matrices," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 16(3), pages 357-379, November.
    3. Filzmoser, Peter & Maronna, Ricardo & Werner, Mark, 2008. "Outlier identification in high dimensions," Computational Statistics & Data Analysis, Elsevier, vol. 52(3), pages 1694-1711, January.
    4. Taskinen, S. & Sirkia, S. & Oja, H., 2007. "Independent component analysis based on symmetrised scatter matrices," Computational Statistics & Data Analysis, Elsevier, vol. 51(10), pages 5103-5111, June.
    5. Maronna, Ricardo A. & Stahel, Werner A. & Yohai, Victor J., 1992. "Bias-robust estimators of multivariate scatter based on projections," Journal of Multivariate Analysis, Elsevier, vol. 42(1), pages 141-161, July.
    6. Caussinus, H. & Fekri, M. & Hakam, S. & Ruiz-Gazen, A., 2003. "A monitoring display of multivariate outliers," Computational Statistics & Data Analysis, Elsevier, vol. 44(1-2), pages 237-252, October.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Xin Dang & Hailin Sang & Lauren Weatherall, 2019. "Gini covariance matrix and its affine equivariant version," Statistical Papers, Springer, vol. 60(3), pages 641-666, June.
    2. Archimbaud, Aurore & Nordhausen, Klaus & Ruiz-Gazen, Anne, 2018. "ICS for multivariate outlier detection with application to quality control," Computational Statistics & Data Analysis, Elsevier, vol. 128(C), pages 184-199.
    3. Alashwali, Fatimah & Kent, John T., 2016. "The use of a common location measure in the invariant coordinate selection and projection pursuit," Journal of Multivariate Analysis, Elsevier, vol. 152(C), pages 145-161.
    4. Nordhausen, Klaus & Ruiz-Gazen, Anne, 2022. "On the usage of joint diagonalization in multivariate statistics," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    5. Hannu Oja & Davy Paindaveine & Sara Taskinen, 2009. "Parametric and nonparametric test for multivariate independence in IC models," Working Papers ECARES 2009_018, ULB -- Universite Libre de Bruxelles.
    6. Archimbaud, Aurore & Boulfani, Fériel & Gendre, Xavier & Nordhausen, Klaus & Ruiz-Gazen, Anne & Virta, Joni, 2021. "ICS for multivariate functional anomaly detection with applications to predictive maintenance and quality control," TSE Working Papers 21-1182, Toulouse School of Economics (TSE), revised Mar 2022.
    7. Jin Wang & Weihua Zhou, 2015. "Effect of kurtosis on efficiency of some multivariate medians," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 27(3), pages 331-348, September.
    8. Joni Virta & Niko Lietzén & Henri Nyberg, 2024. "Robust signal dimension estimation via SURE," Statistical Papers, Springer, vol. 65(5), pages 3007-3038, July.
    9. Fischer, Daniel & Berro, Alain & Nordhausen, Klaus & Ruiz-Gazen, Anne, 2019. "REPPlab: An R package for detecting clusters and outliers using exploratory projection pursuit," TSE Working Papers 19-1001, Toulouse School of Economics (TSE).
    10. Nordhausen, Klaus & Oja, Hannu & Tyler, David E., 2022. "Asymptotic and bootstrap tests for subspace dimension," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    11. Ilmonen, Pauliina, 2013. "On asymptotic properties of the scatter matrix based estimates for complex valued independent component analysis," Statistics & Probability Letters, Elsevier, vol. 83(4), pages 1219-1226.
    12. Ruiz-Gazen, Anne & Thomas-Agnan, Christine & Laurent, Thibault & Mondon, Camille, 2022. "Detecting outliers in compositional data using Invariant Coordinate Selection," TSE Working Papers 22-1320, Toulouse School of Economics (TSE).
    13. Nicola Loperfido, 2019. "Finite mixtures, projection pursuit and tensor rank: a triangulation," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 13(1), pages 145-173, March.
    14. Álvarez, Adolfo, 2013. "Recombining partitions via unimodality tests," DES - Working Papers. Statistics and Econometrics. WS ws130706, Universidad Carlos III de Madrid. Departamento de Estadística.
    15. Dümbgen, Lutz & Nordhausen, Klaus & Schuhmacher, Heike, 2016. "New algorithms for M-estimation of multivariate scatter and location," Journal of Multivariate Analysis, Elsevier, vol. 144(C), pages 200-217.
    16. Boente, Graciela & Salibián Barrera, Matías & Tyler, David E., 2014. "A characterization of elliptical distributions and some optimality properties of principal components for functional data," Journal of Multivariate Analysis, Elsevier, vol. 131(C), pages 254-264.
    17. Thomas-Agnan, Christine & Mondon, Camille & Trinh, Thi-Huong & Ruiz-Gazen, Anne, 2024. "ICS for complex data with application to outlier detection for density data objects," TSE Working Papers 24_1585, Toulouse School of Economics (TSE).
    18. Ilmonen, Pauliina & Nevalainen, Jaakko & Oja, Hannu, 2010. "Characteristics of multivariate distributions and the invariant coordinate system," Statistics & Probability Letters, Elsevier, vol. 80(23-24), pages 1844-1853, December.
    19. Prieto, Francisco J. & Rendón, Carolina, 2014. "Independent components techniques based on kurtosis for functional data analysis," DES - Working Papers. Statistics and Econometrics. WS ws141006, Universidad Carlos III de Madrid. Departamento de Estadística.
    20. Dürre, Alexander & Vogel, Daniel & Tyler, David E., 2014. "The spatial sign covariance matrix with unknown location," Journal of Multivariate Analysis, Elsevier, vol. 130(C), pages 107-117.
    21. Virta, J., 2016. "One-step M-estimates of scatter and the independence property," Statistics & Probability Letters, Elsevier, vol. 110(C), pages 133-136.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Nordhausen, Klaus & Ruiz-Gazen, Anne, 2022. "On the usage of joint diagonalization in multivariate statistics," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    2. Ilmonen, Pauliina & Nevalainen, Jaakko & Oja, Hannu, 2010. "Characteristics of multivariate distributions and the invariant coordinate system," Statistics & Probability Letters, Elsevier, vol. 80(23-24), pages 1844-1853, December.
    3. G. Zioutas & C. Chatzinakos & T. D. Nguyen & L. Pitsoulis, 2017. "Optimization techniques for multivariate least trimmed absolute deviation estimation," Journal of Combinatorial Optimization, Springer, vol. 34(3), pages 781-797, October.
    4. Schmitt, Eric & Öllerer, Viktoria & Vakili, Kaveh, 2014. "The finite sample breakdown point of PCS," Statistics & Probability Letters, Elsevier, vol. 94(C), pages 214-220.
    5. Loperfido, Nicola, 2021. "Some theoretical properties of two kurtosis matrices, with application to invariant coordinate selection," Journal of Multivariate Analysis, Elsevier, vol. 186(C).
    6. Lyócsa, Štefan & Výrost, Tomáš & Baumöhl, Eduard, 2012. "Breakdowns and revivals: the long-run relationship between the stock market and real economic activity in the G-7 countries," MPRA Paper 43306, University Library of Munich, Germany.
    7. Wu, Edmond H.C. & Yu, Philip L.H. & Li, W.K., 2009. "A smoothed bootstrap test for independence based on mutual information," Computational Statistics & Data Analysis, Elsevier, vol. 53(7), pages 2524-2536, May.
    8. Juan, Jesús & Prieto, Francisco J., 1994. "A subsampling method for the computation of multivariate estimators with high breakdown point," DES - Working Papers. Statistics and Econometrics. WS 3952, Universidad Carlos III de Madrid. Departamento de Estadística.
    9. Thomas Triebs & Subal C. Kumbhakar, 2012. "Management Practice in Production," ifo Working Paper Series 129, ifo Institute - Leibniz Institute for Economic Research at the University of Munich.
    10. Philip Dörr & Bruno Ebner & Norbert Henze, 2021. "Testing multivariate normality by zeros of the harmonic oscillator in characteristic function spaces," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 48(2), pages 456-501, June.
    11. Salem Reyen & John Miller & Edward Wegman, 2009. "Separating a mixture of two normals with proportional covariances," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 70(3), pages 297-314, November.
    12. Gelein, Brigitte & Haziza, David & Causeur, David, 2014. "Preserving relationships between variables with MIVQUE based imputation for missing survey data," Journal of Multivariate Analysis, Elsevier, vol. 131(C), pages 197-208.
    13. Davy Paindaveine & Germain Van Bever, 2017. "Halfspace Depths for Scatter, Concentration and Shape Matrices," Working Papers ECARES ECARES 2017-19, ULB -- Universite Libre de Bruxelles.
    14. Junlong Zhao & Chao Liu & Lu Niu & Chenlei Leng, 2019. "Multiple influential point detection in high dimensional regression spaces," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 81(2), pages 385-408, April.
    15. Van Aelst, S. & Vandervieren, E. & Willems, G., 2012. "A Stahel–Donoho estimator based on huberized outlyingness," Computational Statistics & Data Analysis, Elsevier, vol. 56(3), pages 531-542.
    16. Croux, Christophe & Haesbroeck, Gentiane, 1997. "An easy way to increase the finite-sample efficiency of the resampled minimum volume ellipsoid estimator," Computational Statistics & Data Analysis, Elsevier, vol. 25(2), pages 125-141, July.
    17. C. Chatzinakos & L. Pitsoulis & G. Zioutas, 2016. "Optimization techniques for robust multivariate location and scatter estimation," Journal of Combinatorial Optimization, Springer, vol. 31(4), pages 1443-1460, May.
    18. Chung, Hee Cheol & Ahn, Jeongyoun, 2021. "Subspace rotations for high-dimensional outlier detection," Journal of Multivariate Analysis, Elsevier, vol. 183(C).
    19. Fekri, M. & Ruiz-Gazen, A., 2004. "Robust weighted orthogonal regression in the errors-in-variables model," Journal of Multivariate Analysis, Elsevier, vol. 88(1), pages 89-108, January.
    20. Shieh Albert D & Hung Yeung Sam, 2009. "Detecting Outlier Samples in Microarray Data," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 8(1), pages 1-26, February.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jorssb:v:71:y:2009:i:3:p:549-592. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/rssssea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.