This file is part of IDEAS, which uses RePEc data


[ Papers | Articles | Software | Books | Chapters | Authors | Institutions | JEL Classification | NEP reports | Search | New papers by email | Author registration | Rankings | Volunteers | FAQ | Blog | Help! ]

Robust linear clustering

Author info | Abstract | Publisher info | Download info | Related research | Statistics
Author Info
L. A. García-Escudero
A. Gordaliza
R. San Martín
S. Van Aelst
R. Zamar
Abstract

Non-hierarchical clustering methods are frequently based on the idea of forming groups around 'objects'. The main exponent of this class of methods is the "k"-means method, where these objects are points. However, clusters in a data set may often be due to certain relationships between the measured variables. For instance, we can find linear structures such as straight lines and planes, around which the observations are grouped in a natural way. These structures are not well represented by points. We present a method that searches for linear groups in the presence of outliers. The method is based on the idea of impartial trimming. We search for the 'best' subsample containing a proportion 1 - "&agr;" of the data and the best "k" affine subspaces fitting to those non-discarded observations by measuring discrepancies through orthogonal distances. The population version of the sample problem is also considered. We prove the existence of solutions for the sample and population problems together with their consistency. A feasible algorithm for solving the sample problem is described as well. Finally, some examples showing how the method proposed works in practice are provided. Copyright (c) 2009 Royal Statistical Society.

Download Info
To download:

If you experience problems downloading a file, check if you have the proper application to view it first. Information about this may be contained in the File-Format links below. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.

File URL: http://www.blackwell-synergy.com/doi/abs/10.1111/j.1467-9868.2008.00682.x
File Format: text/html
File Function: link to full text
Download Restriction: Access to full text is restricted to subscribers.

As the access to this document is restricted, you may want to look for a different version under "Related research" (further below) or search for a different version of it.

Publisher Info
Article provided by Royal Statistical Society in its journal Journal of the Royal Statistical Society: Series B (Statistical Methodology).

Volume (Year): 71 (2009)
Issue (Month): 1 ()
Pages: 301-318
Download reference. The following formats are available: HTML (with abstract), plain text (with abstract), BibTeX, RIS (EndNote, RefMan, ProCite), ReDIF
Handle: RePEc:bla:jorssb:v:71:y:2009:i:1:p:301-318

Contact details of provider:
Web page: http://www.blackwellpublishing.com/journal.asp?ref=1369-7412

Order Information:
Web: http://www.blackwellpublishing.com/subs.asp?ref=1369-7412

For technical questions regarding this item, or to correct its listing, contact: (Christopher F. Baum).

Related research
Keywords:

Statistics
Access and download statistics

Did you know? IDEAS is also providing many rankings, for example of authors and institutions.

This page was last updated on 2009-12-19.


This information is provided to you by IDEAS at the Department of Economics, College of Liberal Arts and Sciences, University of Connecticut using RePEc data on a server sponsored by the Society for Economic Dynamics.