Robust Detection of Multiple Outliers in Grouped Multivariate Data
Many methods have been developed for detecting multiple outliers in a single multivariate sample, but very few for the case where there may be groups in the data. We propose a method of simultaneously determining groups (as in cluster analysis) and detecting outliers, which are points that are distant from every group. Our method is an adaptation of the BACON algorithm proposed by Billor, Hadi and Velleman for the robust detection of multiple outliers in a single group of multivariate data. There are two versions of our method, depending on whether or not the groups can be assumed to have equal covariance matrices. The effectiveness of the method is illustrated by its application to two real data sets and further shown by a simulation study for different sample sizes and dimensions for 2 and 3 groups, with and without planted outliers in the data. When the number of groups is not known in advance, the algorithm could be used as a robust method of cluster analysis, by running it for various numbers of groups and choosing the best solution.
Volume (Year): 34 (2007)
Issue (Month): 10 ()
|Contact details of provider:|| Web page: http://www.tandfonline.com/CJAS20|
|Order Information:||Web: http://www.tandfonline.com/pricing/journal/CJAS20|
When requesting a correction, please mention this item's handle: RePEc:taf:japsta:v:34:y:2007:i:10:p:1241-1250. See general information about how to correct material in RePEc.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Michael McNulty)
If references are entirely missing, you can add them using this form.