Author
Listed:
- Vasudha Bhatnagar
- Sangeeta Ahuja
- Sharanjit Kaur
Abstract
The problem of instability and non-robustness in K-means clustering has been recognised as a serious problem in both scientific and business applications. Further, these problems get accentuated in the presence of outliers in data. Cluster ensemble technique has been recently developed to combat such problems and improve overall quality of clustering scheme. In this paper, we propose a cluster ensemble method based on discriminant analysis to obtain robust clustering and report noise to the user. Clustering schemes are generated by the partitional clustering algorithm (K-means) for constructing the ensemble. The proposed algorithm operates in three phases. During the first phase, input clustering schemes are reconciled by relabeling the clusters corresponding to an arbitrary reference scheme. This is accomplished using Hungarian algorithm, which is a well-known optimisation approach. The second phase uses discriminant analysis and constructs a label matrix that is used for generating consensus partition. In the final stage, clustering scheme is refined to deliver robust and stable clustering scheme. Empirical evaluation of the algorithm shows that the proposed method significantly improves the quality of resultant ensemble. Further, comparison with the cluster ensembles generated by package R for 20 public datasets demonstrated improved quality of ensembles generated by the proposed algorithm.
Suggested Citation
Vasudha Bhatnagar & Sangeeta Ahuja & Sharanjit Kaur, 2015.
"Discriminant analysis-based cluster ensemble,"
International Journal of Data Mining, Modelling and Management, Inderscience Enterprises Ltd, vol. 7(2), pages 83-107.
Handle:
RePEc:ids:ijdmmm:v:7:y:2015:i:2:p:83-107
Download full text from publisher
As the access to this document is restricted, you may want to
for a different version of it.
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ids:ijdmmm:v:7:y:2015:i:2:p:83-107. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sarah Parker (email available below). General contact details of provider: http://www.inderscience.com/browse/index.php?journalID=342 .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.