IDEAS home Printed from https://ideas.repec.org/a/igg/jdwm00/v11y2015i3p26-48.html
   My bibliography  Save this article

Understanding the SNN Input Parameters and How They Affect the Clustering Results

Author

Listed:
  • Guilherme Moreira

    (ALGORITMI Research Centre, University of Minho, Guimarães, Portugal)

  • Maribel Yasmina Santos

    (ALGORITMI Research Centre, University of Minho, Guimarães, Portugal)

  • João Moura Pires

    (NOVA LINCS, Nova University of Lisbon, Lisbon, Portugal)

  • João Galvão

    (ALGORITMI Research Centre, University of Minho, Guimarães, Portugal)

Abstract

Huge amounts of data are available for analysis in nowadays organizations, which are facing several challenges when trying to analyze the generated data with the aim of extracting useful information. This analytical capability needs to be enhanced with tools capable of dealing with big data sets without making the analytical process an arduous task. Clustering is usually used in the data analysis process, as this technique does not require any prior knowledge about the data. However, clustering algorithms usually require one or more input parameters that influence the clustering process and the results that can be obtained. This work analyses the relation between the three input parameters of the SNN (Shared Nearest Neighbor) clustering algorithm, providing a comprehensive understanding of the relationships that were identified between k, Eps and MinPts, the algorithm's input parameters. Moreover, this work also proposes specific guidelines for the definition of the appropriate input parameters, optimizing the processing time, as the number of trials needed to achieve appropriate results can be substantial reduced.

Suggested Citation

  • Guilherme Moreira & Maribel Yasmina Santos & João Moura Pires & João Galvão, 2015. "Understanding the SNN Input Parameters and How They Affect the Clustering Results," International Journal of Data Warehousing and Mining (IJDWM), IGI Global, vol. 11(3), pages 26-48, July.
  • Handle: RePEc:igg:jdwm00:v:11:y:2015:i:3:p:26-48
    as

    Download full text from publisher

    File URL: http://services.igi-global.com/resolvedoi/resolve.aspx?doi=10.4018/IJDWM.2015070102
    Download Restriction: no
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:igg:jdwm00:v:11:y:2015:i:3:p:26-48. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Journal Editor (email available below). General contact details of provider: https://www.igi-global.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.