IDEAS home Printed from https://ideas.repec.org/a/igg/jdwm00/v7y2011i4p43-63.html
   My bibliography  Save this article

Data Field for Hierarchical Clustering

Author

Listed:
  • Shuliang Wang

    (The University of Pittsburgh, USA and Wuhan University, China)

  • Wenyan Gan

    (Nanjing University of Science and Technology, China)

  • Deyi Li

    (Tsinghua University, China)

  • Deren Li

    (Wuhan University, China)

Abstract

In this paper, data field is proposed to group data objects via simulating their mutual interactions and opposite movements for hierarchical clustering. Enlightened by the field in physical space, data field to simulate nuclear field is presented to illuminate the interaction between objects in data space. In the data field, the self-organized process of equipotential lines on many data objects discovers their hierarchical clustering-characteristics. During the clustering process, a random sample is first generated to optimize the impact factor. The masses of data objects are then estimated to select core data object with nonzero masses. Taking the core data objects as the initial clusters, the clusters are iteratively merged hierarchy by hierarchy with good performance. The results of a case study show that the data field is capable of hierarchical clustering on objects varying size, shape or granularity without user-specified parameters, as well as considering the object features inside the clusters and removing the outliers from noisy data. The comparisons illustrate that the data field clustering performs better than K-means, BIRCH, CURE, and CHAMELEON.

Suggested Citation

  • Shuliang Wang & Wenyan Gan & Deyi Li & Deren Li, 2011. "Data Field for Hierarchical Clustering," International Journal of Data Warehousing and Mining (IJDWM), IGI Global, vol. 7(4), pages 43-63, October.
  • Handle: RePEc:igg:jdwm00:v:7:y:2011:i:4:p:43-63
    as

    Download full text from publisher

    File URL: http://services.igi-global.com/resolvedoi/resolve.aspx?doi=10.4018/jdwm.2011100103
    Download Restriction: no
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Malang, Kanokwan & Wang, Shuliang & Phaphuangwittayakul, Aniwat & Lv, Yuanyuan & Yuan, Hanning & Zhang, Xiuzhen, 2020. "Identifying influential nodes of global terrorism network: A comparison for skeleton network extraction," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 545(C).
    2. Disheng Yi & Yusi Liu & Jiahui Qin & Jing Zhang, 2020. "Identifying Urban Traveling Hotspots Using an Interaction-Based Spatio-Temporal Data Field and Trajectory Data: A Case Study within the Sixth Ring Road of Beijing," Sustainability, MDPI, vol. 12(22), pages 1-20, November.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:igg:jdwm00:v:7:y:2011:i:4:p:43-63. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Journal Editor (email available below). General contact details of provider: https://www.igi-global.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.