IDEAS home Printed from https://ideas.repec.org/a/spr/ijsaem/v9y2018i4d10.1007_s13198-017-0665-x.html
   My bibliography  Save this article

Dynamic frequency based parallel k-bat algorithm for massive data clustering (DFBPKBA)

Author

Listed:
  • Ashish Kumar Tripathi

    (Delhi Technological University)

  • Kapil Sharma

    (Delhi Technological University)

  • Manju Bala

    (IP College of Women)

Abstract

In the past one decade there has been significant increase in the growth of digital data. Therefore, good data mining techniques are important for the better decision making. Clustering is one of the key element in the field of data mining. K-means is a very popular algorithm present in the literature which is widely used for the clustering purpose. However k-means algorithm suffers from the problem of stucking into local optimum solution because of it’s dependency on the random initialization of initial cluster center. In this paper a novel variant of Bat algorithm based on dynamic frequency is introduced. Further the proposed variant is hybridized with K-means to present a new approach for clustering in distributed environment. Since evolutionary computation is very computation intensive, traditional sequential algorithms are not able to provide satisfactory results within the reasonable amount of time for the large scale data problems. To mitigate this problem the proposed variant is parallelized using the MapReduce model in the Hadoop framework. The experimental results show that the proposed algorithm has outperformed K-means, PSO and Bat algorithm on eighty percent of the benchmark datasets in terms of intra-cluster distance. Further DBPKBA has also achieved significant speedup for dealing with massive datasets with increase in the number of nodes.

Suggested Citation

  • Ashish Kumar Tripathi & Kapil Sharma & Manju Bala, 2018. "Dynamic frequency based parallel k-bat algorithm for massive data clustering (DFBPKBA)," International Journal of System Assurance Engineering and Management, Springer;The Society for Reliability, Engineering Quality and Operations Management (SREQOM),India, and Division of Operation and Maintenance, Lulea University of Technology, Sweden, vol. 9(4), pages 866-874, August.
  • Handle: RePEc:spr:ijsaem:v:9:y:2018:i:4:d:10.1007_s13198-017-0665-x
    DOI: 10.1007/s13198-017-0665-x
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s13198-017-0665-x
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s13198-017-0665-x?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Bernard J. Jansen & Mimi Zhang & Kate Sobel & Abdur Chowdury, 2009. "Twitter power: Tweets as electronic word of mouth," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 60(11), pages 2169-2188, November.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Smith, Andrew N. & Fischer, Eileen & Yongjian, Chen, 2012. "How Does Brand-related User-generated Content Differ across YouTube, Facebook, and Twitter?," Journal of Interactive Marketing, Elsevier, vol. 26(2), pages 102-113.
    2. Xuan Yang & Xiao Li & Daning Hu & Harry Jiannan Wang, 2021. "Differential impacts of social influence on initial and sustained participation in open source software projects," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 72(9), pages 1133-1147, September.
    3. Bertrand Jayles & Clément Sire & Ralf H J M Kurvers, 2021. "Crowd control: Reducing individual estimation bias by sharing biased social information," PLOS Computational Biology, Public Library of Science, vol. 17(11), pages 1-28, November.
    4. Jalees, Tariq & Tariq, Huma & Zaman, Syed Imran & Alam Kazmi, Syed Hasnain, 2015. "Social Media in Virtual Marketing," MPRA Paper 69868, University Library of Munich, Germany, revised 10 Apr 2015.
    5. Langley, David J. & Hoeve, Maarten C. & Ortt, J. Roland & Pals, Nico & van der Vecht, Bob, 2014. "Patterns of Herding and their Occurrence in an Online Setting," Journal of Interactive Marketing, Elsevier, vol. 28(1), pages 16-25.
    6. Ines Küster & Asuncion Hernández, 2012. "Brand impact on purchase intention. An approach in social networks channel," Economics and Business Letters, Oviedo University Press, vol. 1(2), pages 1-9.
    7. Aleksandar Bradic, 2012. "The Role of Social Feedback in Financing of Technology Ventures," Papers 1301.2196, arXiv.org.
    8. Lashgari, Maryam, 2014. "Social Media Technology Deployment in B2B: A Case Study," INDEK Working Paper Series 2014/9, Royal Institute of Technology, Department of Industrial Economics and Management.
    9. Xuzhen Zhu & Jinming Ma & Xin Su & Hui Tian & Wei Wang & Shimin Cai, 2019. "Information Spreading on Weighted Multiplex Social Network," Complexity, Hindawi, vol. 2019, pages 1-15, November.
    10. Li, Xin & Xie, Qianqian & Jiang, Jiaojiao & Zhou, Yuan & Huang, Lucheng, 2019. "Identifying and monitoring the development trends of emerging technologies using patent analysis and Twitter data mining: The case of perovskite solar cell technology," Technological Forecasting and Social Change, Elsevier, vol. 146(C), pages 687-705.
    11. Moro, Sérgio & Rita, Paulo & Vala, Bernardo, 2016. "Predicting social media performance metrics and evaluation of the impact on brand building: A data mining approach," Journal of Business Research, Elsevier, vol. 69(9), pages 3341-3351.
    12. Kim Holmberg & Mike Thelwall, 2014. "Disciplinary differences in Twitter scholarly communication," Scientometrics, Springer;Akadémiai Kiadó, vol. 101(2), pages 1027-1042, November.
    13. Shu-Hsun Ho & Yu-Ling Lin & Robert Carlson Patrick, 2015. "Participant Motivations In A Social Media Community Page," Global Journal of Business Research, The Institute for Business and Finance Research, vol. 9(4), pages 67-75.
    14. Mohammed Abdul-Rahman & Wale Alade & Shahnawaz Anwer, 2023. "A Composite Resilience Index (CRI) for Developing Resilience and Sustainability in University Towns," Sustainability, MDPI, vol. 15(4), pages 1-27, February.
    15. Yang Xiao & Beiqun Li & Zaiwu Gong, 2018. "Real-time identification of urban rainstorm waterlogging disasters based on Weibo big data," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 94(2), pages 833-842, November.
    16. Pablo Gomez‐Carrasco & Giovanna Michelon, 2017. "The Power of Stakeholders' Voice: The Effects of Social Media Activism on Stock Markets," Business Strategy and the Environment, Wiley Blackwell, vol. 26(6), pages 855-872, September.
    17. Charitha Harshani Perera & Rajkishore Nayak & Long Thang Van Nguyen, 2019. "Role of social word-of-mouth on emotional brand attachment and brand choice intention: A study on private educational institutes in Vietnam," Proceedings of Business and Management Conferences 8611115, International Institute of Social and Economic Sciences.
    18. Geoffrey Barbier & Reza Zafarani & Huiji Gao & Gabriel Fung & Huan Liu, 2012. "Maximizing benefits from crowdsourced data," Computational and Mathematical Organization Theory, Springer, vol. 18(3), pages 257-279, September.
    19. I Made Wijaya Kusuma & I Gusti Ayu Wimba & Putu Yudy Wijaya, 2022. "The role of brand image and brand trust through electronic word of mouth in creating parent’s interest to sending children to school," Technium Social Sciences Journal, Technium Science, vol. 35(1), pages 477-489, September.
    20. Suddaby, Roy & Saxton, Gregory D. & Gunz, Sally, 2015. "Twittering change: The institutional work of domain change in accounting expertise," Accounting, Organizations and Society, Elsevier, vol. 45(C), pages 52-68.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:ijsaem:v:9:y:2018:i:4:d:10.1007_s13198-017-0665-x. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.