IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v10y2022i19p3517-d926004.html
   My bibliography  Save this article

Performance Evaluation of Query Plan Recommendation with Apache Hadoop and Apache Spark

Author

Listed:
  • Elham Azhir

    (Department of Computer Engineering, Science and Research Branch, Islamic Azad University, Tehran 1477893855, Iran
    Research and Development Center, Mobile Telecommunication Company of Iran, Tehran 1991954168, Iran)

  • Mehdi Hosseinzadeh

    (Institute of Research and Development, Duy Tan University, Da Nang 550000, Vietnam
    Faculty of Natural Sciences, Duy Tan University, Da Nang 550000, Vietnam
    Computer Science, University of Human Development, Sulaymaniyah 0778-6, Iraq)

  • Faheem Khan

    (Department of Computer Engineering, Gachon University, Seongnam 13120, Korea)

  • Amir Mosavi

    (Faculty of Civil Engineering, Technische Universität Dresden, 01069 Dresden, Germany
    John von Neumann Faculty of Informatics, Obuda University, 1034 Budapest, Hungary
    Institute of Information Engineering, Automation and Mathematics, Slovak University of Technology in Bratislava, 81243 Bratislava, Slovakia)

Abstract

Access plan recommendation is a query optimization approach that executes new queries using prior created query execution plans (QEPs). The query optimizer divides the query space into clusters in the mentioned method. However, traditional clustering algorithms take a significant amount of execution time for clustering such large datasets. The MapReduce distributed computing model provides efficient solutions for storing and processing vast quantities of data. Apache Spark and Apache Hadoop frameworks are used in the present investigation to cluster different sizes of query datasets in the MapReduce-based access plan recommendation method. The performance evaluation is performed based on execution time. The results of the experiments demonstrated the effectiveness of parallel query clustering in achieving high scalability. Furthermore, Apache Spark achieved better performance than Apache Hadoop, reaching an average speedup of 2x.

Suggested Citation

  • Elham Azhir & Mehdi Hosseinzadeh & Faheem Khan & Amir Mosavi, 2022. "Performance Evaluation of Query Plan Recommendation with Apache Hadoop and Apache Spark," Mathematics, MDPI, vol. 10(19), pages 1-11, September.
  • Handle: RePEc:gam:jmathe:v:10:y:2022:i:19:p:3517-:d:926004
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/10/19/3517/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/10/19/3517/
    Download Restriction: no
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Nahla Mohammed Elzein & Mazlina Abdul Majid & Ibrahim Abaker Targio Hashem & Ashraf Osman Ibrahim & Anas W. Abulfaraj & Faisal Binzagr, 2023. "JQPro:Join Query Processing in a Distributed System for Big RDF Data Using the Hash-Merge Join Technique," Mathematics, MDPI, vol. 11(5), pages 1-20, March.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:10:y:2022:i:19:p:3517-:d:926004. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.