IDEAS home Printed from https://ideas.repec.org/a/igg/jdsst0/v8y2016i4p50-69.html
   My bibliography  Save this article

Extracting-Transforming-Loading Modeling Approach for Big Data Analytics

Author

Listed:
  • Mahfoud Bala

    (Department of Informatics, Saad Dahleb University, Blida 1, Algeria)

  • Omar Boussaid

    (Laboratory ERIC, University of Lyon 2, Lyon, France)

  • Zaia Alimazighi

    (Department of Computer Science, University of Science and Technology Houari Boumediene, Bab Ezzouar, Algeria)

Abstract

Due to their widespread use, Internet, Web 2.0 and digital sensors create data in non-traditional volumes (at terabytes and petabytes scale). The big data characterized by the four V's has brought with it new challenges given the limited capabilities of traditional computing systems. This paper aims to provide solutions which can cope with very large data in Decision-Support Systems (DSSs). In the data integration phase, specifically, the authors propose a conceptual modeling approach for parallel and distributed Extracting-Transforming-Loading (ETL) processes. Among the complexity dimensions of big data, this study focuses on the volume of data to ensure a good performance for ETL processes. The authors' approach allows anticipating on the parallelization/distribution issues at the early stage of Data Warehouse (DW) projects. They have implemented an ETL platform called Parallel-ETL (P-ETL for short) and conducted some experiments. Their performance analysis reveals that the proposed approach enables to speed up ETL processes by up to 33% with the improvement rate being linear.

Suggested Citation

  • Mahfoud Bala & Omar Boussaid & Zaia Alimazighi, 2016. "Extracting-Transforming-Loading Modeling Approach for Big Data Analytics," International Journal of Decision Support System Technology (IJDSST), IGI Global, vol. 8(4), pages 50-69, October.
  • Handle: RePEc:igg:jdsst0:v:8:y:2016:i:4:p:50-69
    as

    Download full text from publisher

    File URL: http://services.igi-global.com/resolvedoi/resolve.aspx?doi=10.4018/IJDSST.2016100104
    Download Restriction: no
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Asma Dhaouadi & Khadija Bousselmi & Mohamed Mohsen Gammoudi & Sébastien Monnet & Slimane Hammoudi, 2022. "Data Warehousing Process Modeling from Classical Approaches to New Trends: Main Features and Comparisons," Data, MDPI, vol. 7(8), pages 1-38, August.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:igg:jdsst0:v:8:y:2016:i:4:p:50-69. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Journal Editor (email available below). General contact details of provider: https://www.igi-global.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.