IDEAS home Printed from
   My bibliography  Save this article

Big Data Ingestion and Preparation Tools


  • Jaber Alwidian
  • Sana Abdel Rahman
  • Maram Gnaim
  • Fatima Al-Taharwah


Developing in Big Data applications become very important in the last few years, many organizations and industries are aware that data analysis is becoming an important factor to be more competitive and discover new trends and insights. Data ingestion and preparation step is the starting point for developing any Big Data project. This paper is a review for some of the most widely used Big Data ingestion and preparation tools, it discusses the main features, advantages and usage for each tool. The purpose of this paper is to help users to select the right ingestion and preparation tool according to their needs and applications’ requirements.

Suggested Citation

  • Jaber Alwidian & Sana Abdel Rahman & Maram Gnaim & Fatima Al-Taharwah, 2020. "Big Data Ingestion and Preparation Tools," Modern Applied Science, Canadian Center of Science and Education, vol. 14(9), pages 1-12, September.
  • Handle: RePEc:ibn:masjnl:v:14:y:2020:i:9:p:12

    Download full text from publisher

    File URL:
    Download Restriction: no

    File URL:
    Download Restriction: no

    References listed on IDEAS

    1. Rehman, Muhammad Habib ur & Chang, Victor & Batool, Aisha & Wah, Teh Ying, 2016. "Big data reduction framework for value creation in sustainable enterprises," International Journal of Information Management, Elsevier, vol. 36(6), pages 917-928.
    2. Andreea MATACUTA & Catalina POPA, 2018. "Big Data Analytics: Analysis of Features and Performance of Big Data Ingestion Tools," Informatica Economica, Academy of Economic Studies - Bucharest, Romania, vol. 22(2), pages 25-34.
    Full references (including those not matched with items on IDEAS)


    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

    Cited by:

    1. Jaber Alwidian & Tariq Khasawneh & Mahmoud Alsahlee & Ali Safia, 2022. "An Online Machine Learning Approach to Sentiment Analysis in Social Media," Modern Applied Science, Canadian Center of Science and Education, vol. 16(4), pages 1-29, November.
    2. Jaber A. Alwidian, 2023. "An Intelligent Technique to Predict the Autism Spectrum Disorder Using Big Data Platform," Modern Applied Science, Canadian Center of Science and Education, vol. 17(1), pages 1-28, May.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Maniyassouwe Amana & Pingfeng Liu & Mona Alariqi, 2022. "Value Creation and Capture with Big Data in Smart Phones Companies," Sustainability, MDPI, vol. 14(23), pages 1-22, November.
    2. Acharya, Abhilash & Singh, Sanjay Kumar & Pereira, Vijay & Singh, Poonam, 2018. "Big data, knowledge co-creation and decision making in fashion industry," International Journal of Information Management, Elsevier, vol. 42(C), pages 90-101.
    3. Sehrish Atif, 2023. "Mapping circular economy principles and servitisation approach in business model canvas: an integrated literature review," Future Business Journal, Springer, vol. 9(1), pages 1-21, December.
    4. Qinglan Liu & Adriana Hofmann Trevisan & Miying Yang & Janaina Mascarenhas, 2022. "A framework of digital technologies for the circular economy: Digital functions and mechanisms," Business Strategy and the Environment, Wiley Blackwell, vol. 31(5), pages 2171-2192, July.
    5. Micus, Christian & Schramm, Simon & Boehm, Markus & Krcmar, Helmut, 2023. "Methods to analyze customer usage data in a product decision process:A systematic literature review," Operations Research Perspectives, Elsevier, vol. 10(C).
    6. Piera Centobelli & Roberto Cerchione & Eugenio Oropallo & Wael Hassan El‐Garaihy & Tamer Farag & Khalid Hassan Al Shehri, 2022. "Towards a sustainable development assessment framework to bridge supply chain practices and technologies," Sustainable Development, John Wiley & Sons, Ltd., vol. 30(4), pages 647-663, August.
    7. Mihalis Giannakis & Rameshwar Dubey & Shishi Yan & Konstantina Spanaki & Thanos Papadopoulos, 2022. "Social media and sensemaking patterns in new product development: demystifying the customer sentiment," Annals of Operations Research, Springer, vol. 308(1), pages 145-175, January.
    8. Hani Bani-Salameh & Mona Al-Qawaqneh & Salah Taamneh, 2021. "Investigating the Adoption of Big Data Management in Healthcare in Jordan," Data, MDPI, vol. 6(2), pages 1-16, February.
    9. Gupta, Shivam & Kar, Arpan Kumar & Baabdullah, Abdullah & Al-Khowaiter, Wassan A.A., 2018. "Big data with cognitive computing: A review for the future," International Journal of Information Management, Elsevier, vol. 42(C), pages 78-89.
    10. Yang, Li & Zou, Haobo & Shang, Chao & Ye, Xiaoming & Rani, Pratibha, 2023. "Adoption of information and digital technologies for sustainable smart manufacturing systems for industry 4.0 in small, medium, and micro enterprises (SMMEs)," Technological Forecasting and Social Change, Elsevier, vol. 188(C).
    11. Arash Najmaei & Zahra Sadeghinejad, 2023. "Green and sustainable business models: historical roots, growth trajectory, conceptual architecture and an agenda for future research—A bibliometric review of green and sustainable business models," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(2), pages 957-999, February.
    12. Bimba, Andrew Thomas & Idris, Norisma & Al-Hunaiyyan, Ahmed & Mahmud, Rohana Binti & Abdelaziz, Ahmed & Khan, Suleman & Chang, Victor, 2016. "Towards knowledge modeling and manipulation technologies: A survey," International Journal of Information Management, Elsevier, vol. 36(6), pages 857-871.
    13. Jimenez-Marquez, Jose Luis & Gonzalez-Carrasco, Israel & Lopez-Cuadrado, Jose Luis & Ruiz-Mezcua, Belen, 2019. "Towards a big data framework for analyzing social media content," International Journal of Information Management, Elsevier, vol. 44(C), pages 1-12.
    14. Lim, Chiehyeon & Kim, Ki-Hun & Kim, Min-Jun & Heo, Jun-Yeon & Kim, Kwang-Jae & Maglio, Paul P., 2018. "From data to value: A nine-factor framework for data-based value creation in information-intensive services," International Journal of Information Management, Elsevier, vol. 39(C), pages 121-135.

    More about this item

    JEL classification:

    • R00 - Urban, Rural, Regional, Real Estate, and Transportation Economics - - General - - - General
    • Z0 - Other Special Topics - - General


    Access and download statistics


    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ibn:masjnl:v:14:y:2020:i:9:p:12. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Canadian Center of Science and Education (email available below). General contact details of provider: .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.