IDEAS home Printed from https://ideas.repec.org/a/spr/aodasc/v3y2016i1d10.1007_s40745-016-0069-9.html
   My bibliography  Save this article

A Detailed Review on the Prominent Compression Methods Used for Reducing the Data Volume of Big Data

Author

Listed:
  • D. Anuradha

    (Pondicherry University)

  • S. Bhuvaneswari

    (Pondicherry University)

Abstract

The volume of Big data is the primary challenge faced by today’s electronic world. Compressing data should be an important aspect of the huge volume to improve the overall performance of the Big data management system and Big data analytics. There is a quiet few compression methods that can reduce the cost of data management and data transfer and improve efficiency of data analysis. Adaptive data compression approach finds out the suitable data compression technique and the location of the data compression. De-duplication removes duplicate data from the Big data store. Resemblance detection and elimination algorithm uses two techniques namely, Dup-Adj and improved super-feature approach. Using them the similar data chunks are separated from non-similar data chunks. The Delta compression is also used to compress the data before storage. The general compression algorithms are computationally complex and also degrade the application response time. To address this application-specific ZIP-IO framework for FPGA accelerated compression is studied. In this framework a simple instruction trace entropy compression algorithm is implemented in FPGA substrate. The Record-aware Compression (RaC) technique guarantees that the splitting of compressed data blocks does not contain partial records in the data blocks and it is implemented in Hadoop MapReduce.

Suggested Citation

  • D. Anuradha & S. Bhuvaneswari, 2016. "A Detailed Review on the Prominent Compression Methods Used for Reducing the Data Volume of Big Data," Annals of Data Science, Springer, vol. 3(1), pages 47-62, March.
  • Handle: RePEc:spr:aodasc:v:3:y:2016:i:1:d:10.1007_s40745-016-0069-9
    DOI: 10.1007/s40745-016-0069-9
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s40745-016-0069-9
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s40745-016-0069-9?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:aodasc:v:3:y:2016:i:1:d:10.1007_s40745-016-0069-9. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.