IDEAS home Printed from https://ideas.repec.org/a/hin/jnlmpe/6402274.html
   My bibliography  Save this article

Big Data Privacy Preservation Using Principal Component Analysis and Random Projection in Healthcare

Author

Listed:
  • Ritu Ratra
  • Preeti Gulia
  • Nasib Singh Gill
  • Jyotir Moy Chatterjee
  • Tahir Mehmood

Abstract

With the rising usage of technology, a tremendous volume of data is being produced in the current scenario. This data contains a lot of personal data and may be given to third parties throughout the data mining process. Individual privacy is extremely difficult for the data owner to protect. Privacy-Preservation in Data Mining (PPDM) offers a solution to this problem. Encryption or anonymization have been recommended to preserve privacy in existing research. But encryption has high computing costs, and anonymization may drastically decrease the utility of data. This paper proposed a privacy-preserving strategy based on dimensionality reduction and feature selection. The proposed strategy is based on dimensionality reduction and feature selection that is difficult to reverse. The objective of this paper is to propose a perturbation-based privacy-preserving technique. Here, random projection and principal component analysis are utilized to alter the data. The main reason for this is that the dimension reduction combined with feature selection would cause the records to be perturbed more efficiently. The hybrid approach picks relevant features, decreases data dimensionality, and reduces training time, resulting in improved classification performance as measured by accuracy, kappa statistics, mean absolute error and other metrics. The proposed technique outperforms all other approaches in terms of classification accuracy increasing from 63.13 percent to 68.34 percent, proving its effectiveness in detecting cardiovascular illness. Even in its reduced form, the approach proposed here ensures that the dataset's classification accuracy is improved.

Suggested Citation

  • Ritu Ratra & Preeti Gulia & Nasib Singh Gill & Jyotir Moy Chatterjee & Tahir Mehmood, 2022. "Big Data Privacy Preservation Using Principal Component Analysis and Random Projection in Healthcare," Mathematical Problems in Engineering, Hindawi, vol. 2022, pages 1-12, August.
  • Handle: RePEc:hin:jnlmpe:6402274
    DOI: 10.1155/2022/6402274
    as

    Download full text from publisher

    File URL: http://downloads.hindawi.com/journals/mpe/2022/6402274.pdf
    Download Restriction: no

    File URL: http://downloads.hindawi.com/journals/mpe/2022/6402274.xml
    Download Restriction: no

    File URL: https://libkey.io/10.1155/2022/6402274?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:hin:jnlmpe:6402274. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Mohamed Abdelhakeem (email available below). General contact details of provider: https://www.hindawi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.