IDEAS home Printed from https://ideas.repec.org/a/spr/infotm/v25y2024i2d10.1007_s10799-022-00378-4.html
   My bibliography  Save this article

DRAPE: optimizing private data release under adjustable privacy-utility equilibrium

Author

Listed:
  • Qingyue Xiong

    (Hunan University)

  • Qiujun Lan

    (Hunan University)

  • Jiaqi Ma

    (Hunan University)

  • Huiling Zhou

    (Hunan University)

  • Gang Li

    (Deakin University)

  • Zheng Yang

    (Hunan Tianhe Blockchain Research Institute)

Abstract

Data releasing and sharing between several fields has became inevitable tendency in the context of big data. Unfortunately, this situation has clearly caused enormous exposure of sensitive and private information. Along with massive privacy breaches, privacy-preservation issues were brought into sharp focus and privacy concerns may prevent people from providing their personal data. To meet the requirements of privacy protection, such a problem has been extensively studied. However, privacy protection of sensitive information should not prevent data users from conducting valid analyses of the released data. We propose a novel algorithm in this paper, named Data Release under Adjustable Privacy-utility Equilibrium (DRAPE), to address this problem. We handle the privacy versus utility tradeoff in the data release problem by breaking sensitive associations among variables while maintaining the correlations of nonsensitive variables. Furthermore, we quantify the impact of the proposed privacy-preserving method in terms of correlation preservation and privacy level, and thereby develop an optimization model to fulfil data privacy and data utility constraints. The proposed approach is not only able to provide a better privacy levels control scheme for data publishers, but also provides personalized service for data requesters with different utility requirements. We conduct experiments on one simulated dataset and two real datasets, and the simulation results show that DRAPE efficiently achieves a guaranteed privacy level while simultaneously effectively preserving data utility.

Suggested Citation

  • Qingyue Xiong & Qiujun Lan & Jiaqi Ma & Huiling Zhou & Gang Li & Zheng Yang, 2024. "DRAPE: optimizing private data release under adjustable privacy-utility equilibrium," Information Technology and Management, Springer, vol. 25(2), pages 199-217, June.
  • Handle: RePEc:spr:infotm:v:25:y:2024:i:2:d:10.1007_s10799-022-00378-4
    DOI: 10.1007/s10799-022-00378-4
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10799-022-00378-4
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10799-022-00378-4?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Rathindra Sarathy & Krishnamurty Muralidhar, 2002. "The Security of Confidential Numerical Data in Databases," Information Systems Research, INFORMS, vol. 13(4), pages 389-403, December.
    2. Xiao-Bai Li & Sumit Sarkar, 2013. "Class-Restricted Clustering and Microperturbation for Data Privacy," Management Science, INFORMS, vol. 59(4), pages 796-812, April.
    3. Luvai Motiwalla & Xiao-Bai Li, 2013. "Developing privacy solutions for sharing and analysing healthcare data," International Journal of Business Information Systems, Inderscience Enterprises Ltd, vol. 13(2), pages 199-216.
    4. Krishnamurty Muralidhar & Rahul Parsa & Rathindra Sarathy, 1999. "A General Additive Data Perturbation Method for Database Security," Management Science, INFORMS, vol. 45(10), pages 1399-1415, October.
    5. Baak, M. & Koopman, R. & Snoek, H. & Klous, S., 2020. "A new correlation coefficient between categorical, ordinal and interval variables with Pearson characteristics," Computational Statistics & Data Analysis, Elsevier, vol. 152(C).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Haibing Lu & Jaideep Vaidya & Vijayalakshmi Atluri & Yingjiu Li, 2015. "Statistical Database Auditing Without Query Denial Threat," INFORMS Journal on Computing, INFORMS, vol. 27(1), pages 20-34, February.
    2. Heng Xu & Nan Zhang, 2022. "Implications of Data Anonymization on the Statistical Evidence of Disparity," Management Science, INFORMS, vol. 68(4), pages 2600-2618, April.
    3. Xiao-Bai Li & Sumit Sarkar, 2006. "Privacy Protection in Data Mining: A Perturbation Approach for Categorical Data," Information Systems Research, INFORMS, vol. 17(3), pages 254-270, September.
    4. Cosimo Russo & Alberto Castro & Andrea Gioia & Vito Iacobellis & Angela Gorgoglione, 2023. "A Stormwater Management Framework for Predicting First Flush Intensity and Quantifying its Influential Factors," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 37(3), pages 1437-1459, February.
    5. P. Daniel Wright & Matthew J. Liberatore & Robert L. Nydick, 2006. "A Survey of Operations Research Models and Applications in Homeland Security," Interfaces, INFORMS, vol. 36(6), pages 514-529, December.
    6. Risto Silvola & Janne Harkonen & Olli Vilppola & Hanna Kropsu-Vehkapera & Harri Haapasalo, 2016. "Data quality assessment and improvement," International Journal of Business Information Systems, Inderscience Enterprises Ltd, vol. 22(1), pages 62-81.
    7. Trottini, Mario & Muralidhar, Krish & Sarathy, Rathindra, 2011. "Maintaining tail dependence in data shuffling using t copula," Statistics & Probability Letters, Elsevier, vol. 81(3), pages 420-428, March.
    8. Rathindra Sarathy & Krishnamurty Muralidhar & Rahul Parsa, 2002. "Perturbing Nonnormal Confidential Attributes: The Copula Approach," Management Science, INFORMS, vol. 48(12), pages 1613-1627, December.
    9. Shi, Dehua & Xu, Han & Wang, Shaohua & Hu, Jia & Chen, Long & Yin, Chunfang, 2024. "Deep reinforcement learning based adaptive energy management for plug-in hybrid electric vehicle with double deep Q-network," Energy, Elsevier, vol. 305(C).
    10. Syam Menon & Sumit Sarkar & Shibnath Mukherjee, 2005. "Maximizing Accuracy of Shared Databases when Concealing Sensitive Patterns," Information Systems Research, INFORMS, vol. 16(3), pages 256-270, September.
    11. Tianqi Zhang & Yue Zhou & Ming Li & Haoran Zhang & Tong Wang & Yu Tian, 2022. "Impacts of Urbanization on Drainage System Health and Sustainable Drainage Recommendations for Future Scenarios—A Small City Case in China," Sustainability, MDPI, vol. 14(24), pages 1-24, December.
    12. Templ, Matthias & Kowarik, Alexander & Meindl, Bernhard, 2015. "Statistical Disclosure Control for Micro-Data Using the R Package sdcMicro," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 67(i04).
    13. Leng, Lijian & Li, Tanghao & Zhan, Hao & Rizwan, Muhammad & Zhang, Weijin & Peng, Haoyi & Yang, Zequn & Li, Hailong, 2023. "Machine learning-aided prediction of nitrogen heterocycles in bio-oil from the pyrolysis of biomass," Energy, Elsevier, vol. 278(PB).
    14. Yi Qian & Hui Xie, 2013. "Drive More Effective Data-Based Innovations: Enhancing the Utility of Secure Databases," NBER Working Papers 19586, National Bureau of Economic Research, Inc.
    15. Cesar de Lima Nogueira, Silvio & Och, Stephan Hennings & Moura, Luis Mauro & Domingues, Eric & Coelho, Leandro dos Santos & Mariani, Viviana Cocco, 2023. "Prediction of the NOx and CO2 emissions from an experimental dual fuel engine using optimized random forest combined with feature engineering," Energy, Elsevier, vol. 280(C).
    16. Jialiang Cui & Vanessa Hoi Mei Cheung & Wenjie Huang & Wan Sang Kan, 2022. "Mental Distress during the COVID-19 Pandemic: A Cross-Sectional Study of Women Receiving the Comprehensive Social Security Allowance in Hong Kong," IJERPH, MDPI, vol. 19(16), pages 1-13, August.
    17. Cimpoeru Smaranda & Roman Monica & Kobeissi Amira & Mohammad Heba, 2020. "How are European Migrants from the MENA Countries Affected by COVID-19? Insights from an Online Survey," Journal of Social and Economic Statistics, Sciendo, vol. 9(1), pages 128-143, August.
    18. Zhou, Yu & Chen, Ben & Meng, Kai & Zhou, Haoran & Chen, Wenshang & Zhang, Ning & Deng, Qihao & Yang, Guanghua & Tu, Zhengkai, 2023. "Optimal design of a cathode flow field for performance enhancement of PEM fuel cell," Applied Energy, Elsevier, vol. 343(C).
    19. Meghanath Macha & Natasha Zhang Foutz & Beibei Li & Anindya Ghose, 2024. "Personalized Privacy Preservation in Consumer Mobile Trajectories," Information Systems Research, INFORMS, vol. 35(1), pages 249-271, March.
    20. Yuan Liu & Chuyao Liao & Li Zhuo & Haiyan Tao, 2022. "Evaluating Effects of Dynamic Interventions to Control COVID-19 Pandemic: A Case Study of Guangdong, China," IJERPH, MDPI, vol. 19(16), pages 1-17, August.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:infotm:v:25:y:2024:i:2:d:10.1007_s10799-022-00378-4. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.