IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v13y2025i15p2509-d1717406.html
   My bibliography  Save this article

Privacy-Aware Table Data Generation by Adversarial Gradient Boosting Decision Tree

Author

Listed:
  • Shuai Jiang

    (Graduate School of Advanced Science and Engineering, Hiroshima University, Kagamiyama 1-7-1, Higashi-Hiroshima 739-8521, Japan)

  • Naoto Iwata

    (Graduate School of Advanced Science and Engineering, Hiroshima University, Kagamiyama 1-7-1, Higashi-Hiroshima 739-8521, Japan)

  • Sayaka Kamei

    (Graduate School of Advanced Science and Engineering, Hiroshima University, Kagamiyama 1-7-1, Higashi-Hiroshima 739-8521, Japan)

  • Kazi Md. Rokibul Alam

    (Department of Computer Science and Engineering, Khulna University of Engineering and Technology, Khulna 9203, Bangladesh)

  • Yasuhiko Morimoto

    (Graduate School of Advanced Science and Engineering, Hiroshima University, Kagamiyama 1-7-1, Higashi-Hiroshima 739-8521, Japan)

Abstract

Privacy preservation poses significant challenges in third-party data sharing, particularly when handling table data containing personal information such as demographic and behavioral records. Synthetic table data generation has emerged as a promising solution to enable data analysis while mitigating privacy risks. While Generative Adversarial Networks (GANs) are widely used for this purpose, they exhibit limitations in modeling table data due to challenges in handling mixed data types (numerical/categorical), non-Gaussian distributions, and imbalanced variables. To address these limitations, this study proposes a novel adversarial learning framework integrating gradient boosting trees for synthesizing table data, called Adversarial Gradient Boosting Decision Tree (AGBDT). Experimental evaluations on several datasets demonstrate that our method outperforms representative baseline models regarding statistical similarity and machine learning utility. Furthermore, we introduce a privacy-aware adaptation of the framework by incorporating k -anonymization constraints, effectively reducing overfitting to source data while maintaining practical usability. The results validate the balance between data utility and privacy preservation achieved by our approach.

Suggested Citation

  • Shuai Jiang & Naoto Iwata & Sayaka Kamei & Kazi Md. Rokibul Alam & Yasuhiko Morimoto, 2025. "Privacy-Aware Table Data Generation by Adversarial Gradient Boosting Decision Tree," Mathematics, MDPI, vol. 13(15), pages 1-17, August.
  • Handle: RePEc:gam:jmathe:v:13:y:2025:i:15:p:2509-:d:1717406
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/13/15/2509/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/13/15/2509/
    Download Restriction: no
    ---><---

    More about this item

    Keywords

    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:13:y:2025:i:15:p:2509-:d:1717406. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.