IDEAS home Printed from https://ideas.repec.org/a/gam/jftint/v18y2026i1p41-d1836492.html

AnonymAI: An Approach with Differential Privacy and Intelligent Agents for the Automated Anonymization of Sensitive Data

Author

Listed:
  • Marcelo Nascimento Oliveira Soares

    (Institute of Informatics, Federal University of Goiás (UFG), Goiânia 74001-970, GO, Brazil)

  • Leonardo Barbosa Oliveira

    (Department of Computer Science, Federal University of Minas Gerais (UFMG), Belo Horizonte 31270-901, MG, Brazil)

  • Antonio João Gonçalves Azambuja

    (Technological Institute of Aeronautics (ITA), São José dos Campos 12228-900, SP, Brazil)

  • Jean Phelipe de Oliveira Lima

    (Technological Institute of Aeronautics (ITA), São José dos Campos 12228-900, SP, Brazil)

  • Anderson Silva Soares

    (Institute of Informatics, Federal University of Goiás (UFG), Goiânia 74001-970, GO, Brazil)

Abstract

Data governance for responsible AI systems remains challenged by the lack of automated tools that can apply robust privacy-preserving techniques without destroying analytical value. We propose AnonymAI, a novel methodological framework that integrates LLM-based intelligent agents, the mathematical guarantees of differential privacy, and an automated workflow to generate anonymized datasets for analytical applications. This framework produces data tables with formally verifiable privacy protection, dramatically reducing the need for manual classification and the risk of human error. Focusing on the protection of tabular data containing sensitive personal information, AnonymAI is designed as a generalized, replicable pipeline adaptable to different regulations (e.g., General Data Protection Regulation) and use-case scenarios. The novelty lies in combining the contextual classification capabilities of LLMs with the mathematical rigor of differential privacy, enabling an end-to-end pipeline from raw data to a protected, analysis-ready dataset. The efficiency and formal guarantees of this approach offer significant advantages over conventional anonymization methods, which are often manual, inconsistent, and lack the verifiable protections of differential privacy. Validation studies, covering both controlled experiments on four types of synthetic datasets and broader tests on 19 real-world public tables from various domains, confirmed the applicability of the framework, with the agent-based classifier achieving high overall accuracy in identifying confidential columns. The results demonstrate that the protected data maintains high value for statistical analysis and machine learning models, highlighting AnonymAI’s potential to advance responsible data sharing. This work paves the way for trustworthy and scalable data governance in AI through a rigorously engineered automated anonymization pipeline.

Suggested Citation

  • Marcelo Nascimento Oliveira Soares & Leonardo Barbosa Oliveira & Antonio João Gonçalves Azambuja & Jean Phelipe de Oliveira Lima & Anderson Silva Soares, 2026. "AnonymAI: An Approach with Differential Privacy and Intelligent Agents for the Automated Anonymization of Sensitive Data," Future Internet, MDPI, vol. 18(1), pages 1-18, January.
  • Handle: RePEc:gam:jftint:v:18:y:2026:i:1:p:41-:d:1836492
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1999-5903/18/1/41/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1999-5903/18/1/41/
    Download Restriction: no
    ---><---

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jftint:v:18:y:2026:i:1:p:41-:d:1836492. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.