IDEAS home Printed from https://ideas.repec.org/a/gam/jftint/v18y2026i1p36-d1835663.html

Heterogeneous Federated Learning via Knowledge Transfer Guided by Global Pseudo Proxy Data

Author

Listed:
  • Wenhao Sun

    (Information Center, Dalian Party Institute of Communist Party of China, No. 75 Binhai West Road, Xigang District, Dalian 116016, China)

  • Xiaoxuan Guo

    (School of Computer Science and Artificial Intelligence, Liaoning Normal University, No. 1 Liushu South Street, Ganjingzi District, Dalian 116081, China)

  • Wenjun Liu

    (School of Computer Science and Artificial Intelligence, Liaoning Normal University, No. 1 Liushu South Street, Ganjingzi District, Dalian 116081, China)

  • Fang Sun

    (School of Computer Science and Artificial Intelligence, Liaoning Normal University, No. 1 Liushu South Street, Ganjingzi District, Dalian 116081, China)

Abstract

Federated learning with data free knowledge distillation enables effective and privacy-preserving knowledge aggregation by employing generators to produce local pseudo samples during client-side model migration. However, in practical applications, data distributions across different institutions are often non-independent and identically distributed (Non-IID), which introduces bias in local models and consequently impedes the effective transfer of knowledge to the global model. In addition, insufficient local training can further exacerbate model bias, undermining overall performance. To address these challenges, we propose a heterogeneous federated learning framework that enhances knowledge transfer through guidance from global proxy data. Specifically, a noise filter is incorporated into the training of local generators to mitigate the negative impact of low-quality pseudo proxy samples on local knowledge distillation. Furthermore, a global generator is introduced to produce global pseudo proxy samples, which, together with local pseudo proxy data, are used to construct a cross attention matrix. This design effectively alleviates overfitting and underfitting issues in local models caused by data heterogeneity. Extensive experiments on publicly available datasets with heterogeneous data distributions demonstrate the superiority of the proposed framework. Results show that when the Dirichlet distribution coefficient is 0.05, our method achieves an average accuracy improvement of 5.77% over popular baselines; when the coefficient is 0.1, the improvement reaches 6.54%. Even under uniformly distributed sample classes, our model still achieves an average accuracy improvement of 7.07% compared to other methods.

Suggested Citation

  • Wenhao Sun & Xiaoxuan Guo & Wenjun Liu & Fang Sun, 2026. "Heterogeneous Federated Learning via Knowledge Transfer Guided by Global Pseudo Proxy Data," Future Internet, MDPI, vol. 18(1), pages 1-23, January.
  • Handle: RePEc:gam:jftint:v:18:y:2026:i:1:p:36-:d:1835663
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1999-5903/18/1/36/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1999-5903/18/1/36/
    Download Restriction: no
    ---><---

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jftint:v:18:y:2026:i:1:p:36-:d:1835663. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.