IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v11y2023i8p1972-d1129808.html
   My bibliography  Save this article

A Semi-Federated Active Learning Framework for Unlabeled Online Network Data

Author

Listed:
  • Yuwen Zhou

    (College of Intelligence and Computing, Tianjin University, Tianjin 300350, China
    Science and Technology on Information Systems Engineering Laboratory, Changsha 410073, China)

  • Yuhan Hu

    (Science and Technology on Information Systems Engineering Laboratory, Changsha 410073, China)

  • Jing Sun

    (College of Intelligence and Computing, Tianjin University, Tianjin 300350, China)

  • Rui He

    (College of Intelligence and Computing, Tianjin University, Tianjin 300350, China)

  • Wenjie Kang

    (Hunan Provincial Key Laboratory of Network Investigational Technology, Hunan Police Academy, Changsha 410125, China)

Abstract

Federated Learning (FL) is a newly emerged federated optimization technique for distributed data in a federated network. The participants in FL that train the model locally are classified into client nodes. The server node assumes the responsibility to aggregate local models from client nodes without data moving. In this regard, FL is an ideal solution to protect data privacy at each node of the network. However, the raw data generated on each node are unlabeled, making it impossible for FL to apply these data directly to train a model. The large volume of data annotating work prevents FL from being widely applied in the real world, especially for online scenarios, where the data are generated continuously. Meanwhile, the data generated on different nodes tend to be differently distributed. It has been proved theoretically and experimentally that non-independent and identically distributed (non-IID) data harm the performance of FL. In this article, we design a semi-federated active learning (semi-FAL) framework to tackle the annotation and non-IID problems jointly. More specifically, the server node can provide (i) a pre-trained model to help each client node annotate the local data uniformly and (ii) an estimation of the global gradient to help correct the local gradient. The evaluation results demonstrate our semi-FAL framework can efficiently handle unlabeled online network data and achieves high accuracy and fast convergence.

Suggested Citation

  • Yuwen Zhou & Yuhan Hu & Jing Sun & Rui He & Wenjie Kang, 2023. "A Semi-Federated Active Learning Framework for Unlabeled Online Network Data," Mathematics, MDPI, vol. 11(8), pages 1-13, April.
  • Handle: RePEc:gam:jmathe:v:11:y:2023:i:8:p:1972-:d:1129808
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/11/8/1972/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/11/8/1972/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:11:y:2023:i:8:p:1972-:d:1129808. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.