IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v11y2023i10p2385-d1151734.html
   My bibliography  Save this article

FedISM: Enhancing Data Imbalance via Shared Model in Federated Learning

Author

Listed:
  • Wu-Chun Chung

    (Department of Information and Computer Engineering, Chung Yuan Christian University, Taoyuan 320, Taiwan)

  • Yan-Hui Lin

    (Department of Information and Computer Engineering, Chung Yuan Christian University, Taoyuan 320, Taiwan)

  • Sih-Han Fang

    (Department of Information and Computer Engineering, Chung Yuan Christian University, Taoyuan 320, Taiwan)

Abstract

Considering the sensitivity of data in medical scenarios, federated learning (FL) is suitable for applications that require data privacy. Medical personnel can use the FL framework for machine learning to assist in analyzing large-scale data that are protected within the institution. However, not all clients have the same distribution of datasets, so data imbalance problems occur among clients. The main challenge is to overcome the performance degradation caused by low accuracy and the inability to converge the model. This paper proposes a FedISM method to enhance performance in the case of Non-Independent Identically Distribution (Non-IID). FedISM exploits a shared model trained on a candidate dataset before performing FL among clients. The Candidate Selection Mechanism (CSM) was proposed to effectively select the most suitable candidate among clients for training the shared model. Based on the proposed approaches, FedISM not only trains the shared model without sharing any raw data, but it also provides an optimal solution through the selection of the best shared model. To evaluate performance, the proposed FedISM was applied to classify coronavirus disease (COVID), pneumonia, normal, and viral pneumonia in the experiments. The Dirichlet process was also used to simulate a variety of imbalanced data distributions. Experimental results show that FedISM improves accuracy by up to 25% when privacy concerns regarding patient data are rising among medical institutions.

Suggested Citation

  • Wu-Chun Chung & Yan-Hui Lin & Sih-Han Fang, 2023. "FedISM: Enhancing Data Imbalance via Shared Model in Federated Learning," Mathematics, MDPI, vol. 11(10), pages 1-22, May.
  • Handle: RePEc:gam:jmathe:v:11:y:2023:i:10:p:2385-:d:1151734
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/11/10/2385/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/11/10/2385/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Saad I. Nafisah & Ghulam Muhammad & M. Shamim Hossain & Salman A. AlQahtani, 2023. "A Comparative Evaluation between Convolutional Neural Networks and Vision Transformers for COVID-19 Detection," Mathematics, MDPI, vol. 11(6), pages 1-20, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.

      Corrections

      All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:11:y:2023:i:10:p:2385-:d:1151734. See general information about how to correct material in RePEc.

      If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

      If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

      If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

      For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

      Please note that corrections may take a couple of weeks to filter through the various RePEc services.

      IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.