IDEAS home Printed from https://ideas.repec.org/a/igg/jaec00/v12y2021i1p1-17.html
   My bibliography  Save this article

Effects of Class Imbalance Using Machine Learning Algorithms: Case Study Approach

Author

Listed:
  • Swati V. Narwane

    (Datta Meghe College of Engineering, India)

  • Sudhir D. Sawarkar

    (Datta Meghe College of Engineering, India)

Abstract

Class imbalance is the major hurdle for machine learning-based systems. Data set is the backbone of machine learning and must be studied to handle the class imbalance. The purpose of this paper is to investigate the effect of class imbalance on the data sets. The proposed methodology determines the model accuracy for class distribution. To find possible solutions, the behaviour of an imbalanced data set was investigated. The study considers two case studies with data set divided balanced to unbalanced class distribution. Testing of the data set with trained and test data was carried out for standard machine learning algorithms. Model accuracy for class distribution was measured with the training data set. Further, the built model was tested with individual binary class. Results show that, for the improvement of the system performance, it is essential to work on class imbalance problems. The study concludes that the system produces biased results due to the majority class. In the future, the multiclass imbalance problem can be studied using advanced algorithms.

Suggested Citation

  • Swati V. Narwane & Sudhir D. Sawarkar, 2021. "Effects of Class Imbalance Using Machine Learning Algorithms: Case Study Approach," International Journal of Applied Evolutionary Computation (IJAEC), IGI Global, vol. 12(1), pages 1-17, January.
  • Handle: RePEc:igg:jaec00:v:12:y:2021:i:1:p:1-17
    as

    Download full text from publisher

    File URL: http://services.igi-global.com/resolvedoi/resolve.aspx?doi=10.4018/IJAEC.2021010101
    Download Restriction: no
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Yifan Qin & Jinlong Wu & Wen Xiao & Kun Wang & Anbing Huang & Bowen Liu & Jingxuan Yu & Chuhao Li & Fengyu Yu & Zhanbing Ren, 2022. "Machine Learning Models for Data-Driven Prediction of Diabetes by Lifestyle Type," IJERPH, MDPI, vol. 19(22), pages 1-16, November.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:igg:jaec00:v:12:y:2021:i:1:p:1-17. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Journal Editor (email available below). General contact details of provider: https://www.igi-global.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.