IDEAS home Printed from https://ideas.repec.org/a/gam/jeners/v18y2025i9p2364-d1649853.html
   My bibliography  Save this article

Improving Transformer Health Index Prediction Performance Using Machine Learning Algorithms with a Synthetic Minority Oversampling Technique

Author

Listed:
  • Muhammad Akmal A. Putra

    (School of Electrical Engineering and Informatics, Institut Teknologi Bandung, Bandung 40132, Indonesia)

  • Suwarno

    (School of Electrical Engineering and Informatics, Institut Teknologi Bandung, Bandung 40132, Indonesia)

  • Rahman Azis Prasojo

    (Department of Electrical Engineering, Politeknik Negeri Malang, Malang 65141, Indonesia)

Abstract

Machine learning (ML) has emerged as a powerful tool in transformer condition assessment, enabling more accurate diagnostics by leveraging historical test data. However, imbalanced datasets, often characterized by limited samples in poor transformer conditions, pose significant challenges to model performance. This study investigates the application of oversampling techniques to enhance ML model accuracy in predicting the Health Index of transformers. A dataset comprising 3850 transformer tests collected from utilities across Indonesia was used. Key parameters, including oil quality, dissolved gas analysis, and paper condition factors, were employed as inputs for ML modeling. To address the class imbalance, various oversampling methods, such as the Synthetic Minority Oversampling Technique (SMOTE), Borderline-SMOTE, SMOTE-Tomek, and SMOTE-ENN, were implemented and compared. This study explores the impact of these techniques on model performance, focusing on classification accuracy, precision, recall, and F1-score. The results reveal that all SMOTE-based methods improved model performance, with SMOTE-ENN yielding the best outcomes. It significantly reduced classification errors, particularly for minority classes, ensuring better predictive reliability. These findings underscore the importance of advanced oversampling techniques in improving transformer diagnostics. By effectively addressing the challenges posed by imbalanced datasets, this research provides a robust framework for applying ML in transformer condition monitoring and other domains with similar data constraints.

Suggested Citation

  • Muhammad Akmal A. Putra & Suwarno & Rahman Azis Prasojo, 2025. "Improving Transformer Health Index Prediction Performance Using Machine Learning Algorithms with a Synthetic Minority Oversampling Technique," Energies, MDPI, vol. 18(9), pages 1-44, May.
  • Handle: RePEc:gam:jeners:v:18:y:2025:i:9:p:2364-:d:1649853
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1996-1073/18/9/2364/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1996-1073/18/9/2364/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jeners:v:18:y:2025:i:9:p:2364-:d:1649853. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.