IDEAS home Printed from https://ideas.repec.org/a/gam/jeners/v18y2025i9p2364-d1649853.html
   My bibliography  Save this article

Improving Transformer Health Index Prediction Performance Using Machine Learning Algorithms with a Synthetic Minority Oversampling Technique

Author

Listed:
  • Muhammad Akmal A. Putra

    (School of Electrical Engineering and Informatics, Institut Teknologi Bandung, Bandung 40132, Indonesia)

  • Suwarno

    (School of Electrical Engineering and Informatics, Institut Teknologi Bandung, Bandung 40132, Indonesia)

  • Rahman Azis Prasojo

    (Department of Electrical Engineering, Politeknik Negeri Malang, Malang 65141, Indonesia)

Abstract

Machine learning (ML) has emerged as a powerful tool in transformer condition assessment, enabling more accurate diagnostics by leveraging historical test data. However, imbalanced datasets, often characterized by limited samples in poor transformer conditions, pose significant challenges to model performance. This study investigates the application of oversampling techniques to enhance ML model accuracy in predicting the Health Index of transformers. A dataset comprising 3850 transformer tests collected from utilities across Indonesia was used. Key parameters, including oil quality, dissolved gas analysis, and paper condition factors, were employed as inputs for ML modeling. To address the class imbalance, various oversampling methods, such as the Synthetic Minority Oversampling Technique (SMOTE), Borderline-SMOTE, SMOTE-Tomek, and SMOTE-ENN, were implemented and compared. This study explores the impact of these techniques on model performance, focusing on classification accuracy, precision, recall, and F1-score. The results reveal that all SMOTE-based methods improved model performance, with SMOTE-ENN yielding the best outcomes. It significantly reduced classification errors, particularly for minority classes, ensuring better predictive reliability. These findings underscore the importance of advanced oversampling techniques in improving transformer diagnostics. By effectively addressing the challenges posed by imbalanced datasets, this research provides a robust framework for applying ML in transformer condition monitoring and other domains with similar data constraints.

Suggested Citation

  • Muhammad Akmal A. Putra & Suwarno & Rahman Azis Prasojo, 2025. "Improving Transformer Health Index Prediction Performance Using Machine Learning Algorithms with a Synthetic Minority Oversampling Technique," Energies, MDPI, vol. 18(9), pages 1-44, May.
  • Handle: RePEc:gam:jeners:v:18:y:2025:i:9:p:2364-:d:1649853
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1996-1073/18/9/2364/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1996-1073/18/9/2364/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Azmi, A. & Jasni, J. & Azis, N. & Kadir, M.Z.A. Ab., 2017. "Evolution of transformer health index in the form of mathematical equation," Renewable and Sustainable Energy Reviews, Elsevier, vol. 76(C), pages 687-700.
    2. Yong Sun & Huakun Que & Qianqian Cai & Jingming Zhao & Jingru Li & Zhengmin Kong & Shuai Wang, 2022. "Borderline SMOTE Algorithm and Feature Selection-Based Network Anomalies Detection Strategy," Energies, MDPI, vol. 15(13), pages 1-13, June.
    3. Alhaytham Alqudsi & Ayman El-Hag, 2019. "Application of Machine Learning in Transformer Health Index Prediction," Energies, MDPI, vol. 12(14), pages 1-13, July.
    4. Emran Jawad Kadim & Norhafiz Azis & Jasronita Jasni & Siti Anom Ahmad & Mohd Aizam Talib, 2018. "Transformers Health Index Assessment Based on Neural-Fuzzy Network," Energies, MDPI, vol. 11(4), pages 1-14, March.
    5. Sergio Bustamante & Mario Manana & Alberto Arroyo & Raquel Martinez & Alberto Laso, 2020. "A Methodology for the Calculation of Typical Gas Concentration Values and Sampling Intervals in the Power Transformers of a Distribution System Operator," Energies, MDPI, vol. 13(22), pages 1-18, November.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Alhaytham Alqudsi & Ayman El-Hag, 2019. "Application of Machine Learning in Transformer Health Index Prediction," Energies, MDPI, vol. 12(14), pages 1-13, July.
    2. Georgi Ivanov & Anelia Spasova & Valentin Mateev & Iliana Marinova, 2023. "Applied Complex Diagnostics and Monitoring of Special Power Transformers," Energies, MDPI, vol. 16(5), pages 1-24, February.
    3. Mohammed El Amine Senoussaoui & Mostefa Brahami & Issouf Fofana, 2021. "Transformer Oil Quality Assessment Using Random Forest with Feature Engineering," Energies, MDPI, vol. 14(7), pages 1-15, March.
    4. David L. Alvarez & Diego F. Rodriguez & Alben Cardenas & F. Faria da Silva & Claus Leth Bak & Rodolfo GarcĂ­a & Sergio Rivera, 2021. "Optimal Decision Making in Electrical Systems Using an Asset Risk Management Framework," Energies, MDPI, vol. 14(16), pages 1-25, August.
    5. Raji Murugan & Ramasamy Raju, 2021. "Evaluation of in-service power transformer health condition for Inspection, Repair, and Replacement (IRR) maintenance planning in electric utilities," International Journal of System Assurance Engineering and Management, Springer;The Society for Reliability, Engineering Quality and Operations Management (SREQOM),India, and Division of Operation and Maintenance, Lulea University of Technology, Sweden, vol. 12(2), pages 318-336, April.
    6. Changzhi Li & Dandan Liu & Mao Wang & Hanlin Wang & Shuai Xu, 2023. "Detection of Outliers in Time Series Power Data Based on Prediction Errors," Energies, MDPI, vol. 16(2), pages 1-19, January.
    7. Olga Melnikova & Alexandr Nazarychev & Konstantin Suslov, 2022. "Enhancement of the Technique for Calculation and Assessment of the Condition of Major Insulation of Power Transformers," Energies, MDPI, vol. 15(4), pages 1-13, February.
    8. Ahmad Nayyar Hassan & Ayman El-Hag, 2020. "Two-Layer Ensemble-Based Soft Voting Classifier for Transformer Oil Interfacial Tension Prediction," Energies, MDPI, vol. 13(7), pages 1-11, April.
    9. Sergio Bustamante & Mario Manana & Alberto Arroyo & Raquel Martinez & Alberto Laso, 2020. "A Methodology for the Calculation of Typical Gas Concentration Values and Sampling Intervals in the Power Transformers of a Distribution System Operator," Energies, MDPI, vol. 13(22), pages 1-18, November.
    10. Hyeseon Lee & Byungsung Lee & Gyurim Han & Yuri Kim & Yongha Kim, 2023. "Development of Methods for an Overhead Cable Health Index Evaluation That Considers Economic Feasibility," Energies, MDPI, vol. 16(20), pages 1-13, October.
    11. Alexander S. Karandaev & Igor M. Yachikov & Andrey A. Radionov & Ivan V. Liubimov & Nikolay N. Druzhinin & Ekaterina A. Khramshina, 2022. "Fuzzy Algorithms for Diagnosis of Furnace Transformer Insulation Condition," Energies, MDPI, vol. 15(10), pages 1-21, May.
    12. Patryk Bohatyrewicz & Andrzej Mrozik, 2021. "The Analysis of Power Transformer Population Working in Different Operating Conditions with the Use of Health Index," Energies, MDPI, vol. 14(16), pages 1-14, August.
    13. Bustamante, Sergio & Manana, Mario & Arroyo, Alberto & Laso, Alberto & Martinez, Raquel, 2024. "Evolution of graphical methods for the identification of insulation faults in oil-immersed power transformers: A review," Renewable and Sustainable Energy Reviews, Elsevier, vol. 199(C).
    14. Muhammad Sharil Yahaya & Norhafiz Azis & Amran Mohd Selva & Mohd Zainal Abidin Ab Kadir & Jasronita Jasni & Mohd Hendra Hairi & Young Zaidey Yang Ghazali & Mohd Aizam Talib, 2018. "Effect of Pre-Determined Maintenance Repair Rates on the Health Index State Distribution and Performance Condition Curve Based on the Markov Prediction Model for Sustainable Transformers Asset Managem," Sustainability, MDPI, vol. 10(10), pages 1-13, September.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jeners:v:18:y:2025:i:9:p:2364-:d:1649853. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.