IDEAS home Printed from https://ideas.repec.org/a/spr/ijsaem/v15y2024i7d10.1007_s13198-024-02333-8.html
   My bibliography  Save this article

Research on FCM-LR cross electricity theft detection based on big data user profile

Author

Listed:
  • Ronghui Hu

    (Henan Vocational College of Information and Statistics)

  • Tong Zhen

    (Henan University of Technology)

Abstract

Data-driven electricity theft detection (ETD) based on machine learning and deep learning has the advantages of automation, real-time performance, and efficiency while requiring a large amount of labeled data to train models. However, the imbalance ratio between positive and unlabeled samples has reached 1:200, which significantly limits the accuracy of the ETD model. In cases like this, we refer to it as positive-unlabeled learning. Down-sampling wastes a large amount of negative samples, while up-sampling will result in the ETD model not being robust. Both can lead to ETD models performing well in experimental environments but poorly in production environments. In this context, this paper proposes a semi-supervised electricity theft detection algorithm based on fuzzy c-means and logistic regression cross detection (FCM-LR). Firstly, a statistical feature set based on business data and load data is proposed to depict the profile of electricity users, which can achieve the effect of reducing the complexity of data structure. Furthermore, by using the FCM-LR method, the utilization of unlabeled data can be maximized, and new electricity theft patterns can be discovered. The simulation results show that the theft detection effect of this method is significant, with Precision, Recall, F1, and Area under Curve all approaching 99%.

Suggested Citation

  • Ronghui Hu & Tong Zhen, 2024. "Research on FCM-LR cross electricity theft detection based on big data user profile," International Journal of System Assurance Engineering and Management, Springer;The Society for Reliability, Engineering Quality and Operations Management (SREQOM),India, and Division of Operation and Maintenance, Lulea University of Technology, Sweden, vol. 15(7), pages 3251-3265, July.
  • Handle: RePEc:spr:ijsaem:v:15:y:2024:i:7:d:10.1007_s13198-024-02333-8
    DOI: 10.1007/s13198-024-02333-8
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s13198-024-02333-8
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s13198-024-02333-8?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Savian, Fernando de Souza & Siluk, Julio Cezar Mairesse & Garlet, Taís Bisognin & do Nascimento, Felipe Moraes & Pinheiro, José Renes & Vale, Zita, 2021. "Non-technical losses: A systematic contemporary article review," Renewable and Sustainable Energy Reviews, Elsevier, vol. 147(C).
    2. Kong, Jun & Jiang, Wen & Tian, Qing & Jiang, Min & Liu, Tianshan, 2023. "Anomaly detection based on joint spatio-temporal learning for building electricity consumption," Applied Energy, Elsevier, vol. 334(C).
    3. Xuejiao Gong & Bo Tang & Ruijin Zhu & Wenlong Liao & Like Song, 2020. "Data Augmentation for Electricity Theft Detection Using Conditional Variational Auto-Encoder," Energies, MDPI, vol. 13(17), pages 1-14, August.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yang, Kaixiang & Chen, Wuxing & Bi, Jichao & Wang, Mengzhi & Luo, Fengji, 2023. "Multi-view broad learning system for electricity theft detection," Applied Energy, Elsevier, vol. 352(C).
    2. Gao, Bixuan & Kong, Xiangyu & Li, Shangze & Chen, Yi & Zhang, Xiyuan & Liu, Ziyu & Lv, Weijia, 2024. "Enhancing anomaly detection accuracy and interpretability in low-quality and class imbalanced data: A comprehensive approach," Applied Energy, Elsevier, vol. 353(PB).
    3. Farooq, Asma & Shahid, Kamal & Olsen, Rasmus Løvenstein, 2024. "Securing the green grid: A data anomaly detection method for mitigating cyberattacks on smart meter measurements," International Journal of Critical Infrastructure Protection, Elsevier, vol. 46(C).
    4. Turowski, M. & Heidrich, B. & Weingärtner, L. & Springer, L. & Phipps, K. & Schäfer, B. & Mikut, R. & Hagenmeyer, V., 2024. "Generating synthetic energy time series: A review," Renewable and Sustainable Energy Reviews, Elsevier, vol. 206(C).
    5. Han, Yongming & Hao, Yuhang & Feng, Mingfei & Chen, Kai & Xing, Rumeng & Liu, Yuandong & Lin, Xiaoyong & Ma, Bo & Fan, Jinzhen & Geng, Zhiqiang, 2024. "Novel STAttention GraphWaveNet model for residential household appliance prediction and energy structure optimization," Energy, Elsevier, vol. 307(C).
    6. Benish Kabir & Umar Qasim & Nadeem Javaid & Abdulaziz Aldegheishem & Nabil Alrajeh & Emad A. Mohammed, 2022. "Detecting Nontechnical Losses in Smart Meters Using a MLP-GRU Deep Model and Augmenting Data via Theft Attacks," Sustainability, MDPI, vol. 14(22), pages 1-19, November.
    7. Nsabimana, René & Perelman, Sergio & Walheer, Barnabé & Mapapa, Mbangala, 2024. "Effectiveness and efficiency in access to reliable electricity: The case of East African countries," Socio-Economic Planning Sciences, Elsevier, vol. 93(C).
    8. Elinor Ginzburg-Ganz & Eden Dina Horodi & Omar Shadafny & Uri Savir & Ram Machlev & Yoash Levron, 2025. "Statistical Foundations of Generative AI for Optimal Control Problems in Power Systems: Comprehensive Review and Future Directions," Energies, MDPI, vol. 18(10), pages 1-54, May.
    9. Klug, Thomas W. & Beyene, Abebe D. & Meles, Tensay H. & Toman, Michael A. & Hassen, Sied & Hou, Michael & Klooss, Benjamin & Mekonnen, Alemu & Jeuland, Marc, 2022. "A review of impacts of electricity tariff reform in Africa," Energy Policy, Elsevier, vol. 170(C).
    10. Stracqualursi, Erika & Rosato, Antonello & Di Lorenzo, Gianfranco & Panella, Massimo & Araneo, Rodolfo, 2023. "Systematic review of energy theft practices and autonomous detection through artificial intelligence methods," Renewable and Sustainable Energy Reviews, Elsevier, vol. 184(C).
    11. Mahdi Khodayar & Jacob Regan, 2023. "Deep Neural Networks in Power Systems: A Review," Energies, MDPI, vol. 16(12), pages 1-38, June.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:ijsaem:v:15:y:2024:i:7:d:10.1007_s13198-024-02333-8. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.