IDEAS home Printed from https://ideas.repec.org/a/spr/joinma/v36y2025i1d10.1007_s10845-023-02245-7.html
   My bibliography  Save this article

Natural language processing (NLP) and association rules (AR)-based knowledge extraction for intelligent fault analysis: a case study in semiconductor industry

Author

Listed:
  • Zhiqiang Wang

    (Research Center)

  • Kenneth Ezukwoke

    (Mines Saint-Étienne, Univ. Clermont Auvergne, CNRS UMR 6158 LIMOS
    Henri FAYOL Institute)

  • Anis Hoayek

    (Mines Saint-Étienne, Univ. Clermont Auvergne, CNRS UMR 6158 LIMOS
    Henri FAYOL Institute)

  • Mireille Batton-Hubert

    (Mines Saint-Étienne, Univ. Clermont Auvergne, CNRS UMR 6158 LIMOS
    Henri FAYOL Institute)

  • Xavier Boucher

    (Mines Saint-Étienne, Univ. Clermont Auvergne, CNRS UMR 6158 LIMOS
    Center for Biomedical and Healthcare Engineering)

Abstract

Fault analysis (FA) is the process of collecting and analyzing data to determine the cause of a failure. It plays an important role in ensuring the quality in manufacturing process. Traditional FA techniques are time-consuming and labor-intensive, relying heavily on human expertise and the availability of failure inspection equipment. In semiconductor industry, a large amount of FA reports are generated by experts to record the fault descriptions, fault analysis path and fault root causes. With the development of Artificial Intelligence, it is possible to automate the industrial FA process while extracting expert knowledge from the vast FA report data. The goal of this research is to develop a complete expert knowledge extraction pipeline for FA in semiconductor industry based on advanced Natural Language Processing and Machine Learning. Our research aims at automatically predicting the fault root cause based on the fault descriptions. First, the text data from the FA reports are transformed into numerical data using Sentence Transformer embedding. The numerical data are converted into latent spaces using Generalized-Controllable Variational AutoEncoder. Then, the latent spaces are classified by Gaussian Mixture Model. Finally, Association Rules are applied to establish the relationship between the labels in the latent space of the fault descriptions and that of the fault root cause. The proposed algorithm has been evaluated with real data of semiconductor industry collected over three years. The average correctness of the predicted label achieves 97.8%. The method can effectively reduce the time of failure identification and the cost during the inspection stage.

Suggested Citation

  • Zhiqiang Wang & Kenneth Ezukwoke & Anis Hoayek & Mireille Batton-Hubert & Xavier Boucher, 2025. "Natural language processing (NLP) and association rules (AR)-based knowledge extraction for intelligent fault analysis: a case study in semiconductor industry," Journal of Intelligent Manufacturing, Springer, vol. 36(1), pages 357-372, January.
  • Handle: RePEc:spr:joinma:v:36:y:2025:i:1:d:10.1007_s10845-023-02245-7
    DOI: 10.1007/s10845-023-02245-7
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10845-023-02245-7
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10845-023-02245-7?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Xu, Zhaoyi & Saleh, Joseph Homer, 2021. "Machine learning for reliability engineering and safety applications: Review of current status and future opportunities," Reliability Engineering and System Safety, Elsevier, vol. 211(C).
    2. Hahsler, Michael & Grün, Bettina & Hornik, Kurt, 2005. "arules - A Computational Environment for Mining Association Rules and Frequent Item Sets," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 14(i15).
    3. Chia-Yu Hsu & Wei-Chen Liu, 2021. "Multiple time-series convolutional neural network for fault detection and diagnosis and empirical study in semiconductor manufacturing," Journal of Intelligent Manufacturing, Springer, vol. 32(3), pages 823-836, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Pan, Yongjun & Sun, Yu & Li, Zhixiong & Gardoni, Paolo, 2023. "Machine learning approaches to estimate suspension parameters for performance degradation assessment using accurate dynamic simulations," Reliability Engineering and System Safety, Elsevier, vol. 230(C).
    2. Jesus Crespo Cuaresma & Bettina Grün & Paul Hofmarcher & Stefan Humer & Mathias Moser, 2015. "A Comprehensive Approach to Posterior Jointness Analysis in Bayesian Model Averaging Applications," Department of Economics Working Papers wuwp193, Vienna University of Economics and Business, Department of Economics.
    3. Li, Yuanfu & Chen, Yao & Hu, Zhenchao & Zhang, Huisheng, 2023. "Remaining useful life prediction of aero-engine enabled by fusing knowledge and deep learning models," Reliability Engineering and System Safety, Elsevier, vol. 229(C).
    4. Phan, Hieu Chi & Dhar, Ashutosh Sutra & Bui, Nang Duc, 2023. "Reliability assessment of pipelines crossing strike-slip faults considering modeling uncertainties using ANN models," Reliability Engineering and System Safety, Elsevier, vol. 237(C).
    5. Liu, Lu & Song, Xiao & Zhou, Zhetao, 2022. "Aircraft engine remaining useful life estimation via a double attention-based data-driven architecture," Reliability Engineering and System Safety, Elsevier, vol. 221(C).
    6. Liu, Jiale & Wang, Huan, 2024. "A brain-inspired energy-efficient Wide Spiking Residual Attention Framework for intelligent fault diagnosis," Reliability Engineering and System Safety, Elsevier, vol. 243(C).
    7. Yuan, Zixia & Xiong, Guojiang & Fu, Xiaofan & Mohamed, Ali Wagdy, 2023. "Improving fault tolerance in diagnosing power system failures with optimal hierarchical extreme learning machine," Reliability Engineering and System Safety, Elsevier, vol. 236(C).
    8. Chen, Edward & Bao, Han & Dinh, Nam, 2024. "Evaluating the reliability of machine-learning-based predictions used in nuclear power plant instrumentation and control systems," Reliability Engineering and System Safety, Elsevier, vol. 250(C).
    9. Costa, Nahuel & Sánchez, Luciano, 2022. "Variational encoding approach for interpretable assessment of remaining useful life estimation," Reliability Engineering and System Safety, Elsevier, vol. 222(C).
    10. Wang, Weicheng & Chen, Jinglong & Zhang, Tianci & Liu, Zijun & Wang, Jun & Zhang, Xinwei & He, Shuilong, 2023. "An asymmetrical graph Siamese network for one-classanomaly detection of engine equipment with multi-source fusion," Reliability Engineering and System Safety, Elsevier, vol. 235(C).
    11. Hieu T. T. L. Pham & Mahdi Rafieizonooz & SangUk Han & Dong-Eun Lee, 2021. "Current Status and Future Directions of Deep Learning Applications for Safety Management in Construction," Sustainability, MDPI, vol. 13(24), pages 1-37, December.
    12. Bakeer, Tammam, 2023. "General partial safety factor theory for the assessment of the reliability of nonlinear structural systems," Reliability Engineering and System Safety, Elsevier, vol. 234(C).
    13. Wang, Chu & Dou, Manfeng & Li, Zhongliang & Outbib, Rachid & Zhao, Dongdong & Zuo, Jian & Wang, Yuanlin & Liang, Bin & Wang, Peng, 2023. "Data-driven prognostics based on time-frequency analysis and symbolic recurrent neural network for fuel cells under dynamic load," Reliability Engineering and System Safety, Elsevier, vol. 233(C).
    14. Liu, Ruonan & Xie, Yunfei & Lin, Di & Zhang, Weidong & Ding, Steven X., 2024. "Information-based Gradient enhanced Causal Learning Graph Neural Network for fault diagnosis of complex industrial processes," Reliability Engineering and System Safety, Elsevier, vol. 252(C).
    15. Lewis, Austin D. & Groth, Katrina M., 2022. "Metrics for evaluating the performance of complex engineering system health monitoring models," Reliability Engineering and System Safety, Elsevier, vol. 223(C).
    16. Yoichi Matsumoto, 2013. "Heterogeneous Combinations of Knowledge Elements: How the Knowledge Base Structure Impacts Knowledge-related Outcomes of a Firm," Discussion Paper Series DP2013-15, Research Institute for Economics & Business Administration, Kobe University.
    17. Bo, Yimin & Bao, Minglei & Ding, Yi & Hu, Yishuang, 2024. "A DNN-based reliability evaluation method for multi-state series-parallel systems considering semi-Markov process," Reliability Engineering and System Safety, Elsevier, vol. 242(C).
    18. Zhang, Qing & Tang, Lv & Xuan, Jianping & Shi, Tielin & Li, Rui, 2023. "An uncertainty relevance metric-based domain adaptation fault diagnosis method to overcome class relevance caused confusion," Reliability Engineering and System Safety, Elsevier, vol. 231(C).
    19. Alkhaleel, Basem A., 2024. "Machine learning applications in the resilience of interdependent critical infrastructure systems—A systematic literature review," International Journal of Critical Infrastructure Protection, Elsevier, vol. 44(C).
    20. Ma, Yulin & Li, Lei & Yang, Jun, 2022. "Convolutional kernel aggregated domain adaptation for intelligent fault diagnosis with label noise," Reliability Engineering and System Safety, Elsevier, vol. 227(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:joinma:v:36:y:2025:i:1:d:10.1007_s10845-023-02245-7. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.