IDEAS home Printed from https://ideas.repec.org/a/spr/joinma/v36y2025i3d10.1007_s10845-024-02323-4.html
   My bibliography  Save this article

Technical language processing for Prognostics and Health Management: applying text similarity and topic modeling to maintenance work orders

Author

Listed:
  • Sarvesh Sundaram

    (Northeastern University)

  • Abe Zeid

    (Northeastern University)

Abstract

Modern manufacturing paradigms have incorporated Prognostics and Health Management (PHM) to implement data-driven methods for fault detection, failure prediction, and assessment of system health. The maintenance operation has similarly benefitted from these advancements, and predictive maintenance is now being used across the industry. Despite these developments, most of the approaches in maintenance rely on numerical data from sensors and field devices for any sort of analysis. Text data from Maintenance Work Orders (MWOs) contain some of the most crucial information pertaining to the functioning of systems and components, but are still regarded as ‘black holes’, i.e., they store valuable data without being used in decision-making. The analysis of this data can help save time and costs in maintenance. While Natural Language Processing (NLP) methods have been very successful in understanding and examining text data from non-technical sources, progress in the analysis of technical text data has been limited. Non-technical text data are usually structured and consist of standardized vocabularies allowing the use of out-of-the-box language processing methods in their analysis. On the other hand, records from MWOs are often semi-structured or unstructured; and consist of complicated terminologies, technical jargon, and industry-specific abbreviations. Deploying traditional NLP to such data can result in an imprecise and flawed analysis which can be very costly. Owing to these challenges, we propose a Technical Language Processing (TLP) framework for PHM. To illustrate its capabilities, we use text data from MWOs of aircraft to address two scenarios. First, we predict corrective actions for new maintenance problems by comparing them with existing problems using syntactic and semantic textual similarity matching and evaluate the results with cosine similarity scores. In the second scenario, we identify and extract the most dominant topics and salient terms from the data using Latent Dirichlet Allocation (LDA). Using the results, we are able to successfully link maintenance problems to standardized maintenance codes used in the aviation industry.

Suggested Citation

  • Sarvesh Sundaram & Abe Zeid, 2025. "Technical language processing for Prognostics and Health Management: applying text similarity and topic modeling to maintenance work orders," Journal of Intelligent Manufacturing, Springer, vol. 36(3), pages 1637-1657, March.
  • Handle: RePEc:spr:joinma:v:36:y:2025:i:3:d:10.1007_s10845-024-02323-4
    DOI: 10.1007/s10845-024-02323-4
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10845-024-02323-4
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10845-024-02323-4?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Juan José Montero Jiménez & Rob Vingerhoeds & Bernard Grabot & Sébastien Schwartz, 2023. "An ontology model for maintenance strategy selection and assessment," Journal of Intelligent Manufacturing, Springer, vol. 34(3), pages 1369-1387, March.
    2. Witteman, Max & Deng, Qichen & Santos, Bruno F., 2021. "A bin packing approach to solve the aircraft maintenance task allocation problem," European Journal of Operational Research, Elsevier, vol. 294(1), pages 365-376.
    3. Madhusudanan Navinchandran & Michael E. Sharp & Michael P. Brundage & Thurston B. Sexton, 2022. "Discovering critical KPI factors from natural language in maintenance work orders," Journal of Intelligent Manufacturing, Springer, vol. 33(6), pages 1859-1877, August.
    4. Rose, Rodrigo L. & Puranik, Tejas G. & Mavris, Dimitri N. & Rao, Arjun H., 2022. "Application of structural topic modeling to aviation safety data," Reliability Engineering and System Safety, Elsevier, vol. 224(C).
    5. Cuixia Jiang & Hao Chen & Qifa Xu & Xiangxiang Wang, 2023. "Few-shot fault diagnosis of rotating machinery with two-branch prototypical networks," Journal of Intelligent Manufacturing, Springer, vol. 34(4), pages 1667-1681, April.
    6. Gerardo Emanuel Granados & Loïc Lacroix & Kamal Medjaher, 2020. "Condition monitoring and prediction of solution quality during a copper electroplating process," Journal of Intelligent Manufacturing, Springer, vol. 31(2), pages 285-300, February.
    7. Rui Zhang & Na Zhao & Liuhu Fu & Xiaolu Bai & Jianghui Cai, 2023. "Recognizing defects in stainless steel welds based on multi-domain feature expression and self-optimization," Journal of Intelligent Manufacturing, Springer, vol. 34(3), pages 1293-1309, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Changjiu Li & Yong Zhang & Xichao Su & Xinwei Wang, 2022. "An Improved Optimization Algorithm for Aeronautical Maintenance and Repair Task Scheduling Problem," Mathematics, MDPI, vol. 10(20), pages 1-25, October.
    2. Sciau, Jean-Baptiste & Goyon, Agathe & Sarazin, Alexandre & Bascans, Jérémy & Prud’homme, Charles & Lorca, Xavier, 2024. "Using constraint programming to address the operational aircraft line maintenance scheduling problem," Journal of Air Transport Management, Elsevier, vol. 115(C).
    3. Wang, Hui & Zheng, Junkang & Xiang, Jiawei, 2023. "Online bearing fault diagnosis using numerical simulation models and machine learning classifications," Reliability Engineering and System Safety, Elsevier, vol. 234(C).
    4. Wang, Mengyao & Zhou, Chenhao & Wang, Aihu, 2022. "A cluster-based yard template design integrated with yard crane deployment using a placement heuristic," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 160(C).
    5. Zhou, Di & Zhuang, Xiao & Zuo, Hongfu & Cai, Jing & Zhao, Xufeng & Xiang, Jiawei, 2022. "A model fusion strategy for identifying aircraft risk using CNN and Att-BiLSTM," Reliability Engineering and System Safety, Elsevier, vol. 228(C).
    6. Marko Orošnjak & Dragoljub Šević, 2023. "Benchmarking Maintenance Practices for Allocating Features Affecting Hydraulic System Maintenance: A West-Balkan Perspective," Mathematics, MDPI, vol. 11(18), pages 1-30, September.
    7. Tao Dai & Xiangqi Fan, 2021. "Multi-Stove Scheduling for Sustainable On-Demand Food Delivery," Sustainability, MDPI, vol. 13(23), pages 1-13, November.
    8. Li, Jiawen & Meng, Lu & Zhang, Zelin & Yang, Kejia, 2023. "Low-frequency, high-impact: Discovering important rare events from UGC," Journal of Retailing and Consumer Services, Elsevier, vol. 70(C).
    9. Li, Libo & Yu, Huan & Kunc, Martin, 2024. "The impact of forum content on data science open innovation performance: A system dynamics-based causal machine learning approach," Technological Forecasting and Social Change, Elsevier, vol. 198(C).
    10. Epstein, Leah, 2024. "Tighter bounds for the harmonic bin packing algorithm," European Journal of Operational Research, Elsevier, vol. 316(1), pages 72-84.
    11. Xiao Zhang & Weiguo Huang & Rui Wang & Jun Wang & Changqing Shen, 2025. "Dual prototypical contrastive network: a novel self-supervised method for cross-domain few-shot fault diagnosis," Journal of Intelligent Manufacturing, Springer, vol. 36(1), pages 475-490, January.
    12. Valcamonico, Dario & Baraldi, Piero & Zio, Enrico & Decarli, Luca & Crivellari, Anna & Rosa, Laura La, 2024. "Combining natural language processing and bayesian networks for the probabilistic estimation of the severity of process safety events in hydrocarbon production assets," Reliability Engineering and System Safety, Elsevier, vol. 241(C).
    13. Jinping Liu & Jie Wang & Xianfeng Liu & Tianyu Ma & Zhaohui Tang, 2022. "MWRSPCA: online fault monitoring based on moving window recursive sparse principal component analysis," Journal of Intelligent Manufacturing, Springer, vol. 33(5), pages 1255-1271, June.
    14. van Kessel, Paul J. & Freeman, Floris C. & Santos, Bruno F., 2023. "Airline maintenance task rescheduling in a disruptive environment," European Journal of Operational Research, Elsevier, vol. 308(2), pages 605-621.
    15. Park, So-Young & Wang, Xinchen & Oh, Yeeun & Hong, Soo-Min & Woo, Su-Han, 2025. "Application of structural topic modeling in a literature review of air transport," Journal of Air Transport Management, Elsevier, vol. 122(C).
    16. Alotaibi, Naif M. & Scarf, Philip & Cavalcante, Cristiano A.V. & Lopes, Rodrigo S. & de Oliveira e Silva, André Luiz & Rodrigues, Augusto J.S. & Alyami, Salem A., 2023. "Modified-opportunistic inspection and the case of remote, groundwater well-heads," Reliability Engineering and System Safety, Elsevier, vol. 237(C).
    17. Li, Guolong & Li, Yanjun & Fang, Chengyue & Su, Jian & Wang, Haotong & Sun, Shengdi & Zhang, Guolei & Shi, Jianxin, 2023. "Research on fault diagnosis of supercharged boiler with limited data based on few-shot learning," Energy, Elsevier, vol. 281(C).
    18. Polyakovskiy, Sergey & M’Hallah, Rym, 2022. "A lookahead matheuristic for the unweighed variable-sized two-dimensional bin packing problem," European Journal of Operational Research, Elsevier, vol. 299(1), pages 104-117.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:joinma:v:36:y:2025:i:3:d:10.1007_s10845-024-02323-4. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.