IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0281519.html
   My bibliography  Save this article

Application of multivariate time-series model for high performance computing (HPC) fault prediction

Author

Listed:
  • Xiangdong Pei
  • Min Yuan
  • Guo Mao
  • Zhengbin Pang

Abstract

Aiming at the high reliability demand of increasingly large and complex supercomputing systems, this paper proposes a multidimensional fusion CBA-net (CNN-BiLSTAM-Attention) fault prediction model based on HDBSCAN clustering preprocessing classification data, which can effectively extract and learn the spatial and temporal features in the predecessor fault log. The model can effectively extract and learn the spatial and temporal features from the predecessor fault logs, and has the advantages of high sensitivity to time series features and sufficient extraction of local features, etc. The RMSE of the model for fault occurrence time prediction is 0.031, and the prediction accuracy of node location for fault occurrence is 93% on average, as demonstrated by experiments. The model can achieve fast convergence and improve the fine-grained and accurate fault prediction of large supercomputers.

Suggested Citation

  • Xiangdong Pei & Min Yuan & Guo Mao & Zhengbin Pang, 2023. "Application of multivariate time-series model for high performance computing (HPC) fault prediction," PLOS ONE, Public Library of Science, vol. 18(10), pages 1-18, October.
  • Handle: RePEc:plo:pone00:0281519
    DOI: 10.1371/journal.pone.0281519
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0281519
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0281519&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0281519?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Mandakini Behera & Archana Sarangi & Debahuti Mishra & Pradeep Kumar Mallick & Jana Shafi & Parvathaneni Naga Srinivasu & Muhammad Fazal Ijaz, 2022. "Automatic Data Clustering by Hybrid Enhanced Firefly and Particle Swarm Optimization Algorithms," Mathematics, MDPI, vol. 10(19), pages 1-29, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Naveed Ahmed Malik & Naveed Ishtiaq Chaudhary & Muhammad Asif Zahoor Raja, 2023. "Firefly Optimization Heuristics for Sustainable Estimation in Power System Harmonics," Sustainability, MDPI, vol. 15(6), pages 1-20, March.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0281519. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.