IDEAS home Printed from https://ideas.repec.org/a/eee/reensy/v114y2013icp45-51.html
   My bibliography  Save this article

A new fault detection method for computer networks

Author

Listed:
  • Lu, Lu
  • Xu, Zhengguo
  • Wang, Wenhai
  • Sun, Youxian

Abstract

Over the past few years, fault detection for computer networks has attracted extensive attentions for its importance in network management. Most existing fault detection methods are based on active probing techniques which can detect the occurrence of faults fast and precisely. But these methods suffer from the limitation of traffic overhead, especially in large scale networks. To relieve traffic overhead induced by active probing based methods, a new fault detection method, whose key is to divide the detection process into multiple stages, is proposed in this paper. During each stage, only a small region of the network is detected by using a small set of probes. Meanwhile, it also ensures that the entire network can be covered after multiple detection stages. This method can guarantee that the traffic used by probes during each detection stage is small sufficiently so that the network can operate without severe disturbance from probes. Several simulation results verify the effectiveness of the proposed method.

Suggested Citation

  • Lu, Lu & Xu, Zhengguo & Wang, Wenhai & Sun, Youxian, 2013. "A new fault detection method for computer networks," Reliability Engineering and System Safety, Elsevier, vol. 114(C), pages 45-51.
  • Handle: RePEc:eee:reensy:v:114:y:2013:i:c:p:45-51
    DOI: 10.1016/j.ress.2012.12.015
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0951832013000045
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ress.2012.12.015?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Bartlett, L.M. & Hurdle, E.E. & Kelly, E.M., 2009. "Integrated system fault diagnostics utilising digraph and fault tree-based approaches," Reliability Engineering and System Safety, Elsevier, vol. 94(6), pages 1107-1115.
    2. Rocco S., Claudio M. & Ramirez-Marquez, José Emmanuel, 2011. "Vulnerability metrics and analysis for communities in complex networks," Reliability Engineering and System Safety, Elsevier, vol. 96(10), pages 1360-1366.
    3. Kim, Kyungmee O. & Zuo, Ming J., 2007. "Two fault classification methods for large systems when available data are limited," Reliability Engineering and System Safety, Elsevier, vol. 92(5), pages 585-592.
    4. Doguc, Ozge & Emmanuel Ramirez-Marquez, Jose, 2012. "An automated method for estimating reliability of grid systems using Bayesian networks," Reliability Engineering and System Safety, Elsevier, vol. 104(C), pages 96-105.
    5. Chi Zhang & José Ramirez-Marquez & Claudio Sanseverino, 2011. "A holistic method for reliability performance assessment and critical components detection in complex networks," IISE Transactions, Taylor & Francis Journals, vol. 43(9), pages 661-675.
    6. Assaf, T. & Dugan, J.B., 2008. "Diagnosis based on reliability analysis using monitors and sensors," Reliability Engineering and System Safety, Elsevier, vol. 93(4), pages 509-521.
    7. Wilson, Alyson G. & Huzurbazar, Aparna V., 2007. "Bayesian networks for multilevel system reliability," Reliability Engineering and System Safety, Elsevier, vol. 92(10), pages 1413-1420.
    8. Doguc, Ozge & Ramirez-Marquez, Jose Emmanuel, 2009. "A generic method for estimating system reliability using Bayesian networks," Reliability Engineering and System Safety, Elsevier, vol. 94(2), pages 542-550.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Yi-Kuei Lin & Lance Fiondella & Ping-Chen Chang, 2022. "Reliability of time-constrained multi-state network susceptible to correlated component faults," Annals of Operations Research, Springer, vol. 311(1), pages 239-254, April.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Babaleye, Ahmed O. & Kurt, Rafet Emek & Khan, Faisal, 2019. "Safety analysis of plugging and abandonment of oil and gas wells in uncertain conditions with limited data," Reliability Engineering and System Safety, Elsevier, vol. 188(C), pages 133-141.
    2. Kondakci, Suleyman, 2015. "Analysis of information security reliability: A tutorial," Reliability Engineering and System Safety, Elsevier, vol. 133(C), pages 275-299.
    3. Cai, Baoping & Liu, Yonghong & Liu, Zengkai & Tian, Xiaojie & Dong, Xin & Yu, Shilin, 2012. "Using Bayesian networks in reliability evaluation for subsea blowout preventer control system," Reliability Engineering and System Safety, Elsevier, vol. 108(C), pages 32-41.
    4. Zhong, X. & Ichchou, M. & Saidi, A., 2010. "Reliability assessment of complex mechatronic systems using a modified nonparametric belief propagation algorithm," Reliability Engineering and System Safety, Elsevier, vol. 95(11), pages 1174-1185.
    5. Zhang, Chi & Ramirez-Marquez, José Emmanuel & Wang, Jianhui, 2015. "Critical infrastructure protection using secrecy – A discrete simultaneous game," European Journal of Operational Research, Elsevier, vol. 242(1), pages 212-221.
    6. Wen, Tao & Gao, Qiuya & Chen, Yu-wang & Cheong, Kang Hao, 2022. "Exploring the vulnerability of transportation networks by entropy: A case study of Asia–Europe maritime transportation network," Reliability Engineering and System Safety, Elsevier, vol. 226(C).
    7. Yan-Feng Li & Jinhua Mi & Yu Liu & Yuan-Jian Yang & Hong-Zhong Huang, 2015. "Dynamic fault tree analysis based on continuous-time Bayesian networks under fuzzy numbers," Journal of Risk and Reliability, , vol. 229(6), pages 530-541, December.
    8. Hiba Baroud & Jose E. Ramirez‐Marquez & Kash Barker & Claudio M. Rocco, 2014. "Stochastic Measures of Network Resilience: Applications to Waterway Commodity Flows," Risk Analysis, John Wiley & Sons, vol. 34(7), pages 1317-1335, July.
    9. Baroud, Hiba & Barker, Kash & Ramirez-Marquez, Jose E. & Rocco S., Claudio M., 2014. "Importance measures for inland waterway network resilience," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 62(C), pages 55-67.
    10. Li, Yapeng & Qiao, Shun & Deng, Ye & Wu, Jun, 2019. "Stackelberg game in critical infrastructures from a network science perspective," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 521(C), pages 705-714.
    11. Claudio M. Rocco & Kash Barker & Jose Moronta, 2022. "Determining the best algorithm to detect community structures in networks: application to power systems," Environment Systems and Decisions, Springer, vol. 42(2), pages 251-264, June.
    12. Li, Gong & Shi, Jing, 2012. "Applications of Bayesian methods in wind energy conversion systems," Renewable Energy, Elsevier, vol. 43(C), pages 1-8.
    13. Chi Zhang & Jose Ramirez-Marquez, 2013. "Protecting critical infrastructures against intentional attacks: a two-stage game with incomplete information," IISE Transactions, Taylor & Francis Journals, vol. 45(3), pages 244-258.
    14. Li, Y.F. & Sansavini, G. & Zio, E., 2013. "Non-dominated sorting binary differential evolution for the multi-objective optimization of cascading failures protection in complex networks," Reliability Engineering and System Safety, Elsevier, vol. 111(C), pages 195-205.
    15. Claudio M Rocco & Kash Barker & Jose Moronta & Jose E Ramirez-Marquez, 2018. "Community detection and resilience in multi-source, multi-terminal networks," Journal of Risk and Reliability, , vol. 232(6), pages 616-626, December.
    16. Dui, Hongyan & Meng, Xueyu & Xiao, Hui & Guo, Jianjun, 2020. "Analysis of the cascading failure for scale-free networks based on a multi-strategy evolutionary game," Reliability Engineering and System Safety, Elsevier, vol. 199(C).
    17. Marhavilas, P.K. & Koulouriotis, D.E., 2012. "A combined usage of stochastic and quantitative risk assessment methods in the worksites: Application on an electric power provider," Reliability Engineering and System Safety, Elsevier, vol. 97(1), pages 36-46.
    18. Iamsumang, Chonlagarn & Mosleh, Ali & Modarres, Mohammad, 2018. "Monitoring and learning algorithms for dynamic hybrid Bayesian network in on-line system health management applications," Reliability Engineering and System Safety, Elsevier, vol. 178(C), pages 118-129.
    19. Ramirez-Marquez, Jose E. & Rocco, Claudio M. & Barker, Kash & Moronta, Jose, 2018. "Quantifying the resilience of community structures in networks," Reliability Engineering and System Safety, Elsevier, vol. 169(C), pages 466-474.
    20. Farzin Salehpour-Oskouei & Mohammad Pourgol-Mohammad, 2018. "Sensor placement determination in system health monitoring process based on dual information risk and uncertainty criteria," Journal of Risk and Reliability, , vol. 232(1), pages 65-81, February.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:reensy:v:114:y:2013:i:c:p:45-51. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: https://www.journals.elsevier.com/reliability-engineering-and-system-safety .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.