IDEAS home Printed from https://ideas.repec.org/a/hin/complx/8509821.html
   My bibliography  Save this article

Surprise Bug Report Prediction Utilizing Optimized Integration with Imbalanced Learning Strategy

Author

Listed:
  • Hui Li
  • Yang Qu
  • Shikai Guo
  • Guofeng Gao
  • Rong Chen
  • Guo Chen

Abstract

In software projects, a large number of bugs are usually reported to bug repositories. Due to the limited budge and work force, the developers often may not have enough time and ability to inspect all the reported bugs, and thus they often focus on inspecting and repairing the highly impacting bugs. Among the high-impact bugs, surprise bugs are reported to be a fatal threat to the software systems, though they only account for a small proportion. Therefore, the identification of surprise bugs becomes an important work in practices. In recent years, some methods have been proposed by the researchers to identify surprise bugs. Unfortunately, the performance of these methods in identifying surprise bugs is still not satisfied for the software projects. The main reason is that surprise bugs only occupy a small percentage of all the bugs, and it is difficult to identify these surprise bugs from the imbalanced distribution. In order to overcome the imbalanced category distribution of the bugs, a method based on machine learning to predict surprise bugs is presented in this paper. This method takes into account the textual features of the bug reports and employs an imbalanced learning strategy to balance the datasets of the bug reports. Then these datasets after balancing are used to train three selected classifiers which are built by three different classification algorithms and predict the datasets with unknown type. In particular, an ensemble method named optimization integration is proposed to generate a unique and best result, according to the results produced by the three classifiers. This ensemble method is able to adjust the ability of the classifier to detect different categories based on the characteristics of different projects and integrate the advantages of three classifiers. The experiments performed on the datasets from 4 software projects show that this method performs better than the previous methods in terms of detecting surprise bugs.

Suggested Citation

  • Hui Li & Yang Qu & Shikai Guo & Guofeng Gao & Rong Chen & Guo Chen, 2020. "Surprise Bug Report Prediction Utilizing Optimized Integration with Imbalanced Learning Strategy," Complexity, Hindawi, vol. 2020, pages 1-14, February.
  • Handle: RePEc:hin:complx:8509821
    DOI: 10.1155/2020/8509821
    as

    Download full text from publisher

    File URL: http://downloads.hindawi.com/journals/8503/2020/8509821.pdf
    Download Restriction: no

    File URL: http://downloads.hindawi.com/journals/8503/2020/8509821.xml
    Download Restriction: no

    File URL: https://libkey.io/10.1155/2020/8509821?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Su, Zhan & Zheng, Xiliang & Ai, Jun & Shen, Yuming & Zhang, Xuanxiong, 2020. "Link prediction in recommender systems based on vector similarity," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 560(C).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:hin:complx:8509821. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Mohamed Abdelhakeem (email available below). General contact details of provider: https://www.hindawi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.