IDEAS home Printed from https://ideas.repec.org/a/gam/jftint/v13y2021i8p216-d618376.html
   My bibliography  Save this article

Cross-Project Defect Prediction Method Based on Manifold Feature Transformation

Author

Listed:
  • Yu Zhao

    (School of Computer Science and Technology, Jiangsu Normal University, Xuzhou 221116, China)

  • Yi Zhu

    (School of Computer Science and Technology, Jiangsu Normal University, Xuzhou 221116, China
    Key Laboratory of Safety-Critical Software, Ministry of Industry and Information Technology, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China)

  • Qiao Yu

    (School of Computer Science and Technology, Jiangsu Normal University, Xuzhou 221116, China)

  • Xiaoying Chen

    (School of Computer Science and Technology, Jiangsu Normal University, Xuzhou 221116, China)

Abstract

Traditional research methods in software defect prediction use part of the data in the same project to train the defect prediction model and predict the defect label of the remaining part of the data. However, in the practical realm of software development, the software project that needs to be predicted is generally a brand new software project, and there is not enough labeled data to build a defect prediction model; therefore, traditional methods are no longer applicable. Cross-project defect prediction uses the labeled data of the same type of project similar to the target project to build the defect prediction model, so as to solve the problem of data loss in traditional methods. However, the difference in data distribution between the same type of project and the target project reduces the performance of defect prediction. To solve this problem, this paper proposes a cross-project defect prediction method based on manifold feature transformation. This method transforms the original feature space of the project into a manifold space, then reduces the difference in data distribution of the transformed source project and the transformed target project in the manifold space, and finally uses the transformed source project to train a naive Bayes prediction model with better performance. A comparative experiment was carried out using the Relink dataset and the AEEEM dataset. The experimental results show that compared with the benchmark method and several cross-project defect prediction methods, the proposed method effectively reduces the difference in data distribution between the source project and the target project, and obtains a higher F1 value, which is an indicator commonly used to measure the performance of the two-class model.

Suggested Citation

  • Yu Zhao & Yi Zhu & Qiao Yu & Xiaoying Chen, 2021. "Cross-Project Defect Prediction Method Based on Manifold Feature Transformation," Future Internet, MDPI, vol. 13(8), pages 1-18, August.
  • Handle: RePEc:gam:jftint:v:13:y:2021:i:8:p:216-:d:618376
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1999-5903/13/8/216/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1999-5903/13/8/216/
    Download Restriction: no
    ---><---

    More about this item

    Keywords

    cross-project defect prediction; manifold feature transformation; naive Bayes prediction model; F1;
    All these keywords.

    JEL classification:

    • F1 - International Economics - - Trade

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jftint:v:13:y:2021:i:8:p:216-:d:618376. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.