Author
Listed:
- Mahmoud Abdelmohsen Ahmed
(Department of Computer Science, Faculty of Graduate Studies for Statistical Research, Cairo University, Giza 12613, Egypt)
- Shahira Shaaban Azab
(Department of Computer Science, Faculty of Graduate Studies for Statistical Research, Cairo University, Giza 12613, Egypt)
- Hesham Ahmed Hefny
(Department of Computer Science, Faculty of Graduate Studies for Statistical Research, Cairo University, Giza 12613, Egypt)
Abstract
Software defect prediction is an important activity in every software firms, so the Software defects have severe consequences, especially in mission-critical arrangements developed by organizations like NASA. Effective techniques for early detection and prediction of defective software modules are crucial for ensuring reliability and qualifying risks. This study investigates the application of machine learning models for predicting software defects using datasets from NASA’s Metrics Data Program depending on 10 datasets. Four different classification models – Support Vector Machines, Random Forests, Logistic Regression, and Ensemble model – were evaluated on their ability to classify software modules as defective or non-defective based on software metrics. The datasets exhibited significant class imbalance, with defective modules being the minority class. To address this, the Synthetic Minority Over-sampling Technique was employed, which generated synthetic examples of the minority class, leading to improved performance across all models. Also, two feature selection procedures, Recursive Feature Elimination with Cross-Validation and Information Gain, were applied and compared. RFECV generally resulted in higher accuracy and precision, while the results for recall and F1-score were mixed. Among the assessed models, the Random Forest model demonstrated the highest overall accuracy after applying SMOTE and feature selection. The research highlights the potential of machine learning, particularly ensemble methods like Random Forests, for automating software defect prediction in critical systems. By addressing trials such as class imbalance and feature selection, the performance of these models can be significantly enhanced. This study contributes to the rising field of machine learning applications in software engineering, providing insights and methods for improving the reliability and quality of software systems developed by NASA and other organizations working on mission-critical application.
Suggested Citation
Mahmoud Abdelmohsen Ahmed & Shahira Shaaban Azab & Hesham Ahmed Hefny, 2026.
"An -Improved Intelligent Model for Software Defect Prediction,"
Scientific Review, Academic Research Publishing Group, vol. 12(1), pages 6-22, 03-2026.
Handle:
RePEc:arp:srarsr:2026:p:6-22
DOI: doi.org/10.32861/sr.121.6.22
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arp:srarsr:2026:p:6-22. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Managing Editor (email available below). General contact details of provider: http://arpgweb.com/index.php?ic=journal&journal=10&info=aims .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.