Author
Listed:
- Al Mahmud Siam
- Pankaj Bhowmik
- Md Palash Uddin
Abstract
Electronic payment methods are increasingly prevalent worldwide, facilitating both in-person and online transactions. As credit card usage for online payments grows, fraud and payment defaults have also risen, resulting in significant financial losses. Detecting fraudulent transactions is challenging due to the highly imbalanced nature of transaction datasets, where fraudulent activities constitute only a small fraction of the data. To address this, we propose a novel hybrid feature selection framework designed to enhance the performance of machine learning models in credit card fraud detection. Our framework integrates three complementary feature selection techniques: Pearson correlation, information gain (IG), and random forest importance (RFI), each optimized for the dataset‘s characteristics. Pearson Correlation eliminates redundancy by removing highly correlated features, while IG and RFI evaluate the relevance of the remaining features. A union operation combines the most informative features from these methods, ensuring comprehensive and efficient feature selection. To validate the proposed approach, we test it on five diverse datasets with varying characteristics and imbalance levels, employing five state-of-the-art machine learning algorithms: Random Forest (RF), Extra Trees (ET), XGBoost (XGBC), AdaBoost, and CatBoost. We primarily propose this work for PCA-transformed datasets, but for the validation of our research, we also apply it to a real-world dataset. The results demonstrate that our methodology outperforms existing baseline approaches, achieving superior fraud detection performance across all datasets. Our findings highlight the robustness and adaptability of the proposed framework, offering a practical solution for real-world fraud detection systems. Additionally, we believe that our proposed framework can serve as a decision support system for the detection of fraudulent transactions in real-time credit cards, with the potential to make a substantial contribution to the business industry.
Suggested Citation
Al Mahmud Siam & Pankaj Bhowmik & Md Palash Uddin, 2025.
"Hybrid feature selection framework for enhanced credit card fraud detection using machine learning models,"
PLOS ONE, Public Library of Science, vol. 20(7), pages 1-34, July.
Handle:
RePEc:plo:pone00:0326975
DOI: 10.1371/journal.pone.0326975
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0326975. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.