Author
Listed:
- Yongtao Shi
- Yuefeng Zheng
- Xiaotong Bai
Abstract
Recently, hybrid feature selection methods have demonstrated excellent performance on high-dimensional data, but many of these methods tend to yield relatively homogeneous feature subsets. To address this, we propose a novel hybrid feature selection algorithm called the Hybrid Multiple Filter-Wrapper algorithm. This algorithm employs a dual-module structure: Module 1 utilizes the random forest feature importance method to achieve significant dimensionality reduction of the original feature set, resulting in the candidate feature subset F1. In Module 2, we first propose a bivariate filter algorithm: the minimum Spearman-Maximum Mutual Information method. This method assesses both the correlation and redundancy of F1, whose results are then fed into the wrapper algorithm for further exploration. Furthermore, we integrate two swarm intelligence algorithms to develop the Hybrid Grey Wolf and Chaotic Dung Beetle Wrapper Algorithm. This algorithm incorporates chaos theory to enhance the position update mechanism of the Dung Beetle Algorithm, then embeds Dung Beetle Algorithm into the Grey Wolf Algorithm, thereby balancing exploration and exploitation capabilities. Finally, a process optimization mechanism based on the theory of random laser intensity fluctuations dynamically monitors the optimization process. Upon convergence of the wrapper algorithm to a local optimum, the filter algorithm is restarted, and chaos theory is used to reset the population. This process enhances the diversity of both the candidate feature subset and the population, effectively avoiding local optima. We extensively compare our method with ten hybrid algorithms from the past three years across ten public benchmark datasets from MGE. Experimental results show that our algorithm outperforms the most other algorithms: on all datasets, it achieves an average classification accuracy that is at 1.3% least higher, an average feature subset length that is at least 8 units shorter, and a dimensionality reduced to less than 0.45% of the original. The results are statistically significant.
Suggested Citation
Yongtao Shi & Yuefeng Zheng & Xiaotong Bai, 2025.
"A multiple filter-wrapper feature selection algorithm based on process optimization mechanism for high-dimensional omics data analysis,"
PLOS ONE, Public Library of Science, vol. 20(12), pages 1-44, December.
Handle:
RePEc:plo:pone00:0338051
DOI: 10.1371/journal.pone.0338051
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0338051. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.