IDEAS home Printed from https://ideas.repec.org/a/dbk/datame/v3y2024ip.420id1056294dm2024420.html
   My bibliography  Save this article

Novel HGDBO: A Hybrid Genetic and Dung Beetle Optimization Algorithm for Microarray Gene Selection and Efficient Cancer Classification

Author

Listed:
  • Vijaya Lakshmi Alluri
  • Karteeka Pavan Kanadam
  • Helen Josephine Vincent Lawrence

Abstract

Introduction: Ovarian cancer ranks as the seventh most frequently diagnosed cancer and stands as the eighth leading cause of cancer-related mortality among women globally. Early detection significantly improves survival rates and outcomes, highlighting the need for enhanced screening methods and increased awareness to facilitate early diagnosis and treatment. Microarray gene data, characterized by its high dimensionality, includes the expression levels of thousands of genes across numerous samples, posing both opportunities and challenges in the analysis of gene functions and disease mechanisms. Method: This paper presents a novel hybrid gene feature selection method called HGDBO, which combines the Dung Beetle Optimization (DBO) algorithm with the Genetic Algorithm (GA) to increase the effectiveness of microarray data analysis. The proposed HGDBO method utilizes the exploratory capabilities of DBO and the exploitative strengths of GA to identify the most relevant genes for disease classification. Experimental results on multiple microarray datasets demonstrate that the hybrid approach offers superior classification performance, stability, and computational efficiency compared to traditional and state-of-the-art methods. To classify ovarian cancer, Naïve-Bayes (NB) and Random-Forest (RF) classification algorithms were employed. Results and Discussion: The proposed Random Forest model outperforms the Naive Bayes model across all metrics, achieving better accuracy (0.96 vs. 0.91), precision (0.95 vs. 0.91), recall (0.97 vs. 0.90), F-1 score (0.95 vs. 0.91), and specificity (0.97 vs. 0.86). Conclusion: These results underscore the effectiveness of the HGDBO method and the Random Forest classifier in enhancing the analysis and classification of ovarian cancer using microarray gene data.

Suggested Citation

Handle: RePEc:dbk:datame:v:3:y:2024:i::p:.420:id:1056294dm2024420
DOI: 10.56294/dm2024.420
as

Download full text from publisher

To our knowledge, this item is not available for download. To find whether it is available, there are three options:
1. Check below whether another version of this item is available online.
2. Check on the provider's web page whether it is in fact available.
3. Perform a
for a similarly titled item that would be available.

More about this item

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:dbk:datame:v:3:y:2024:i::p:.420:id:1056294dm2024420. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

We have no bibliographic references for this item. You can help adding them by using this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Javier Gonzalez-Argote (email available below). General contact details of provider: https://dm.ageditor.ar/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.